Metadata-Version: 2.0
Name: The-Curator
Version: 0.1.2
Summary: UNKNOWN
Home-page: https://github.com/fny/thecurator
Author: Faraz Yashar
Author-email: faraz.yashar@gmail.com
License: Proprietary
Platform: UNKNOWN
Classifier: Intended Audience :: Developers
Classifier: Topic :: Software Development
Classifier: Topic :: Utilities
Classifier: Programming Language :: Python :: 3
Requires-Dist: jsonschema
Requires-Dist: pyaml
Requires-Dist: sqlalchemy
Provides-Extra: dev
Requires-Dist: dateparser; extra == 'dev'
Requires-Dist: pandas; extra == 'dev'
Requires-Dist: pytest; extra == 'dev'

The Curator 🖼
==============

.. image:: https://travis-ci.org/fny/thecurator.svg?branch=master
   :target: https://travis-ci.org/fny/thecurator
   :alt: Build Status

.. image:: https://badge.fury.io/py/thecurator.svg
   :target: https://pypi.python.org/pypi/thecurator
   :alt: The Curator on PyPI


The Curator helps you define pipelines for transforming dirty data into consumable databases.

Usage
-----

.. code:: python

  from thecurator import Curator

  # Paths to files describing different tables
  table_descriptions = ['patient.yml', 'lab.yml']
  curator = Curator(sqlalchemy_engine, table_descriptions)

  # Transform a pandas DataFrame according to the descriptions
  curator.transform_df('patient', patient_df)

  # Transform a dictionary array according to the descriptions
  curator.transform_dicts('patient', patient_dicts)

  # Transform and insert a according to the descriptions
  curator.insert_dicts('lab', lab_dicts)


See the tests. More coming soon...

Development
-----------

 - Install development requirements `pip install -r dev-requirements.txt`
 - Make changes
 - Run the tests `pytest tests`

Release
-------


https://packaging.python.org/tutorials/distributing-packages/
gpg --detach-sign -a dist/package-1.0.1.tar.gz
twine upload dist/*

twine upload --repository-url https://test.pypi.org/legacy/ dist/*


