Metadata-Version: 2.0
Name: ChemDataExtractor
Version: 1.3.0
Summary: A toolkit for extracting chemical information from the scientific literature.
Home-page: https://github.com/mcs07/ChemDataExtractor
Author: Matt Swain
Author-email: m.swain@me.com
License: MIT
Keywords: text-mining mining chemistry cheminformatics nlp html xml science scientific
Platform: UNKNOWN
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Topic :: Internet :: WWW/HTTP :: Indexing/Search
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Classifier: Topic :: Scientific/Engineering :: Chemistry
Classifier: Topic :: Text Processing
Classifier: Topic :: Text Processing :: Linguistic
Classifier: Topic :: Text Processing :: Markup :: HTML
Requires-Dist: DAWG
Requires-Dist: PyYAML
Requires-Dist: appdirs
Requires-Dist: beautifulsoup4
Requires-Dist: click
Requires-Dist: cssselect
Requires-Dist: lxml
Requires-Dist: nltk
Requires-Dist: pdfminer.six
Requires-Dist: python-crfsuite
Requires-Dist: python-dateutil
Requires-Dist: requests
Requires-Dist: six

ChemDataExtractor
=================

.. image:: http://img.shields.io/pypi/v/ChemDataExtractor.svg?style=flat-square
    :target: https://pypi.python.org/pypi/ChemDataExtractor

.. image:: http://img.shields.io/pypi/l/ChemDataExtractor.svg?style=flat-square
    :target: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE

.. image:: http://img.shields.io/travis/mcs07/ChemDataExtractor.svg?style=flat-square
    :target: https://travis-ci.org/mcs07/ChemDataExtractor

ChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.


Features
--------

- HTML, XML and PDF document readers
- Chemistry-aware natural language processing pipeline
- Chemical named entity recognition
- Rule-based parsing grammars for property and spectra extraction
- Table parser for extracting tabulated data
- Document processing to resolve data interdependencies


Installation
------------

To install ChemDataExtractor, simply run::

    pip install chemdataextractor

Or if you are an Anaconda user, run::

    conda install -c chemdataextractor chemdataextractor

Alternatively, try one of the other `installation options`_.


Documentation
-------------

Full documentation is available at http://chemdataextractor.org/docs


License
-------

ChemDataExtractor is licensed under the `MIT license`_, a permissive, business-friendly license for open source
software.


.. _`installation options`: http://chemdataextractor.org/docs/install
.. _`MIT license`: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE


