Metadata-Version: 2.1
Name: BlueWhale3-Text
Version: 1.5.1
Summary: 用于文本挖掘的蓝鲸附加组件。
Home-page: https://github.com/biolab/orange3-text
Download-URL: https://github.com/biolab/orange3-text/tarball/1.6.0
Author: 大圣实验楼
Author-email: dashenglab@163.com
Keywords: orange3-text,data mining,orange3 add-on
Description-Content-Type: text/markdown
Provides-Extra: test
Provides-Extra: doc
License-File: LICENSE

BlueWhale3 Text
============

Orange add-on for text mining. It provides access to publicly available data,
like NY Times, Twitter and PubMed. Further, it provides tools for preprocessing,
constructing vector spaces (like bag-of-words, topic modeling and word2vec) and
visualizations like word cloud end geo map. All features can be combined with
powerful data mining techniques from the Orange data mining framework.

See [documentation](http://orange3-text.readthedocs.org/).

Features
--------
#### Access to data
* Load a corpus of text documents
* Access publicly available data (The Guardian, NY Times, Twitter, Wikipedia, PubMed)

#### Text analysis
* Preprocess corpus
* Generate bag of words
* Embed documents into vector space
* Perform sentiment analysis
* Detect emotions in tweets
* Discover topics in the text
* Compute document statistics
* Visualize frequent words in the word cloud
* Find words that enrich selected documents
