Metadata-Version: 2.1
Name: bixin
Version: 0.0.3
Summary: UNKNOWN
Home-page: UNKNOWN
License: UNKNOWN
Platform: UNKNOWN
Description-Content-Type: text/markdown
Requires-Dist: cppjieba-py
Provides-Extra: dev
Requires-Dist: prefixtree (>=0.2.5) ; extra == 'dev'
Requires-Dist: chardet (>=3.0.4) ; extra == 'dev'
Requires-Dist: opencc-python-reimplemented (>=0.1.3) ; extra == 'dev'
Provides-Extra: test
Requires-Dist: spec (>=1.4.1) ; extra == 'test'
Requires-Dist: nose (>=1.3.7) ; extra == 'test'

bixin
=====
[![Build Status](https://travis-ci.org/bung87/bixin.svg?branch=master)](https://travis-ci.org/bung87/bixin) [![PyPI](https://img.shields.io/pypi/v/bixin.svg)](https://pypi.python.org/pypi/bixin)  

Chinese Sentiment Analysis base on dictionary and rules.

Installation
============
``> pip3 install git+https://github.com/bung87/bixin``

Usage
=====

```python
    from bixin import predict
    text ="幸福每时每刻都会像路边的乞丐一样出现在你面前。要是你觉得你所梦想的幸福不是这样的，因而断言你的幸福已死亡，你只接受符合你的原则和心愿的幸福，那么你就会落得不幸。"
    # 出自安德烈·纪德《人间食粮》
    predict(text)
    # sentiment score: 0.42
```
sentiment score is in the range of -1 to 1

``predict`` will load dictionary data at first time,to load it manually use ``predict.classifier.initialize()``

Accuracy
========

Test with 6226 taged corpus mixed up with  shopping reviews 、Sina Weibo tweets 、hotel reviews 、news and financial news

accuracy: **0.827771**

**Notice**:neutral texts are all ignored.

details about dataset `https://github.com/bung87/bixin/wiki <https://github.com/bung87/bixin/wiki>`_

Development
===========

``> pip3 install -e ".[dev]" git+https://github.com/bung87/bixin``




    ./dictionaries dictionaries from vary sources
    ./data processed dictionaries through ./scripts/tagger.py
    ./scripts/release_data.py release data to package

``./scripts/score.py``

all data archives: `https://github.com/bung87/bixin/releases/tag/v0.0.1 <https://github.com/bung87/bixin/releases/tag/v0.0.1>`_

run accuray testing with all .txt files under **test_data** directory sentence per line end with a space and a tag **n** or **p**

Test
===========
`nosetests -c nose.cfg` for single python version  
`tox` for multiple python versions

