Metadata-Version: 2.1
Name: NIADic-py
Version: 0.0.1
Summary: Unofficial Python Wrapper for NIADic dataset
Author-email: minwook-shin <minwook0106@gmail.com>
Project-URL: Homepage, https://github.com/minwook-shin/niadic_py
Project-URL: Bug Tracker, https://github.com/minwook-shin/niadic_py/issues
Classifier: Programming Language :: Python :: 3
Classifier: License :: Other/Proprietary License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.9
Description-Content-Type: text/markdown
Requires-Dist: pandas ==2.1.1
Requires-Dist: openpyxl ==3.1.2
Requires-Dist: chardet ==5.2.0
Provides-Extra: dev
Requires-Dist: build ==1.0.3 ; extra == 'dev'
Requires-Dist: pdoc ==14.1.0 ; extra == 'dev'

# NIADic_Py

Unofficial Python Wrapper for NIADic dataset

# Description

This is Python wrapper for the NIADic provided by the K-ICT Big Data Center.

# Column

* term : korean words
* tag : word class
  * Using [Korean POS tags comparison chart](https://docs.google.com/spreadsheets/d/1OGAjUvalBuX-oZvZ_-9tEfYD2gQe7hTGsgUpiiBSXI8/edit#gid=0)
* category: Category of words

# Tokenizer Example

* input : "저는 오늘 아침에 하나의 빵을 먹고 학교로 급하게 갔습니다."
* output : ['저', '오늘', '아침', '하나', '빵', '먹', '학교', '급하', '갔습니']

# License 

'NIADic_py' contains the bundled NIADic files and available under the CC BY-SA 2.0 license.
