Metadata-Version: 2.1
Name: anko
Version: 0.2.8
Summary: Toolkit for performing anomaly detection algorithm on time series.
Home-page: https://github.com/tanlin2013/anko
Author: tao-lin
Author-email: tanlin2013@gmail.com
License: MIT
Download-URL: https://github.com/tanlin2013/anko/archive/v0.2.8.tar.gz
Description: ![](docs/anko_logo.png)
        # anko
        Toolkit for performing anomaly detection algorithm on 1D time series based on numpy, scipy.
        
        Conventional approaches that based on statistical analysis have been implemented, with mainly two approaches included:
        1. Normal Distribution  
        Data samples are presumably been generated by normal distribution, and therefore anomalous data points can be targeted by analysing the standard deviation.  
        
        2. Fitting Ansatz  
        Data samples are fitted by several ansatzs, and in accordance with the residual, anomalous data points can be selected. 
        
        Regarding model selections, models are adopted dynamically by performing normal test and by computing the (Akaike/Bayesian) information criterion.
        By default, the algorithm will first try to fit in the data into normal distribution, if it passed [normal test](https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.normaltest.html).
        If this attempt suffers from the loss of convergence or it did not pass [normal test](https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.normaltest.html) from begining, 
        then the algorithm will pass data into the second methods and try to execute all the available fitting ansatzs simultaneously. 
        The best fitting ansatz will be selected by information criterion, and finally the algorithm will pick up anomalous points in accordance with the residual.
        [click here to see all available methods.](https://tanlin2013.github.io/anko/build/html/anko.html#anko.anomaly_detector.AnomalyDetector.models)   
        
        Future development will also include methods that are based on deep learning techniques, such as isolation forest, support vector machine, etc.
        
        ## Requirements
        * python >= 3.6.0
        * numpy >= 1.16.4
        * scipy >= 1.2.1
        
        ## Installation 
        ```
        pip install anko
        ```
        For current release version please refer to [PyPI - anko homepage](https://pypi.org/project/anko/).
        
        ## Documentation
        For details about anko API, see the [reference documentation](https://tanlin2013.github.io/anko/index.html).
        
        ## Jupyter Notebook Tutorial (in dev)
        Run **anko_tutorial.ipynb** on your local Jupyter Notebook or host on [google colab](https://colab.research.google.com/github/tanlin2013/anko/blob/master/anko_tutorial.ipynb).
        
        ## Basic Usage
        1. Call AnomalyDetector
        ```
        from anko.anomaly_detector import AnomalyDetector  
        agent = AnomalyDetector(t, series)
        ```
        
        2. Define policies and threshold values (optional)
        ```
        agent.thres_params["linregress_res"] = 1.5  
        agent.apply_policies["z_normalization"] = True  
        agent.apply_policies["info_criterion"] = 'AIC'
        ```
        for the use of [**AnomalyDetector.thres_params**](https://tanlin2013.github.io/anko/build/html/anko.html#anko.anomaly_detector.AnomalyDetector.thres_params) 
        and [**AnomalyDetector.apply_policies**](https://tanlin2013.github.io/anko/build/html/anko.html#anko.anomaly_detector.AnomalyDetector.apply_policies), 
        please refer to the documentation.
        
        3. Run check
        ```
        check_result = agent.check()
        ```
        
        The type of output **check_result** is [**CheckResult**](https://tanlin2013.github.io/anko/build/html/anko.html#anko.anomaly_detector.CheckResult), which is basically a dictionary that contains the following attributes:
        > model: 'increase_step_func'  
        > popt: [220.3243250055105, 249.03846355234577, 74.00000107457113]  
        > perr: [0.4247789247961187, 0.7166253174634686, 0.0]  
        > anomalous_data: [(59, 209)]  
        > residual: [10.050378152592119]  
        > extra_info: ['Info: AnomalyDetector is using z normalization.', 'Info: There are more than 1 discontinuous points detected.']        
        
        * model (str): The best fit model been selected by algorithm.
        * popt (list): Estimated fitting parameters. 
        * perr (list): Corresponding errors of popt.
        * anomalous_data (list\[tuple(float, float)\]): Return a list of anomalous data points (t, series(t)), or an empty list if all data points are in order. 
        * residual (list): Residual of anomalous data.
        * extra_info (list): All convergence errors, warnings, informations during the execution are stored here.
        
        
        ## Run Test
        ```
        python -m unittest discover -s test -p '*_test.py'
        ```
        or simply
        ```
        make test
        ```
Keywords: statistics,time series,anomaly detection
Platform: UNKNOWN
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Topic :: Scientific/Engineering :: Mathematics
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Description-Content-Type: text/markdown
