Metadata-Version: 2.1
Name: aita
Version: 0.1.2
Summary: AI Powered Data Platform
Home-page: https://www.project-aita.com
License: Apache Software License 2.0
Keywords: Artificial Intelligence,Data Analysis,Code Generation
Author: aita
Author-email: contact@project-aita.com
Requires-Python: >=3.9,<4.0
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: Apache Software License
Classifier: License :: Other/Proprietary License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Provides-Extra: aitalab
Provides-Extra: dynamodb
Provides-Extra: postgresql
Provides-Extra: pyspark
Provides-Extra: snowflake
Requires-Dist: adbc-driver-postgresql (>=1.0.0,<2.0.0) ; extra == "postgresql"
Requires-Dist: adbc-driver-snowflake (>=1.0.0,<2.0.0) ; extra == "snowflake"
Requires-Dist: adbc-driver-sqlite (>=1.0.0,<2.0.0)
Requires-Dist: boto3 (>=1.34.117,<2.0.0) ; extra == "dynamodb"
Requires-Dist: findspark (>=2.0.1,<3.0.0) ; extra == "pyspark"
Requires-Dist: ipython (==8.14)
Requires-Dist: jupyter-ai-magics (>=2.14.0,<3.0.0)
Requires-Dist: langchain (==0.1.20)
Requires-Dist: langchain-openai (>=0.0.7,<0.0.8)
Requires-Dist: langgraph (>=0.0.48,<0.0.49)
Requires-Dist: mock (>=5.1.0,<6.0.0)
Requires-Dist: openai (>=1.12.0,<2.0.0)
Requires-Dist: pandas (>=2.2.2,<3.0.0)
Requires-Dist: pyarrow (>=16.1.0,<17.0.0)
Requires-Dist: pydantic (>=2.7.1,<3.0.0)
Requires-Dist: pyspark (>=3.2.0,<4.0.0) ; extra == "pyspark"
Requires-Dist: typer (>=0.12.3,<0.13.0)
Requires-Dist: types-setuptools (>=70.0.0.20240524,<71.0.0.0)
Project-URL: Repository, https://github.com/project-aita/aita
Description-Content-Type: text/markdown

# aita
AI Powered Data Agent, a comprehensive solution for data analysis, engineering, and visualization. 


[![Build status](https://github.com/project-aita/aita/workflows/build/badge.svg)](https://github.com/project-aita/aita/actions/workflows/build.yml?query=workflow%3Abuild)
[![Python Version](https://img.shields.io/pypi/pyversions/aita.svg)](https://pypi.org/project/aita/)
[![Dependencies Status](https://img.shields.io/badge/dependencies-up%20to%20date-brightgreen.svg)](https://github.com/project-aita/aita/pulls?utf8=%E2%9C%93&q=is%3Apr%20author%3Aapp%2Fdependabot)

[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
[![Security: bandit](https://img.shields.io/badge/security-bandit-green.svg)](https://github.com/PyCQA/bandit)
[![Pre-commit](https://img.shields.io/badge/pre--commit-enabled-brightgreen?logo=pre-commit&logoColor=white)](https://github.com/project-aita/aita/blob/main/.pre-commit-config.yaml)
[![License](https://img.shields.io/github/license/project-aita/aita)](https://github.com/project-aita/aita/blob/main/LICENSE)
[![Coverage Report](assets/images/coverage.svg)](https://github.com/project-aita/aita/blob/main/assets/images/coverage.svg)


## Tech Stack

Our platform leverages a combination of cutting-edge technologies and frameworks:

- **[Langchain](https://www.langchain.com/)**: Facilitates the seamless integration of language models into application workflows, significantly enhancing AI interaction capabilities.
- **[Reflex](https://reflex.dev/)**: An open-source framework for quickly building beautiful, interactive web applications in pure Python
- **[Apache Arrow](https://arrow.apache.org/)**: A cross-language development platform for in-memory data that specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware like CPUs and GPUs.
- **[Jupyter Ai Magics](https://github.com/jupyterlab/jupyter-ai)**: A JupyterLab extension that provides a set of magics for working with AI models.

## Supported Models
Model provider are supported by jupyter ai magics.

| Provider            | Provider ID          | Environment variable(s)    | Python package(s)               |
|---------------------|----------------------|----------------------------|---------------------------------|
| AI21                | `ai21`               | `AI21_API_KEY`             | `ai21`                          |
| Anthropic           | `anthropic`          | `ANTHROPIC_API_KEY`        | `langchain-anthropic`           |
| Anthropic (chat)    | `anthropic-chat`     | `ANTHROPIC_API_KEY`        | `langchain-anthropic`           |
| Bedrock             | `bedrock`            | N/A                        | `boto3`                         |
| Bedrock (chat)      | `bedrock-chat`       | N/A                        | `boto3`                         |
| Cohere              | `cohere`             | `COHERE_API_KEY`           | `cohere`                        |
| ERNIE-Bot           | `qianfan`            | `QIANFAN_AK`, `QIANFAN_SK` | `qianfan`                       |
| Gemini              | `gemini`             | `GOOGLE_API_KEY`           | `langchain-google-genai`        |
| GPT4All             | `gpt4all`            | N/A                        | `gpt4all`                       |
| Hugging Face Hub    | `huggingface_hub`    | `HUGGINGFACEHUB_API_TOKEN` | `huggingface_hub`, `ipywidgets`, `pillow` |
| NVIDIA              | `nvidia-chat`        | `NVIDIA_API_KEY`           | `langchain_nvidia_ai_endpoints` |
| OpenAI              | `openai`             | `OPENAI_API_KEY`           | `langchain-openai`              |
| OpenAI (chat)       | `openai-chat`        | `OPENAI_API_KEY`           | `langchain-openai`              |
| SageMaker           | `sagemaker-endpoint` | N/A                        | `boto3`                         |

## Supported Data Sources
- [x] Snowflake
- [x] Sqlite
- [ ] BigQuery
- [x] Postgres
- [ ] MySQL
- [ ] Redshift
- [ ] DynamoDB
- [x] File (CSV, Excel, Parquet, etc.)

## Supported Engines
- [x] Apache Spark
- [ ] Apache Flink
- [ ] Presto
- [ ] **[Ray.io](https://ray.io/)**: A distributed computing framework that efficiently scales AI tasks and data processing across clusters, improving performance and resource utilization.

## 🛡 License

[![License](https://img.shields.io/github/license/project-aita/aita)](https://github.com/project-aita/aita/blob/main/LICENSE)

This project is licensed under the terms of the `Apache Software License 2.0` license. See [LICENSE](https://github.com/aita/aita/blob/master/LICENSE) for more details.
