Metadata-Version: 2.4
Name: benchwise
Version: 0.1.0a0
Summary: The GitHub of LLM Evaluation - Python SDK
Author-email: Bhuvnesh Sharma <bhuvnesh875@gmail.com>
License: MIT
Project-URL: Homepage, https://github.com/Benchwise/benchwise
Project-URL: Repository, https://github.com/Benchwise/benchwise
Project-URL: Issues, https://github.com/Benchwise/benchwise/issues
Keywords: llm,evaluation,benchmarking,ai,ml
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Testing
Requires-Python: >=3.11
Description-Content-Type: text/markdown
Requires-Dist: pydantic>=2.0.0
Requires-Dist: httpx>=0.24.0
Requires-Dist: numpy>=1.24.0
Requires-Dist: pandas>=2.0.0
Requires-Dist: requests>=2.28.0
Requires-Dist: pytest>=7.0.0
Requires-Dist: pytest-asyncio>=0.21.0
Provides-Extra: metrics
Requires-Dist: rouge-score>=0.1.2; extra == "metrics"
Requires-Dist: sacrebleu>=2.3.0; extra == "metrics"
Requires-Dist: bert-score>=0.3.13; extra == "metrics"
Requires-Dist: nltk>=3.8.0; extra == "metrics"
Provides-Extra: llm-apis
Requires-Dist: openai>=1.0.0; extra == "llm-apis"
Requires-Dist: anthropic>=0.7.0; extra == "llm-apis"
Requires-Dist: google-generativeai>=0.3.0; extra == "llm-apis"
Provides-Extra: transformers
Requires-Dist: transformers>=4.30.0; extra == "transformers"
Requires-Dist: torch>=2.0.0; extra == "transformers"
Requires-Dist: sentence-transformers>=2.2.0; extra == "transformers"
Provides-Extra: dev
Requires-Dist: pytest>=7.0.0; extra == "dev"
Requires-Dist: pytest-asyncio>=0.21.0; extra == "dev"
Requires-Dist: pytest-mock>=3.10.0; extra == "dev"
Requires-Dist: pytest-cov>=4.0.0; extra == "dev"
Requires-Dist: ruff>=0.1.6; extra == "dev"
Requires-Dist: pre-commit>=3.0.0; extra == "dev"
Requires-Dist: mypy>=1.0.0; extra == "dev"
Requires-Dist: psutil>=5.9.0; extra == "dev"
Provides-Extra: all
Requires-Dist: rouge-score>=0.1.2; extra == "all"
Requires-Dist: sacrebleu>=2.3.0; extra == "all"
Requires-Dist: bert-score>=0.3.13; extra == "all"
Requires-Dist: nltk>=3.8.0; extra == "all"
Requires-Dist: openai>=1.0.0; extra == "all"
Requires-Dist: anthropic>=0.7.0; extra == "all"
Requires-Dist: google-generativeai>=0.3.0; extra == "all"
Requires-Dist: transformers>=4.30.0; extra == "all"
Requires-Dist: torch>=2.0.0; extra == "all"
Requires-Dist: sentence-transformers>=2.2.0; extra == "all"
Requires-Dist: pytest>=7.0.0; extra == "all"
Requires-Dist: pytest-asyncio>=0.21.0; extra == "all"
Requires-Dist: pytest-mock>=3.10.0; extra == "all"
Requires-Dist: pytest-cov>=4.0.0; extra == "all"
Requires-Dist: ruff>=0.1.6; extra == "all"
Requires-Dist: pre-commit>=3.0.0; extra == "all"
Requires-Dist: mypy>=1.0.0; extra == "all"
Requires-Dist: psutil>=5.9.0; extra == "all"

# BenchWise 🎯

**The GitHub of LLM Evaluation**

BenchWise is an open-source platform that democratizes LLM evaluation by making it as easy as writing unit tests. Create, share, and run custom LLM evaluations with PyTest-like simplicity.

---

Built with ❤️ by Bhuvnesh Sharma
