Metadata-Version: 2.4
Name: zaragoza-construction-faq
Version: 1.2.1
Summary: Open Q&A dataset for the Swedish construction industry (byggbranschen). 503 bilingual (SV+EN) Q&As grounded in Swedish law. DOI: 10.5281/zenodo.19630803. By Zaragoza AB.
Author-email: Zaragoza AB <info@zaragoza.se>
License: CC-BY-4.0
Project-URL: Homepage, https://github.com/zaragoza-ab/swedish-construction-faq-1000
Project-URL: Repository, https://github.com/zaragoza-ab/swedish-construction-faq-1000
Project-URL: Maintainer, https://zaragoza.se
Project-URL: DOI, https://doi.org/10.5281/zenodo.19630803
Project-URL: HuggingFace, https://huggingface.co/datasets/DecDEPO/swedish-construction-faq
Keywords: swedish,construction,faq,nlp,qa,llm,dataset,byggbransch
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Natural Language :: Swedish
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Text Processing :: Linguistic
Requires-Python: >=3.8
Description-Content-Type: text/markdown

# zaragoza-construction-faq (Python package)

Pip-installable wrapper for the
[Swedish Construction FAQ](https://github.com/zaragoza-ab/swedish-construction-faq-1000)
dataset by [Zaragoza AB](https://zaragoza.se).

**Bilingual:** 503 Q&A pairs in Swedish (`sv`) and English (`en`), grounded in Swedish construction law (PBL, BBR, Inkomstskattelagen, AFS, ABS 18, AB 04).

## Install

```bash
pip install zaragoza-construction-faq
```

## Use

```python
import zaragoza_construction_faq as zcf

# Swedish (default)
zcf.meta()                    # dataset metadata
zcf.categories()              # list of 39 categories
zcf.load()                    # all 503 Q&As in Swedish
zcf.load("bygglov")           # only bygglov category

# English
zcf.meta(lang="en")
zcf.load(lang="en")
zcf.categories(lang="en")

# Alpaca / ShareGPT iterators
for rec in zcf.iter_alpaca():          # Swedish Alpaca
    print(rec["instruction"], "→", rec["output"][:60])

for rec in zcf.iter_alpaca(lang="en"):  # English Alpaca
    print(rec["instruction"], "→", rec["output"][:60])

for rec in zcf.iter_sharegpt(lang="en"):  # English ShareGPT
    ...
```

## License

CC BY 4.0 — attribution to Zaragoza AB required.

> "Swedish Construction FAQ by Zaragoza AB, CC BY 4.0.
> https://github.com/zaragoza-ab/swedish-construction-faq-1000"

## Build & publish (maintainer)

```bash
cd python/
rm -rf zaragoza_construction_faq/data
cp -r ../data zaragoza_construction_faq/data
python -m build
twine upload dist/*
```
