Metadata-Version: 2.4
Name: avrocurio
Version: 1.0.0
Summary: Apache Avro serialization/deserialization with Confluent Schema Registry framing and Apicurio integration
Project-URL: source, https://github.com/castoredc/avrocurio/
Project-URL: changelog, https://github.com/castoredc/avrocurio/blob/main/CHANGELOG.md
Author-email: Castor <dev@castoredc.com>
Maintainer-email: Nick Groenen <nick.groenen@castoredc.com>
License-Expression: BSD-2-Clause-Patent
License-File: LICENSE
Requires-Python: >=3.10
Requires-Dist: cachetools>=5.0.0
Requires-Dist: dataclasses-avroschema>=0.62.0
Requires-Dist: fastavro>=1.9.0
Requires-Dist: httpx>=0.27.0
Requires-Dist: typing-extensions>=4.0.0
Description-Content-Type: text/markdown

# AvroCurio

[Apache Avro] serialization/deserialization with [Confluent Schema Registry framing] using [Apicurio Schema Registry].

## Installation

Using `uv`, `poetry`, or `pip`:

```bash
uv add avrocurio
```

```bash
poetry add avrocurio
```

```bash
pip install avrocurio
```

## Quick Start

### 1. Define your schema using [dataclasses-avroschema]

```python
from dataclasses import dataclass
from dataclasses_avroschema import AvroModel

@dataclass
class User(AvroModel):
    name: str
    age: int
    email: str
```

### 2. Serialize and deserialize data

```python
import asyncio
from avrocurio import AvroSerializer, ApicurioClient, ApicurioConfig

async def main():
    # Configure connection to Apicurio Registry
    config = ApicurioConfig(base_url="http://localhost:8080")

    # Create client and serializer
    async with ApicurioClient(config) as client:
        serializer = AvroSerializer(client)

        # Create a user instance
        user = User(name="John Doe", age=30, email="john@example.com")

        # Serialize the user to an Avro binary with Confluent registry framing.
        # Under the hood this will perform a lookup against Apicurio to get the
        # artifact ID for the schema, which is then prepended to the Avro binary
        # (along with a magic byte).
        serialized = await serializer.serialize(user)

        # Deserialize the binary back to a User instance.
        deserialized_user = await serializer.deserialize(serialized, User)

asyncio.run(main())
```

## Confluent Schema Registry Wire Format

AvroCurio implements the Confluent Schema Registry wire format:

```
+----------------+------------------+------------------+
| Magic Byte     | Schema ID        | Avro Payload     |
| (1 byte = 0x0) | (4 bytes, BE)    | (remaining)      |
+----------------+------------------+------------------+
```

- **Magic Byte**: Always `0x0` to identify Confluent wire format
- **Schema ID**: 4-byte big-endian integer referencing the schema in the registry
- **Avro Payload**: Standard Avro binary-encoded data

### Schema Caching

Schema caching is handled automatically by the ApicurioClient for performance.

## Development

### Requirements

Integration tests require a running Apicurio Registry.
Running it through Docker or Podman using [Compose][docker-compose] is easiest:

```bash
docker compose up
```

Port 8080 is assumed by default, but you can set `APICURIO_URL` to point to a different instance.

### Running Tests

```bash
# Run all tests
uv run pytest

# Run specific test file
uv run pytest tests/test_serializer.py

# Run with verbose output
uv run pytest -v

# Skip integration tests
uv run pytest -m "not integration"
```

## License

AvroCurio is open-source software released under the [BSD-2-Clause Plus Patent License].
This license is designed to provide: a) a simple permissive license; b) that is compatible with the GNU General Public License (GPL), version 2; and c) which also has an express patent grant included.

Please review the [LICENSE] file for the full text of the license.

[Apache Avro]: https://avro.apache.org/
[Apicurio Schema Registry]: https://www.apicur.io/registry/
[BSD-2-Clause Plus Patent License]: https://spdx.org/licenses/BSD-2-Clause-Patent.html
[Confluent Schema Registry framing]: https://docs.confluent.io/platform/current/schema-registry/fundamentals/serdes-develop/index.html#wire-format
[LICENSE]: LICENSE
[dataclasses-avroschema]: https://marcosschroh.github.io/dataclasses-avroschema/
[docker-compose]: https://docs.docker.com/compose/
