====================================================================================================
NER PII Benchmark — Comparative Summary
====================================================================================================

Dataset: nvidia-pii
--------------------------------------------------------------------------------
System                                F1-macro   F1-micro  Entity-F1  Latency(ms)  Tier
----------------------------------------------------------------------------------
NerGuard Hybrid V2 (qwen2.5:7b)         0.5051     0.7009     0.6618       563.67     2 (16 labels)
NerGuard Hybrid V2 (gpt-oss:20b)        0.5028     0.7012     0.6640      3139.28     2 (16 labels)
NerGuard Hybrid V2 (deepseek-r1:14      0.5008     0.6970     0.6606      7566.13     2 (16 labels)
NerGuard Hybrid V2 (llama3.1:8b)        0.4972     0.6973     0.6583       706.52     2 (16 labels)
NerGuard Hybrid V2 (phi4:14b)           0.4778     0.6981     0.6595      1251.25     2 (16 labels)
NerGuard Hybrid V2 (mistral-nemo:1      0.4774     0.6967     0.6582       734.33     2 (16 labels)
NerGuard Hybrid V2 (qwen2.5:14b)        0.4773     0.6975     0.6619       981.33     2 (16 labels)
