====================================================================================================
NER PII Benchmark — Comparative Summary
====================================================================================================

Dataset: nvidia-pii
--------------------------------------------------------------------------------
System                                F1-macro   F1-micro  Entity-F1  Latency(ms)  Tier
----------------------------------------------------------------------------------
NerGuard Hybrid V2 (gpt-4o)             0.5069     0.7015     0.6634        41.36     2 (16 labels)
NerGuard Hybrid (gpt-4o)                0.4943     0.6862     0.6475        31.31     2 (16 labels)
Presidio                                0.4933     0.5493     0.6680        86.09     2 (15 labels)
Piiranha                                0.4731     0.6501     0.6195        30.91     2 (14 labels)
NerGuard Base                           0.4175     0.6105     0.6076        33.23     2 (16 labels)
spaCy (en_core_web_trf)                 0.3607     0.4175     0.5527       144.22     2 (8 labels)
dslim/bert-base-NER                     0.3331     0.4821     0.6225        37.59     2 (4 labels)
