faker-ph
Copyright 2026 Daniel Roxas

This product includes software developed by Daniel Roxas
(https://github.com/danielquillanroxas/faker-ph).

Licensed under the Apache License, Version 2.0.

-------------------------------------------------------------------------------
Bundled reference data and upstream credits
-------------------------------------------------------------------------------

Geographic data (src/faker_ph/data/psgc_*.csv)
  Derived from the Philippine Standard Geographic Code (PSGC), published by
  the Philippine Statistics Authority (PSA). Source redistribution via:
    - jgngo/psgc-data — https://github.com/jgngo/psgc-data (MIT License)
    - altcoder/philippines-psgc-shapefiles —
      https://github.com/altcoder/philippines-psgc-shapefiles
  PSA PSGC data is published under PSA's open data terms.

Mobile carrier prefix data (src/faker_ph/data/mobile_prefixes.json)
  Compiled from:
    - 0xbitx/PH_Mobile_Number_Prefixes —
      https://github.com/0xbitx/PH_Mobile_Number_Prefixes
    - Wikipedia article "Telephone numbers in the Philippines"
      (CC BY-SA 4.0)

Landline area code data (src/faker_ph/data/landline_area_codes.json)
  Compiled from the Wikipedia article "Telephone numbers in the Philippines"
  (CC BY-SA 4.0).

Philippine name inventory (src/faker_ph/data/ph_names.json)
  Hand-compiled from publicly observable surname distributions and given-name
  frequency tables; origin buckets (Spanish, Chinese, native Tagalog,
  Anglo) for synthetic generation. NOT a complete or representative
  inventory — see README caveats.

-------------------------------------------------------------------------------
Disclaimer on identifier realism
-------------------------------------------------------------------------------

All identifiers generated by this library (PSN, TIN, SSS, PhilHealth PIN,
UMID, GSIS, PRC, driver's license, passport, account numbers, mobile
numbers) are FORMAT-VALID but are NOT guaranteed to correspond to any real
person, entity, or issued document. Checksum algorithms for several
agencies (e.g. PhilHealth, SSS) are not publicly documented, so checksum
digits are emitted as format-valid random digits rather than verified
checksums. This library must not be used to forge, impersonate, or
validate real-world credentials. It is intended for synthetic-data
generation in NLP, privacy research, and software testing.
