A Java application that uses Phileas to identify PII (Personally Identifiable Information) in text across a wide variety of file formats. Types of PII are scored by magnitude, density, and confidence. A list of files suggested for redaction testing will be generated.
The goal of Phinder is to provide a comprehensive analysis of PII to help you take the next step to redact it with Philter. Note that Phinder may support more file types than Philter.
Visit http://philterd.github.io/phinder for documentation and more information.
mvn clean installjava -jar target/phinder-1.0.0-SNAPSHOT.jar -i src/test/resources/input.txtTo process a directory recursively:
java -jar target/phinder-1.0.0-SNAPSHOT.jar -i src/test/resources/ -RNote
Processing images requires tesseract-ocr to be installed.
At the completion of the scan, report.json and report.html files will be generated in the current directory.
Phinder can turn a scan into a starter Philter / Phileas redaction policy that enables the entity types it found, so discovery and redaction become one workflow. Use --emit-policy with an output file:
java -jar target/phinder-1.0.0-SNAPSHOT.jar -i src/test/resources/ -R --emit-policy starter-policy.jsonThe output is redaction-policy JSON that loads unchanged into Philter or Phileas. Each detected type is enabled with a REDACT strategy. Apply it directly, or tune it first (change strategies, add conditions, ignore terms) and re-run. Use a .phisql file extension to get the policy as PhiSQL instead, which compiles to the same JSON.
This is a starting point to review, tune, and measure (for example with Philter Scope) before you rely on it. Redaction is probabilistic: it reduces how much sensitive data gets through, it does not catch every instance, and you are responsible for validating the policy against your own data. Types that need a supplied custom policy to detect (custom identifiers, sections, or PhEye name detection) are reported as skipped rather than guessed at.
To store the report history in MongoDB, use the --mongodb CLI option:
java -jar target/phinder-1.0.0-SNAPSHOT.jar -i src/test/resources/input.txt --mongodb "mongodb://localhost:27017/phinder"For more examples and detailed usage, please refer to the documentation.
Copyright 2026 Philterd, LLC.
This project is licensed under the Apache License 2.0.
