Developing an Automated Redaction Pipeline with Local LLMs
MedDB Group @ Brown CS
Neurosurgery @ RI Hospital
LLMs
Prompt Engineering
Fine Tuning
Medical Database
Current Challenges & Practices
Strict adherence to ethical and legal requirements, such as HIPAA, is critical when handling sensitive patient information. Under HIPAA, 18 identifiers are classified as PHI, and failing to anonymize these can result in significant legal and ethical issues. Traditional methods—such as manual redaction, rule-based systems, and conventional NLP models—struggle with context-dependent PHI, leading to potential oversights. The LLM-based pipeline addresses these limitations, offering a more sophisticated approach by integrating local LLMs into the redaction process.
Feel free to reach out if you’re curious about more details or updates!