Build a human-in-the-loop audit system for clinical trial data extraction. Focus on provenance: every cell in an extracted table must point back to a specific PDF page/region.