Publications

(2026). Hair-Trigger Alignment: Black-Box Evaluation Cannot Guarantee Post-Update Alignment. arXiv.

PDF Source Document

(2025). TruthTorchLM: A Comprehensive Library for Predicting Truthfulness in LLM Outputs. EMNLP 2025.

PDF Code Source Document DOI

(2025). Reject Only Critical Tokens: Pivot-Aware Speculative Decoding. NeurIPS 2025 Workshop.

PDF Code Source Document

(2025). Uncertainty as Feature Gaps: Epistemic Uncertainty Quantification of LLMs in Contextual Question-Answering. ICLR 2026.

PDF Source Document

(2025). Reconsidering LLM Uncertainty Estimation Methods in the Wild. ACL 2025.

PDF Code Source Document DOI

(2025). Do Not Design, Learn: A Trainable Scoring Function for Uncertainty Estimation in Generative LLMs. NAACL 2025, Findings.

PDF Cite

(2024). CroMo-Mixup: Augmenting Cross-Model Representations for Continual Self-Supervised Learning. ECCV 2024.

PDF Cite Code

(2024). MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs. ACL 2024.

PDF Cite Code

(2024). Federated Orthogonal Training: Mitigating Global Catastrophic Forgetting in Continual Federated Learning. ICLR 2024.

PDF Cite Code

(2023). Federated Alternate Training (FAT): Leveraging Unannotated Data Silos in Federated Segmentation for Medical Imaging. ISBI 2023.

PDF Cite DOI