Publications

My research spans agents and applications, multilingual NLP, LLM post-training, and benchmarks & evaluation — published across ACL, EMNLP, NAACL, COLING, NeurIPS and AACL, alongside top finishes in shared tasks.

Google Scholar

Last updated: Dec 23, 2025.

Papers

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

NeurIPS 2026
Robust and Fine-Grained Detection of AI Generated Texts Robust and Fine-Grained Detection of AI Generated Texts

Robust and Fine-Grained Detection of AI Generated Texts

TBD
Uncovering Cultural Representation Disparities in Vision-Language Models Uncovering Cultural Representation Disparities in Vision-Language Models

Uncovering Cultural Representation Disparities in Vision-Language Models

AACL 2025 Findings
DSBC: Data Science task Benchmarking with Context engineering DSBC: Data Science task Benchmarking with Context engineering

DSBC: Data Science task Benchmarking with Context engineering

AACL 2025 Main
Improving Multilingual Capabilities with Cultural and Local Knowledge in LLMs While Enhancing Native Performance Improving Multilingual Capabilities with Cultural and Local Knowledge in LLMs While Enhancing Native Performance

Improving Multilingual Capabilities with Cultural and Local Knowledge in LLMs While Enhancing Native Performance

AACL 2025 Main
Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia

Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia

NeurIPS 2025
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

ICLR 2026
Query Attribute Modeling: Improving search relevance with Semantic Search and Meta Data Filtering Query Attribute Modeling: Improving search relevance with Semantic Search and Meta Data Filtering

Query Attribute Modeling: Improving search relevance with Semantic Search and Meta Data Filtering

ICDM 2025 Workshops

Shared Tasks & Competitions

Cross-lingual emotion detection through Large Language Models Cross-lingual emotion detection through Large Language Models

Cross-lingual emotion detection through Large Language Models

ACL 20241st / 72
Self Reported Health Text Classification through Ensembles Self Reported Health Text Classification through Ensembles

Self Reported Health Text Classification through Ensembles

ACL 20242nd / 37
Black-Box Word-Level Text Boundary Detection in Partially AI Generated Texts Black-Box Word-Level Text Boundary Detection in Partially AI Generated Texts

Black-Box Word-Level Text Boundary Detection in Partially AI Generated Texts

NAACL 20241st, 6th / 308
Lexical Reranking of Semantic Retrieval (LeSeR) for Regulatory Question Answering Lexical Reranking of Semantic Retrieval (LeSeR) for Regulatory Question Answering

Lexical Reranking of Semantic Retrieval (LeSeR) for Regulatory Question Answering

COLING 20253rd / 38
Sequential Learning for Claim Verification and Explanation Generation in Financial Domains Sequential Learning for Claim Verification and Explanation Generation in Financial Domains

Sequential Learning for Claim Verification and Explanation Generation in Financial Domains

COLING 20253rd / 43
NLU of Devanagari Script Languages: Detection of Language, Hate Speech, and Targets using LLMs NLU of Devanagari Script Languages: Detection of Language, Hate Speech, and Targets using LLMs

NLU of Devanagari Script Languages: Detection of Language, Hate Speech, and Targets using LLMs

COLING 20252nd / 57
Multi-class Emotion detection on highly imbalanced data Multi-class Emotion detection on highly imbalanced data

Multi-class Emotion detection on highly imbalanced data

ACL 20231st / 81

Questions about a paper?

Happy to chat about any of the work above.

Get in touch