Profile Picture I am a third-year PhD student in the Computer Science Department at the University of North Carolina at Chapel Hill advised by Prof. Mohit Bansal. At UNC Chapel Hill, I am a part of the MURGe-Lab and the broader UNC-NLP group.

My research interests lie in the fields of Machine Learning, and Natural Language Processing. I maintain a list of my publications and research projects under the Research tab.

I have spent 5 (more like 3.5) excellent years at the Indian Institute of Technology, Bombay from where I received my dual-degree (B.Tech. + M.Tech.) in 2021. I worked closely with Prof. Preethi Jyothi and other members of the CSALT group. I have also spent wonderful summers as an intern at Adobe Research in 2019, 2022 and at the Allen Institute for AI (AI2) in 2023.

Email: archiki@cs.unc.edu

Updates

Nov 2024: Preprint, ScPO from my internship at Meta on self-alignment for reasoning in unsupervised and semi-supervised manner is out now.
Sept 2024: Preprint, LASeR out on learning to select reward models during LLM training.
Sept 2024: New preprint, MAgICoRe on multi-agent, iterative, coarse-to-fine refinement for math reasoning.
Sept 2024: New preprint, AdaCAD, introduces a dynamic decoding strategy to deal with variable amounts of knowledge conflict.
Jul 2024:Preprint on Sytem-1.x on training LLMs to balance fast and slow planning is out.
May 2024:Our paper Soft-SC on soft-voting across actions and trajectories of LLM agents to ACL 2024.
May 2024:Our paper ReGAL on learning generalizable and reusable abstractions in programs accepted to ICML 2024.
Apr 2024:Joining FAIR Labs, Meta in NYC as Research Scientist Intern with Jason Weston and Maryam Fazel-Zarandi for Summer 2024.
Mar 2024:Work on enabling LLM-agents to dynamically adapt (recursively decompose) to task complexity & LLM capabilities: ADaPT accepted to NAACL 2024 (findings). Camera-ready coming soon!
Feb 2024:Preprint on Soft-SC a continous relaxation of self-consistency for superior performance on interactive benchmarks code.
Jan 2024:Preprint on ReGAL an approach for learning generalizable and reusable abstractions in programs is out with code.
Jan 2024:Our paper RepARE: a framework incorporating visual information in questions for VQA accepted at ICLR 2024.
Nov 2023:Preprint on ADaPT an approach to enable LLM-agents to dynamically adapt (recursively decompose) to task complexity & LLM capabilities "as-needed" is out along with code.
Oct 2023:Preprint on RepARE: a framework incorporating visual information in questions for VQA out along with code.
Oct 2023:Reasoning chain/CoT rationales evaluation work: ReCEval accepted to EMNLP 2023 (main conference). Camera-ready coming soon!
May 2023:Internship work at Adobe: "MeetingQA" on answering questions asked by meeting participants accepted to ACL 2023 (main conference). Paper, code and data release upcoming soon.
Apr 2023:Pre-print on evaluating reasoning chains based on desirable properties is out now along with code.
Feb 2023:Joining Allen Institute for AI (AI2) as Research Intern with Aristo team for Summer 2023.
Jan 2023:GrIPS paper accepted to EACL 2023.
Mar 2022:Pre-print on searching good instructions for large language models is out now along with code.
Mar 2022:Joining Adobe Research (US) as Research Scientist Intern for Summer 2022 to work with Franck Dernoncourt, David Yoon, and Trung Bui.
Nov 2021:Code-switching paper wins best paper honorable mention 🏆 at MRL 2021.
Sep 2021:Paper on intermediate-task training in code-switched languages accepted to appear in the Workshop on Multilingual Representation Learning (MRL 2021) at EMNLP 2021.
Aug 2021:Joined the PhD program of the Computer Science Department at UNC Chapel Hill.
Jul 2021:New paper on intermediate-task training for natuaral understanding benchmarks in code-switched languages is out on arxiv.
Apr 2021:Accepted the PhD admit from the Computer Science Department at UNC Chapel Hill. Awarded the Inclusive Excellence Fellowship.
Jan 2021:Paper investigating robustness in end-to-end speech recognition systems accepted at IEEE- International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021. Camera-ready version is now available!
Dec 2020:Paper on minimization of Age-of-Information with decentralized Multiplayer Multi-Armed Bandits accepted at IEEE-Wireless Communications and Networking Conference (WCNC) 2021 , camera-ready version is now available!
Dec 2020:Attended the Amazon Research Days 2020 India virtually (invite-only)
Oct 2020:Paper investigating robustness in end-to-end speech recognition systems submitted to IEEE- International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021 (pre-print available).
Sep 2020:Paper on minimization of Age-of-Information with decentralized Multiplayer Multi-Armed Bandits submitted to IEEE-Wireless Communications and Networking Conference (WCNC) 2021 , here is a detailed pre-print with proofs.
Aug 2020:Awarded the Institute Academic Prize by IIT Bombay for outstanding academic performance in the academic year 2019-2020
Aug 2020:Selected in the 150 students to attend the Google AI Summer School organized by Google Research India
Jun 2020:Paper on Understanding Confounding Effect of Accents in End-to-End ASR accepted at ACL 2020. Paper link out!
May 2020:Filed a patent at the United States Patent and Trademark Office (USPTO) on Key-Value Memory Networks for predicting time series metrics
Apr 2020:Paper on Understanding Confounding Effect of Accents in End-to-End ASR accepted at ACL 2020. Paper link to be out very soon!
Apr 2020:Poster of my work on Cold-Start Time Series Forecasting was presented at the Web Conference (WWW), 2020
Jul 2019:Joined CSALT Lab, IIT Bombay conducting research on machine learning and its applications to NLP and ASR
May 2019:Internship at the Big Data Experience Lab, Adobe Research, Bangalore, India
May 2018:Internship at CIBIV Research Lab working with Prof. Arndt Von Haeseler