Intro
I am Atharva Naik a first year PhD (and former MLT) student advised by Prof. Carolyn Rose at the Language Technologies Institute at Carnegie Mellon University.
My research interests lie around creating LLM agents that can collaborate with humans for software engineering while facilitating on-the-job learning about code quality, security and efficiency.
I'm currently working on a benchmark for version specific best practice recommendation for Python and using code generation capabilities of LLMs for automated phonological reasoning.
I recently finished a project where I created CRScore an automated metric for evaluating code review comments in a reference free manner using LLMs and static analysis tools like code smell detectors.
In the past, I have published work on data augmentation for code translation, using curriculum learning to improve generalization in code search, and using LLMs to provide support to students for reflection-based learning in collaborative programming activities.
Education
Aug 2024 (incoming) |
PhD in Language Technologies
Carnegie Mellon University
|
Aug 2022 - May 2024 |
Masters in Language Technologies
Carnegie Mellon University
GPA: 4.08/4
|
July 2018 - May 2022 |
Bachelor of Technology, Computer Science and Engineering
Indian Institute of Technology, Kharagpur
GPA: 9.66/10
|
Experience
Jan 2023 |
Research Assistant, Carnegie Mellon University
|
Aug 2021 - Dec 2021 |
Research Assistant Intern, Technische Universität Darmstadt
|
Apr 2021 - Sep 2021 |
Research Assistant Intern, University of Alberta
|
May 2021 - Aug 2021 |
Research Intern, Adobe, Bangalore
|
Feb 2019 - Mar 2020 |
Undergraduate Student Researcher, Autonomous Ground Vehicle Research Group, IIT Kharagpur
|
2024
CRScore: Grounding the Evaluation of Code Review Comments in Code Claims and Smells
Atharva Naik, Marcus Alenius, Daniel Fried, Carolyn Rosé
Arxiv
|
Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction
Atharva Naik, Kexun Zhang, Nathaniel Robinson, Aravind Mysore, Clayton Marr, Hong Sng, Rebecca Byrnes, Anna Cai, Kalvin Chang, David Mortensen
Arxiv
|
On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation
Atharva Naik
Arxiv
|
Generating Situated Reflection Triggers about Alternative Solution Paths: A Case Study of Generative AI for Computer-Supported Collaborative Learning
Atharva Naik, Jessica Ruhan Yin, Anusha Kamath, Qianou Ma, Sherry Tongshuang Wu, Charles Murray, Christopher Bogart, Majd Sakr, Carolyn P. Rose
AIED (2024)
|
Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Abhinav Rao, Atharva Naik* Sachin Vashistha*, Somak Aditya, Monojit Choudhury
LREC-COLLING (2024): 16802-16830
(* - indicates equal contribution)
|
SkillCLIP: Skill Aware Modality Fusion Visual Question Answering (Student Abstract)
Atharva Naik, Yash Parag Butala, Navaneethan Vaikunthan, Raghav Kapoor
AAAI Student Abstracts (2024): 23592-23593
(* - indicates equal contribution)
|
2023
Data Augmentation for Code Translation with Comparable Corpora and Multiple References
Yiqing Xie, Atharva Naik, Daniel Fried, Carolyn Rose
EMNLP (2023) Findings
|
SYNC: A Structurally guided Hard Negative Curriculum for Generalizable Neural Code Search
Atharva Naik, Soumitra Das, Jyothi Vedurada, Somak Aditya
AACL (2023)
|
Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic
Zijun Wu, Zi Xuan Zhang, Atharva Naik, Zhijian Mei, Mauajama Firdaus, Lili Mou
ICLR (2023)
|
2022
Super-NaturalInstructions: Generalization via declarative instructions on 1600+ nlp tasks
Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, ... , Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi, Daniel Khashabi
NAACL (2022): 1718–1734
|
Representation Learning for Conversational Data using Discourse Mutual Information Maximization
Bishal Santra, Sumegh Roychowdhury, Aishik Mandal, Vasu Gurram, Atharva Naik, Manish Gupta, Pawan Goyal
NAACL (2022): 1718–1734
|
Towards Enabling Synchronous Digital Creative Collaboration: Codifying Conflicts in Co-Coloring
Suryateja BV, Jeet Patel, Atharva Naik, Yash Parag Butala, Sristy Sharma, Niyati Chhaya
CHI Extended Abstracts (2022/4/27): 1-7
|
2021
Email :
arnaik [at] andrew [dot] cmu [dot] edu