Sandesh Swamy

Senior Applied Scientist - NLP, AWS AI Labs
Formerly
Computer Science and Engineering
The Ohio State University
Email: sanswamy (at) amazon (dot) com
Here is how my name is pronounced.


I am currently a Senior Applied Scientist at AWS AI Labs where I work on conversational agents, personalization, and Large Language Model (LLM) applications which make AWS customer experiences better. Prior to my stint at AWS, I was at Amazon Alexa from 2017 to 2021 where I worked on the in-house deep learning toolkit, traditional and neural-based Intent Classification and Slot Recognition models, and architected data pipelines and modeling pipelines for 100k skills. I also was part of the core set of initial contributors who started the Semantic Parsing based utterance recognition at Alexa. I have extensive experience working with (and deploying to scale) traditional Machine Learning models (max-ent models, linear chain CRFs), Deep learning models (LSTM-based models), and also Large Language Models (LLMs ~1B parameters). Before joining Amazon, I obtained my Master's graduate degree from The Ohio State University, Department of Computer Science in 2017. I was advised by Dr. Alan Ritter and Dr. Marie-Catherine de Marneffe (Still consider them the best advisers, ever! :)). My research interests include Natural Language Processing, Conversational agents, Semantic Parsing, Large Language Models (LLMs), Text Generation, Personalization, and Social Media data. I was previously the Teaching Associate for Introduction to Computer Programming in C++ for Engineers and Scientists.

I have significant programming experience in Python, Java and C(code samples and HackerRank profile can be found in the Navigation bar). I have dabbled with web programming sporadically. I also have extensive experience deploying real-world Machine Learrning systems which used frameworks such as MxNet, Pytorch, HuggingFace, and an in-house Amazon framework which have served millions of customers. During my time at Alexa, I have been a core contributor for the launch of traditional ML models for all Alexa skills, launching DNN-based tiny models for Alexa skills, release of Semantic Parsing based models for utterance recognition, international expansion of Alexa skills, and allowing customers to seamlessly request resource information using natural language on AWS' chat agent.

I am fascinated by the amount of information generated on Social Media. Analyzing Twitter data is a big interest of mine since it helps get a better understanding of the kind of things that people are really interested in and want to talk about.


Publications, Patents and blogs
  • Sandesh Swamy, Rashmi Gangadharaiah, James W. Horsley, Abhijit S Barde, Jonathan James Pezzino - "Provider network user console with natural language querying feature", US Patent number: 12455905, [Link]
  • Sandesh Swamy, Rashmi Gangadharaiah - "Automatic user console question generation", US Patent number: 12461953, [Link]
  • Devang Kulshreshtha, Wanyu Du, Raghav Jain, Srikanth Doss, Hang Su, Sandesh Swamy, Yanjun Qi - "The Subtle Art of Defection: Understanding Uncooperative Behaviors in LLM based Multi-Agent Systems", ArXiv 2025. [Paper]
  • Tanqiu Jiang, Min Bai, Nikolaos Pappas, Yanjun Qi,- "Cross-Modal Content Optimization for Steering Web Agent Preferences", ArXiv 2025. [Paper]
  • Jing-Jing Li, Jianfeng He, Chao Shang, Devang Kulshreshtha, Xun Xian, Yi Zhang, Hang Su, Sandesh Swamy, Yanjun Qi - "STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents", ArXiv 2025. [Paper]
  • Binwei Yao, Chao Shang, Wanyu Du, Jianfeng He, Ruixue Lian, Yi Zhang, Hang Su, Sandesh Swamy, Yanjun Qi - "Peacemaker or Troublemaker: How Sycophancy Shapes Multi-Agent Debate", ArXiv 2025. [Paper]
  • Dhruv Agarwal, Manoj Ghuhan Arivazhagan, Rajarshi Das, Sandesh Swamy et.al., - "Searching for Optimal Solutions with LLMs via Bayesian Optimization", ICLR 2025. [Paper]
  • Narges Tabari, Sandesh Swamy, and Rashmi Gangadharaiah - "User Persona Identification and New Service Adaptation Recommendation", ArXiV. [Paper]
  • Sandesh Swamy, Narges Tabari, Chacha Chen, and Rashmi Gangadharaiah - "Contextual Dynamic Prompting for Response Generation in Task-oriented Dialog Systems", EACL 2023. [Paper] [Amazon Science coverage]
  • Konstantine Arkoudas, Nicolas Guenon des Mesnards, Melanie Rubino, Sandesh Swamy, Saarthak Khanna, Weiqi Sun, Khan Haidar - PIZZA: A new benchmark for complex end-to-end task-oriented parsing, arXiV, 2022 [Paper] [Amazon Science coverage]
  • Blog post on Amazon Science with my work for releasing Deep Neural Networks for 100,000 skills for Alexa.
  • Sandesh Swamy, Alan Ritter, and Marie-Catherine de Marneffe “i have a feeling trump will win..................”: Forecasting Winners and Losers from User Predictions on Twitter, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP). [code] [data] [demo](Talk given by Prof. Ritter at EMNLP 2017)
  • Sandesh Swamy Forecasting event outcomes from user predictions on Twitter, Master's thesis at The Ohio State University.
Skills and Expertise
  • Programming Languages : Java, Python, C, C++
  • Frameworks: HuggingFace, PyTorch, MxNet, Keras
  • Tools and IDEs: Eclipse, Visual Studio, R-Studio, Android Studio, Sublime, PyCharm, Jupyter, Github, Octave
  • Although I do not claim to know all the Linux commands off the top of my head, I do love working on Linux and the charm of the plain old Terminal cannot be beaten by any IDE
  • My favorite programming language/language of choice is Python!
Other Activities
  • Reviewer, ACL & ACL Rolling Review (2021, 2022, 2023), COLING (2018, 2020, 2022), W-NUT (2020), ECNLP (2022), NAACL 2022, EMNLP (2021)
  • Program Committee, ACL SRW, 2018.
  • Session Chair, NAACL 2022, Seattle
  • I have also been a reviewer for the Amazon internal conference (2017-present)