Riju Marwah
Trustworthy & Reliable LLMs · Token-level Dynamics · Interpretability · Alignment
I am a Research Intern at the IRT Group at the University of South Carolina, advised by Dr. Amit Sheth and Vishal Pallagani. I will complete my Bachelor's in Computer Science in June 2026.
My research centers on making large language models trustworthy, robust, and reliable through a focus on token-level dynamics. I study how technical signals and linguistic cues interact during long-context generation, and design retrain-free interventions to mitigate degradation. My work sits at the intersection of robustness, interpretability, and alignment.
I am currently working on formalizing cognitive fatigue as a measurable latent state in autoregressive transformers — see the project page →
I am always open to discussing research, collaborations, or anything else — feel free to reach out!
News
- Apr 2026
- 🎉 Paper accepted at ICML 2026 — Cognitive Fatigue in Autoregressive Transformers: Formalization and Measurement.
- Jan 2026
- 💻 Presented Chatsparent at AAAI 2026 in Singapore; served as a Student Scholar Volunteer.
- Oct 2025
- 🎉 Paper accepted at the AAAI 2026 Demonstration Track — Chatsparent: An Interactive System for Detecting and Mitigating Cognitive Fatigue in LLMs (28% acceptance rate).
Publications
-
Cognitive Fatigue in Autoregressive Transformers: Formalization and Measurement
ICML 2026
Riju Marwah*, Ritvik Garimella, Vishal Pallagani, Atishay Jain, Michael Stewart, Amit ShethFormalizes cognitive fatigue as a runtime state variable grounded in three token-level signals. Introduces the Fatigue Index with five explicit axioms and validates it across nine models (1B–13B) on HotpotQA, TriviaQA, and SQuAD under long-context, positional, and precision stress conditions. [Project Page]
-
Chatsparent: An Interactive System for Detecting and Mitigating Cognitive Fatigue in LLMs
AAAI 2026 Demo
Riju Marwah*, Vishal Pallagani, Ritvik Garimella, Amit ShethIntroduces Chatsparent, an interactive system that detects and mitigates cognitive fatigue in LLMs via token-level signals and retrain-free interventions, improving reliability and transparency during dialogue. [OpenReview]
-
MicroDetect-Net (MDN): Leveraging Deep Learning to Detect Microplastics in Clam Blood
Springer / ICICC 2025
8th International Conference on Innovative Computing and Communication, Springer NatureProposes MDN, combining fluorescence microscopy (Nile Red staining) and deep learning to scan blood samples for microplastics — a step toward human blood analysis. [Preprint]
Experience
-
Research Intern, AI Institute, University of South Carolina
· Apr 2025 – Present
- Advisors: Dr. Amit Sheth (NCR Chair & Director, AIISC), Vishal Pallagani
- Designed a framework to detect and mitigate cognitive fatigue in LLMs using token-level signals and real-time interventions.
- First-authored an accepted AAAI 2026 Demonstration paper; extended to full paper accepted at ICML 2026.
- Conducting research on long-context reliability, entropy collapse, and attention decay in LLMs.
-
Research Collaborator, University of Illinois Urbana-Champaign
· Nov 2025 – Mar 2026
- Advisor: Soorya Ram Shemgekar
- Studied politeness framing and reward leakage in LLMs across structured tasks and instruction-following settings.
- Performed mechanistic interpretability analysis including early-token probing and activation patching.
-
Generative AI Intern, EY (Ernst & Young)
· Jan 2025 – Mar 2025
- Built a low-code platform for agentic AI workflows using modular DAGs, vector DBs, and LLM toolchains.
- Implemented Celery–Redis task execution with production-grade scalability.
- Integrated semantic agent routing, memory components, and external API/tool support (OpenAI, FAISS).
-
Software Developer Intern, National Thermal Power Corporation
· Jul 2024 – Sep 2024
- Developed ASP.NET Core applications using MVC and Entity Framework for enterprise automation.
- Optimized MySQL and MongoDB CRUD operations via LINQ, ensuring ACID compliance.
- Implemented secure authentication using JWT, Identity Framework, Google reCAPTCHA, and SMTP.
-
Strategy & IS Intern, Indian Oil Corporation Limited
- QA for Digital Tender Management: black-box testing, UI/UX review, and performance testing.
- Root-cause analysis and test design to improve system stability and user experience.
Projects
-
Early Stage Lung Cancer Detection Using Deep CNNs
Built a CNN model on the LIDC-IDRI dataset to detect pulmonary nodules in CT scans, enabling earlier diagnosis. Integrated Grad-CAM and SHAP to assist radiologists with model-driven decision support.
-
Bhoomi: AI-Powered Agricultural Productivity
Full-stack platform integrating drone technology and mobile solutions to enhance agricultural productivity. Developed AI models for crop disease detection and farmer assistance to improve yield and sustainability.
-
Microphone Array-Based Direction-of-Arrival for Gunshot Detection
— Smart India Hackathon 2024 Nominee
Six-mic array with TDoA algorithms, FPGA processing, band-pass filtering, and KNN/STFT classification. Reduces false positives and improves situational awareness in public safety applications.
-
Dynamically Optimized Cycling Navigation with Real-Time Adaptive Routing
Real-time route optimization using OSM/OSMnx/NetworkX and Monte Carlo rollouts with dynamic safety and traffic weights, Dijkstra/A* fallback, and SCC connectivity checks.
Honors & Service
- AAAI Student Volunteer Scholarship — USD 1,100 travel grant, AAAI 2026
- McKinsey Forward Program — selected for global initiative on problem-solving, business, and leadership
- Media coverage — Times of India, Hindustan Times (Ordin@trix), Dwarka Parichay, Brainfeed (Quarantech)
- Student Scholar Volunteer, AAAI 2026, Singapore
- Delegate, Harvard Project for Asian & International Relations (HPAIR), 2025
- Volunteer, Rotaract Club of New Delhi — led community health initiatives in Delhi's urban slums, 2025
Education
B.Tech, Computer Science Engineering · Guru Gobind Singh Indraprastha University
Nov 2022 – Jun 2026 (Expected) · GPA: 8.12 / 10.0
MIT OpenCourseWare — Graduate & Advanced Coursework (Part-time)
6.825 Techniques in AI / Deep Learning for NLP ·
6.864 Advanced NLP ·
6.867 Machine Learning (Advanced) ·
6.006 Introduction to Algorithms
Relevant Undergraduate Coursework · Prerequisite Mapping Syllabi →
- Artificial Intelligence
- Data Structures & Algorithms
- Design & Analysis of Algorithms
- Operating Systems
- Computer Networks
- Compiler Design
- Theory of Computation
- Software Engineering
- DBMS
- OOP in C++
- Java & Advanced Java
- Digital Logic & Computer Design
- Probability, Statistics & Linear Programming