CAIS 2026 Workshop

Accepted Papers

AI Agents for Discovery in the Wild · May 26, 2026

37
Accepted Papers
3
Oral Presentations

Oral Presentations

Featured Orals

  1. AI-PROPELLER: Warehouse-Scale Interprocedural Code Layout Optimization with AlphaEvolve

    Chaitanya Mamatha Ananda, Rajiv Gupta, Mircea Trofin, Aiden Grossman, Sriraman Tallam, Xinliang Li, Amir Yazdanbakhsh

  2. Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks

    Young-Jun Lee, Seungone Kim, Minki Kang, Alistair Cheong Liang Chuen, Zerui Chen, Seungho Han, Taehee Jung, Dongyeop Kang

  3. Meta-Harness: Harness Search for Agents Under Expensive Evaluation

    Yoonho Lee, Roshen Sanjay Nair, Qizheng Zhang, Kangwook Lee, Omar Khattab, Chelsea Finn

All Accepted Papers

  1. Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization Alexander Blasberg, Vasilis Kypriotis, Dimitrios Skarlatos
  2. AI-PROPELLER: Warehouse-Scale Interprocedural Code Layout Optimization with AlphaEvolve Oral Chaitanya Mamatha Ananda, Rajiv Gupta, Mircea Trofin, Aiden Grossman, Sriraman Tallam, Xinliang Li, Amir Yazdanbakhsh
  3. AttackEvolve: Using In-Context Learning Enhanced Searches to Generate Multi-modal Attacks on Autonomous Vehicles Marsalis Gibson, Claire Tomlin, S. Shankar Sastry
  4. Autonomous Agent Learning in Production Xinhao Cheng, Jianan Ji, Zhihao Jia, Vasilis Kypriotis, Dimitrios Skarlatos, Eliot H. Solomon, Zhihao Zhang, Yu Zhou
  5. Beyond Fault Injection: Leveraging LLMs for Autonomous Chaos Engineering Gerard Matthew, Philip Godfrey
  6. BIORESEARCHER: Scenario-Guided Multi-Agent for Translational Medicine Remigiusz Kinas, Joanna Krawczyk, Rafal Powalski, Przemysław Pietrzak, Agnieszka Kowalewska, Krzysztof Kolmus, Maciej Sypetkowski, Łukasz Smoliński, Tomasz Jetka
  7. CadAgent: A Multi-Agent System for Manufacturing Process Classification from 2D Engineering Drawings, with Audit-Gated Bounded Autonomy for In-the-Wild Deployment Jaerim choi
  8. Can AI Agents Discover Tractable Statistical Mechanics Mapping for Physics Problems? Wanyu Zhao
  9. Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows Hardy Chen, Nancy Lau, Haoqin Tu, Shuo Yan, Xiangyan Liu, Zijun Wang, Juncheng Wu, Michael Qizhe Shieh, Alvaro A. Cardenas, Cihang Xie, Yuyin Zhou
  10. Context or Capability? Debugging Agentic Workflows Paulina Toro Isaza, Saurabh Jha, Yu Deng
  11. Declarative Data Services: Structured Agentic Discovery for Composing Data Systems Shanshan Ye, Duo Lu
  12. DeepRoot: A KG-Coordinated Multi-Agent System for Therapeutic Reasoning over Historical Medical Texts Zijian Carl Ma, Sean J. Wang, Sijbren Manuel Kramer
  13. Deploying Agents in the Wild: Failure Modes from Healthcare Access Optimization Diego Estuar
  14. Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas Victor Gallego
  15. Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics Jishnu Sethumadhavan Nair, Patrice Bechard, Rishabh Maheshwary, SRAVAN RAMACHANDRAN, Surajit Dasgupta, Aakash Bhagat, Shruthan Radhakrishna, Pulkit Pattnaik, Johan Obando-Ceron, Shiva Krishna Reddy Malay, Sagar Davasam, Seganrasan Subramanian, Vipul Mittal, Sridhar Krishna Nemala, Christopher Pal, Srinivas Sunkara, Sai Rajeswar
  16. Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks Oral Young-Jun Lee, Seungone Kim, Minki Kang, Alistair Cheong Liang Chuen, Zerui Chen, Seungho Han, Taehee Jung, Dongyeop Kang
  17. Foundry: Host-Owned Trust and Memory for Long-Horizon Agent Swarms Monishwaran Maheswaran, Leon Lakhani, Shu Liu, Yuqing Jian, Tianyi Zhang, Kurt Keutzer, James Zou, Aditya Akella, Ben Athiwaratkun, Chenfeng Xu
  18. How Do Tool-Augmented LLM Agents Perform on Real-World Energy Analytics Tasks? David Akinpelu, Akintonde Abbas, RERELOLUWA VICTOR ALIMI, Ayodeji Lana
  19. Interpretable Early Termination of Web Navigation Agents via Closed Sequential Pattern Mining Sergio Talavera
  20. Knowing When to Ask: Self-Gated Clarification for Hierarchical Language Agents Aijing Gao, Yiming Kang, Mengdie Flora Wang, Jae Oh Woo
  21. LEVI: Stronger Search Architectures Can Substitute for Larger LLMs in Evolutionary Search Temoor Ali
  22. LiteSR: Literature-Guided Agentic Retrieval for Symbolic Regression ZALISH MAHMUD, Anantaa Kotal, Lixin Jin, Anthony Darrouzet-Nardi, Aritran Piplai, Nan Jiang
  23. MatPref: Training the Reasoning Backbone of Materials Discovery Agents with Verifiable Rewards Sarrah Mikhail Leung, Taehan Kim, Jeongbin Park
  24. Meta-Harness: Harness Search for Agents Under Expensive Evaluation Oral Yoonho Lee, Roshen Sanjay Nair, Qizheng Zhang, Kangwook Lee, Omar Khattab, Chelsea Finn
  25. PACEvolve++: Improving Continual Learning for Evolutionary Search Agents Minghao Yan, Bo Peng, Benjamin Coleman, Ziqi Chen, Zhouhang Xie, Shuo Chen, Zhankui He, Noveen Sachdeva, Weili Wang, Ed H. Chi, Shivaram Venkataraman, Wang-Cheng Kang, Derek Zhiyuan Cheng, Beidou Wang
  26. PaperDoctor: Evidence-Grounded and Actionable Feedback for Scientific Papers in Progress Kevin Qinghong Lin, Siyuan Hu, Pan Lu, Yu Chen, Yanzhe Chen, Owen Queen, Yupeng Chen, Jialin Yu, Junchi Yu, Zifeng Ding, Yuanfeng Ji, Sheng Liu, Jindong Gu, Linjie Li, Mike Zheng Shou, Philip Torr, James Zou
  27. PromptKV: A Workflow for Building AI-Driven Distributed KV Stores Anthony Tafoya, Keshab Agarwal
  28. Red-Teaming Claude and ChatGPT-based Security Advisors for Trusted Execution Environments Kunal Mukherjee, Spandan Mukherjee
  29. ScientistOne: Verifiable Autonomous Research via Chain-of-Evidence Rui Meng, Bhavana Dalvi Mishra, Jiefeng Chen, Chun-Liang Li, Palash Goyal, Mihir Parmar, Yiwen Song, Yale Song, Rajarishi Sinha, Parthasarathy Ranganathan, Burak Gokturk, Jinsung Yoon, Tomas Pfister
  30. Shepherd: A Runtime Substrate Empowering Meta-Agents with a Formalized Execution Trace Simon Yu, Derek Chong, Ananjan Nandi, Dilara Soylu, Jiuding Sun, Christopher D Manning, Weiyan Shi
  31. Side Effects Are the Output: Evaluating AI Agents That Act on Live Systems Ganeshkumar Ashokavardhanan
  32. Spilling the TE: Lessons from AI-driven evolution of Traffic Engineering Rahul Bothra, Alexander Krentsel, Philip Godfrey, Sylvia Ratnasamy
  33. Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution Monishwaran Maheswaran, Leon Lakhani, Zhongzhu Zhou, Shijia Yang, Junxiong Wang, Coleman Richard Charles Hooper, Yuezhou Hu, Rishabh Tiwari, Jue WANG, Harman Singh, Qingyang Wu, Yuqing Jian, Ce Zhang, Kurt Keutzer, Tri Dao, Xiaoxia Wu, Ben Athiwaratkun, James Zou, Chenfeng Xu
  34. Stage–Audit: Auditable Source-Frontier Discovery for Cross-Wiki Tables Chen Shen, Eser Kandogan
  35. Stochastic Agent Descent: Adaptive Agents for the Future of Non-Convex Optimization Justin Singh Kang
  36. The Partial Testimony of Logs: Evaluation of Language Model Generation under Confounded Model Choice Jikai Jin, Vasilis Syrgkanis
  37. Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw Zijun Wang, Haoqin Tu, Letian Zhang, Hardy Chen, Juncheng Wu, Xiangyan Liu, Zhenlong Yuan, Tianyu Pang, Michael Qizhe Shieh, Fengze Liu, Zeyu Zheng, Huaxiu Yao, Yuyin Zhou, Cihang Xie