關於大語言模型代理的必讀論文。
“這是您可能感興趣的其他一些紙質清單:
提示4 ReasoningPaper:通過語言模型提示論文的推理。
?知識培訓:關於大語模型的知識編輯的必讀論文。
我們真誠地邀請您深入研究這些論文和資源的收藏,每個論文和資源都提供了獨特的探索和發現旅程。 ? ”
互動自然語言處理
Zekun Wang, Ge Zhang, Kexin Yang, Ning Shi, Wangchunshu Zhou, Shaochun Hao, Guangzheng Xiong, Yizhi Li, Mong Yuan Sim, Xiuying Chen, Qingqing Zhu, Zhenzhu Yang, Adam Nik, Qi Liu, Chenghua Lin, Shi Wang, Ruibo Liu, Wenhu Chen,Ke Xu,Dayiheng Liu,Yike Guo,Jie Fu。 [ABS],2023.5
一項基於大語言模型的自主代理的調查
Lei Wang,Chen Ma,Xueyang Feng,Zeyu Zhang,Hao Yang,Jingsen Zhang,Zhiyuan Chen,Jiakai Tang,Xu Chen,Yankai Lin,Wayne Zhao Zhao,Zhewei Wei,Ji-Rong Wen。 [ABS],2023.8
基於大語模型的代理人的興起和潛力:調查
Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang劉,Zhangyue Yin,Shihan Dou,Rongxiang Weng,Wensen Cheng,Qi Zhang,Wenjuan Qin,Yongyan Zheng,Xipeng Qiu,Xuanjing Huang,Tao Gui。 [ABS],2023.9
如果LLM是嚮導,那麼代碼是魔杖:關於代碼如何賦予大型語言模型的調查
Ke Yang,Jiateng Liu,John Wu,Chaoqi Yang,Yi R. Fung,Sha Li,Zixuan Huang,Xu Cao,Xingyao Wang,Yiquan Wang,Heng Ji,Chengxiang Zhai。 [ABS],2024.1
代理AI:測量多模式相互作用的視野
Zane Durante,Qiuyuan Huang,Naoki Wake,Ran Gong,Jae Sung Park,Bidipta Sarkar,Rohan Taori,Yusuke Noda,Demetri Terzopoulos,Yejin Choi,Yejin Choi,Katsushi Ikeuchi,Hoi Ikeuchi,Hoi vo,Li Fei Fei-Fei,Jianfeng Gao,Jianfeng Gao。 [ABS],2024.1
個人LLM代理:有關功能,效率和安全性的見解和調查
Yuanchun Li, Hao Wen, Weijun Wang, Xiangyu Li, Yizhen Yuan, Guohong Liu, Jiacheng Liu, Wenxing Xu, Xiang Wang, Yi Sun, Rui Kong, Yile Wang, Hanfei Geng, Jian Luan, Xuefeng Jin, Zilong Ye, Guanjing Xiong, Fan Zhang, Xiang Li, Mengwei Xu,Zhijun Li,Peng Li,Yang Liu,Ya-Qin Zhang,Yunxin Liu。 [ABS],2024.1
神經法規智能的調查:範式,進步及以後
Qiushi Sun,Zhirui Chen,Fangzhi Xu,Kanzhi Cheng,Chang Ma,Zhangyue Yin,Jianing Wang,Chengcheng Han,Renyu Zhu,Shuai Yuan,Shuai Yuan,Qipeng Guo,Qipeng Guo,Qiiu,Xipeng Qiu,Xengcheng Yin,Peengcheng Yin,fei Yian,Ziaoli Li,Ziaygian,Ziapen,lingipen,lingpen,lingpen,吳。 [ABS],2024.3
精神理論可能自發地出現在大語言模型中
Michal Kosinski。 [ABS],2023.2
chatgpt中的毒性:分析人格分配的語言模型
Ameet Deshpande,Vishvak Murahari,Tanmay Rajpurohit,Ashwin Kalyan,Karthik Narasimhan。 [ABS],2023.4
用大語言模型玩重複的遊戲
Elif Akata,Lion Schulz,Julian Coda-Forno,Seong Joon Oh,Matthias Bethge,Eric Schulz。 [ABS],2023.5
專家宣傳:指導大型語言模型成為傑出的專家
Benfeng Xu,Yang,Junyang Lin,Quan Wang,Chang Zhou,Yongdong Zhang,Zhendong Mao。 [ABS],2023.5
大型語言模型的角色扮演
Murray Shanahan,Kyle McDonell,Laria Reynolds。 [ABS],2023.5
Tidybot:大型語言模型的個性化機器人協助
Jimmy Wu,Rika Antonova,Adam Kan,Marion Lepert,Andy Zeng,Shuran Song,Jeannette Bohg,Szymon Rusinkiwiewicz,Thomas Funkhouser。 [ABS],2023.5
大語言模型中的人格特徵
Mustafa Safdari,Greg Serapio-García,ClémentCrepy,Stephen Fitz,Peter Romero,Luning Sun,Marwa Abdulhai,Aleksandra Faust,MajaMatarić。 [ABS],2023.7
LLM有個性嗎?使MBTI測試成為大型語言模型的驚人評估
Keyu Pan,Yawen Zeng。 [ABS],2023.7
人工智能中的意識:意識科學的見解
Patrick Butlin, Robert Long, Eric Elmoznino, Yoshua Bengio, Jonathan Birch, Axel Constant, George Deane, Stephen M. Fleming, Chris Frith, Xu Ji, Ryota Kanai, Colin Klein, Grace Lindsay, Matthias Michel, Liad Mudrik, Megan AK Peters, Eric Schwitzgebel, Jonathan Simon,魯芬·範魯倫(Rufin VanRullen)。 [ABS],2023.8
脫離上下文:衡量LLMS中的情境意識
Lukas Berglund,Asa Cooper Stickland,Mikita Balesni,Max Kaufmann,Meg Tong,Tomasz Korbak,Daniel Kokotajlo,Owain Evans。 [ABS],2023.9
大型語言模型代理可以模擬人類的信任行為嗎?
Chengxing Xie,Canyu Chen,Feiran Jia,Ziyu Ye,Kai Shu,Adel Bibi,Ziniu Hu,Philip Torr,Bernard Ghanem,Guohao Li。 [ABS],2024.02
COLT5:具有條件計算的更快的遠程變壓器
Joshua Ainslie,Tao Lei,Michiel de Jong,SantiagoOntañón,Siddhartha Brahma,Yury Yury Zemlyanskiy,David Uthus,Mandy Guo,James Lee-Thorp,James Lee-Thorp,Yi Tay,Yun-Hsuan,Yun-Hsuan Sung,Sumit Sanghai。 [ABS],2023.3
大語言模型中的新興和可預測的記憶
Stella Biderman,USVSN Sai Prashanth,Lintang Sutawika,Hailey Schoelkopf,Quentin Anthony,Shivanshu Purohit,Edward Raff。 [ABS],2023.4
具有自控內存系統的大規模語言模型的無限長度輸入能力
Xinnian Liang,Bing Wang,Hui Huang,Shuangzhi Wu,Peihao Wu,Lu Lu,Zejun MA,Zhoujun li。 [ABS],2023.4
聊天案:錄製和分析跨時間
Shangqing Tu,Chunyang Li,Jifan Yu,Xiaozhi Wang,Lei Hou,Juanzi Li。 [ABS],2023.4
學會推理和記住自稱
Jack Lanchantin,Shubham Toshniwal,Jason Weston,Arthur Szlam,Sainbayar Sukhbaatar。 [ABS],2023.5
無形者:無限長度輸入的遠程變壓器
Amanda Bertsch,Uri Alon,Graham Neubig,Matthew R. Gormley。 [ABS],2023.5
小型型號是大型語言模型的寶貴插件
Canwen Xu,Yichong Xu,Shuohang Wang,Yang Liu,Chenguang Zhu,Julian McAuley。 [ABS],2023.5
記憶庫:增強具有長期記憶的大型語言模型
Wanjun Zhong,Lianghong Guo,Qiqi Gao,He Ye,Yanlin Wang。 [ABS],2023.5
Toolkengpt:通過工具嵌入使用大量工具來增強冷凍語言模型
Shibo Hao,Tianyang Liu,Zhen Wang,Zhiting Hu。 [ABS],2023.5
recurrentgpt:(任意)長文本的互動生成
Wangchunshu Zhou,Yuchen Eleanor Jiang,Peng Cui,Tiannan Wang,Zhenxin Xiao,Yifan Hou,Ryan Cotterell,Mrinmaya Sachan。 [ABS],2023.5
ret-llm:邁向大型語言模型的一般閱讀寫入記憶
Ali Modarressi,Ayyoob Imani,Mohsen Fayyaz,HinrichSchütze。 [ABS],2023.5
適應語言模型以壓縮上下文
Alexis Chevalier,Alexander Wettig,Anirudh Ajith,Danqi Chen。 [ABS],2023.5
重新訪問並行上下文窗口:令人沮喪的簡單替代方案和經過思考鏈惡化
Kejuan Yang,小劉,Kaiwen Men,Aohan Zeng,Yuxiao Dong,Jie Tang。 [ABS],2023.5
具有里程碑意義的關注:變壓器的隨機訪問無限上下文長度
Amirkeivan Mohtashami,Martin Jaggi。 [ABS],2023.5
隨機位置編碼增強變壓器的長度泛化
Anian Ruoss,GrégoireDeLétang,Tim Genewein,Jordi Grau-Moya,RóbertCsordás,Mehdi Bennani,Shane Legg,Joel Veness。 [ABS],2023.5
長度概括的單調位置關注
Jishnu Ray Chowdhury,Cornelia Caragea。 [ABS],2023.5
CHATDB:用數據庫作為其符號內存增強LLM
Chenxu Hu,Jie Fu,Chenzhuang DU,Simian Luo,Junbo Zhao,Hang Zhao。 [ABS],2023.6
語言代理的認知體系結構
Theodore Sumers,Shunyu Yao,Karthik Narasimhan,Thomas L. Griffiths [ABS],2023.9
JARVIS-1:帶有內存增強多模式模型的開放世界多任務代理
Zihao Wang,Shaofei Cai,Anji Liu,Yonggang Jin,Jinbing Hou,Bowei Zhang,Haowei Lin,Zhaofeng HE,Zilong Zheng,Zilong Zheng,Yaodong Yang Yang,Xiaojian Ma,Yitao Liang 。 [ABS],2023.11
一項關於基於大語言模型代理的記憶機制的調查
Zeyu Zhang,Xiaohe BO,Chen MA,Rui Li,Xu Chen,Quanyu Dai,Jieming Zhu,Zhenhua dong,ji-rong wen 。 [ABS],2024.4
Hipporag:神經生物學啟發了大語言模型的長期記憶
BernalJiménezGutiérrez,Yiheng Shu,Yu Gu,Michihiro Yasunaga,Yu Su。 [ABS],2024.5
思想緩衝:具有大語言模型的思想增強推理
Ling Yang,Zhaochen Yu,Tianjun Zhang,Shiyi Cao,Minkai Xu,Winao Zhang,Joseph E. Gonzalez,Bin Cui。 [ABS],2024,6
語言模型作為零拍的計劃者:為具體代理提取可行的知識
Wenlong Huang,Pieter Abbeel,Deepak Pathak,Igor Mordatch 。 [ABS],2022.1
內部獨白:通過使用語言模型進行計劃的體現推理
Wenlong Huang , Fei Xia , Ted Xiao , Harris Chan, Jacky Liang, Pete Florence, Andy Zeng, Jonathan Tompson, Igor Mordatch, Yevgen Chebotar, Pierre Sermanet, Noah Brown, Tomas Jackson, Linda Luu, Sergey Levine, Karol Hausman, Brian Ichter . [ABS],2022.7
反應:在語言模型中協同推理和作用
Shunyu Yao,Jeffrey Zhao,Dian Yu,Nan Du,Izhak Shafran,Karthik Narasimhan,Yuan Cao。 [ABS],2022.10
思想的眼睛:通過模擬的基礎語言模型推理
Ruibo Liu,Jason Wei,Shixiang Shane Gu,Te-Yen Wu,Soroush Vosoughi,Claire Cui,Denny Zhou,Andrew M. Dai。 [ABS],2022.10
LLM-Planner:具有大語言模型的具體代理的基礎計劃很少
Chan Hee Song,Jian Wu,Clayton Washington,Brian M. Sadler,Wei-Lun Chao,Yu Su 。 [ABS],2022.12
不要產生,歧視:將語言模型接地到現實世界環境的建議
Yu Gu,Xiang Deng,Yu su。 [ABS],2022.12
體現的代理商是否夢想著像素化綿羊的夢想? :使用語言指導世界建模的具體決策
Kolby Nottingham,Prithviraj Ammanabrolu,Alane Suhr,Yejin Choi,Hannaneh Hajishirzi,Sameer Singh,Roy Fox 。 [ABS],2023.1
描述,解釋,計劃和選擇:與大語言模型的互動計劃啟用開放世界多任務代理
Zihao Wang,Shaofei Cai,Anji Liu,Xiaojian MA,Yitao Liang 。 [ABS],2023.2
Palm-E:一種具體的多模式模型
Danny Driess, Fei Xia, Mehdi SM Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Wenlong Huang, Yevgen Chebotar, Pierre Sermanet, Daniel Duckworth, Sergey Levine, Vincent Vanhoucke,Karol Hausman,Marc Toussaint,Klaus Greff,Andy Zeng,Igor Mordatch,Pete Florence。 [ABS],2023.3
反射:語言加強學習的語言代理商
Noah Shinn,Federico Cassano,Beck Labash,Ashwin Gopinath,Karthik Narasimhan,Shunyu Yao。 [ABS],2023.3
與環境聊天:使用大語言模型的交互式多模式感知
Xufeng Zhao,Mengdi Li,Cornelius Weber,Muhammad Burhan Hafez,Stefan Wermter 。 [ABS],2023.3
PLAN4MC:開放世界的Minecraft任務的技能增強學習和計劃
Haoqi Yuan,Chi Zhang,Hongcheng Wang,Feiyang Xie,Penglin Cai,Hao Dong,Zongqing Lu。 [ABS],2023.3
自我refine:迭代精緻和自我反饋
Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Shashank Gupta, Bodhisattwa Prasad Majumder, Katherine Hermann, Sean Welleck, Amir Yazdanbakhsh, Peter克拉克。 [ABS],2023.3
向自我挑剔傳授大型語言模型
Xinyun Chen,Maxwell Lin,NathanaelSchärli,Denny Zhou。 [ABS],2023.4
wizardlm:授權大語言模型遵循複雜的說明
Can Xu,Qingfeng Sun,Kai Zheng,Xiubo Geng,Pu Zhao,Jiazhan Feng,Chongyang Tao,Daxin Jiang。 [ABS],2023.4
frugalgpt:如何使用大語言模型,同時降低成本和提高性能
Lingjiao Chen,Matei Zaharia,James Zou。 [ABS],2023.5
思想樹:大型語言模型的故意解決問題
Shunyu Yao,Dian Yu,Jeffrey Zhao,Izhak Shafran,Thomas L. Griffiths,Yuan Cao,Karthik Narasimhan。 [ABS],2023.5
計劃,消除和跟踪 - 語言模型是體現代理商的好老師
Yue Wu,So Yeon Min,Yonatan Bisk,Ruslan Salakhutdinov,Amos Azaria,Yuanzhi Li,Tom Mitchell,Shrimai Prabhumoye 。 [ABS],2023.5
互動文本遊戲的知識增強代理
Prateek Chhikara,Jiarui Zhang,Filip Ilievski,Jonathan Francis,Kaixin MA。 [ABS],2023.5
Voyager:具有大語言模型的開放式體現代理
Guanzhi Wang,Yuqi Xie,Yunfan Jiang,Ajay Mandlekar,Chaowei Xiao,Yuke Zhu,Linxi Fan,Anima Anandkumar 。 [ABS],2023.5
Swiftsage:一種具有快速和緩慢思考的生成代理,用於復雜的互動任務
Bill Yuchen Lin,Yicheng Fu,Karina Yang,Prithviraj Ammanabrolu,Faeze Brahman,Shiyu Huang,Chandra Bhagavatula,Yejin Choi,Xiang Ren。 [ABS],2023.5
語言模型符合世界模型:體現體驗增強語言模型
Jiannan Xiang,Tianhua Tao,Yi Gu,Tianmin Shu,Zirui Wang,Zichao Yang,Zhiting Hu。 [ABS],2023.5
Minecraft中的幽靈:通過具有基於文本的知識和記憶的大語言模型,開放世界環境的通常具有能力的代理
Xizhou Zhu,Yuntao Chen,Hao Tian,Chenxin Tao,Weijie Su,Chenyu Yang,Gao Huang,Bin Li,Lewei Lu,Xiaogang Wang,Yu Qiao,Zhaoxiang Zhang Zhang,Jifeng Dai。 [ABS],2023.5
Adaplanner:反饋與語言模型的自適應計劃
Haotian Sun,Yuchen Zhuang,Lingkai Kong,Bo Dai,Chao Zhang。 [ABS],2023.5
語言模型的推理是通過世界模型計劃
Shibo Hao,Yi Gu,Haodi MA,Joshua Jiahua Hong,Zhen Wang,Daisy Zhe Wang,Zhiting Hu。 [ABS],2023.5
計劃和解決提示:通過大型語言模型改善零擊鍊鍊的推理
Lei Wang,Wanyu Xu,Yihuai Lan,Zhiqiang Hu,Yunshi Lan,Roy Ka-Wei Lee,Ee-Peng Lim。 [ABS],2023.5
實現代理與LLM之間的智能互動:一種加強學習方法
Bin Hu,Chenyang Zhao,Pu Zhang,Zihao Zhou,Yuanhang Yang,Zenglin Xu,Bin Liu。 [ABS],2023.6
遞歸:推薦系統的新型模擬範式
Lei Wang,Jingsen Zhang,Xu Chen,Yankai Lin,Ruihua Song,Wayne Xin Zhao,Ji-Rong Wen。 [ABS],2023.6
邁向具有基礎模型的統一代理。
諾曼·迪·帕洛(Norman Di Palo),阿魯庫瑪·拜拉萬(Arunkumar Byravan),倫納德·哈斯克勒(Leonard Hasenclever),馬庫斯·沃夫梅爾(Markus Wulfmeier),尼古拉斯·海斯(Nicolas Heess),馬丁·里德米勒(Martin Riedmiller)。 [ABS],2023.7
pangu-coder2:通過排名反饋來提高代碼的大型語言模型
Bo Shen,Jiaxin Zhang,Taihong Chen,Daoguang Zan,Bing Geng,An Fu,Muhan Zeng,Ailun Yu,Jichuan JI,Jingyang Zhao,Yuenan Guo,Qianxiang Wang。 [ABS],2023.7
一個現實世界中的涉及計劃,長篇小說理解和程序綜合
Izzeddin Gur,Hiroki Furuta,Austin Huang,Mustafa Safdari,Yutaka Matsuo,Douglas Eck,Aleksandra Faust。 [ABS],2023.7
改造器:具有政策梯度優化的回顧性大語言代理
Weiran Yao,Shelby Heinecke,Juan Carlos Niebles,Zhiwei Liu,Yihao Feng,Le Xue,Rithesh Murthy,Zeyuan Chen,Jianguo Zhang,Jianguo Zhang,Devansh Arpit,ran arpit,Ran Xu,Phil Mui,Phil Mui,Huan Wang,Caiming Ximing Xiong,Silvio Savarese。 [ABS],2023.8
自我檢查:使用LLMS零射擊檢查自己的分步推理
Ning Miao,Yee Whye Teh,Tom Rainforth。 [ABS],2023.8
開除:LLM代理是經驗學習者
Andrew Zhao,Daniel Huang,Quentin Xu,Matthieu Lin,Yong-Jin Liu,Gao Huang。 [ABS],2023.8
自我驅動的接地:具有自動語言對準技能學習的大型語言模型代理
Shaohui Peng,Xing Hu,Qi Yi,Rui Zhang,Jiaming Guo,Di Huang,Zikang Tian,Ruizhi Chen,Zidong du,Qi Guo,Yunji Chen,Ling Li。 [ABS],2023.9
JARVIS-1:帶有內存增強多模式模型的開放世界多任務代理
Zihao Wang,Shaofei Cai,Anji Liu,Yonggang Jin,Jinbing Hou,Bowei Zhang,Haowei Lin,Zhaofeng HE,Zilong Zheng,Zilong Zheng,Yaodong Yang Yang,Xiaojian Ma,Yitao Liang 。 [ABS],2023.11
獅子座:3D世界中具有體現的通才特工
Jiangyong Huang,Silong Yong,Siaojian MA,Xiongkun Linghu ,Puhao Li,Yan Wang,Qing Li,Song-Chun Zhu,Baoxiong Jia,Siyuan Huang* [ABS],2023.11,
代碼鏈:使用語言模型的代碼模擬器推理
Chengshu Li,Jacky Liang,Andy Zeng,Xinyun Chen,Karol Hausman,Dorsa Sadigh,Sergey Levine,Li Fei-Fei,Fei Xia,Brian Ichter。 [ABS],2023.12
REST滿足React:多步推理LLM代理的自我完善
Renat Aksitov,Sobhan Miryoosefi,Zonglin Li,Daliang Li,Sheila Babayan,Kavya Kopparapu,Zachary Fisher,Ruiqi Guo,Sushant Prakash,Pranesh Srinivasan,Manzil Zaheer,Felix Yu,Sanjiv Yu,Sanjiv Kumar。 [ABS],2023.12
自我對比:通過不一致的解決觀點更好地思考
Wenqi Zhang,Yongliang Shen,Linjuan Wu,Qiuying Peng,Jun Wang,Yueting Zhuang,Weiming Lu。 [ABS],2024.01
自動手術:自動代理通過自我計劃從頭開始學習
Shuofei Qiao,Ningyu Zhang,Runnan Fang,Yujie Luo,Wangchunshu Zhou,Yuchen Eleanor Jiang,Chengfei LV,Huajun Chen。 [ABS],2024.01
TravelPlanner:與語言代理商的現實世界計劃的基準
Jian Xie,Kai Zhang,Jiangjie Chen,Tinghui Zhu,Renze Lou,Yuandong Tian,Yanghua Xiao,Yu su。 [ABS],2024.02
Agent-Pro:學習通過政策級別的反思和優化發展
Wenqi Zhang,Ke Tang,Hai Wu,Mengna Wang,Yongliang Shen,Guiyang Hou,Zeqi Tan,Peng Li,Yueting Zhuang,Weiming Lu。 [ABS],2024.02
Knowagent:基於LLM的代理商的知識增強計劃
Yuqi Zhu,Shuofei Qiao,Yixin OU,Shumin Deng,Ningyu Zhang,Shiwei Lyu,Yue Shen,Lei Liang,Jinjie Gu,Huajun Chen。 [ABS],2024.03
Sotopia-π:社會智能語言代理的互動學習
Ruiyi Wang,Haofei Yu,Wenxin Zhang,Zhengyang Qi,Maarten SAP,Graham Neubig,Yonatan Bisk,Hao Zhu。 [ABS],2024.03
自動化:大語模型代理的自動生成和選擇國家感知指南
Yao Fu,Dong-Ki Kim,Jaekyeom Kim,Sungryull Sohn,Lajanugen Logeswaran,Kyunghoon Bae,Honglak Lee。 [ABS],2024.03
通過行動學習賦予大型語言模型代理
海廷趙,張馬,吉琳·王,王蘇,林彭·孔,吉金Xu,張洪鄧,洪楊。 [ABS],2024.02
魔鬼的擁護者:LLM代理商的預期反思
Haoyu Wang,Tao Li,Zhiwei Deng,Dan Roth,Yang Li。 [ABS],2024.05
與世界知識模型的代理計劃
Shuofei Qiao,Runnan Fang,Ningyu Zhang,Yuqi Zhu,Xiang Chen,Shumin Deng,Yong Jiang,Pengjun Xie,Fei Huang,Huajun Chen。 [ABS],2024.05
智能探索:站在巨型基礎模型的肩膀上
康盧,申格蘭胡,傑夫·克萊恩。 [ABS],2024.05
忠實的邏輯推理通過象徵性思想鏈
Jundong Xu,Hao Fei,Liangming Pan,Qian Liu,Mong-Li Lee,Wynne Hsu。 [ABS],2024.05
愛麗絲夢遊仙境:簡單任務顯示最先進的大語言模型中的完整推理故障
Marianna Nezhurina,Lucia Cipolina-Kun,Mehdi Cherti,Jenia Jitsev。 [ABS],2024.06
TextGrad:通過文本自動“差異”
Mert Yuksekgonul,Federico Bianchi,Joseph Boen,Sheng Liu,Zhi Huang,Carlos Guestrin,James Zou。 [ABS],2024.06
符號學習使自我發展的代理人
Wangchunshu Zhou,Yixin OU,Shengwei ding,Lon Li,Jialong Wu,Tiannan Wang,Jiamin Chen,Shuai Wang,Xiaohua Xu,Ningyu Zhang,Huajun Chen,Yuchen Eleanor Jiang。 [ABS],2024.06
OS-CopiLot:朝著具有自我完善的通才計算機代理商
Wu,Chengcheng Han,Zichen Ding,Zhenmin Weng,Zhoumianze Liu,Shunyu Yao,Tao Yu,Lingpeng Kong。 [ABS],2024.02
Seeclick:利用高級視覺GUI代理的GUI接地
Kanzhi Cheng,Qiushi Sun,Yougang Chu,Fangzhi Xu,Yantao Li,Jianbing Zhang,Zhiyong Wu。 [ABS],2024.01
WebGPT:通過人類反饋的瀏覽器協助提問
Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman. [ABS],2021.12
工具形式:語言模型可以教會自己使用工具
Timo Schick,Jane Dwivedi-Yu,RobertoDessì,Roberta Realeanu,Maria Lomeli,Luke Zettlemoyer,Nicola Cancedda,Thomas Scialom。 [ABS],2023.2
MM反應:提示CHATGPT進行多模式推理和行動
Zhengyuan Yang,Linjie Li,Jianfeng Wang,Kevin Lin,Ehsan Azarnasab,Faisal Ahmed,Zicheng Liu,Ce Liu,Michael Zeng,Lijuan Wang。 [ABS],2023.3
Hugginggpt:與Chatgpt及其朋友在擁抱臉上解決AI任務
Yongliang Shen,Kaitao Song,Xu Tan,Dongsheng Li,Weiming Lu,Yueting Zhuang。 [ABS],2023.3
Visual Chatgpt:使用視覺基礎模型說話,繪畫和編輯
Chenfei Wu,Shengming Yin,Weizhen Qi,Xiaodong Wang,Zecheng Tang,Nan Duan。 [ABS],2023.3
藝術:大型語言模型的自動多步推理和工具使用
Bhargavi Paranjape,Scott Lundberg,Sameer Singh,Hannaneh Hajishirzi,Luke Zettlemoyer,Marco Tulio Ribeiro。 [ABS],2023.3
taskmatrix.ai:通過將基礎模型與數百萬API連接起來完成任務
Yaobo Liang,Chenfei Wu,Ting Song,Wenshan Wu,Yan Xia,Yu Liu,Yang Ou,Shuai Lu,Lei JI,Shaoguang Mao,Yun Wang,Linjun Shou,Ming Shou,Ming Gong,Nan Duan。 [ABS],2023.3
變色龍:大型語言模型的插件構圖推理
Pan Lu,Baolin Peng,Hao Cheng,Michel Galley,Kai-Wei Chang,Ying Nian Wu,Song-Chun Zhu,Jianfeng Gao。 [ABS],2023.4
Chemcrow:使用化學工具增強大型模型
Andres M Bran,Sam Cox,Andrew D White,Philippe Schwaller。 [ABS],2023.4
TALM:工具增強語言模型
亞倫·帕里西(Aaron Parisi),趙趙(Yao Zhao),諾亞·菲德爾(Noah Fiedel)。 [ABS],2022.5
評論家:大型語言模型可以通過工具相互作用的批評自我糾正
Zhibin Gou,Zhihong Shao,Yeyun Gong,Yelong Shen,Yujiu Yang,Minlie Huang,Nan Duan,Weizhu Chen。 [ABS] [代碼],2023.5
通過執行反饋使語言模型更好
Shuofei Qiao,Honghao Gui,Huajun Chen,Ningyu Zhang。 [ABS],2023.5
CHATCOT:基於聊天的大語言模型的工具增強鏈的推理
Zhipeng Chen,Kun Zhou,Beichen Zhang,Zheng Gong,Wayne Xin Zhao,Ji-Rong Wen。 [ABS],2023.5
大猩猩:與大型API相連的大語言模型
Shishir G. Patil,Tianjun Zhang,Xin Wang,Joseph E. Gonzalez。 [ABS],2023.5
TOOLLLM:促進大型語言模型掌握16000多個現實世界中的API
Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Runchu Tian, Ruobing Xie, Jie Zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun. [ABS],2023.7
裝備:具有可推廣和高效的工具分辨率的增強語言模型
lu lu,yu,丹尼爾·卡沙比(Daniel Khashabi)。 [ABS],2023.7
Gentopia:一個工具增強LLMS的協作平台
Binfeng Xu,Xukun Liu,Hua Shen,Zeyu Han,Yuhan Li,Murong Yue,Zhiyuan Peng,Yuchen Liu,Ziyu Yao,Dongkuan Xu。 [ABS],2023.8
用LM含有LM的沙盒確定LM劑的風險
Yangjun Ruan,Honghua Dong,Andrew Wang,Silviu Pitis,Yongchao Zhou,Jimmy BA,Yann Dubois,Chris J. Maddison,Tatsunori Hashimoto。 [ABS],2023.9
利用預先培訓的大型語言模型來構建和利用世界模型進行基於模型的任務計劃
Lin Guan,Karthik Valmeekam,Sarath Sreedharan,Subbarao Kambhampati [ABS],2023.5
數據 - 操作:橋接數十億個數據和人類具有自主工作流程
Wenqi Zhang,Yongliang Shen,Weiming Lu,Yueting Zhuang [abs],2023.6
Clova:使用工具使用和更新的閉環視覺助手
Zhi Gao,Yuntao du,Xintong Zhang,Xiaojian MA,Wenjuan Han,Song-Chun Zhu,Qing Li [abs],2023.12
gitagent:用工具擴展促進使用github的自主劑
Bohan Lyu,Xin Cong,Heyang Yu,Pan Yang,Yujia Qin,Yining Ye,Yaxi Lu,Zhong Zhang,Yukun Yan,Yukun Yan,Yankai Lin,Yankai Lin,Zhiyuan Liu,Maosong Sun。 [ABS],2023.12
EasyTool:使用簡潔的工具指令增強基於LLM的代理
Siyu Yuan,Kaitao Song,Jiangjie Chen,Xu Tan,Yongliang Shen,Kan Ren,Dongsheng Li,Deqing Yang。 [ABS],2024.1
符號-llm:邁向大型語言模型的基礎符號界面
Fangzhi Xu,Zhiyong Wu,Qiushi Sun,Siyu Ren,Fei Yuan,Shuai Yuan,Qika Lin,Yu Qiao,Jun Liu。 [ABS],2023.11
鬱金香代理 - 使基於LLM的代理使用大型工具庫解決任務
Felix Ocker,Daniel Tanneberg,Julian Eggert,Michael Gienger。 [ABS],2024.07
OneGen:LLM的有效的一通統一生成和檢索
Jintian Zhang,Cheng Peng,Mengshu Sun,Xiang Chen,Lei Liang,Zhiqiang Zhang,Jun Zhou,Huajun Chen,Ningyu Zhang。 [ABS],2024.09
語言模型級聯
David Dohan,Winnie Xu,Aitor Lewkowycz,Jacob Austin,David Bieber,Raphael Gontijo Lopes,Yuhuai Wu,Henryk Michalewski,Rif A. Saurous,Jascha Sohl-Dickstein,Kevin Murphy,Kevin Murphy,Charles Sutton。 [ABS],2022.7
與語言模型合作用於具體推理
Ishita Dasgupta,Christine Kaeser-Chen,Kenneth Marino,Arun Ahuja,Sheila Babayan,Felix Hill,Rob Fergus。 [ABS],2023.2
駱駝:大規模語言模型社會的“思維”探索的交流代理商
Guohao Li,Hasan Abed Al Kader Hammoud,Hani Itani,Dmitrii Khizbullin,Bernard Ghanem。 [ABS],2023.3
多方聊天:與人類和模型的小組設置中的對話代理
Jimmy Wei,Kurt Shuster,Arthur Szlam,Jason Weston,Jack Urbanek,Mojtaba Komeili。 [ABS],2023.4
Chatllm網絡:更多的大腦,更多的智能
Rui Hao,Linmei Hu,Weijian Qi,Qingliu Wu,Yirui Zhang,Liqiang Nie。 [ABS],2023.4
通過chatgpt生成的自我合作代碼
Yihong Dong,Xue Jiang,Zhi Jin,Ge Li。 [ABS],2023.4
大型語言模型的新興自主科學研究能力
Daniil A. Boiko,Robert Macknight,Gabe Gomes。 [ABS],2023.4
CHATGPT/GPT-4用於知識圖構建和推理:最近的功能和未來機會
Yuqi Zhu,Xiaohan Wang,Jing Chen,Shuofei Qiao,Yixin OU,Yunzhi Yao,Shumin Deng,Huajun Chen,Ningyu Zhang。 [ABS],2023.5
大型語言模型作為工具製造商
Tianle Cai,Xuezhi Wang,Tengyu MA,Xinyun Chen,Denny Zhou 。 [ABS],2023.5
從行動和指示中推斷出傳達代理的目標
Lance Ying,Tan Zhi-Xuan,Vikash Mansinghka,Joshua B. Tenenbaum。 [ABS],2023.6
無線多代理生成AI:從連接的智能到集體智能
Hang Zou,Qiyang Zhao,Lina Bariah,Mehdi Bennis,Merououane Debbah。 [ABS],2023.7
Roco:與大語言模型的辯證法多機器人合作
Zhao Mandi,Shreeya Jain,Shuran Song。 [ABS],2023.7
在大語模型中釋放認知協同作用:通過多人自行車解決任務的代理
Zhenhailong Wang,Shaoguang Mao,Wenshan Wu,Tao Ge,Furu Wei,Heng JI。 [ABS],2023.7
軟件開發的交流代理
Chen Qian,Xin Cong,Cheng Yang,Weize Chen,Yusheng Su,Juyuan Xu,Zhiyuan Liu,Maosong Sun。 [ABS],2023.7
到Infinity及以後:多代理模擬中的Show-1和Showrunner代理
Philipp Maas,Frank Carey,Chris Wheeler,Edward Saatchi,Pete Billington,Jessica Yaffa Shamash。 [ABS],2023.7
METAGPT:用於多代理協作框架的元編程
Sirui Hong,Xiawu Zheng,Jonathan Chen,Yuheng Cheng,Ceyao Zhang,Zili Wang,Steven Ka Shing Yau,Zijuan Lin,Liyang Zhou,Chenyu Ran,Lingfeng ran,Lingfeng Xiao,Chenglin Wu。 [ABS],2023.8
通過自我播放和文本中的自我反饋學習來改善語言模型談判
Yao Fu,Hao Peng,Tushar Khot,Mirella Lapata。 [ABS],2023.5
多代理協作:利用智能LLM代理的力量
Yashar Talebirad,Amirhossein Nadiri。 [ABS],2023.6
RESTGPT:通過RESTFUL API連接大型語言模型與現實世界應用程序
Yifan Song,Weimin Xiong,Dawei Zhu,Cheng Li,Ke Wang,Ye Tian,Sujian Li 。 [ABS],2023.6
用大語言模型模塊化建造合作的體現代理
Hongxin Zhang,Weihua du,Jiaming Shan,Qinhong Zhou,Yilun DU,Joshua B. Tenenbaum,Tianmin Shu,Chuang Gan。 [ABS],2023.7
互動:探索Chatgpt作為合作社的潛力
Po-Lin Chen,Cheng-Shang Chang。 [ABS],2023.8
Autogen:通過多代理對話框架啟用下一代LLM應用程序
青牛,加根·班薩爾,傑尤張,Yiran Wu,Shaokun Zhang,Erkang Zhu,Beibin Li,Li Jiang,Xiaoyun Zhang,Chi Wang。 [ABS],2023.8
通過及時工程探索大語模型和基於代理的建模的交集
愛德華·詹普朗(Edward Junprung)。 [ABS],2023.8
神經攤銷的推斷,用於嵌套多代理推理
Kunal Jha,Tuan Anh Le,Chuanyang Jin,Yen-Ling Kuo,Joshua B. Tenenbaum,Tianmin Shu。 [ABS],2023.8
GPT-in-the-limop:多型系統的自適應決策
Nathalia Nascimento,Paulo Alencar,Donald Cowan。 [ABS],2023.8
主動:建立主動合作AI,具有大語言模型
Ceyao Zhang,Kaijie Yang,Siyi Hu,Zihao Wang,Guanghe Li,Yihang Sun,Cheng Zhang,Zhaowei Zhang,Anji Liu,Song-Chun Zhu,Xiaojun Zhu,Xiaojun Zhun,Jiage Zhang Zhang,Junge Zhang,Feng Yin,Yitao Liang,Yaodong,Yaodong,Yaodong。 [ABS],2023.8
Mindagent:新興遊戲互動
Ran Gong,Qiuyuan Huang,Xiaojian MA,Hoi VO,Zane Durante Yusuke Noda,Zilong Zheng,Song-Chun Zhu Zhu demetri Terzopoulos,li fei fei,li fei fei,jianfeng gao。 [ABS],2023.9
探索LLM代理商的協作機制:一種社會心理學觀點
Jintian Zhang,Xin Xu,Shumin Deng。 [ABS],2023.10
Lumos:具有統一數據,模塊化設計和開源LLM的學習代理商
Da Yin,Faeze Brahman,Abhilasha Ravichander,Khyathi Chandu,Kai-Wei Chang,Yejin Choi,Bill Yuchen Lin。 [ABS],2023.11
自動手術:自動代理通過自我計劃從頭開始學習
Shuofei Qiao,Ningyu Zhang,Runnan Fang,Yujie Luo,Wangchunshu Zhou,Yuchen Eleanor Jiang,Chengfei LV,Huajun Chen。 [ABS],2024.01
COREX:通過多模型協作來推動複雜推理的界限
Qiushi Sun,Zhangyue Yin,Xiang Li,Zhiyong Wu,Xipeng Qiu,Lingpeng Kong。 [ABS],2023.10
COMM:協作多代理,多樣性路徑,促使復雜的問題解決
Pei Chen,Boran Han,Shuai Zhang。 [ABS],2024.4
進入未知的未知數:通過參與語言模型代理對話的人類學習
Yucheng Jiang,Yijia Shao,Dekun MA,Sina J. Semnani,Monica S. Lam。 [ABS],2024.8
通過多代理辯論在大型語言模型中鼓勵不同的思維
田liang,Zhiwei He,Wenxiang Jiao,Xing Wang,Yan Wang,Rui Wang,Yujiu Yang,Zhaopeng Tu,Shuming Shi。 [ABS],2023.5
通過多種辯論改善語言模型中的事實和推理
Yilun Du,Shuang Li,Antonio Torralba,Joshua B. Tenenbaum,Igor Mordatch。 [ABS],2023.5
通過自我播放和文本中的自我反饋學習來改善語言模型談判
Yao Fu,Hao Peng,Tushar Khot,Mirella Lapata。 [ABS],2023.5
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate
Chi-Min Chan, Weize Chen, Yusheng Su, Jianxuan Yu, Wei Xue, Shanghang Zhang, Jie Fu, Zhiyuan Liu. [abs], 2023.8
How susceptible are LLMs to Logical Fallacies?
Amirreza Payandeh, Dan Pluth, Jordan Hosier, Xuesu Xiao, Vijay K. Gurbani. [abs], 2023.8
Identifying the Risks of LM Agents with an LM-Emulated Sandbox
Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto. [abs], 2023.9
Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View
Jintian Zhang, Xin Xu, Shumin Deng. [abs], 2023.10
Generative Agents: Interactive Simulacra of Human Behavior
Joon Sung Park, Joseph C. O'Brien, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein. [abs], 2023.4
Training Socially Aligned Language Models in Simulated Human Society.
Ruibo Liu, Ruixin Yang, Chenyan Jia, Ge Zhang, Denny Zhou, Andrew M. Dai, Diyi Yang, Soroush Vosoughi. [abs], 2023.5
The Role of Summarization in Generative Agents: A Preliminary Perspective
Xiachong Feng, Xiaocheng Feng, Bing Qin. [abs], 2023.5
Epidemic Modeling with Generative Agents.
Ross Williams, Niyousha Hosseinichimeh, Aritra Majumdar, Navid Ghaffarzadegan. [abs], 2023.7
S^3: Social-network Simulation System with Large Language Model-Empowered Agents
Chen Gao, Xiaochong Lan, Zhihong Lu, Jinzhu Mao, Jinghua Piao, Huandong Wang, Depeng Jin, Yong Li. [abs],2023.7
AgentSims: An Open-Source Sandbox for Large Language Model Evaluation
Jiaju Lin, Haoran Zhao, Aochi Zhang, Yiting Wu, Huqiuyue Ping, Qin Chen . [abs], 2023.8
CGMI: Configurable General Multi-Agent Interaction Framework
Shi Jinxin, Zhao Jiabao, Wang Yilei, Wu Xingjiao, Li Jiawen, He Liang. [abs], 2023.8
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education
Yuhao Dan, Zhikai Lei, Yiyang Gu, Yong Li, Jianghao Yin, Jiaju Lin, Linhao Ye, Zhiyan Tie, Yougen Zhou, Yilei Wang, Aimin Zhou, Ze Zhou, Qin Chen, Jie Zhou, Liang He, Xipeng Qiu. [abs], 2023.8
SuperAgent: A Customer Service Chatbot for E-commerce Websites
Lei Cui, Shaohan Huang, Furu Wei, Chuanqi Tan, Chaoqun Duan, Ming Zhou. [paper], 2017
WebArena: A Realistic Web Environment for Building Autonomous Agents
Shuyan Zhou, Frank F. Xu, Hao Zhu, Xuhui Zhou, Robert Lo, Abishek Sridhar, Xianyi Cheng, Yonatan Bisk, Daniel Fried, Uri Alon, Graham Neubig. [abs], 2023.7
LLM As DBA
Xuanhe Zhou, Guoliang Li, Zhiyuan Liu. [abs], 2023.8
RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking
Homanga Bharadhwaj, Jay Vakil, Mohit Sharma, Abhinav Gupta, Shubham Tulsiani, Vikash Kumar. [paper], 2023
Is There Any Social Principle for LLM-Based Agents?
Jitao Bai, Simiao Zhang, Zhonghao Chen. [abs], 2023.8
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Minlie Huang, Nan Duan, Weizhu Chen. [abs] [code], 2023.9
Agentic Skill Discovery
Xufeng Zhao, Cornelius Weber, Stefan Wermter [abs] [code], 2024.5
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
Yijia Shao, Yucheng Jiang, Theodore A. Kanell, Peter Xu, Omar Khattab, Monica S. Lam. [abs], [code], 2024.4
Agents: An Open-source Framework for Autonomous Language Agents
Wangchunshu Zhou, Yuchen Eleanor Jiang, Long Li, Jialong Wu, Tiannan Wang, Shi Qiu, Jintian Zhang, Jing Chen, Ruipu Wu, Shuai Wang, Shiding Zhu, Jiyu Chen, Wentao Zhang, Ningyu Zhang, Huajun Chen, Peng Cui, Mrinmaya Sachan. [abs], 2023.9
Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization
Zijun Liu, Yanzhe Zhang, Peng Li, Yang Liu, Diyi Yang. [abs], 2023.10
OpenAgents: An Open Platform for Language Agents in the Wild
Tianbao Xie, Fan Zhou, Zhoujun Cheng, Peng Shi, Luoxuan Weng, Yitao Liu, Toh Jing Hua, Junning Zhao, Qian Liu, Che Liu, Leo Z. Liu, Yiheng Xu, Hongjin Su, Dongchan Shin, Caiming Xiong, Tao Yu. [abs], 2023.10
AutoAct: Automatic Agent Learning from Scratch via Self-Planning
Shuofei Qiao, Ningyu Zhang, Runnan Fang, Yujie Luo, Wangchunshu Zhou, Yuchen Eleanor Jiang, Chengfei Lv, Huajun Chen. [abs], 2024.01
An Interactive Agent Foundation Model
Zane Durante, Bidipta Sarkar, Ran Gong, Rohan Taori, Yusuke Noda, Paul Tang, Ehsan Adeli, Shrinidhi Kowshika Lakshmikanth, Kevin Schulman, Arnold Milstein, Demetri Terzopoulos, Ade Famoti, Noboru Kuno, Ashley Llorens, Hoi Vo, Katsu Ikeuchi, Li Fei-Fei, Jianfeng Gao, Naoki Wake, Qiuyuan Huang. [abs], 2024.02
Emergence of Social Norms in Generative Agent Societies: Principles and Architecture
Siyue Ren, Zhiyao Cui, Ruiqi Song, Zhen Wang, Shuyue Hu. [abs], 2024.03
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models
Fangzhi Xu, Qiushi Sun, Kanzhi Cheng, Jun Liu, Yu Qiao, Zhiyong Wu. [abs], 2024.06
AgentSquare: Automatic LLM Agent Search in Modular Design Space
Yu Shang, Yu Li, Keyu Zhao, Likai Ma, Jiahe Liu, Fengli Xu, Yong Li [abs], 2024.10
Enhancing Trust in LLM-Based AI Automation Agents: New Considerations and Future Challenges
Sivan Schwartz, Avi Yaeli, Segev Shlomov. [abs], 2023.8
Mind2Web: Towards a Generalist Agent for the Web
Xiang Deng, Yu Gu, Boyuan Zheng, Shijie Chen, Samuel Stevens, Boshi Wang, Huan Sun, Yu Su. [abs], 2023.6
The Tong Test: Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions
Yujia Peng , Jiaheng Han, Zhenliang Zhang , Lifeng Fan , Tengyu Liu, Siyuan Qi, Xue Feng, Yuxi Ma, Yizhou Wang, Song-Chun Zhu. [abs], 2023.7
AgentBench: Evaluating LLMs as Agents
Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, Hanyu Lai, Yu Gu, Hangliang Ding, Kaiwen Men, Kejuan Yang, Shudan Zhang, Xiang Deng, Aohan Zeng, Zhengxiao Du, Chenhui Zhang, Sheng Shen, Tianjun Zhang, Yu Su, Huan Sun, Minlie Huang, Yuxiao Dong, Jie Tang . [abs], 2023.8
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents.
Zhiwei Liu, Weiran Yao, Jianguo Zhang, Le Xue, Shelby Heinecke, Rithesh Murthy, Yihao Feng, Zeyuan Chen, Juan Carlos Niebles, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese. [abs], 2023.8
Identifying the Risks of LM Agents with an LM-Emulated Sandbox
Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto. [abs], 2023.9
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao. [abs], 2023.12
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
Jian Xie, Kai Zhang, Jiangjie Chen, Tinghui Zhu, Renze Lou, Yuandong Tian, Yanghua Xiao, Yu Su. [abs], 2024.02
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
Chang Ma, Junlei Zhang, Zhihao Zhu, Cheng Yang, Yujiu Yang, Yaohui Jin, Zhenzhong Lan, Lingpeng Kong, Junxian He. [abs], 2024.01
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu. [abs], 2024.04
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models
Jaewoo Ahn, Taehyun Lee, Junyoung Lim, Jin-Hwa Kim, Sangdoo Yun, Hwaran Lee, Gunhee Kim. [abs], 2024.05
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Harsh Trivedi, Tushar Khot, Mareike Hartmann, Ruskin Manku, Vinty Dong, Edward Li, Shashank Gupta, Ashish Sabharwal, Niranjan Balasubramanian. [abs], 2024.07
Benchmarking Agentic Workflow Generation
Shuofei Qiao, Runnan Fang, Zhisong Qiu, Xiaobin Wang, Ningyu Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen . [abs], 2024.10
| 類型 | 工具 |
|---|---|
| Agent with tool | AutoGPT、LangChain、Transformer Agents、WorkGPT、AutoChain 、Langroid、 WebArena、GPT Researcher、BMTools、ToolBench 、AgentGPT、xlang |
| Multi-Agent | CAMEL、GPTeam、AgentVerse、MetaGPT、Langroid、SocraticAI、AutoGen、Agents |
| 其他的 | AutoAgents 、GPT Engineer |
Auto-GPT. An experimental open-source attempt to make GPT-4 fully autonomous.
LangChain. Building applications with LLMs through composability.
駱駝。 Communicative Agents for “Mind” Exploration of Large Scale Language Model Society.
GPTeam. GPTeam: An open-source multi-agent simulation.
Transformer Agents. In short, it provides a natural language API on top of transformers: we define a set of curated tools and design an agent to interpret natural language and to use these tools.
AgentVerse . A Framework for Multi-LLM Environment Simulation.
AutoAgents. Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.
GPT Engineer . Specify what you want it to build, the AI asks for clarification, and then builds it.
MetaGPT. The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
WorkGPT. A GPT agent framework for invoking APIs.
AutoChain. Build lightweight, extensible, and testable LLM Agents.
Langroid. Harness LLMs with Multi-Agent Programming.
SocraticAI. Problem solving by engaging multiple AI agents in conversation with each other and the user.
WebArena. A Realistic Web Environment for Building Autonomous Agents.
GPT Researcher. GPT based autonomous agent that does online comprehensive research on any given topic.
BMTools. Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
ToolBench. An open platform for training, serving, and evaluating large language model for tool learning.
AgentGPT. Assemble, configure, and deploy autonomous AI Agents in your browser.
xlang. An open-source framework for building and evaluating language model agents via executable language grounding
Agently. A fast way to build LLM Agent based Application ? A light weight framework helps developers to create amazing LLM based applications.
Lagent. A lightweight framework for building LLM-based agents.
ToolEmu An LLM-based emulation framework for testing and identifying the risks of LLM-based agents
storm A knowledge agent that researches a topic and generates a full-length report with citations.
" Join us in improving this repository! If you know of any important works we've missed, please contribute. Your efforts are highly valued! "