Model percakapan mulai dapat mengakses web atau mencadangkan klaim mereka dengan sumber (Atribusi alias). Dengan demikian, chatbots ini dapat dibilang merupakan mesin pengambilan informasi, bersaing melawan atau bahkan mengganti mesin pencari tradisional. Kami ingin mendedikasikan ruang untuk model -model ini tetapi juga untuk bidang pengambilan informasi generatif yang lebih umum. Kami secara tentatif memisahkan lapangan dalam dua topik utama: pembuatan jawaban ground dan pengambilan dokumen generatif . Kami juga menyertakan rekomendasi generatif, ringkasan ground generatif dll.
Pull-Requests Welcome!
Kutipan Deterministik: Membuat LLMS lebih aman untuk perawatan kesehatan
Matt Yeung
Blog Pribadi - Apr 2024 [Tautan]
Penelitian Generasi Augmented Pengambilan: 2017-2024
Moritz Mallawitsch
Pengetahuan Penskalaan - Feb 2024 [Tautan]
Menguasai Rag: Cara Arsitek Sistem Rag Enterprise
Pratik Bhavsar
Galileo Labs - Jan 2024 [tautan]
Menjalankan mixtral 8x7 secara lokal dengan llamaindex
Llamaindex
LLAMAINDEX Blog - Desember 2023 [Tautan]
Teknik Rag Lanjutan: Tinjauan Ilustrasi
Ivan Ilin
Menuju AI - Desember 2023 [Tautan]
Pipa kain multimodal dengan llamaindex dan neo4j
Tomaz Bratanic
LLAMAINDEX Blog - Desember 2023 [Tautan]
Benchmarking Rag di Tabel
Langchain
Blog Langchain - Desember 2023 [Tautan]
Lanjutan Rag 01: Pengambilan Kecil ke Big
Sophia Yang
Menuju Ilmu Data - Nov 2023 [Tautan]
Transformasi kueri
Langchain
Blog Langchain - Okt 2023 [tautan]
Apa yang membuat agen dialog bermanfaat?
Nazneen Rajani, Nathan Lambert, Victor Sanh, Thomas Wolf
Blog Face Memeluk - Jan 2023 [Tautan]
Peramalan Potensi Penyalahgunaan Model Bahasa untuk Kampanye Disinformasi dan Cara Mengurangi Risiko
Josh A. Goldstein, Girish Sastry, Micah Musser, Renée Diresta, Matthew Gentzel, Katerina Sedova
Blog Openai - Jan 2023 [Tautan]
Fakta, Ambil, dan Alasan: Evaluasi Terpadu Generasi Pengambilan Satyapriya Krishna, Kalpesh Krishna, Anhad Mohananey, Steven Schwarcz, Adam Stambler, Shyam Upadhyay, Manaal Faruqui arxiv-Sep 2024 [Makalah] [Data] [Data]]
Litsearch: Tolok ukur pengambilan untuk pencarian literatur ilmiah
Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen, Tianyu Gao
Arxiv - Jul 2023 [kertas] [Data]
Bright: Benchmark yang realistis dan menantang untuk pengambilan intensif penalaran
Hongjin Su, Howard Yen, Mengzhou Xia, Weijia Shi, Niklas Muennighoff, Han-yu Wang, Haisu Liu, Quan Shi, Zachary S. Siegel, Michael Tang, Ruoxi Sun, Jinsung Yoon, Sercan O. Arik, Danqi Chen, Tao Yu
Arxiv - Okt 2023 [kertas] [data] [kode]
Freshllms: Model bahasa besar yang menyegarkan dengan augmentasi mesin pencari
Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc Le, Thang Luong
Arxiv - Okt 2023 [kertas] [Kode]
Legalbench: tolok ukur yang dibangun secara kolaboratif untuk mengukur penalaran hukum dalam model bahasa besar
Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia, Margaret Hagan, Megan Ma, Michael Livermore, Nikon Rasumov-Rahe, Nils Holzenberger, Noam Kolt, Peter Henderson, Sean Rehaag, Sharad Goel, Shang Gao, Spencer Williams, Sunny Gandhi, Tom Zur, Varun Iyer, Zehua LI
ARXIV - Agustus 2023 [kertas] [Dataset]
OpenAssistant Conversations - mendemokratisasi penyelarasan model bahasa besar
Andreas Köpf, Yannic Kilcher, Dimitri von Rütte, Sotiris Anagnostidis, Zhi-Rui Tam, Keith Stevens, Abdullah Barhoum, Nguyen Minh Duc, Oliver Stanley, Richárd Nagyfi, Shahul ES, Sameer Suri, David Glushkov, Arnav Dantuluri, Andrew Maguire, Christoph Schuhmann, Huu Nguyen, Alexander Mattick
Arxiv - April 2023 [kertas]
Chatgpt-retrievalqa
Arian Askari, Mohammad Aliannejadi, Evangelos Kanoulas, Suzan Verbernne
GitHub - Feb 2023 [Kode]
Kamel: Analisis Pengetahuan dengan entitas multitoken dalam model bahasa
Jan-Christoph Kalo, Leandra Fichtel
AKBC 22 - [kertas]
Sejujurnya: Mengukur bagaimana model meniru kepalsuan manusia
Stephanie Lin, Jacob Hilton, Owain Evans
Arxiv - Sep 2021 [kertas] [kode]
Pengambilan jawaban yang kompleks
Laura Dietz, Manisha Verma, Filip Radlinski, Nick Craswell, Ben Gamari, Jeff Dalton, John Foley
TREC-2017-2019 [tautan]
Graphrag
Jonathan Larson, Steven Truitt
Microsoft - Feb 2024 [Kode]
Mempersempit kesenjangan evaluasi pengetahuan: pertanyaan domain terbuka menjawab dengan jawaban multi-granularitas
Gal Yona, Roee Aharoni, Mor Geva
Arxiv - Jan 2024 [kertas]
Lokakarya DHS LLM - Modul 6
Sourab Mangrulkar
GitHub - Desember 2023 [Kode]
Primeqa: Repositori utama untuk pertanyaan multibahasa yang canggih menjawab penelitian dan pengembangan
Avirup Sil, Jaydeep Sen, Bhavani Iyer, Martin Franz, Kshitij Fadnis, Mihaela Bornea, Sara Rosenthal, Scott McCarley, Rong Zhang, Vishwajeet Kumar, Yulong Li, Md Arafat Sultan, Riyaz Bhat, Radu Florian, Salim Roukos
Arxiv - Jan 2023 [kertas] [kode]
TRL: Pembelajaran Penguatan Transformer
Leandro von Werra, Younes Belkada, Lewis Tunstall, Edward Beeching, Tristan Thrush, Nathan Lambert, Shengyi Huang
GitHub - 2020 [Kode]
FactScore: Evaluasi atom berbutir halus dari ketepatan faktual dalam pembuatan teks bentuk panjang
Sewon Min, Kalpesh Krishna, Xinxi Lyu, Mike Lewis, Wen-Tau Yih, Pang Wei Koh, Mohit Iyyer, Luke Zettlemoyer, Hananeh Hajishirzi
PYPI - Mei 2023 [kertas] [Kode]
FACTKB: Evaluasi faktualitas yang dapat digeneralisasikan menggunakan model bahasa yang ditingkatkan dengan pengetahuan faktual
Shangbin Feng, Vidhisha Balachandran, Yuyang Bai, Yulia Tsvetkov
Arxiv - Mei 2023 [kertas] [Kode]
Mengevaluasi Verifikasi dalam Mesin Pencari Generatif
Nelson F. Liu, Tianyi Zhang, Percy Liang
Arxiv - April 2023 [kertas] [Kode]
Lokakarya tentang AI Generatif untuk Sistem Rekomendasi dan Personalisasi
Narges Tabari, Aniket Deshmukh, Wang-Cheng Kang, Rashmi Gangadharaiah, Hamed Zamani, Julian McAuley, George Karypis
KDD 24 - Agustus 2024 [Tautan]
Lokakarya kedua tentang pengambilan informasi generatif
Gabriel Bénédict, Ruqing Zhang, Donald Metzler, Andrew Yates, Ziyan Jiang
Sigir 24 - Jul 2024 [Tautan]
AI generatif yang dipersonalisasi
Zheng Chen, Ziyan Jiang, Fan Yang, Zhankui He, Yupeng Hou, Eunah Cho, Julian McAuley, Aram Galstyan, Xiaohua Hu, Jie Yang
Cikm 23 - Okt 2023 [Tautan]
Lokakarya pertama tentang rekomendasi dengan model generatif
Wenjie Wang, Yong Liu, Yang Zhang, Weiwen Liu, Fuli Feng, Xiangnan He, Aixin Sun
Cikm 23 - Okt 2023 [Tautan]
Lokakarya pertama tentang pengambilan informasi generatif
Gabriel Bénédict, Ruqing Zhang, Donald Metzler
Sigir 23 - Jul 2023 [Tautan]
Model dan aplikasi bahasa berbasis pengambilan
Akari Asai, Sewon Min, Zexuan Zhong, Danqi Chen
ACL 23 - Jul 2023 [Tautan]
Pengambilan Informasi Agen
Weinan Zhang, Junwei Liao, Ning Li, Kounianhua du
Arxiv - Okt 2024 [kertas]
Bacalah, merekonstruksi, mengingat: menghafal dalam LMS sebagai fenomena multifaset
Usvsn Sai Prashanth, Alvin Deng, Kyle O'Brien, Jyothir SV, Mohammad Aflah Khan, Jaydeep Borkar, Christopher A. Choquette-Choo, Jacob Ray FueHne, Stella Biderman, Tracy Ke, Kather Lee, Naomi Saphra, Stella Biderman, Tracy Ke, Kathere, Naomi Saphra, Stella Biderman, Tracy Ke, Kathere, Naomi Saphra, Stella, Tracy Ke, Naomi Saphra, Naomi Saphra, Naomi Saphra, Naomi Saphra, Naomi Saphra, Naomi Saphra, Naomi Saphra, Naomi, Naomi Saphra, Naomi, Naomi, Naomi Naomi Saphra
Arxiv - Jun 2024 [kertas]
Chatgpt adalah omong kosong
Michael Townsen Hicks, James Humphries, Joe Slater
Etika Inf Technol - Jun 2024 [Kertas]
Halusinasi Model Bahasa Multimodal Besar: Survei
Zechen Bai, Pichao Wang, Tianjun Xiao, Tong He, Zongbo Han, Zheng Zhang, Mike Zheng Shou
Arxiv - Apr 2024 [kertas]
Dari pencocokan hingga generasi: survei tentang pengambilan informasi generatif
Xiaoxi Li, Jiajie Jin, Yujia Zhou, Yuyao Zhang, Peitian Zhang, Yutao Zhu, dan Zhicheng Dou
Arxiv - Apr 2024 [kertas]
Konflik Pengetahuan untuk LLMS: Survei
Rongwu Xu, Zehan Qi, Cunxiang Wang, Hongru Wang, Yue Zhang, Wei Xu
Arxiv - Mar 2024 [kertas]
Laporan Lokakarya Pertama tentang Pengambilan Informasi Generatif (Gen-IR 2023) di SIGIR 2023
Gabriel Bénédict, Ruqing Zhang, Donald Metzler, Andrew Yates, Romain Deffayet, Philipp Hager, Sami Jullien
Sigir Forum - Desember 2023 [Kertas]
Laporan Lokakarya Pertama tentang Tugas yang Difokuskan IR di Era AI Generatif
Chirag Shah, Ryen W. White
Sigir Forum - Desember 2023 [Kertas]
Menuju Pencarian dan Rekomendasi Generatif: Keynote di Recsys 2023
Tat-Seng Chua
Sigir Forum - Desember 2023 [Kertas]
Model Pencarian Besar: Mendefinisikan Ulang Tumpukan Pencarian Di Era LLMS
Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei
Sigir Forum - Desember 2023 [Kertas]
Model Bahasa Besar untuk Ekstraksi Informasi Generatif: Survei
Derong Xu, Wei Chen, Wenjun Peng, Chao Zhang, Tong Xu, Xiangyu Zhao, Xian Wu, Yefeng Zheng, Enhong Chen
Arxiv - Desember 2023 [kertas]
Pengambilan teks padat berdasarkan model bahasa pretrained: survei
Wayne Xin Zhao, Jing Liu, Ruiyang Ren, Ji-Rong Wen
Tois - Desember 2023 [kertas]
Generasi Pengambilan-Pengambilan untuk Model Bahasa Besar: Survei
Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Haofen Wang
Arxiv - Desember 2023 [kertas]
Model bahasa yang dikalibrasi harus berhalusinasi
Adam Tauman Kalai, Santosh S. Vempala
Arxiv - Nov 2023 [kertas]
Lagu Siren di Samudra AI: Survei tentang Halusinasi dalam Model Bahasa Besar
Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shuming Shi
Arxiv - Sep 2023 [kertas]
Janji palsu untuk meniru LLMS kepemilikan
Arnav Gudibande, Eric Wallace, Charlie Snell, Xinyang Geng, Hao Liu, Pieter Abbeel, Sergey Levine, Lagu Dawn
Arxiv - Mei 2023 [Kertas]
Rekomendasi Generatif: Menuju Paradigma Rekomendasi Generasi Berikutnya
Fengji Zhang, Bei Chen, Yue Zhang, Jin Liu, Daoguang Zan, Yi Mao, Jian-Guang Lou, Weizhu Chen
Arxiv - April 2023 [kertas]
Augmented Language Models: Sebuah Survei
Grégoire Mialion, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-Yu, Asli Celikyilmaz, Kuburan EdoUard, Yann LeCun, Thomas Scialiom
Arxiv - Feb 2023 [kertas]
Model bahasa generatif dan operasi pengaruh otomatis: ancaman yang muncul dan potensi mitigasi
Josh A. Goldstein, Sastry Girish, Micah Musser, Renee Diresta, Matthew Gentzel, Katerina Sedova
Arxiv - Jan 2023 [kertas]
Pencarian informasi percakapan. Pengantar Pencarian, Rekomendasi, dan Jawaban Percakapan Percakapan
Hamed Zamani, Johanne R. Trippas, Jeff Dalton dan Filip Radlinski
Arxiv - Jan 2023 [kertas]
Fakta
Kevin Mulligan dan Fabrice Correia
The Stanford Encyclopedia of Philosophy - Winter 2021 [URL]
AI yang jujur: Mengembangkan dan mengatur AI yang tidak berbohong
Owain Evans, Owen Cotton-Barratt, Lukas Finnveden, Adam Bales, Avital Balwit, Peter Wills, Luca Righetti, William Saunders
Arxiv - Okt 2021 [kertas]
Pencarian Memikirkan Kembali: Membuat Pakar Domain Keluar dari Dilettantes
Donald Metzler, Yi Tay, Dara Bahri, Marc Najork
Sigir Forum 2021 - Mei 2021 [Kertas]
Pertanyaan yang dikaitkan dengan pertanyaan: Evaluasi dan pemodelan untuk model bahasa besar yang dikaitkan
Bernd Bohnet, Vinh Q. Tran, Pat Verga, Roee Aharoni, Daniel Andor, Livio Baldini Soares, Jacob Eisenstein, Kuzman Ganchev, Jonathan Herzig, Kai Hui, Tom Kwiatkowski, Ji Ma, Jianmo Ni, Tal Schuster, William W. Cohen, Michael Collins, Dipanjan Das, Donald Metzler, Slave Petrov, Kellie Webster
Arxiv - Desember 2022 [Kertas]
landasan/pengambilan eksternal pada waktu inferensi
Raptor: Pemrosesan abstraktif rekursif untuk pengambilan yang diatur pohon
Parthi Parthi, Salman Abdullah, Aditi Tuli, Shubh Khanna, Anna Goldie, Christopher D. Manning
ICLR 24 - Jan 2024 [Kertas]
Pengambilan korektif generasi augmented
Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, Zhen-Hua Ling
Arxiv - Jan 2024 [kertas]
Ini Tentang Waktu: Menggabungkan Temporalitas dalam Pengambilan Model Bahasa augmented
Anoushka Gade, Jorjeta Jetcheva
Arxiv - Jan 2024 [kertas]
Rag vs Fine-tuning: Pipa, pengorbanan, dan studi kasus tentang pertanian
Malaikat Balaguer, Vinamra Benara, Renato Luiz de Freitas Cunha, Roberto de M. Estevão Filho, Todd Hendry, Daniel Holstein, Jennifer Marsman, Nick Mecklenburg, Sara Malvar, Leonardo O. Nunes, Rafael Padiha, Morrami, Morris, Morris, Morris, Morris, Morriardo O. Ranveer Chandra
Arxiv - Jan 2024 [kertas]
Urutan Ma�tters: Model Generate-Retrieve-Generate untuk Membangun Agen Percakapan
Quinn Patwardhan, Grace Hui Yang
TREC 23 - Nov 2023 [kertas]
Self-Rag: Belajar mengambil, menghasilkan, dan mengkritik melalui refleksi diri
Anonim
ICLR 24 - Okt 2023 [kertas]
RA-DIT: Tuning Instruksi Ganda Retrieval-Agusted
Anonim
ICLR 24 - Okt 2023 [kertas]
Pembelajaran dalam konteks dengan pengambilan model bahasa encoder-decoder augmented
Anonim
ICLR 24 - Okt 2023 [kertas]
Membuat model bahasa pengambilan-pengambilan yang kuat untuk konteks yang tidak relevan
Anonim
ICLR 24 - Okt 2023 [kertas]
Pengambilan Memenuhi Konteks Panjang Model Bahasa Besar
Anonim
ICLR 24 - Okt 2023 [kertas]
Reformulasi adaptasi domain model bahasa besar sebagai adapt-retrieve-revise
Anonim
ICLR 24 - Okt 2023 [kertas]
Instructretro: Instruction Tuning Post Retrieval-Agusted Pretraining
Anonim
ICLR 24 - Okt 2023 [kertas]
Tentu: Meningkatkan pertanyaan domain terbuka menjawab LLMS melalui pengambilan yang diringkas
Anonim
ICLR 24 - Okt 2023 [kertas]
REComp: Meningkatkan LMS pengambilan-pengambilan dengan kompresi konteks dan augmentasi selektif
Anonim
ICLR 24 - Okt 2023 [kertas]
Pengambilan adalah generasi yang akurat
Anonim
ICLR 24 - Okt 2023 [kertas]
Paperqa: agen generatif pengambilan-pengambilan untuk penelitian ilmiah
Anonim
ICLR 24 - Okt 2023 [kertas]
Memahami augmentasi pengambilan untuk menjawab pertanyaan panjang
Anonim
ICLR 24 - Okt 2023 [kertas]
Generasi bahasa yang dipersonalisasi melalui Bayesian Metric augmented retrieval
Anonim
ICLR 24 - Okt 2023 [kertas]
DSPY: Mengompilasi model bahasa deklaratif memanggil pipa yang meningkatkan diri sendiri
Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potter Potts
Arxiv - Okt 2023 [kertas] [Kode]
RA-DIT: Tuning Instruksi Ganda Retrieval-Agusted
XI Victoria Lin, Xilun Chen, Mingda Chen, Weijia Shi, Maria Lomeli, Rich James, Pedro Rodriguez, Jacob Kahn, Gergely Szilvasy, Mike Lewis, Luke Zettlemoyer, Scott Yihy Yihi
Arxiv - Agustus 2023 [kertas]
Dokumentasi Alat memungkinkan penggunaan alat-alat zero-shot dengan model bahasa yang besar
Cheng-yu Hsieh, Si-an Chen, Chun-Liang Li, Yasuhisa Fujii, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister
Arxiv - Agustus 2023 [kertas]
Reaugkd: Distilasi Pengetahuan Penghapusan Pengambilan untuk Model Bahasa Pra-Terlatih
Jianyi Zhang, Aashiq Muhamed, Aditya Anantharaman, Guoyin Wang, Changyou Chen, Kai Zhong, Qingjun Cui, Yi Xu, Belinda Zeng, Trishul Chilimbi, Yiran Chen
ACL 23 - Jul 2023 [kertas]
Pengambilan Berbasis Permukaan Mengurangi Perplexity Model Bahasa Pengambilan-Pengambilan
Ehsan Doostmohammadi, Tobias Norlund, Marco Kuhlmann, Richard Johansson
ACL 23 - Jul 2023 [kertas]
Tuning prompt lunak untuk menambah pengambilan padat dengan model bahasa besar
Zhiyuan Peng, Xuyang Wu, Yi Fang
Arxiv - Jun 2023 [kertas]
Reta-llm: Toolkit Model Bahasa Besar Pengambilan-Pengambilan
Jiongnan Liu, Jiajie Jin, Zihan Wang, Jiehan Cheng, Zhicheng Dou, Ji-Rong Wen
Arxiv - Jun 2023 [kertas]
WebGLM: Menuju sistem penjawab pertanyaan yang ditingkatkan web yang efisien dengan preferensi manusia
Xiao Liu, Hanyu Lai, Hao Yu, Yifan Xu, Aohan Zeng, Zhengxiao Du, Peng Zhang, Yuxiao Dong, Jie Tang
Arxiv - Jun 2023 [kertas]
Wikichat: Menghentikan halusinasi chatbot model bahasa besar dengan landasan beberapa tembakan di wikipedia
Sina J. Semnani, Violet Z. Yao, Heidi C. Zhang, Monica S. Lam
Temuan EMNLP 2023 - Mei 2023 [kertas] [kode] [demo]
Ret-llm: Menuju memori baca-tulis umum untuk model bahasa besar
Ali Modarressi, Ayyoob Imani, Mohsen Fayyaz, Hinrich Schutze
Arxiv - Mei 2023 [Kertas]
Gorilla: Model bahasa besar yang terhubung dengan API besar
Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez
Arxiv - Mei 2023 [kertas] [Kode]
Haruskah kita pretrain model bahasa autoregresif dengan pengambilan? Studi yang komprehensif
Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro
Arxiv - Apr 2023 [kertas] [kode]
Periksa fakta Anda dan coba lagi: Meningkatkan model bahasa besar dengan pengetahuan eksternal dan umpan balik otomatis
Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
Arxiv - Feb 2023 [kertas] [Kode]
ToolFormer: Model bahasa dapat mengajar diri mereka sendiri untuk menggunakan alat
Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Luke Zettlemoyer, Nicola Cancedda, Thomas Scialom
Arxiv - Feb 2023 [kertas]
REPLUG: Model Bahasa Black-Box Retrieval-Agusted
Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-Tau Yih
Arxiv - Jan 2023 [kertas]
Model bahasa pengambilan dalam konteks
Ori Ram, Yoav Levine, Itay Dalmedigos, Dor Muhlgay, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham
AI21 Labs - Jan 2023 [kertas] [Kode]
Resep untuk membangun chatbot domain terbuka
Stephen Roller, Emily Dinan, Naman Goyal, Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Eric Michael Smith, Y-Lan Boureau, Jason Weston
EACL 2021 - Apr 2021 [Kertas]
Atman: Memahami prediksi transformator melalui manipulasi perhatian yang efisien memori
Hamed Zamani, Johanne R. Trippas, Jeff Dalton dan Filip Radlinski
Arxiv - Jan 2023 [kertas]
Retromae V2: Duplex Masked Auto-Encoder untuk model bahasa yang berorientasi pra-pelatihan
Shitao xiao, zheng liu
Arxiv - Nov 2023 [kertas]
Demonstrasi-pencarian-prediksi: menyusun model pengambilan dan bahasa untuk NLP Omar Khattab yang intensif , Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang, Christopher Potts, Matei Zaharia
Arxiv - Desember 2022 [Kertas]
Meningkatkan model bahasa dengan mengambil dari triliunan token
Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford, Katie Millican, George van den Driessche, Jean-Baptiste Lespiau, Bogdan Damoc, Aidan Clark, Diego de Las Casas, Aurelia Guy, Jacob Menick, Roman Ring, Tom Hennigan, Saffron Huang, Loren Maggiore, Chris Jones, Albin Cassirer, Andy Brock, Michela Paganini, Geoffrey Irving, Oriol Vinyals, Simon Osindero, Karen Simonyan, Jack W. Rae, Erich Elsen dan Laurent Sifre
Arxiv - Feb 2022 [kertas]
Meningkatkan model bahasa dengan mengambil dari triliunan token
Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford, Katie Millican, George van den Driessche, Jean-Baptiste Lespiau, Bogdan Damoc, Aidan Clark, Diego de Las Casas, Aurelia Guy, Jacob Menick, Roman Ring, Tom Hennigan, Saffron Huang, Loren Maggiore, Chris Jones, Albin Cassirer, Andy Brock, Michela Paganini, Geoffrey Irving, Oriol Vinyals, Simon Osindero, Karen Simonyan, Jack W. Rae, Erich Elsen, Laurent Sifre
Arxiv - Desember 2021 [kertas]
WebGPT: Permintaan pertanyaan yang dibantu oleh browser dengan umpan balik manusia
Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman
Arxiv - Desember 2021 [kertas]
Bert-Knn: Menambahkan komponen pencarian KNN ke model bahasa pretrained untuk QA yang lebih baik
Nora Kassner, Hinrich Schütze
EMNLP 2020 - Nov 2020 [kertas]
REALM: Model Bahasa yang Pengambilan Model Pra-Pelatihan Pra-pelatihan
Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, Ming-Wei Chang
ICML 2020 - Jul 2020 [kertas]
Model percakapan saraf pengambilan hibrida
Liu Yang, Junjie Hu, Minghui Qiu, Chen Qu, Jianfeng Gao, W. Bruce Croft, Xiaodong Liu, Yelong Shen, Jingjing Liu
Arxiv - Apr 2019 [kertas]
Dibumi dalam bobot model internal pada waktu inferensi
Bagaimana model bahasa besar memperoleh pengetahuan faktual selama pretraining?
Hoyeon Chang, Jinho Park, Seonghyeon Ye, Sohee Yang, Youngkyung Seo, Du-Seong Chang, Minjoon Seo
Arxiv - Jun 2024 [kertas]
Model bahasa yang menyempurnakan untuk faktualitas
Katherine Tian, Eric Mitchell, Huaxiu Yao, Christopher D. Manning, Chelsea Finn
Arxiv - Nov 2023 [kertas]
R-Tuning: Mengajar model bahasa besar untuk menolak pertanyaan yang tidak diketahui
Hanning Zhang, Shizhe Diao, Yong Lin, Yi R. Fung, Qing Lian, Xingyao Wang, Yangyi Chen, Heng JI, Tong Zhang
Arxiv - Nov 2023 [kertas]
EasyEdit: Kerangka pengeditan pengetahuan yang mudah digunakan untuk model bahasa besar
Peng Wang, Ningyu Zhang, Xin Xie, Yunzhi Yao, Bozhong Tian, Mengru Wang, Zekun XI, Siyuan Cheng, Kangwei Liu, Guozhou Zheng, Huajun Chen
Arxiv - Agustus 2023 [kertas]
Memeriksa dan mengedit representasi pengetahuan dalam model bahasa
Evan Hernandez, Belinda Z. Li, Jacob Andreas
Arxiv - Apr 2023 [kertas] [kode]
Leveraging Passage Retrieval dengan model generatif untuk menjawab pertanyaan domain terbuka
Gautier Izacard, Edouard Grave
Arxiv - Feb 2023 [kertas]
Menemukan pengetahuan laten dalam model bahasa tanpa pengawasan
Collin Burns, Haotian Ye, Dan Klein, Jacob Steinhardt
ICLR 23 - Feb 2023 [kertas] [Kode]
Galactica: Model Bahasa Besar untuk Sains
Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, Robert Stojnic
Galactica.org - 2022 [kertas]
BlenderBot 3: Agen percakapan yang dikerahkan yang terus -menerus belajar untuk terlibat secara bertanggung jawab
Kurt Shuster, Jing Xu, Mojtaba Komeili, Da Ju, Eric Michael Smith, Stephen Roller, Megan Ung, Moya Chen, Kushal Arora, Joshua Lane, Morteza Behrooz, William Ngan, Spencer Poff, Naman Goyal, Arthur Szlam, Y-Lan Boureau, Melanie Kambadur, Jason Weston
Arxiv - Agustus 2022 [kertas]
Generate Than Retrieve: Model Bahasa Besar adalah generator konteks yang kuat
Wenhao Yu, Dan Iter, Shuohang Wang, Yichong Xu, Mingxuan Ju, Soumya Sanyal, Chenguang Zhu, Michael Zeng, Meng Jiang
ICLR 2023 - Sep 2022 [kertas]
Model Bahasa Bafalan-Bafalan
Zhiqing Sun, Xuezhi Wang, Yi Tay, Yiming Yang, Denny Zhou
ICLR 2023 - Sep 2022 [kertas]
Meningkatkan penyelarasan agen dialog melalui penilaian manusia yang ditargetkan
Amelia Glaese, Nat McAleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-Gillingham, Jonathan Ueesio, Po-Sen Huang, Ramon Campbell-Gillingham, Jonathan Uesato, Po-Sen Huang, Ramona, Ramonon, Jonathan Uesato, Po-Sen Huang, Ramonon, Jonathan Jonathan, Po-Sen Huang, Po-Sen Huang, Ramona, Jonathan, Po-Sen Huang, Po-Sen, Lucy Campbell, Jonathan, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koravis, Gabriel Irving
Arxiv - Sep 2022 [kertas]
LAMDA: Model Bahasa untuk Aplikasi Dialog
Romal Thoppilan, Daniel de Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-Tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongraee, Huaixiu Steven Zheng, Aminang Li, Huongrae, Huaixiu Steven Zheng, Aminang Li, Aminal Ghooure, Huaixiu Steven Zheng, Aminang, Aminal Ghooure, Aminang Ghoure, Huaixiu Steven Zheng, Aminang, Aminang, Aminang, Aminang, Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-Ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-Hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-John, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-Arcas, Claire Cui, Marian Croak, Ed Chi, Quoc le Le
Arxiv - Jan 2022 [kertas]
Model bahasa sebagai atau untuk basis pengetahuan
Simon Razniewski, Andrew Yates, Nora Kassner, Gerhard Weikum
DL4KG 2021 - Okt 2021 [kertas]
Generalisasi Melalui Hafalan: Model Bahasa Tetangga terdekat
Urvashi Khandelwal, Omer Levy, Dan Jurafsky, Luke Zettlemoyer, Mike Lewis
ICLR 2020 - Sep 2019 [kertas] [Kode]
Apakah chatgpt baik dalam pencarian? Menyelidiki model bahasa besar sebagai agen peringkat ulang
Wenhao Yu, Hongming Zhang, Pan Xiaoman, Kaixin MA, Hongwei Wang, Dong Yu
Arxiv - Nov 2023 [kertas]
Distilasi Instruksi Membuat Model Bahasa Besar Efisien Rankers Zero-Shot
Weiwei Sun, Zheng Chen, Xinyu MA, Lingyong Yan, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
Arxiv 2023 - Nov 2023 [Kertas]
Kritik: Model bahasa besar dapat mengoreksi diri dengan kritik interaktif alat
Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen
ICLR 2024 - Jan 2024 [Kertas]
Jahitan dalam waktu menghemat sembilan: mendeteksi dan mengurangi halusinasi LLMS dengan memvalidasi generasi kepercayaan rendah
Neeraj Varshney, Wenlin Yao, Hongming Zhang, Jianshu Chen, Dong Yu
Arxiv - Agustus 2023 [kertas]
RARR: Meneliti dan merevisi apa yang dikatakan model bahasa, menggunakan model bahasa
Luyu Gao, Zhuyun Dai, Panupong Pasupat, Anthony Chen, Arun Tejasvi Chaganty, penggemar Yicheng, Vincent Zhao, Ni Lao, Hongrae Lee, Da-Cheng Juan, Kelvin Guu
ACL 2023 - Jul 2023 [kertas]
Verifikasi-dan-Edit: Kerangka kerja rantai yang ditingkatkan pengetahuan
Ruochen Zhao, Xingxuan LI, Shafiq Joty, Chengwei Qin, Lidong Bing
ACL 2023 - Jul 2023 [kertas]
Pengambilan aktif generasi augmented
Zhengbao Jiang, Frank F. Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig
Arxiv - Mei 2023 [kertas] [Kode]
Meningkatkan model bahasa melalui umpan balik pengambilan plug-and-play
Wenhao Yu, Zhihan Zhang, Zhenwen Liang, Meng Jiang, Ashish Sabharwal
Arxiv - Mei 2023 [Kertas]
Kalibrasi linguistik generasi panjang
Neil Band, Xuechen Li, Tengyu MA, Tatsunori Hashimoto
Arxiv 2024 - Jun 2024 [Kertas]
Untuk percaya atau tidak mempercayai LLM Anda
Yasin Abbasi Yadkori, Ilja Kuzborskij, András György, Csaba Szepesvári
Arxiv 2024 - Jun 2024 [Kertas]
Sayself: Mengajar LLM untuk mengekspresikan kepercayaan diri dengan rasional yang reflektif diri
Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao
ARXIV 2024 - Mei 2024 [Kertas]
Pakar tidak menipu: mempelajari apa yang tidak Anda ketahui dengan memprediksi pasangan
Daniel D. Johnson, Daniel Tarlow, David Duvenaud, Chris J. Maddison
ARXIV 2024 - Feb 2024 [kertas]
Membuka kunci generasi teks antisipatif: pendekatan terbatas untuk decoding yang setia dengan model bahasa besar
Anonim
ICLR 24 - Okt 2023 [kertas]
DOLA: Decoding dengan membatasi lapisan meningkatkan faktualitas dalam model bahasa besar
Yung-Sung Chuang, Yujia Xie, Hongyin Luo, Yoon Kim, James Glass, Pengcheng He
ICLR 24 - Sep 2023 [kertas]
Pendekatan data-sentris untuk menghasilkan ringkasan pasien yang setia dan berkualitas tinggi dengan model bahasa besar
Stefan Hegselmann, Shannon Zejiang Shen, Florian Gierse, Monica Agrawal, David Sontag, Xiaoyi Jiang
ARXIV 24 - Feb 2024 [kertas]
Rag Stochastic: Generasi Pengambilan Ujung Ujung-Ujung Melalui Maksimalisasi Utilitas yang Diharapkan
Hamed Zamani, Michael Bendersky
ARXIV 24 - Mei 2024 [Kertas]
AI Konstitusi: tidak berbahaya dari umpan balik AI
Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Carol Chen, Catherine Olsson, Christopher Olah, Danny Hernandez, Dawn Drain, Deep Ganguli, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse, Kamile Lukosiute, Liane Lovitt, Michael Sellitto, Nelson Elhage, Nicholas Schiefer, Noemi Mercado, Nova DasSarma, Robert Lasenby, Robin Larson, Sam Ringer, Scott Johnston, Shauna Kravec, Sheer El Showk, Stanislav Fort, Tamera Lanham, Timothy Telleen-Lawton, Tom Conerly, Tom Henighan, Tristan Hume, Samuel R. Bowman, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Jared Kaplan Anthropic.com – Dec 2022 [paper]
Mempelajari Keterampilan Baru Setelah Penempatan: Meningkatkan Dialog Berbasis Internet Domain Terbuka Dengan Umpan Balik Manusia
Jing Xu, Megan Ung, Mojtaba Komeili, Kushal Arora, Y-Lan Boureau, Jason Weston
Arxiv - Agustus 2022 [kertas]
Pemodelan Bahasa Multimodal Pengambilan-Agung
Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-Tau Yih
Arxiv - Nov 2022 [kertas]
RAMM: Retrieval-augmented Biomedical Visual Question Menjawab dengan pra-pelatihan multi-modal
Zheng Yuan, Qiao Jin, Chuanqi Tan, Zhengyun Zhao, Hongyi Yuan, Fei Huang, Songfang Huang
Arxiv - Mar 2023 [kertas]
Pengambilan interleaving dengan penalaran rantai-dipikirkan untuk pertanyaan multi-langkah intensif-pengetahuan Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot dan Ashish Sabharwal ACL 23-Jul 2023 [kertas]
Bereaksi: Sinergisasi Penalaran dan Bertindak dalam Model Bahasa
Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao
Arxiv - Okt 2022 [kertas]
Repocoder: Penyelesaian kode tingkat repositori melalui pengambilan iteratif dan generasi
Fengji Zhang, Bei Chen, Yue Zhang, Jin Liu, Daoguang Zan, Yi Mao, Jian-Guang Lou, Weizhu Chen
Arxiv - Mar 2023 [kertas]
DocPrompting: Menghasilkan kode dengan mengambil dokumen
Shuyan Zhou, Uri Alon, Frank F. Xu, Zhiruo Wang, Zhengbao Jiang, Graham Neubig
ICLR 23 - Jul 2022 [kertas] [kode] [Data]
Hasilkan, Filter, dan Fuse: Perluasan Kueri Melalui Generasi Kata Kunci Multi-Langkah Untuk Rankers Saraf Zero-Shot
Minghan Li, Honglei Zhuang, Kai Hui, Zhen Qin, Jimmy Lin, Rolf Jagerman, Xuanhui Wang, Michael Bendersky
Arxiv - Nov 2023 [kertas]
Agent4Ranking: Peringkat semantik yang kuat melalui penulisan ulang kueri yang dipersonalisasi menggunakan multi-agent LLM
Xiaopeng Li, Lixin SU, Pengyue Jia, Xiangyu Zhao, Suqi Cheng, Junfeng Wang, Dawei Yin
Arxiv - Desember 2023 [kertas]
Pengambilan Generatif & Batuan untuk Menulis ulang kueri dalam pencarian yang disponsori
Akash Kumar Mohankumar, Bhargav Dodla, Gururaj K, Amit Singh
Arxiv - Sep 2022 [kertas]
Menghasilkan Olahraga yang Konsisten Secara Faktal Menyoroti Narasi
Noah Sarfati, Ido Yerushalmy, Michael Chertok, Yosi Keller
MMSports 2023 - 23 Okt [kertas]
Pengambilan informasi generatif genetik
Hrishikesh Kulkarni, Zachary Young, Nazli Goharian, Ophir Frieder, Sean Macavaney
Doceng 23 - 23 Agustus [kertas]
Belajar meringkas dengan umpan balik manusia
Nisan Stiennon, Long Ouyang, Jeff Wu, Daniel M. Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano
NeurIPS 2020 – Sep 2020 [paper]
On Faithfulness and Factuality in Abstractive Summarization
Joshua Maynez, Shashi Narayan, Bernd Bohnet, Ryan McDonald
ACL 2020 – May 2020 [paper]
Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion
Yujian Liu, Jiabao Ji, Tong Yu, Ryan Rossi, Sungchul Kim, Handong Zhao, Ritwik Sinha, Yang Zhang, Shiyu Chang
arXiv – Jan 2024 [paper]
We jump-started this section by reusing the content of awesome-generative-retrieval-models and give full credit to Chriskuei for that! We now have added some content on top.
De-DSI: Decentralised Differentiable Search Index
Petru Neague, Marcel Gregoriadis, Johan Pouwelse
EuroMLSys 24 – Apr 2024 [paper]
Listwise Generative Retrieval Models via a Sequential Learning Process
Yubao Tang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Wei Chen, Xueqi Cheng
TOIS 2024 – Mar 2024 [Paper]
Distillation Enhanced Generative Retrieval
Yongqi Li, Zhen Zhang, Wenjie Wang, Liqiang Nie, Wenjie Li, Tat-Seng Chua
arXiv 2024 – Feb 2024 [Paper]
Self-Retrieval: Building an Information Retrieval System with One Large Language Model
Qiaoyu Tang, Jiawei Chen, Bowen Yu, Yaojie Lu, Cheng Fu, Haiyang Yu, Hongyu Lin, Fei Huang, Ben He, Xianpei Han, Le Sun, Yongbin Li
arXiv 2024 – Feb 2024 [Paper]
Generative Dense Retrieval: Memory Can Be a Burden
Peiwen Yuan, Xinglin Wang, Shaoxiong Feng, Boyuan Pan, Yiwei Li, Heda Wang, Xupeng Miao, Kan Li
EACL 2024 - Jan 2024 [paper] [code]
Auto Search Indexer for End-to-End Document Retrieval
Tianchi Yang, Minghui Song, Zihan Zhang, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang
EMNLP 2023 - December 23 [paper]
DiffusionRet: Diffusion-Enhanced Generative Retriever using Constrained Decoding
Shanbao Qiao, Xuebing Liu, Seung-Hoon Na
EMNLP Findings 2023 – Dec 2023 [paper]
Scalable and Effective Generative Information Retrieval
Hansi Zeng, Chen Luo, Bowen Jin, Sheikh Muhammad Sarwar, Tianxin Wei, Hamed Zamani
WWW 2024 - Nov 2023 [paper] [code]
Nonparametric Decoding for Generative Retrieval
Hyunji Lee, JaeYoung Kim, Hoyeon Chang, Hanseok Oh, Sohee Yang, Vladimir Karpukhin, Yi Lu, Minjoon Seo
ACL Findings 2023 – Jul 2023 [paper]
Model-enhanced Vector Index
Hailin Zhang, Yujing Wang, Qi Chen, Ruiheng Chang, Ting Zhang, Ziming Miao, Yingyan Hou, Yang Ding, Xupeng Miao, Haonan Wang, Bochen Pang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Xing Xie, Mao Yang, Bin Cui
NeurIPS 2023 – May 2023 [paper] [code]
Continual Learning for Generative Retrieval over Dynamic Corpora
Jiangui Chen, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Wei Chen, Yixing Fan, Xueqi Cheng
CIKM 2023 - Aug 2023 [paper]
Learning to Rank in Generative Retrieval
Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li
arXiv – Jun 2023 [paper]
Large Language Models are Built-in Autoregressive Search Engines
Noah Ziems, Wenhao Yu, Zhihan Zhang, Meng Jiang
ACL Findings 2023 – May 2023 [paper]
Multiview Identifiers Enhanced Generative Retrieval
Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li
ACL 2023 – May 2023 [paper]
How Does Generative Retrieval Scale to Millions of Passages?
Ronak Pradeep, Kai Hui, Jai Gupta, Adam D. Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran
arXiv – May 2023 [paper]
TOME: A Two-stage Approach for Model-based Retrieval
Ruiyang Ren, Wayne Xin Zhao, Jing Liu, Hua Wu, Ji-Rong Wen, Haifeng Wang
ACL 2023 - May 2023 [paper]
Understanding Differential Search Index for Text Retrieval
Xiaoyang Chen, Yanjiang Liu, Ben He, Le Sun, Yingfei Sun
ACL Findings 2023 - May 2023 [paper]
Learning to Tokenize for Generative Retrieval
Weiwei Sun, Lingyong Yan, Zheng Chen, Shuaiqiang Wang, Haichao Zhu, Pengjie Ren, Zhumin Chen, Dawei Yin, Maarten de Rijke, Zhaochun Ren
arXiv – Apr 2023 [paper]
DynamicRetriever: A Pre-trained Model-based IR System Without an Explicit Index
Yu-Jia Zhou, Jing Yao, Zhi-Cheng Dou, Ledell Wu, Ji-Rong Wen
Machine Intelligence Research – Jan 2023 [paper]
DSI++: Updating Transformer Memory with New Documents
Sanket Vaibhav Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Jinfeng Rao, Marc Najork, Emma Strubell, Donald Metzler
arXiv – Dec 2022 [paper]
CodeDSI: Differentiable Code Search
Usama Nadeem, Noah Ziems, Shaoen Wu
arXiv – Oct 2022 [paper]
Contextualized Generative Retrieval
Hyunji Lee, Jaeyoung Kim, Hoyeon Chang, Hanseok Oh, Sohee Yang, Vlad Karpukhin, Yi Lu, Minjoon Seo
arXiv – Oct 2022 [paper]
Transformer Memory as a Differentiable Search Index
Yi Tay, Vinh Q. Tran, Mostafa Dehghani, Jianmo Ni, Dara Bahri, Harsh Mehta, Zhen Qin, Kai Hui, Zhe Zhao, Jai Gupta, Tal Schuster, William W. Cohen, Donald Metzler
Neurips 2022 – Oct 2022 [paper] [Video] [third-party code]
A Neural Corpus Indexer for Document Retrieval
Wang et al.
Arxiv 2022 [paper]
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, Guido Zuccon, and Daxin Jiang
Arxiv 2022 [paper] [Code]
DynamicRetriever: A Pre-training Model-based IR System with Neither Sparse nor Dense Index
Zhou et al
Arxiv 2022 [paper]
Ultron: An Ultimate Retriever on Corpus with a Model-based Indexer
Zhou et al
Arxiv 2022 [paper]
Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous Decoding
Hansi Zeng ,Chen Luo ,Hamed Zamani
arXiv – Apr 2024 [paper] [Code]
NOVO: Learnable and Interpretable Document Identifiers for Model-Based IR
Zihan Wang, Yujia Zhou, Yiteng Tu, Zhicheng Dou
CIKM 2023 - October 2023 [paper]
Generative Retrieval as Multi-Vector Dense Retrieval
Shiguang Wu, Wenda Wei, Mengqi Zhang, Zhumin Chen, Jun Ma, Zhaochun Ren, Maarten de Rijke, Pengjie Ren
SIGIR 2024 - March 24 [paper] [Code]
Re3val: Reinforced and Reranked Generative Retrieval
EuiYul Song, Sangryul Kim, Haeju Lee, Joonkee Kim, James Thorne
EACL Findings 2023 – Jan 24 [paper]
GLEN: Generative Retrieval via Lexical Index Learning
Sunkyung Lee, Minjin Choi, Jongwuk Lee
EMNLP 2023 - December 23 [paper] [Code]
Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback
Yujia Zhou, Zhicheng Dou, Ji-Rong Wen
EMNLP 2023 - December 23 [paper]
Generative Retrieval with Large Language Models
Anonim
ICLR 24 – October 23 [paper]
Semantic-Enhanced Differentiable Search Index Inspired by Learning Strategies
Yubao Tang, Ruqing Zhang, Jiafeng Guo, Jiangui Chen, Zuowei Zhu, Shuaiqiang Wang, Dawei Yin, Xueqi Cheng
KDD 2023 – May 2023 [paper]
Term-Sets Can Be Strong Document Identifiers For Auto-Regressive Search Engines
Peitian Zhang, Zheng Liu, Yujia Zhou, Zhicheng Dou, Zhao Cao
arXiv – May 2023 [paper] [Code]
A Unified Generative Retriever for Knowledge-Intensive Language Tasks via Prompt Learning
Jiangui Chen, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yiqun Liu, Yixing Fan, Xueqi Cheng
SIGIR 2023 – Apr 2023 [paper] [Code]
CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks
Jiangui Chen, Ruqing Zhang, Jiafeng Guo, Yiqun Liu, Yixing Fan, Xueqi Cheng
CIKM 2022 – Aug 2022 [paper] [Code]
Autoregressive Search Engines: Generating Substrings as Document Identifiers
Michele Bevilacqua, Giuseppe Ottaviano, Patrick Lewis, Wen-tau Yih, Sebastian Riedel, Fabio Petroni
arXiv – Apr 2022 [paper] [Code]
Autoregressive Entity Retrieval
Nicola De Cao, Gautier Izacard, Sebastian Riedel, Fabio Petroni
ICLR 2021 – Oct 2020 [paper] [Code]
Data-Efficient Autoregressive Document Retrieval for Fact Verification
James Thorne
SustaiNLP@EMNLP 2022 – Nov 2022 [paper]
GERE: Generative Evidence Retrieval for Fact Verification
Jiangui Chen, Ruqing Zhang, Jiafeng Guo, Yixing Fan, Xueqi Cheng
SIGIR 2022 [paper] [Code]
Generative Multi-hop Retrieval
Hyunji Lee, Sohee Yang, Hanseok Oh, Minjoon Seo
arXiv – Apr 2022 [paper]
Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens
Ting-Ji Huang, Jia-Qi Yang, Chunxu Shen, Kai-Qi Liu, De-Chuan Zhan, Han-Jia Ye
arXiv – Jun 2024 [paper]
Plug-in Diffusion Model for Sequential Recommendation
Haokai Ma, Ruobing Xie, Lei Meng, Xin Chen, Xu Zhang, Leyu Lin, Zhanhui Kang
arXiv – Jan 2024 [paper]
Towards Graph-Aware Diffusion Modeling For Collaborative Filtering Yunqin Zhu1, Chao Wang, Hui Xiong
arXiv – Nov 2023 [paper]
RecMind: Large Language Model Powered Agent For Recommendation
Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang
arXiv – Aug 2023 [paper]
Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation
Jizhi Zhang, Keqin Bao, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He
Recsys 2023 – Jul 2023 [paper]
RecFusion: A Binomial Diffusion Process for 1D Data for Recommendation
Gabriel Bénédict, Olivier Jeunen, Samuele Papa, Samarth Bhargav, Daan Odijk, Maarten de Rijke
arXiv – Jun 2023 [paper]
A First Look at LLM-Powered Generative News Recommendation
Qijiong Liu, Nuo Chen, Tetsuya Sakai, Xiao-Ming Wu
arXiv – Jun 2023 [paper]
Large Language Models as Zero-Shot Conversational Recommenders
Yupeng Hou, Junjie Zhang, Zihan Lin, Hongyu Lu, Ruobing Xie, Julian McAuley, Wayne Xin Zhao
arXiv – May 2023 [paper]
DiffuRec: A Diffusion Model for Sequential Recommendation
Zihao Li, Aixin Sun, Chenliang Li
arXiv – Apr 2023 [paper]
Diffusion Recommender Model
Wenjie Wang, Yiyan Xu, Fuli Feng, Xinyu Lin, Xiangnan He, Tat-Seng Chua
SIGIR 2023 – Apr 2023 [paper]
Blurring-Sharpening Process Models for Collaborative Filtering
Jeongwhan Choi, Seoyoung Hong, Noseong Park, Sung-Bae Cho
SIGIR 2023 – Apr 2023 [paper] [code]
Recommender Systems with Generative Retrieval
Shashank Rajput, Nikhil Mehta, Anima Singh, Raghunandan Keshavan, Trung Vu, Lukasz Heldt, Lichan Hong, Yi Tay, Vinh Q. Tran, Jonah Samost, Maciej Kula, Ed H. Chi, Maheswaran Sathiamoorthy
non-archival – Mar 2023 [paper]
Pre-train, Prompt and Recommendation: A Comprehensive Survey of Language Modelling Paradigm Adaptations in Recommender Systems
Peng Liu, Lemei Zhang, Jon Atle Gulla
arXiv – Feb 2023 [paper]
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet, Thibaut Thonet, Jean-Michel Renders, and Maarten de Rijke
WSDM 2023 – Feb 2023 [paper]
Recommendation via Collaborative Diffusion Generative Model
Joojo Walker, Ting Zhong, Fengli Zhang, Qiang Gao, Fan Zhou
KSEM 2022 – Aug 2022 [paper]
DocGraphLM: Documental Graph Language Model for Information Extraction
Dongsheng Wang, Zhiqiang Ma, Armineh Nourbakhsh, Kang Gu, Sameena Shah
arXiv – Jan 2024 [paper]
KBFormer: A Diffusion Model for Structured Entity Completion
Ouail Kitouni, Niklas Nolte, James Hensman, Bhaskar Mitra
arXiv – Dec 2023 [paper]
From Retrieval to Generation: Efficient and Effective Entity Set Expansion
Shulin Huang, Shirong Ma, Yangning Li, Yinghui Li, Hai-Tao Zheng, Yong Jiang
arXiv – Apr 2023 [paper]
Crawling the Internal Knowledge-Base of Language Models
Roi Cohen, Mor Geva, Jonathan Berant, Amir Globerson
arXiv – Jan 2023 [paper]
Prompt Tuning or Fine-Tuning - Investigating Relational Knowledge in Pre-Trained Language Models
Leandra Fichtel, Jan-Christoph Kalo, Wolf-Tilo Balke
AKBC 2021 – [paper]
Language Models as Knowledge Bases?
Fabio Petroni, Tim Rocktäschel, Patrick Lewis, Anton Bakhtin, Yuxiang Wu, Alexander H. Miller, Sebastian Riedel
EMNLP 2019 – Sep 2019 [paper]
Although some of these are not accompanied by a paper, they might be useful to other Generative IR researchers for empirical studies or interface design considerations.
⚡ Gemini Dec 2023 [live] ⚡️ factiverse Jun 2023 [live] ⚡️ devmarizer Mar 2023 [live] ⚡️ TaxGenius Mar 2023 [live] ⚡️ doc-gpt Mar 2023 [live] ⚡️ book-gpt Feb 2023 [live] ⚡️ Neeva Feb 2023 [live] ⚡️ Golden Retriever Feb 2023 [live] ⚡️ Bing – Prometheus Feb 2023 [waitlist] ⚡️ Google – Bard Feb 2023 [only in certain countries] ⚡️ Paper QA Feb 2023 [code] [demo] ⚡️ DocsGPT Feb 2023 [live] [code] ⚡️ DocAsker Jan 2023 [live] ⚡️ Lexii.ai Jan 2023 [live] ⚡️ YOU.com Dec 2022 [live] ⚡️ arXivGPT Dec 2022 [Chrome extension] ⚡️ GPT Index Nov 2022 [API] ⚡️ BlenderBot Aug 2022 [live (USA)] [model weights] [code] [paper1] [paper2] ⚡️ PHIND date? [live] ⚡️ Perplexity date? [live] ⚡️ Galactica date? [demo] [API] [paper] ⚡️ Elicit date? [live] ⚡️ ZetaAlpha date? [live] uses OpenAI API
To get just the paper titles do grep '**' README.md | sed 's/**//g'