Google 學術搜尋

You can't pick your neighbors, or can you? When and how to rely on retrieval in the NN-LM

A Drozdov, S Wang, R Rahimi, A McCallum… - arXiv preprint arXiv …, 2022 - arxiv.org

A Drozdov, S Wang, R Rahimi, A McCallum, H Zamani, M Iyyer

arXiv preprint arXiv:2210.15859, 2022•arxiv.org

Retrieval-enhanced language models (LMs), which condition their predictions on text retrieved from large external datastores, have recently shown significant perplexity improvements compared to standard LMs. One such approach, the NN-LM, interpolates any existing LM's predictions with the output of a -nearest neighbors model and requires no additional training. In this paper, we explore the importance of lexical and semantic matching in the context of items retrieved by NN-LM. We find two trends: (1) the presence of large overlapping -grams between the datastore and evaluation set plays an important factor in strong performance, even when the datastore is derived from the training data; and (2) the NN-LM is most beneficial when retrieved items have high semantic similarity with the query. Based on our analysis, we define a new formulation of the NN-LM that uses retrieval quality to assign the interpolation coefficient. We empirically measure the effectiveness of our approach on two English language modeling datasets, Wikitext-103 and PG-19. Our re-formulation of the NN-LM is beneficial in both cases, and leads to nearly 4% improvement in perplexity on the Wikitext-103 test set.

arxiv.org

顯示更多顯示較少

儲存引用被引用 16 次相關文章全部共 4 個版本 HTML 版

引用

進階搜尋

已儲存至「我的圖書館」

You can't pick your neighbors, or can you? When and how to rely on retrieval in the NN-LM