Blog

Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval - TACL2023

Motivation: Recent work has shown that models such as BERT are not ‘‘structurally ready’’ to aggregate textual information into a [CLS] vector for dense passage retrieval (DPR). This ‘‘lack of readiness’’ results from the gap between language model pre-training and DPR fine-tuning. Methods: In this work, we instead propose to fully exploit knowledge in a pretrained language model for DPR by aggregating the contextualized token embeddings into a dense vector, which we call agg*. Experiments: By concatenating vectors from the [CLS] token and agg*, our agg* retriever model substantially improves the effectiveness of dense retrieval models on both in-domain and zero-shot evaluations without introducing substantial training overhead.

2024.02.26

Today is a gift!

Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval - TACL2023

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning - EMNLP2023 Best Paper

GPT-RE: In-context Learning for Relation Extraction using Large Language Models

Expected Calibration Error (ECE) of LLMs

Paper reading "Knowledge Rumination for Pre-trained Language Models" -- ACL2023

Paper reading "Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback" -- EMNLP2023 shot paper

Paper reading "Data Curation Alone Can Stabilize In-context Learning" -- ACL2023

Paper reading "More Samples or More Prompt Inputs? Exploring Effective In-Context Sampling for LLM Few-Shot Prompt Engineering" -- Arxiv2023

Paper reading "Instruct Me More ! Random Prompting for Visual In-Context Learning -- WACV2023

Paper reading "Enhancing Conversational Search: Large Language Model-Aided Informative Query Rewriting" -- EMNLP2023 Findings

Paper reading "Diversify Question Generation with Retrieval-Augmented Style Transfer" -- EMNLP2023

Paper reading "Query-as-context Pre-training for Dense Passage Retrieval" -- EMNLP2023

Paper reading "Active Retrieval Augmented Generation" -- EMNLP2023

Paper reading "Questions Are All You Need to Train a Dense Passage Retriever" -- TACL2023

Paper reading "SOUL: Towards Sentiment and Opinion Understanding of Language" -- EMNLP2023-main

Paper reading "Human-like systematic generalization through a meta-learning neural network" -- Nature2023

Paper reading "MoT: Memory-of-Thought Enables ChatGPT to Self-Improve" -- EMNLP2023 main

Paper reading "Large Language Models Struggle to Learn Long-Tail Knowledge" -- ICML2022

Paper reading "Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!" -- EMNLP2023

Paper reading "FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation" -- SIGIR2023

Paper reading "RARR: Researching and Revising What Language Models Say, Using Language Models" -- ACL2023

Paper reading "Large Language Models Are Human-Level Prompt Engineers" -- ICLR2023

Combine images online.

Ppaer reading "Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration" - EMNLP2023 Findings

Lecture Notes: "Towards Generative Search and Recommendation".

Rough reading "Contrastive Decoding Improves Reasoning in Large Language Models - arxiv2023".

Paper reading "RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit - arxiv2023".

Paper reading "Augmented Large Language Models with Parametric Knowledge Guiding - arxiv2023".

Seaborn: statistical data visualization

A paper list about Large Language Models

Rough Reading "Query Rewriting for Retrieval-Augmented Large Language Models"

Rough Reading "Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy"

Rough Reading "In-Context Demonstration Selection with Cross Entropy Difference"

Rough Reading "RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models - Arxiv2023"

Rough Reading "GENERATE RATHER THAN RETRIEVE: LARGE LANGUAGE MODELS ARE STRONG CONTEXT GENERATORS - ICLR2023"

Rough Reading "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"

Rough Reading "Z-ICL : Zero-Shot In-Context Learning with Pseudo-Demonstrations"

Rough Reading "When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories"

Paper Reading "Making pre-trained language models better few-shot learners"

Rough Reading "Understanding In-Context Learning via Supportive Pretraining Data"

行列式点过程（Determinantal Point Process, DPP）

Paper Reading "Diverse Demonstrations Improve In-context Compositional Generalization" -- ACL2023

Paper Reading "Unified Demonstration Retriever for In-Context Learning" -- ACL2023 Oral

Paper Reading "Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and Ordering" -- ACL2023 Oral

Paper Reading "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In" -- ACL2023 Oral

Sweety and Tiny.

My first blog.

Coming soon.