Assignee Research: Index of Papers

[44]

Vendi-RAG: Adaptively Trading-Off Diversity And Quality Significantly Improves R

27 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20415648

Abstract: Retrieval-augmented generation (RAG) enhances large language models (LLMs) for domain-specific question-answering (QA) tasks by leveraging external knowledge sources. However, traditional RAG systems primarily focus on relevance-based retrieval and often struggle with redundancy, especially when reasoning requires…

[43]

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Model

27 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20415634

Abstract: Instruction-tuned language models (LM) are able to respond to imperative commands, providing a more natural user interface compared to their base counterparts. In this work, we present Promptriever, the first retrieval model able to be prompted like an LM. To train Promptriever, we curate and release a new…

[42]

Evaluating Multi-Hop Reasoning in RAG Systems: A Comparison of LLM-Based Retriev

27 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20415620

Abstract: Retrieval-augmented generation (RAG) enhances large language models (LLMs) with external knowledge to answer questions more accurately. However, research on evaluating RAG systems-particularly the retriever component-remains limited, as most existing work focuses on single-context retrieval rather than multi-hop…

[41]

DRAGON: Domain-specific Robust Automatic Data Generation for RAG Optimization

27 May 2026. Score: 8.17/10. Verification: L2, Source-grounded claims.

Abstract: Retrieval-augmented generation (RAG) can substantially enhance the performance of LLMs on knowledge-intensive tasks. Various RAG paradigms - including vanilla, planning-based, and iterative RAG - all depend on a robust retriever, yet existing retrievers rely heavily on public knowledge and often falter when faced…

[40]

Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-ho

27 May 2026. Score: 3.00/10. Verification: L1, Literature synthesis.

Abstract: Multi-hop question answering is a knowledge-intensive complex problem. Large Language Models (LLMs) use their Chain of Thoughts (CoT) capability to reason complex problems step by step, and retrieval-augmentation can effectively alleviate factual errors caused by outdated and unknown knowledge in LLMs. Recent works…

[39]

PRISM: Agentic Retrieval with LLMs for Multi-Hop Question Answering

27 May 2026. Score: 8.17/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20413352

Abstract: Retrieval plays a central role in multi-hop question answering (QA), where answering complex questions requires gathering multiple pieces of evidence. We introduce an Agentic Retrieval System that leverages large language models (LLMs) in a structured loop to retrieve relevant evidence with high precision and recall.…

[38]

VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

27 May 2026. Score: 7.83/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20413164

Abstract: Retrieval-Augmented Generation (RAG) has demonstrated remarkable success in enhancing Large Language Models (LLMs) through external knowledge integration, yet its application has primarily focused on textual content, leaving the rich domain of multi-modal video knowledge predominantly unexplored. This paper…

[37]

Evaluating Multi-Hop Reasoning in RAG Systems: A Comparison of LLM-Based Retriev

27 May 2026. Score: 7.33/10. Verification: L2, Source-grounded claims.

Abstract: Retrieval-augmented generation (RAG) enhances large language models (LLMs) with external knowledge to answer questions more accurately. However, research on evaluating RAG systems-particularly the retriever component-remains limited, as most existing work focuses on single-context retrieval rather than multi-hop…

[36]

SMoES: Soft Modality-Guided Expert Specialization in MoE-VLMs

27 May 2026. Score: 7.00/10. Verification: L2, Source-grounded claims.

Abstract: Mixture-of-Experts (MoE) has become a prevalent backbone for large vision-language models (VLMs), yet how modality-specific signals should guide expert routing remains under-explored. Existing routing strategies are either hand-crafted or modality-agnostic, relying on idealized priors that ignore the layer-dependent…

[35]

Context-Length Robustness in Question Answering Models: A Comparative Empirical

27 May 2026. Score: 7.83/10. Verification: L2, Source-grounded claims.

Abstract: Large language models are increasingly deployed in settings where relevant information is embedded within long and noisy contexts. Despite this, robustness to growing context length remains poorly understood across different question answering tasks. In this work, we present a controlled empirical study of…

[34]

LFM2 Technical Report

27 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20412586

Abstract: We present LFM2, a family of Liquid Foundation Models designed for efficient on-device deployment and strong task capabilities. Using hardware-in-the-loop architecture search under edge latency and memory constraints, we obtain a compact hybrid backbone that combines gated short convolutions with a small number of…

[33]

MambaFormer: Token-Level Guided Routing Mixture-of-Experts for Accurate and Effi

27 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20412328

Abstract: The deployment of large language models (LLMs) in real-world clinical applications is constrained by the fundamental trade-off between computational cost and the efficiency of linear-time models. To address this, we propose an LLM-based MambaFormer hybrid Mixture-of-Experts (MoE) framework for efficient medical…

[32]

Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-ho

27 May 2026. Score: 8.17/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20412206

Abstract: Multi-hop question answering is a knowledge-intensive complex problem. Large Language Models (LLMs) use their Chain of Thoughts (CoT) capability to reason complex problems step by step, and retrieval-augmentation can effectively alleviate factual errors caused by outdated and unknown knowledge in LLMs. Recent works…

[31]

Learning From Failure: Integrating Negative Examples when Fine-tuning Large Lang

27 May 2026. Score: 8.00/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20412097

Abstract: Large language models (LLMs) have achieved success in acting as agents, which interact with environments through tools such as search engines. However, LLMs are optimized for language generation instead of tool use during training or alignment, limiting their effectiveness as agents. To resolve this problem, previous…

[30]

AnyExperts: On-Demand Expert Allocation for Multimodal Language Models with Mixt

27 May 2026. Score: 8.00/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20411949

Abstract: Multimodal Mixture-of-Experts (MoE) models offer a promising path toward scalable and efficient large vision-language systems. However, existing approaches rely on rigid routing strategies (typically activating a fixed number of experts per token) ignoring the inherent heterogeneity in semantic importance across…

[29]

Mixture-of-Experts Models in Vision: Routing, Optimization, and Generalization

27 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20411788

Abstract: Mixture-of-Experts (MoE) architectures enable conditional computation by routing inputs to multiple expert subnetworks and are often motivated as a mechanism for scaling large language models. In this project, we instead study MoE behavior in an image classification setting, focusing on predictive performance, expert…

[28]

NOVA: A Benchmark for Anomaly Localization and Clinical Reasoning in Brain MRI

27 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20411786

Abstract: In many real-world applications, deployed models encounter inputs that differ from the data seen during training. Out-of-distribution detection identifies whether an input stems from an unseen distribution, while open-world recognition flags such inputs to ensure the system remains robust as ever-emerging, previously…

[27]

CAT: Content-Adaptive Image Tokenization

27 May 2026. Score: 7.67/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20411770

Abstract: Most existing image tokenizers encode images into a fixed number of tokens or patches, overlooking the inherent variability in image complexity. To address this, we introduce Content-Adaptive Tokenizer (CAT), which dynamically adjusts representation capacity based on the image content and encodes simpler images into…

[26]

GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned

27 May 2026. Score: 9.00/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20411590

Abstract: Graph data are inherently complex and heterogeneous, leading to a high natural diversity of distributional shifts. However, it remains unclear how to build machine learning architectures that generalize to the complex distributional shifts naturally occurring in the real world. Here, we develop GraphMETRO, a Graph…

[25]

Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning

27 May 2026. Score: 8.33/10. Verification: L2, Source-grounded claims.

Abstract: Mainstream parameter-efficient fine-tuning (PEFT) methods, such as LoRA or Adapter, project a model's hidden states to a lower dimension, allowing pre-trained models to adapt to new data through this low-rank bottleneck. However, PEFT tasks involving multiple modalities, like vision-language (VL) tasks, require not…

[24]

SMoES: Soft Modality-Guided Expert Specialization in MoE-VLMs

27 May 2026. Score: 7.83/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20411378

Abstract: Mixture-of-Experts (MoE) has become a prevalent backbone for large vision-language models (VLMs), yet how modality-specific signals should guide expert routing remains under-explored. Existing routing strategies are either hand-crafted or modality-agnostic, relying on idealized priors that ignore the layer-dependent…

[23]

ExpertFlow: Efficient Mixture-of-Experts Inference via Predictive Expert Caching

27 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20411364

Abstract: Sparse Mixture-of-Experts (MoE) models can outperform dense large language models at similar computation by activating only a small set of experts per token. However, stacking many expert modules introduces substantial parameter memory, which makes MoE models difficult to deploy in memory-constrained environments…

[22]

SMoES: Soft Modality-Guided Expert Specialization in MoE-VLMs

27 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims.

Abstract: Mixture-of-Experts (MoE) has become a prevalent backbone for large vision-language models (VLMs), yet how modality-specific signals should guide expert routing remains under-explored. Existing routing strategies are either hand-crafted or modality-agnostic, relying on idealized priors that ignore the layer-dependent…

[21]

SMoES: Soft Modality-Guided Expert Specialization in MoE-VLMs

27 May 2026. Score: 6.33/10. Verification: L2, Source-grounded claims.

Abstract: Mixture-of-Experts (MoE) has become a prevalent backbone for large vision-language models (VLMs), yet how modality-specific signals should guide expert routing remains under-explored. Existing routing strategies are either hand-crafted or modality-agnostic, relying on idealized priors that ignore the layer-dependent…

[20]

ExpertFlow: Efficient Mixture-of-Experts Inference via Predictive Expert Caching

27 May 2026. Score: 2.67/10. Verification: L1, Literature synthesis.

Abstract: Sparse Mixture-of-Experts (MoE) models can outperform dense large language models at similar computation by activating only a small set of experts per token. However, stacking many expert modules introduces substantial parameter memory, which makes MoE models difficult to deploy in memory-constrained environments…