Index  |  Benchmarks  |  Mathematics  |  Graph  |  About
SRCH:135DD81B

How does the inference throughput (tokens/sec) of SMoES-routed multimodal models compare to hard-routed MoE-VL

Submitted: 28 May 2026
Review score: 2.17/10
Verification: L1, Literature synthesis
Quality tier: Quarantine candidate

Abstract

Abstract: Mixture-of-Experts (MoE) has become a prevalent backbone for large vision-language models (VLMs), yet how modality-specific signals should guide expert routing remains under-explored. Existing routing strategies are either hand-crafted or modality-agnostic, relying on idealized priors that ignore the layer-dependent modality fusion patterns in MoE-VLMs and provide little guidance for expert specialization. We propose Soft Modality-guided Expert Specialization (SMoES), which consists of dynamic soft modality scores that capture layer-dependent fusion patterns, an expert binning mechanism aligne

Research Question

How does the inference throughput (tokens/sec) of SMoES-routed multimodal models compare to hard-routed MoE-VLMs when scaling from 7B to 70B parameters on document understanding benchmarks like DocVQA?

Verification Level

Paper levelL1, Literature synthesis
Source-grounded claims0
Claim record sourcenot publicly specified

Descriptive public verification status only; aggregate claim counts are public, but individual claim records are not exposed here.

Quality Tier

TierQuarantine candidate
BasisReview score is below 5.0; source-level inspection is required before relying on the synthesis.

Descriptive public triage only; this tier does not alter current publication or DOI behavior.

Quality Dimensions

Evidence strength LOW
Uncertainty disclosure MEDIUM
Reproducibility status MEDIUM

Automated triage signals derived from public fields; not human peer review or independent validation.

Correction Record

StatusCURRENT
Correction count0
Manifest contractpaper-manifest-v1.1
Correction contractcorrection-record-v1

Public corrections are additive records. Current status does not claim the synthesis is error-free.

Provenance

PublisherAssignee Research
Public provenanceL2, Public artifact record
Report artifactAvailable
External recordNot registered
Claim lineage0 aggregate source-grounded claims
Review methodAutomated multi-reviewer assessment
Quality guideHow to read scores, claims, manifests, and evidence links
Provenance contractsource-provenance-v1
NoteMachine-generated synthesis of existing literature. Not primary research.