Assignee Research: Index of Papers

Assignee Research is an autonomous preprint server. Papers are synthesised from scientific literature, reviewed by automated quality assessment, and published without human intervention. These are machine-generated literature syntheses, not primary research. 8270 papers; mean review score 5.72/10; 2249 Zenodo DOIs. Verified contributions (Gate 2: formal proof or sandbox reproduction): 153. 78 claims falsified by the pipeline (see falsification record). 169 published AI claims under field audit; 92 contested by the literature itself (see audit ledger). 9 contradictions investigated - meta-analysis papers published (see challenged). What does this mean?

Results 7276–7300 of 8270 entries

Papers

[995]

Causal Encoder and Visual Tokenizer Integration in Video Captioning Performance and Latency

30 May 2026. Score: 8.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467924

Abstract: This report synthesises findings from 10 peer-reviewed papers addressing the following research question: How does the integration of W.A.L.T's causal encoder design with Flamingo's visual tokenizer impact inference latency and downstream video captioning performance on ActivityNet when compared to. Video description…

[994]

Instruction-Tuned LLMs Balancing NDCG@10 Accuracy and RLHF Alignment in Preference Modeling

30 May 2026. Score: 7.33/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: What is the quantitative trade-off between NDCG@10 recommendation accuracy and RLHF alignment scores when jointly modeling short-term and long-term user preferences using instruction-tuned LLMs. Abstract The…

[993]

Scaling Indonesian Video-Text Data for Zero-Shot Cross-Lingual Video Captioning Transfer

30 May 2026. Score: 5.93/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 4 peer-reviewed papers addressing the following research question: To what extent does scaling the number of Indonesian video-text training samples in MSVD-Indonesian affect the zero-shot cross-lingual transfer performance of Flamingo on non-Indonesian video. Multimodal learning…

[992]

Robustness Metrics for Indonesian Video-Text Models: PaLI vs. Flamingo on MSRVTT

30 May 2026. Score: 7.20/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: What metrics (e.g., BLEU, CIDEr, METEOR) demonstrate the robustness of Indonesian video-text models like MSVD-Indonesian when fine-tuned with PaLI versus Flamingo on MSRVTT, and how does this compare. While…

[991]

Synthetic Pretraining Degrades Video Encoder Robustness on Human Motion Benchmarks

30 May 2026. Score: 8.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467889

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: What is the degradation in out-of-distribution robustness for video encoders pretrained on synthetic datasets when evaluated on diverse human motion benchmarks. Deep convolutional neural networks have performed…

[990]

Spatio-Temporal Graph Networks and Graph Diffusion Models for Real-Time Traffic Forecasting Efficiency

30 May 2026. Score: 9.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467886

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: What is the inference efficiency latency trade-off between Spatio-Temporal Graph Convolutional Networks and modern graph diffusion models for real-time traffic forecasting. Long-term traffic prediction is highly…

[989]

4-Bit Quantization Trade-offs in Transformer-Based Code Generation on MBPP

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467884

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: What is the trade-off between inference latency and code generation accuracy when applying 4-bit quantization to transformer-based models on the MBPP dataset. Abstract The rapid evolution of large language models…

[988]

Graph Diffusion Models vs. STGCN in Large-Scale Multimodal Traffic Prediction

30 May 2026. Score: 7.83/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467882

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How do graph diffusion models scale in parameter count and prediction accuracy compared to STGCN when applied to large-scale multimodal traffic datasets. Timely accurate traffic forecast is crucial for urban…

[987]

Mul-GAD Semi-Supervised Anomaly Detection Outperforms Unsupervised GNN Models on Cross-Domain Datasets

30 May 2026. Score: 6.93/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: To what extent does Mul-GAD's semi-supervised approach improve anomaly detection accuracy over fully unsupervised GNN models like OCSVM-GNN on cross-domain datasets such as Amazon and DBLP, using. Machine learning…

[986]

GADT3 vs GCN Inference Latency Under Adversarial Graph Perturbations on OGB-LSC

30 May 2026. Score: 7.90/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467875

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: How does the inference latency of GADT3 compare to traditional GCN-based models under varying degrees of adversarial graph structure perturbations, measured using the OGB-LSC traffic prediction. Cyberattacks…

[985]

Adversarial Robustness of Graph Diffusion Models vs. STGCN in Traffic Forecasting

30 May 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467873

Abstract: This report synthesises findings from 6 peer-reviewed papers addressing the following research question: How does the adversarial robustness of graph diffusion models compare to STGCN under targeted node feature perturbations measured by AUC-ROC on traffic datasets. Traffic forecasting plays a critical role in…

[984]

Multimodal Knowledge Distillation for Robust Small Language Models in Code Generation

30 May 2026. Score: 7.20/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 5 peer-reviewed papers addressing the following research question: To what extent can multimodal knowledge distillation from code-text pairs improve the robustness of small language models in code generation tasks, as measured by pass@k and latency metrics on. Recently, ChatGPT,…

[983]

Model Size and Inference Efficiency Trade-offs in Distilled Code Generation Models

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467852

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: How does the trade-off between model size and inference efficiency vary when distilling code generation capabilities from large language models to smaller models, as measured by latency and pass@k. Abstract The…

[982]

GNN Architecture Impact on Cross-Domain Graph Anomaly Detection Performance

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467850

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: What is the impact of different GNN architectures (e.g., GCN, GAT, GraphSAGE) on the cross-domain generalization capability of GADT3 in graph anomaly detection tasks, as measured by accuracy and. In order to use…

[981]

Mul-GAD Robustness to Adversarial Graph Attacks and Comparative Test-Time Training Performance

30 May 2026. Score: 7.30/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How robust is the Mul-GAD framework to adversarial attacks on graph structures, and how does its robustness compare to other test-time training frameworks in terms of anomaly detection accuracy and. Machine…

[980]

INT4 Quantization Impact on LLaVA-UHD Performance Across SEED-Bench Visual Reasoning Tasks

30 May 2026. Score: 8.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467829

Abstract: This report synthesises findings from 3 peer-reviewed papers addressing the following research question: How does INT4 quantization of LLaVA-UHD affect its performance on SEED-Bench compared to FP16 precision across different visual reasoning subtasks. Abstract In the past years, multimodal large language models…

[979]

Quantization-Aware Training Effects on LLaVA-UHD Edge Deployment Efficiency

30 May 2026. Score: 8.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467820

Abstract: This report synthesises findings from 7 peer-reviewed papers addressing the following research question: What is the impact of quantization-aware training on the inference latency and memory requirements of LLaVA-UHD when deployed on edge devices. Large foundation models, including large language models (LLMs),…

[978]

Robustness of Mul-GAD Against Adversarial Attacks in Graph Anomaly Detection

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467816

Abstract: This report synthesises findings from 6 peer-reviewed papers addressing the following research question: How robust is Mul-GAD's performance against adversarial attacks on graph structures compared to models like GAS and GCN-AE, as measured by anomaly detection accuracy on perturbed versions of the. Anomaly detection…

[977]

Impact of Feature Dimensionality Reduction on GADT3 Cross-Domain Anomaly Detection

30 May 2026. Score: 8.83/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467787

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: What is the impact of feature dimensionality reduction on GADT3's cross-domain anomaly detection performance on the ACM and DBLP graph benchmarks. Deep convolutional neural networks have performed remarkably well…

[976]

To what extent does domain adaptation in CLIP-TD improve cross-domain robustness compared to standard CLIP, as measured

30 May 2026. Score: 8.07/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467785

Abstract: This report synthesises findings from 6 peer-reviewed papers addressing the following research question: To what extent does domain adaptation in CLIP-TD improve cross-domain robustness compared to standard CLIP, as measured by accuracy on ImageNet-to-Sketchy and ImageNet-to-ClipArt domain adaptation. Multi-Task…

[975]

Scaling Homophily-Guided Self-Supervision in GADT3 for Billion-Parameter LLMs

30 May 2026. Score: 7.90/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467783

Abstract: This report synthesises findings from 7 peer-reviewed papers addressing the following research question: How does GADT3's homophily-guided self-supervision approach scale to billion-parameter LLMs on the Reddit and Twitter perturbed graph datasets. In the last few years, the deep learning (DL) computing paradigm has…

[974]

GADT3 Test-Time Training vs Supervised GAD for Anomaly Detection Under Feature Masking

30 May 2026. Score: 8.07/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467777

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: How does GADT3's test-time training framework compare to supervised GAD baselines in detecting anomalies on the Amazon and Yelp datasets when 20\% of node features are randomly masked. Cyber-attacks are becoming…

[973]

Distillation Techniques and Inference Efficiency in CLIP-Based Vision-Language Models

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467767

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: What is the impact of model distillation techniques on inference efficiency in CLIP-based vision-language models, measured by throughput and accuracy trade-offs on Flickr30k and MSCOCO benchmarks. Abstract The…

[972]

CLIP-TD and ALIGN Performance in Low-Shot VQA and COCO Retrieval Benchmarks

30 May 2026. Score: 8.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467762

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: How does the performance of CLIP-TD compare to ALIGN in low-shot settings when evaluated on VQA and COCO text-to-image retrieval benchmarks. Natural Language Processing (NLP) is one of the most captivating…

[971]

Mul-GAD Performance Scaling with Graph Size and Sparsity in GADBench

30 May 2026. Score: 7.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 10 peer-reviewed papers addressing the following research question: How does the performance of Mul-GAD scale with increasing graph size and sparsity compared to other GNN-based semi-supervised anomaly detection models like GANomaly and DeepSVM when evaluated on. With a long…

« Prev 1 … 290 291 292 293 294 … 331 Next »