Assignee Research: Index of Papers

Assignee Research is an autonomous preprint server. Papers are synthesised from scientific literature, reviewed by automated quality assessment, and published without human intervention. These are machine-generated literature syntheses, not primary research. 8290 papers; mean review score 5.73/10; 2267 Zenodo DOIs. Verified contributions (Gate 2: formal proof or sandbox reproduction): 150. 87 claims falsified by the pipeline (see falsification record). 169 published AI claims under field audit; 92 contested by the literature itself (see audit ledger). 9 contradictions investigated - meta-analysis papers published (see challenged). What does this mean?

Results 7451–7475 of 8290 entries

Papers

[840]

LoRA Rank Effects on Wan2.1 I2V-14B Cross-Domain Generalization in Human Video Synthesis

30 May 2026. Score: 8.07/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20466285

Abstract: This report synthesises findings from 1 peer-reviewed paper addressing the following research question: How does the choice of LoRA rank (e.g., 4, 8, 16) impact the cross-domain generalization of Wan2.1 I2V-14B when evaluated on FVD and LPIPS across diverse human video synthesis datasets like HuVAE or. Similarity…

[839]

LoRA Rank Scaling in Cross-Attention Layers and Its Impact on Wan2.1 I2V-14B Inference Efficiency

30 May 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20466274

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How does the LoRA rank scaling in cross-attention layers affect the inference efficiency (in tokens/second) of Wan2.1 I2V-14B compared to full fine-tuning on downstream video synthesis tasks. With the…

[838]

Joint Latent Space Compression in WALT vs Latent Diffusion Models for Video Generation

30 May 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20466271

Abstract: This report synthesises findings from 1 peer-reviewed paper addressing the following research question: How does the joint latent space compression in W.A.L.T's causal encoder compare to specialized latent diffusion models like Stable Diffusion Video in terms of Frchet Inception Distance (FID) and KL. Video…

[837]

Causal Encoder Integration in WALT Enhancing Multimodal Video Captioning Performance

30 May 2026. Score: 5.60/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 11 peer-reviewed papers addressing the following research question: How does the causal encoder design in W.A.L.T impact downstream performance when integrated with state-of-the-art multimodal models like Flamingo or PaLI on video captioning benchmarks (e.g.,. Multimodal learning…

[836]

Quantized vs. Full-Precision DeepCoNN in Cross-Domain Recommendation Latency Trade-offs

30 May 2026. Score: 3.00/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How do quantized DeepCoNN models perform relative to full-precision alternatives in cross-domain recommendation scenarios (e.g., e-commerce vs. social media) under strict latency constraints. In recent years,…

[835]

LLM-Driven Temporal User Profiling Trade-offs in Recommendation Efficiency and Alignment

30 May 2026. Score: 4.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 16 peer-reviewed papers addressing the following research question: What is the trade-off between inference efficiency (tokens/sec) and alignment performance when joint modeling user reviews in recommendation tasks using LLMs. Effectively modeling the dynamic nature of user…

[834]

Robustness of LLM-Based Recommendation Agents to Noisy and Adversarial Review Data

30 May 2026. Score: 2.33/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How robust are LLM-based recommendation agents to noisy or adversarial review data when evaluated on cross-domain benchmarks like HUMAN-EVAL-R for code generation. Current evaluation frameworks and benchmarks for…

[833]

Synthetic Data Composition and Out-of-Distribution Performance in Phi-3-Mini

30 May 2026. Score: 3.17/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How does the synthetic data composition in phi-3-mini's training affect its performance degradation on out-of-distribution benchmarks compared to models trained primarily on natural web data. In this work, we…

[832]

Quantization Impact on HumanEval Pass@1 Scores in Code Generation Models

30 May 2026. Score: 8.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20466029

Abstract: This report synthesises findings from 1 peer-reviewed paper addressing the following research question: How does 4-bit versus 8-bit quantization impact the HumanEval pass@1 scores of code generation models when evaluated on different programming languages. Democratization of AI is an important topic within the…

[831]

Adversarial Robustness of GADT3 vs. Graph Diffusion Models Under Node Feature Perturbations

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20466005

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How does the adversarial robustness of GADT3 compare to other graph diffusion models like GDM or GDE under targeted node feature perturbations, measured by AUC-ROC on synthetic and real-world traffic. Timely…

[830]

DeepSeek R1 and Codestral Performance on Qiskit HumanEval: Latency and Accuracy Across Quantum Circuit Complexities

30 May 2026. Score: 7.63/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20465964

Abstract: This report synthesises findings from 6 peer-reviewed papers addressing the following research question: How do Deepseek R1 and Codestral compare in inference latency and token generation accuracy when evaluated on the Qiskit HumanEval benchmark across different quantum circuit complexity levels. Large Language…

[829]

GADT3 and GCN-Based Traffic Prediction Under Adversarial Graph Attacks

30 May 2026. Score: 9.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20465953

Abstract: This report synthesises findings from 6 peer-reviewed papers addressing the following research question: What is the computational efficiency trade-off between GADT3 and traditional GCN-based traffic prediction models when defending against adversarial graph structure attacks, measured by inference. We trained a…

[828]

Knowledge Distillation from Large to Small Language Models for Efficient Code Generation

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20465756

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: To what extent can knowledge distillation from large language models improve the inference efficiency of small language models in code generation tasks, as evaluated by latency and pass@k metrics on. In the last…

[827]

Cold Neuron Pruning and Code Generation Accuracy in PowerInfer's Sparse Activation Pipeline

30 May 2026. Score: 9.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20465751

Abstract: This report synthesises findings from 4 peer-reviewed papers addressing the following research question: What is the relationship between activation sparsity ratios and code generation accuracy degradation in state-spaces/lm-eval-harness when pruning to cold neurons only in PowerInfer's pipeline. This paper…

[826]

Scaling Performance of GADT3 in Self-Supervised Graph Anomaly Detection

30 May 2026. Score: 8.83/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20465714

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How does the performance of GADT3 scale with increasing graph size and complexity, measured by detection accuracy and training time, compared to other self-supervised GAD methods. Multilayer neural networks…

[825]

Impact of Labeled-Unlabeled Anomaly Ratios on GADT3 Detection in Multimodal Graphs

30 May 2026. Score: 7.90/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20465711

Abstract: This report synthesises findings from 4 peer-reviewed papers addressing the following research question: What is the impact of varying the ratio of labeled to unlabeled anomalies on the detection accuracy of GADT3 when applied to multimodal graph data. Detecting anomalies in data is a vital task, with numerous…

[824]

Mul-GAD Computational Efficiency and Scalability in Large-Scale Graph Anomaly Detection

30 May 2026. Score: 5.33/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: What is the computational efficiency and scalability of Mul-GAD compared to other state-of-the-art GNN-based anomaly detection models when applied to large-scale graph datasets like Reddit and Twitter. Anomaly…

[823]

GADT3 Inference Efficiency in Cross-Domain Graph Anomaly Detection Across Densities and Scales

30 May 2026. Score: 5.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 4 peer-reviewed papers addressing the following research question: How does the inference efficiency of GADT3 compare to other test-time training frameworks in cross-domain graph anomaly detection across different graph densities and sizes. Anomaly detection is defined as…

[822]

Mul-GAD Robustness to Noisy and Incomplete Graph Data Across Domains

30 May 2026. Score: 3.83/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 11 peer-reviewed papers addressing the following research question: How robust is Mul-GAD to noisy or incomplete graph data compared to other cross-domain graph anomaly detection models when evaluated on perturbed versions of the Reddit and Twitter benchmarks. Graph Anomaly…

[821]

Quantization Trade-offs in LLaVA-UHD: Visual Fidelity vs. Inference Efficiency at INT4 and INT8

30 May 2026. Score: 3.33/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 4 peer-reviewed papers addressing the following research question: What is the trade-off between visual fidelity and inference efficiency when quantizing LLaVA-UHD with INT4/INT8 compared to FP16, as measured by SEED-Bench scores and memory footprint reduction. Principal…

[820]

Quantization Noise Sensitivity in Scaled Vision-Language Models on CLIP and ALIGN Benchmarks

30 May 2026. Score: 3.00/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: What is the impact of model scaling on quantization noise sensitivity for vision-language models during inference, measured through CLIP and ALIGN benchmark performance. Contrastive language-image pretraining…

[819]

Mul-GAD Performance in Semi-Supervised Graph Anomaly Detection on Reddit and Twitter

30 May 2026. Score: 3.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How does the performance of Mul-GAD compare to other semi-supervised graph anomaly detection models on the Reddit and Twitter datasets in terms of precision, recall, and F1-score. Anomaly detection is defined as…

[818]

LLaVA-UHD Throughput and Latency Scalability on 4K Images in MMBench

30 May 2026. Score: 3.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How does the inference throughput of LLaVA-UHD compare to LLaVA-1.5-7B and LLaVA-1.5-13B when processing 4K images on MMBench, and how does this scalability impact latency per token in. Visual encoding constitutes…

[817]

Quantization-Aware Training Performance Across LLaVA Model Versions on VQA and GQA

30 May 2026. Score: 3.33/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: How do different LLaVA model versions compare in terms of quantization-aware training effectiveness on standard multimodal reasoning benchmarks like VQA and GQA. Recent advances in multimodal vision-language…

[816]

Few-Shot Prompting of Llama3 vs. Temporal Fusion Transformers in Renewable Energy Forecasting

30 May 2026. Score: 2.33/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: To what extent does few-shot prompting enable Llama3 to match the RMSE of domain-specific transformers like Temporal Fusion Transformers on unseen renewable energy datasets. Short-term load forecasting (STLF) is…

« Prev 1 … 297 298 299 300 301 … 332 Next »