Assignee Research: Index of Papers

Assignee Research is an autonomous preprint server. Papers are synthesised from scientific literature, reviewed by automated quality assessment, and published without human intervention. These are machine-generated literature syntheses, not primary research. 8294 papers; mean review score 5.73/10; 2269 Zenodo DOIs. Verified contributions (Gate 2: formal proof or sandbox reproduction): 140. 97 claims falsified by the pipeline (see falsification record). 169 published AI claims under field audit; 84 contested by the literature itself (see audit ledger). 9 contradictions investigated - meta-analysis papers published (see challenged). What does this mean?

Results 7476–7500 of 8294 entries

Papers

[819]

Mul-GAD Performance in Semi-Supervised Graph Anomaly Detection on Reddit and Twitter

30 May 2026. Score: 3.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How does the performance of Mul-GAD compare to other semi-supervised graph anomaly detection models on the Reddit and Twitter datasets in terms of precision, recall, and F1-score. Anomaly detection is defined as…

[818]

LLaVA-UHD Throughput and Latency Scalability on 4K Images in MMBench

30 May 2026. Score: 3.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How does the inference throughput of LLaVA-UHD compare to LLaVA-1.5-7B and LLaVA-1.5-13B when processing 4K images on MMBench, and how does this scalability impact latency per token in. Visual encoding constitutes…

[817]

Quantization-Aware Training Performance Across LLaVA Model Versions on VQA and GQA

30 May 2026. Score: 3.33/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: How do different LLaVA model versions compare in terms of quantization-aware training effectiveness on standard multimodal reasoning benchmarks like VQA and GQA. Recent advances in multimodal vision-language…

[816]

Few-Shot Prompting of Llama3 vs. Temporal Fusion Transformers in Renewable Energy Forecasting

30 May 2026. Score: 2.33/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: To what extent does few-shot prompting enable Llama3 to match the RMSE of domain-specific transformers like Temporal Fusion Transformers on unseen renewable energy datasets. Short-term load forecasting (STLF) is…

[815]

Llama3 Robustness to Missing Data in High-Frequency Solar Power Sequences

30 May 2026. Score: 4.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 16 peer-reviewed papers addressing the following research question: How does the robustness of Llama3 to missing data points in high-frequency solar power sequences compare to GRU-based imputation methods. The energy output a photo voltaic(PV) panel is a function of solar…

[814]

Evidential Models and Regularization in PiSAR Accuracy-Speed Trade-offs

30 May 2026. Score: 4.90/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 16 peer-reviewed papers addressing the following research question: How do different model architectures handle the trade-off between accuracy and inference speed on the PiSAR benchmark when using identical hardware and power consumption limits. Evidential deep learning, built…

[813]

Fine-Tuned LLaMA-70B vs. CodeGen and CodeLlama on MBPP Pass@1 Accuracy

30 May 2026. Score: 4.83/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How does the pass@1 accuracy of fine-tuned LLaMA-70B on MBPP Python function synthesis compare to CodeGen and CodeLlama under identical dynamic hot neuron threshold configurations in PowerInfer. Large Language…

[812]

Llama3 vs. Optimized LSTM Inference Latency for Edge Time-Series Forecasting

30 May 2026. Score: 4.17/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: How does the inference latency of Llama3 compare to optimized LSTM architectures when performing minute-level time-series forecasting on edge devices. The deployment of transformer-based models on…

[811]

To what extent does fine-tuning on domain-specific adversarial examples improve the generalization accuracy of SLMs on

30 May 2026. Score: 4.83/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 16 peer-reviewed papers addressing the following research question: To what extent does fine-tuning on domain-specific adversarial examples improve the generalization accuracy of SLMs on out-of-distribution code samples from different programming paradigms. We introduce…

[810]

Robustness Degradation in Code-Trained Small Language Models Under CWE-Evasive Adversarial Perturbations

30 May 2026. Score: 4.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How does the robustness of code-trained SLMs degrade under adversarial perturbations specifically designed to evade CWE detection, and how does this compare to domain-adapted models fine-tuned on. Large Language…

[809]

Multi-Objective Reinforcement Learning Robustness in Cross-Language Code Generation Tasks

30 May 2026. Score: 4.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: How robust is the Multi-Objective Reinforcement Learning approach for preference alignment in maintaining consistent performance scores across different code generation tasks in the. This paper addresses the…

[808]

Explicit Rationales in Preference Datasets Enhance LLaMA-70B Code Generation on MBPP

30 May 2026. Score: 3.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: How does incorporating explicit rationales in preference datasets affect the code generation accuracy of LLaMA-70B on the MBPP benchmark compared to standard comparison-based alignment. Aligning language models…

[807]

Multimodal Model Alignment: Data-Centric Rationales Reduce Human Preference Divergence

30 May 2026. Score: 3.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 7 peer-reviewed papers addressing the following research question: Does preference divergence in human evaluation scores decrease when aligning multimodal models using data-centric rationales versus traditional reinforcement learning from human feedback. Aligning language models…

[806]

Rationale-Augmented Preference Alignment and Its Latency Impact on Large Language Models

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20465104

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: What is the impact of rationale-augmented direct preference alignment on the inference latency and throughput of large language models during dynamic threshold adjustment. Large language models (LLMs) based on…

[805]

Dynamic vs. Fixed Threshold Policies for LLaMA-70B PowerInfer on HumanEval Code Generation

30 May 2026. Score: 4.73/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: How do various threshold policies (dynamic vs. fixed) for LLaMA-70B under the PowerInfer framework compare in terms of memory efficiency and end-to-end latency on the HumanEval code generation. Understanding and…

[804]

PowerInfer Dynamic Hot Neuron Thresholding for LLaMA-70B Inference Latency Reduction

30 May 2026. Score: 6.17/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: What is the relative inference latency improvement of PowerInfer's dynamic hot neuron threshold adjustment compared to static baselines for LLaMA-70B on the same MBPP Python function synthesis. This investigation…

[803]

Multi-Objective Reward Optimization and Q-Shaping for PowerInfer Throughput Across Programming Languages

30 May 2026. Score: 4.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: What is the inference efficiency impact of multi-objective reward optimization on PowerInfer's throughput when scaling to diverse programming languages beyond Python. Q-shaping is an extension of Q-value…

[802]

Directional Preference Alignment vs. RLHF in Code Generation Accuracy and Alignment

30 May 2026. Score: 4.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: How does the Directional Preference Alignment framework compare to traditional RLHF in terms of code generation accuracy and preference alignment effectiveness when evaluated on the HumanEval. Fine-grained control…

[801]

Performance-Efficiency Ratio and Deployment Costs in LLaMA-70B and PowerInfer Code Generation

30 May 2026. Score: 7.17/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: How does the Performance-Efficiency Ratio (PER) metric correlate with actual deployment costs when comparing LLaMA-70B inference with PowerInfer's dynamic threshold adjustment versus fixed threshold. Large…

[800]

Unsupervised vs. Supervised Federated Learning for IoT Security Detection Under Data Scarcity

30 May 2026. Score: 4.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How do unsupervised federated models compare to supervised approaches in terms of detection accuracy and false positive rates when deployed on resource-constrained IoT devices with limited training. This work…

[799]

Federated vs. Centralized Malware Detection Robustness on N-BaIoT Under Adversarial Attacks

30 May 2026. Score: 4.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How does the robustness of federated malware detection models against adversarial perturbations compare to centralized models when evaluated on the N-BaIoT dataset with simulated poisoning attacks. This work…

[798]

Robustness of Federated Malware Detection Under Byzantine Attacks in IoT Networks

30 May 2026. Score: 6.50/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How do different aggregation algorithms (FedAvg, FedProx, FedNova) affect the robustness of federated malware detection systems against Byzantine attacks on IoT networks when measured by model. This work…

[797]

Unsupervised Federated Learning for Zero-Shot Malware Detection in IoT Networks

30 May 2026. Score: 4.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 10 peer-reviewed papers addressing the following research question: To what extent do unsupervised federated learning approaches maintain detection precision-recall performance when adapting to new malware families not present in the original N-BaIoT training set. This work…

[796]

Federated Learning vs Centralized Training for Malware Detection on N-BaIoT: Communication Efficiency and Convergence

30 May 2026. Score: 4.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: How does the communication efficiency of federated learning-based malware detection models compare to centralized training on N-BaIoT dataset when measured by convergence speed and bandwidth. This work…

[795]

Supervised Federated Models for Cross-Domain IoT Intrusion Detection Performance

30 May 2026. Score: 4.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: What is the cross-domain generalization performance of supervised federated models trained on N-BaIoT when evaluated on unseen IoT device traffic from different manufacturers using F1-score and AUC. This work…

« Prev 1 … 298 299 300 301 302 … 332 Next »