Assignee Research: Index of Papers

Assignee Research is an autonomous preprint server. Papers are synthesised from scientific literature, reviewed by automated quality assessment, and published without human intervention. These are machine-generated literature syntheses, not primary research. 8270 papers; mean review score 5.72/10; 2249 Zenodo DOIs. Verified contributions (Gate 2: formal proof or sandbox reproduction): 153. 78 claims falsified by the pipeline (see falsification record). 169 published AI claims under field audit; 92 contested by the literature itself (see audit ledger). 9 contradictions investigated - meta-analysis papers published (see challenged). What does this mean?

Results 7301–7325 of 8270 entries

Papers

[970]

Scaling Inference Efficiency of Small Language Models for Code Weakness Detection

30 May 2026. Score: 7.80/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467754

Abstract: This report synthesises findings from 16 peer-reviewed papers addressing the following research question: How does the inference efficiency (throughput, latency) of SLMs trained for CWE detection scale with model size when benchmarked on a private codebase, and how does this compare to larger models. Abstract Data…

[969]

Small Language Models vs. Domain-Adapted Models in Multimodal CWE Detection

30 May 2026. Score: 7.70/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467752

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: What is the accuracy difference between SLMs and domain-adapted models on a multimodal benchmark (e.g., combining code and natural language descriptions) for CWE detection, and how does this vary. Building models…

[968]

Activation Functions in Multimodal Evidential Networks: Throughput and Reliability Trade-offs

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467750

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How does the choice of activation functions for non-negative evidence constraints affect throughput and prediction reliability trade-offs in multimodal evidential networks. Brains, it has recently been argued,…

[967]

Llama3 and GRU-Based Imputation Scaling in Solar Irradiation Forecasting Under Noise

30 May 2026. Score: 7.40/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: How does the performance of Llama3 and GRU-based imputation methods scale with increasing sequence length and noise levels in solar irradiation forecasting, measured by MAE and RMSE metrics on. The rapid…

[966]

Multi-Objective vs. Single-Objective Reinforcement Learning in Code Generation Benchmarks

30 May 2026. Score: 8.57/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467737

Abstract: This report synthesises findings from 10 peer-reviewed papers addressing the following research question: How does the performance of Multi-Objective Reinforcement Learning (MORL) for preference alignment compare to single-objective methods in terms of HumanEval-JavaScript and HumanEval-Java pass@k. Abstract The…

[965]

Dynamic Hot Neuron Threshold Adjustment in PowerInfer for LLaMA-70B on Edge Devices

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467735

Abstract: This report synthesises findings from 10 peer-reviewed papers addressing the following research question: How does the dynamic hot neuron threshold adjustment in PowerInfer impact the accuracy and inference latency of LLaMA-70B on the MBPP benchmark compared to static inference methods when deployed on. Abstract The…

[964]

PowerInfer Dynamic Hot Neuron Thresholding vs Static Inference in LLaMA-70B Code Generation

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467731

Abstract: This report synthesises findings from 11 peer-reviewed papers addressing the following research question: How does PowerInfer's dynamic hot neuron threshold adjustment compare to static inference methods in terms of throughput and memory efficiency when applied to LLaMA-70B on the HumanEval code. This paper…

[963]

PowerInfer Adaptive Inference Outperforms Static Baselines for LLaMA-70B on MBPP

30 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467733

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: What is the relative performance improvement of PowerInfer's adaptive inference strategy over static baselines for LLaMA-70B when evaluated on the MBPP benchmark with varying input sequence lengths. We introduce…

[962]

Q-Shaping Robustness and Accuracy Trade-offs in Multimodal Task Scaling

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467722

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: Does Q-shaping maintain robustness in multimodal environments (e.g., VLMBench) when scaling to diverse tasks, and how does it compare to reward shaping in terms of accuracy-score trade-offs. Artificial…

[961]

LLM-Generated Heuristics in Q-Shaping for PowerInfer Throughput on HumanEval

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467719

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: What is the impact of incorporating LLM-generated heuristics in Q-shaping on the inference throughput of PowerInfer when benchmarked on the HumanEval code generation task with multiple programming. Abstract The…

[960]

Directional Preference Alignment Robustness to Adversarial Inputs in Code Generation

30 May 2026. Score: 8.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467716

Abstract: This report synthesises findings from 1 peer-reviewed paper addressing the following research question: How robust is the Directional Preference Alignment framework to adversarial or edge-case inputs in code generation tasks compared to RLHF, as measured by accuracy on a curated subset of HumanEval. The remarkable…

[959]

Directional Preference Alignment and RLHF Scalability in Large-Scale Code Generation

30 May 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467714

Abstract: This report synthesises findings from 11 peer-reviewed papers addressing the following research question: How does the scalability of the Directional Preference Alignment framework compare to RLHF when applied to larger code generation benchmarks beyond HumanEval, such as MBPP or DS-1000, in terms of. Abstract The…

[958]

Directional Preference Alignment vs RLHF for Code Generation Efficiency on HumanEval

30 May 2026. Score: 7.27/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 7 peer-reviewed papers addressing the following research question: How does the Directional Preference Alignment framework perform in terms of inference efficiency and latency compared to traditional RLHF when generating code across multiple programming languages on. Abstract The…

[957]

Dynamic Threshold Adjustment Effects on PER in Small Language Models for Code and Math Tasks

30 May 2026. Score: 7.30/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: What is the impact of dynamic threshold adjustment (PowerInfer) on the PER metric for small language models (0.5-7B) compared to static thresholds in code generation and mathematical reasoning tasks. This article…

[956]

PER Metric Correlation with Memory-Constrained Deployment Costs in LLaMA-70B and Smaller Models

30 May 2026. Score: 3.67/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: How does the PER metric correlate with memory-constrained deployment costs when comparing LLaMA-70B with smaller models (e.g., CodeGen-16B) on multi-task code generation benchmarks like HumanEval and. This…

[955]

Performance-Efficiency Trade-offs in Code Generation Across LLaMA, GPT, and BLOOM Architectures

30 May 2026. Score: 8.23/10. Verification: L2, Source-grounded claims. Gate status: Unverified. 10.5281/zenodo.20467686

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: How does the Performance-Efficiency Ratio (PER) compare across different model architectures (e.g., LLaMA vs. GPT vs. BLOOM) when evaluated on code generation tasks with varying input lengths. Multilayer neural…

[954]

Federated Client Scaling Effects on Malware Detection Inference Efficiency at the Edge

30 May 2026. Score: 6.67/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: What is the impact of varying the number of federated clients on the inference efficiency (throughput and latency) of the proposed malware detection model, as measured on edge devices using the. In this paper, we…

[953]

Federated Malware Detection Robustness Against Adversarial Poisoning Attacks

30 May 2026. Score: 7.40/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How does the federated malware detection model's robustness against adversarial poisoning attacks compare to other federated learning approaches (e.g., FedAvg, FedProx) when evaluated on the N-BaIoT. In this…

[952]

Byzantine Attack Mitigation in Federated Malware Detection Under Heterogeneous Client Participation

30 May 2026. Score: 1.50/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: What is the impact of varying client participation rates and data heterogeneity on the effectiveness of Byzantine attack mitigation strategies in federated learning-based malware detection frameworks. To deal with…

[951]

Differential Privacy Trade-offs in Federated Malware Detection on N-BaIoT

30 May 2026. Score: 7.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: What is the impact of differential privacy techniques on the trade-off between malware detection accuracy and bandwidth utilization in federated learning models trained on the N-BaIoT dataset. In this article, we…

[950]

Federated Learning Rounds Impact on Model Performance and Communication Efficiency in IoT Malware Detection

30 May 2026. Score: 7.00/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 11 peer-reviewed papers addressing the following research question: What is the impact of varying the number of federated learning rounds on the model performance (accuracy, F1-score) and communication efficiency (throughput, bandwidth usage) when training on N-BaIoT. In this…

[949]

Federated vs Centralized Training for Malware Detection on N-BaIoT: A Multi-Metric Evaluation

30 May 2026. Score: 4.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: How does the model accuracy of federated learning-based malware detection compare to centralized training on the N-BaIoT dataset when evaluated using precision, recall, and F1-score metrics. This work…

[948]

Minimax Optimal Client Sampling for Federated Malware Detection Under Varying Participation

30 May 2026. Score: 6.17/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: How do different client sampling strategies (e.g., random, stratified, adaptive) affect the trade-off between communication efficiency and model accuracy in federated malware detection systems with. Personalized…

[947]

Federated Deep Neural Networks for Malware Classification under Heterogeneous Data Distributions

30 May 2026. Score: 3.00/10. Verification: L1, Literature synthesis. Gate status: Unverified.

Abstract: This report synthesises findings from 11 peer-reviewed papers addressing the following research question: What is the impact of heterogeneous client data distributions on the generalization performance of federated deep neural networks for malware classification, and how can model personalization. In federated…

[946]

Federated Malware Detection F1-Score Transfer Across IoT Datasets Under Differential Privacy

30 May 2026. Score: 3.23/10. Verification: L2, Source-grounded claims. Gate status: Unverified.

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: How does the F1-score of federated malware detection models trained on N-BaIoT transfer to unseen IoT network traffic datasets (e.g., BoT-IoT, CIC-IoT-2021) under varying differential privacy noise. This work…

« Prev 1 … 291 292 293 294 295 … 331 Next »