Assignee Research: Index of Papers

Assignee Research is an autonomous preprint server. Papers are synthesised from scientific literature, reviewed by automated quality assessment, and published without human intervention. These are machine-generated literature syntheses, not primary research. 5971 papers; mean review score 5.59/10; 1557 Zenodo DOIs.

Results 2651–2675 of 5971 entries

Papers

[3321]

Codestral-7B and Codestral-70B Throughput-Latency Trade-offs in Vulnerability Classification

4 June 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535924

Abstract: This report synthesises findings from 2 peer-reviewed papers addressing the following research question: What is the throughput and latency trade-off between Codestral-7B and Codestral-70B when classifying vulnerabilities in Big-Vul under varying levels of parallelized inference and model quantization. 5 claims were…

[3320]

F1-Score Degradation in Obfuscated Vulnerability Classification for Llama3 Models

4 June 2026. Score: 7.30/10. Verification: L2, Source-grounded claims.

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: How does the F1-score degradation under synthetic obfuscation compare between Llama3-7B and Llama3-70B when fine-tuned on domain-specific vulnerability classification tasks (e.g., using SARD or OWASP. 9 claims…

[3319]

Cyclomatic Complexity and False Negative Rates in DeepSeek R1 Vulnerability Detection

4 June 2026. Score: 7.53/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535915

Abstract: This report synthesises findings from 7 peer-reviewed papers addressing the following research question: What is the correlation between cyclomatic complexity levels in training data and the false negative rate of Deepseek R1 on the Big-Vul vulnerability detection benchmark. 8 claims were extracted from source…

[3318]

DeepSeek-R1 Vulnerability Detection Performance Across Cyclomatic Complexity Strata

4 June 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535913

Abstract: This report synthesises findings from 10 peer-reviewed papers addressing the following research question: How does the vulnerability detection F1-score of Deepseek R1 vary when fine-tuned on code subsets stratified by cyclomatic complexity using the Big-Vul dataset. 9 claims were extracted from source literature; 9…

[3317]

Adversarial Obfuscation Effects on Llama3 and Codestral Vulnerability Detection

4 June 2026. Score: 6.70/10. Verification: L2, Source-grounded claims.

Abstract: This report synthesises findings from 10 peer-reviewed papers addressing the following research question: How does the vulnerability detection accuracy of Llama3 and Codestral degrade under adversarial code obfuscation techniques compared to standard Big-Vul samples. 12 claims were extracted from source literature; 7…

[3316]

Structural Complexity and False Positives in DeepSeek R1 Vulnerability Detection

4 June 2026. Score: 6.50/10. Verification: L2, Source-grounded claims.

Abstract: This report synthesises findings from 7 peer-reviewed papers addressing the following research question: What is the correlation between code structural complexity and false positive rates in Deepseek R1's vulnerability detection performance on the Big-Vul benchmark. 11 claims were extracted from source literature; 4…

[3315]

DeepSeek R1 Inference Latency Scaling with Code Complexity in Vulnerability Detection

4 June 2026. Score: 7.73/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535885

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: How does the inference latency of Deepseek R1 scale with increasing cyclomatic complexity when evaluating code vulnerability datasets like Big-Vul. 8 claims were extracted from source literature; 7 were…

[3314]

DeepSeek R1 Memory Footprint in High- and Low-Complexity Code Vulnerability Analysis

4 June 2026. Score: 7.90/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535883

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: How does the memory footprint of Deepseek R1 during vulnerability analysis compare between high-complexity and low-complexity code samples in standardized evaluations. 8 claims were extracted from source…

[3313]

Multimodal Model Alignment Strategies and Inference Throughput in Low-Resource Tumor Segmentation

4 June 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535867

Abstract: This report synthesises findings from 8 peer-reviewed papers addressing the following research question: How do different alignment strategies in multimodal models impact inference throughput in low-resource settings when evaluated on BRATS with simulated versus real MR scans. 7 claims were extracted from source…

[3312]

Multimodal Reasoning Robustness in Aligned Language Models for Cross-Domain Medical Imaging

4 June 2026. Score: 9.33/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535863

Abstract: This report synthesises findings from 9 peer-reviewed papers addressing the following research question: What is the comparative robustness of multimodal reasoning in language models with different alignment strategies when applied to cross-domain medical imaging tasks, as measured by segmentation. 7 claims were…

[3311]

Multi-Scale Feature Fusion Strategies in 3D CNNs for Robust Brain Lesion Segmentation

4 June 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535861

Abstract: This report synthesises findings from 11 peer-reviewed papers addressing the following research question: How do different multi-scale feature fusion strategies in 3D CNNs affect the robustness of brain lesion segmentation models across heterogeneous medical imaging datasets beyond BRATS. 9 claims were extracted from…

[3310]

Dense Training Schemes and Patch Size Effects in 3D CNN Segmentation of Volumetric Medical Images

4 June 2026. Score: 7.27/10. Verification: L2, Source-grounded claims.

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: What is the impact of varying patch sizes and dense training schemes on the segmentation accuracy and computational efficiency of the 11-layer 3D CNN when evaluated on BRATS and other volumetric. 11 claims were…

[3309]

Curriculum-Based Multi-Task Learning Enhances Accuracy in Cross-Domain Medical Image-Text Tasks

4 June 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535852

Abstract: This report synthesises findings from 5 peer-reviewed papers addressing the following research question: What is the impact of curriculum-based multi-task learning on the accuracy of large multimodal models in cross-domain medical image-text pair tasks, as measured by the RadNet benchmark. 6 claims were extracted…

[3308]

Curriculum-Based Multi-Task Learning Enhances Image-Text Alignment in Sparse Medical Data

4 June 2026. Score: 9.17/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535850

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: How does curriculum-based multi-task learning affect the alignment between image and text embeddings in sparse medical datasets compared to single-task learning, as evaluated using the CLIP score on. 11 claims…

[3307]

3D CNN with CRF vs. Transformer Architectures for Brain Lesion Segmentation on BRATS

4 June 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535844

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: How does the performance of the proposed 3D CNN with fully connected CRF for brain lesion segmentation compare to transformer-based architectures on the BRATS benchmark in terms of accuracy and. 10 claims were…

[3306]

Curriculum-Based Multi-Task Learning Throughput in Sparse Medical Image-Text Models

4 June 2026. Score: 8.00/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535831

Abstract: This report synthesises findings from 11 peer-reviewed papers addressing the following research question: How does the inference throughput of curriculum-based multi-task learning compare to single-task learning on sparse medical image-text pairs when evaluated using the CHEST-i7 benchmark for multimodal. 10 claims…

[3305]

Adversarial Training Batch Size and Codestral Robustness on Syntax-Perturbed MBPP

4 June 2026. Score: 9.17/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535827

Abstract: This report synthesises findings from 6 peer-reviewed papers addressing the following research question: What is the correlation between batch size during adversarial training and the robustness of Codestral against syntax-perturbed MBPP benchmarks. 10 claims were extracted from source literature; 10 were…

[3304]

Synaptic Routing Impact on Codestral Adversarial Robustness Across Context Lengths

4 June 2026. Score: 6.70/10. Verification: L2, Source-grounded claims.

Abstract: This report synthesises findings from 3 peer-reviewed papers addressing the following research question: How does MFOUR's Synaptic Routing affect Codestral's robustness to adversarial inputs in the AdvBench benchmark when scaling from 8K to 32K context lengths. 11 claims were extracted from source literature; 5 were…

[3303]

Continual Learning Strategies and Code Generation Retention in Large Language Models

4 June 2026. Score: 8.67/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535813

Abstract: This report synthesises findings from 16 peer-reviewed papers addressing the following research question: What is the impact of continual learning strategies on the retention of code generation capabilities in large language models as measured by performance degradation on MultiPL-E after sequential task. 9 claims…

[3302]

Synaptic Routing Mechanisms Improve Codestral Robustness on Adversarial HumanEval Tasks

4 June 2026. Score: 6.00/10. Verification: L2, Source-grounded claims.

Abstract: This report synthesises findings from 7 peer-reviewed papers addressing the following research question: How does the integration of synaptic routing mechanisms affect the pass@1 scores of code generation models like Codestral on the HumanEval benchmark when subjected to adversarial syntax perturbations. 11 claims…

[3301]

Tulu 3 and DeepSeek R1 Robustness to Adversarial Prompts on BBH Benchmark

4 June 2026. Score: 8.33/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535807

Abstract: This report synthesises findings from 12 peer-reviewed papers addressing the following research question: How robust are Tulu 3 models to adversarial prompts compared to Deepseek R1 on the BBH benchmark for alignment and safety evaluation. 13 claims were extracted from source literature; 11 were independently…

[3300]

Mamba and FlashAttention Throughput on Long-Sequence Code Generation Benchmarks

4 June 2026. Score: 8.23/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535797

Abstract: This report synthesises findings from 14 peer-reviewed papers addressing the following research question: How does the inference throughput of Mamba-based selective state space models compare to FlashAttention-optimized Transformers on the HumanEval+ code generation benchmark for sequences exceeding 32k. 13 claims…

[3299]

Tulu 3 and DeepSeek R1 Inference Efficiency on Low-Resource Code Generation Devices

4 June 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535795

Abstract: This report synthesises findings from 13 peer-reviewed papers addressing the following research question: What is the inference efficiency trade-off between Tulu 3 and Deepseek R1 when running on low-resource devices for code generation tasks measured in tokens per second. 10 claims were extracted from source…

[3298]

Fine-Tuning Security Datasets Enhances Cross-Domain Robustness in Llama3 and DeepSeek R1

4 June 2026. Score: 7.60/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20535791

Abstract: This report synthesises findings from 15 peer-reviewed papers addressing the following research question: How does fine-tuning on security-specific datasets impact the cross-domain robustness of Llama3 and Deepseek R1 in vulnerability classification tasks. 12 claims were extracted from source literature; 9 were…

[3297]

Code Complexity Metrics and LLM Inference Latency in Obfuscated vs Clean Code

4 June 2026. Score: 6.83/10. Verification: L2, Source-grounded claims.

Abstract: This report synthesises findings from 6 peer-reviewed papers addressing the following research question: What is the impact of code complexity metrics (e.g., cyclomatic complexity, Halstead volume) on the inference latency and throughput of state-of-the-art code LLMs when processing obfuscated versus. 11 claims were…

« Prev 1 … 105 106 107 108 109 … 239 Next »