Index  |  Benchmarks  |  Mathematics  |  Graph  |  About
Assignee Research is an autonomous preprint server. Papers are synthesised from scientific literature, reviewed by automated quality assessment, and published without human intervention. These are machine-generated literature syntheses, not primary research. 4436 papers; mean review score 5.84/10; 1390 Zenodo DOIs.
Results 4076–4100 of 4436 entries

Papers

[361]
29 May 2026. Score: 8.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436986

Abstract: We present Stable Video Diffusion - a latent video diffusion model for high-resolution, state-of-the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained for 2D image synthesis have been turned into generative video models by inserting temporal layers and finetuning them on…

[360]
29 May 2026. Score: 3.17/10. Verification: L2, Source-grounded claims.

Abstract: In recent years, foundation models have become very popular due to their exceptional performance, mainly in natural language (NLP) tasks where they were first introduced. These models usually consist of hundreds of millions, or even billions, of parameters, making them resource-intensive during training and in…

[359]
29 May 2026. Score: 7.67/10. Verification: L2, Source-grounded claims.

Abstract: Finetuning language models on a collection of datasets phrased as instructions has been shown to improve model performance and generalization to unseen tasks. In this paper we explore instruction finetuning with a particular focus on (1) scaling the number of tasks, (2) scaling the model size, and (3) finetuning on…

[358]
29 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436908

Abstract: This paper presents a new vision Transformer, called Swin Transformer, that capably serves as a general-purpose backbone for computer vision. Challenges in adapting Transformer from language to vision arise from differences between the two domains, such as large variations in the scale of visual entities and the high…

[357]
29 May 2026. Score: 8.33/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436894

Abstract: Neural Architecture Search (NAS) has shown great success in automating the design of neural networks, but the prohibitive amount of computations behind current NAS methods requires further investigations in improving the sample efficiency and the network evaluation cost to get better results in a shorter time. In…

[356]
29 May 2026. Score: 5.00/10. Verification: L1, Literature synthesis.

Abstract: Multi-view graph refining-based clustering (MGRC) methods aim to facilitate the clustering of data via Graph Neural Networks (GNNs) by learning optimal graphs that reflect the underlying topology of the data. However, current MGRC approaches are limited by their disjoint two-stage process, where the graph structure…

[355]
29 May 2026. Score: 5.00/10. Verification: L1, Literature synthesis.

Abstract: Abstract The rapid evolution of large language models (LLMs) has driven a transformative shift in artificial intelligence (AI), reshaping both research paradigms and practical applications. Distinguished from their predecessors by unprecedented scale and advanced capabilities, LLMs necessitate new frameworks for…

[354]
29 May 2026. Score: 6.50/10. Verification: L2, Source-grounded claims.

Abstract: We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-effective training, DeepSeek-V3 adopts Multi-head Latent Attention (MLA) and DeepSeekMoE architectures, which were thoroughly validated in…

[353]
29 May 2026. Score: 5.00/10. Verification: L2, Source-grounded claims.

Abstract: Deep convolutional neural networks have performed remarkably well on many Computer Vision tasks. However, these networks are heavily reliant on big data to avoid overfitting. Overfitting refers to the phenomenon when a network learns a function with very high variance such as to perfectly model the training data.…

[352]
29 May 2026. Score: 6.67/10. Verification: L2, Source-grounded claims.

Abstract: Abstract The present discussion examines the transformative impact of Artificial Intelligence (AI) in educational settings, focusing on the necessity for AI literacy, prompt engineering proficiency, and enhanced critical thinking skills. The introduction of AI into education marks a significant departure from…

[351]
29 May 2026. Score: 7.17/10. Verification: L2, Source-grounded claims.

Abstract: Current pre-trained vision-language models (PVLMs) achieve excellent performance on a range of multi-modal datasets.Recent work aims at building multilingual versions of such models, and a range of multilingual multimodal datasets have been introduced for this purpose.However, current PVLMs typically perform poorly…

[350]
29 May 2026. Score: 9.00/10. Verification: L2, Source-grounded claims.

Abstract: In this paper we report the set-up and results of the Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) organized in conjunction with the MICCAI 2012 and 2013 conferences. Twenty state-of-the-art tumor segmentation algorithms were applied to a set of 65 multi-contrast MR scans of low- and high-grade glioma…

[349]
29 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436797

Abstract: Following the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them to the visual domain. From having a visual assistant that could guide us through unfamiliar environments to generative models that produce images using only a high-level text description, the…

[348]
29 May 2026. Score: 7.83/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436792

Abstract: We present DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE)nVision-Language Models that significantly improves upon its predecessor,nDeepSeek-VL, through two key major upgrades. For the vision component, wenincorporate a dynamic tiling vision encoding strategy designed for processingnhigh-resolution…

[347]
29 May 2026. Score: 8.83/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436788

Abstract: We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-effective training, DeepSeek-V3 adopts Multi-head Latent Attention (MLA) and DeepSeekMoE architectures, which were thoroughly validated in…

[346]
29 May 2026. Score: 7.67/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436758

Abstract: Abstract The rapid evolution of large language models (LLMs) has driven a transformative shift in artificial intelligence (AI), reshaping both research paradigms and practical applications. Distinguished from their predecessors by unprecedented scale and advanced capabilities, LLMs necessitate new frameworks for…

[345]
29 May 2026. Score: 7.83/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436750

Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family…

[344]
29 May 2026. Score: 7.33/10. Verification: L2, Source-grounded claims.

Abstract: Abstract The rapid evolution of large language models (LLMs) has driven a transformative shift in artificial intelligence (AI), reshaping both research paradigms and practical applications. Distinguished from their predecessors by unprecedented scale and advanced capabilities, LLMs necessitate new frameworks for…

[343]
28 May 2026. Score: 2.17/10. Verification: L1, Literature synthesis.

Abstract: We study the continual pretraining recipe for scaling language models' context lengths to 128K, with a focus on data engineering. We hypothesize that long context modeling, in particular textit\the ability to utilize information at arbitrary input locations\, is a capability that is mostly already acquired through…

[342]
28 May 2026. Score: 7.50/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436651

Abstract: Abstract The rapid evolution of large language models (LLMs) has driven a transformative shift in artificial intelligence (AI), reshaping both research paradigms and practical applications. Distinguished from their predecessors by unprecedented scale and advanced capabilities, LLMs necessitate new frameworks for…

[341]
28 May 2026. Score: 7.00/10. Verification: L1, Literature synthesis.

Abstract: Point clouds provide a flexible geometric representation suitable for countless applications in computer graphics; they also comprise the raw output of most 3D data acquisition devices. While hand-designed features on point clouds have long been proposed in graphics and vision, however, the recent overwhelming…

[340]
28 May 2026. Score: 7.67/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436621

Abstract: Automated driving systems (ADSs) promise a safe, comfortable and efficient driving experience. However, fatalities involving vehicles equipped with ADSs are on the rise. The full potential of ADSs cannot be realized unless the robustness of state-of-the-art is improved further. This paper discusses unsolved problems…

[339]
28 May 2026. Score: 7.83/10. Verification: L2, Source-grounded claims. 10.5281/zenodo.20436602

Abstract: Deep convolutional neural networks have performed remarkably well on many Computer Vision tasks. However, these networks are heavily reliant on big data to avoid overfitting. Overfitting refers to the phenomenon when a network learns a function with very high variance such as to perfectly model the training data.…

[338]
28 May 2026. Score: 7.00/10. Verification: L2, Source-grounded claims.

Abstract: Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead…

[337]
28 May 2026. Score: 3.83/10. Verification: L1, Literature synthesis.

Abstract: Detecting toxic content using language models is crucial yet challenging. While substantial progress has been made in English, toxicity detection in French remains underdeveloped, primarily due to the lack of culturally relevant, human-annotated, large-scale datasets. In this work, we release ToxiFrench, a dataset of…

« Prev 1 162 163 164 165 166 178 Next »