AI Research News Feeds for October 20th, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

CuSfM: CUDA-Accelerated Structure-from-Motion
QCFace: Image Quality Control for boosting Face Representation & Recognition
Proto-Former: Unified Facial Landmark Detection by Prototype Transformer
SHARE: Scene-Human Aligned Reconstruction
Adaptive transfer learning for surgical tool presence detection in laparoscopic videos through gradual freezing fine-tuning
FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers
PFGS: Pose-Fused 3D Gaussian Splatting for Complete Multi-Pose Object Reconstruction
Rethinking Convergence in Deep Learning: The Predictive-Corrective Paradigm for Anatomy-Informed Brain MRI Segmentation
MAVR-Net: Robust Multi-View Learning for MAV Action Recognition with Cross-View Attention
DPTrack:Directional Kernel-Guided Prompt Learning for Robust Nighttime Aerial Tracking
Improving Micro-Expression Recognition with Phase-Aware Temporal Augmentation
MRASfM: Multi-Camera Reconstruction and Aggregation through Structure-from-Motion in Driving Scenes
MSAM: Multi-Semantic Adaptive Mining for Cross-Modal Drone Video-Text Retrieval
A Novel Combined Optical Flow Approach for Comprehensive Micro-Expression Recognition
Iterative Motion Compensation for Canonical 3D Reconstruction from UAV Plant Images Captured in Windy Conditions
Rethinking Efficient Hierarchical Mixing Architecture for Low-light RAW Image Enhancement
Exploring Conditions for Diffusion models in Robotic Control
Balanced Multi-Task Attention for Satellite Image Classification: A Systematic Approach to Achieving 97.23% Accuracy on EuroSAT Without Pre-Training
Diffusion Bridge Networks Simulate Clinical-grade PET from MRI for Dementia Diagnostics
Imaginarium: Vision-guided High-Quality 3D Scene Layout Generation
Unmasking Facial DeepFakes: A Robust Multiview Detection Framework for Natural Images
Standardization for improved Spatio-Temporal Image Fusion
FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification
Quantized FCA: Efficient Zero-Shot Texture Anomaly Detection
Lightweight Data-Free Denoising for Detail-Preserving Biomedical Image Restoration
Deep Learning Based Domain Adaptation Methods in Remote Sensing: A Comprehensive Survey
Uncertainty-Aware Extreme Point Tracing for Weakly Supervised Ultrasound Image Segmentation
Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
SEGA: A Stepwise Evolution Paradigm for Content-Aware Layout Generation with Design Prior
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection
ERNet: Efficient Non-Rigid Registration Network for Point Sequences
VISTA: A Test-Time Self-Improving Video Generation Agent
Neuro-Symbolic Spatial Reasoning in Segmentation
3DPR: Single Image 3D Portrait Relight using Generative Priors
Memory-SAM: Human-Prompt-Free Tongue Segmentation via Retrieval-to-Prompt
BLIP3o-NEXT: Next Frontier of Native Image Generation
BiomedXPro: Prompt Optimization for Explainable Diagnosis with Biomedical Vision Language Models
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Neural Posterior Estimation for Cataloging Astronomical Images from the Legacy Survey of Space and Time
Confidence-Weighted Semi-Supervised Learning for Skin Lesion Segmentation Using Hybrid CNN-Transformer Networks
Fix False Transparency by Noise Guided Splatting
SANR: Scene-Aware Neural Representation for Light Field Image Compression with Rate-Distortion Optimization
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs
Diffusion Models are Efficient Data Generators for Human Mesh Recovery
CLOVER: Context-aware Long-term Object Viewpoint- and Environment- Invariant Representation Learning
Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving
V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation
Conformal Risk Control for Pulmonary Nodule Detection
Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification
YOLOE: Real-Time Seeing Anything
L2RSI: Cross-view LiDAR-based Place Recognition for Large-scale Urban Scenes via Remote Sensing Imagery
UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection
A Plug-and-Play Learning-based IMU Bias Factor for Robust Visual-Inertial Odometry
Bolt3D: Generating 3D Scenes in Seconds
CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image
X$^{2}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction
CMaP-SAM: Contraction Mapping Prior for SAM-driven Few-shot Segmentation
CADE 2.5 - ZeResFDG: Frequency-Decoupled, Rescaled and Zero-Projected Guidance for SD/SDXL Latent Diffusion Models
Scope: Selective Cross-modal Orchestration of Visual Perception Experts
CoDS: Enhancing Collaborative Perception in Heterogeneous Scenarios via Domain Separation
LOPR: Latent Occupancy PRediction using Generative Models
TAS: A Transit-Aware Strategy for Embodied Navigation with Non-Stationary Targets
Universal Vessel Segmentation for Multi-Modality Retinal Images
MLFM: Multi-Layered Feature Maps for Richer Language Understanding in Zero-Shot Semantic Navigation
A Generalizable Rhetorical Strategy Annotation Model Using LLM-based Debate Simulation and Labelling
Measuring the Effect of Disfluency in Multilingual Knowledge Probing Benchmarks
Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding
Automatic essay scoring: leveraging Jaccard coefficient and Cosine similaritywith n-gram variation in vector space model approach
Accelerating Mobile Language Model Generation via Hybrid Context and Hardware Coordination
Capabilities and Evaluation Biases of Large Language Models in Classical Chinese Poetry Generation: A Case Study on Tang Poetry
AutoGraph-R1: End-to-End Reinforcement Learning for Knowledge Graph Construction
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing
VocalBench-DF: A Benchmark for Evaluating Speech LLM Robustness to Disfluency
Large-scale User Game Lifecycle Representation Learning
When Seeing Is not Enough: Revealing the Limits of Active Reasoning in MLLMs
Controllable Abstraction in Summary Generation for Large Language Models via Prompt Engineering
CORE: Reducing UI Exposure in Mobile Agents via Collaboration Between Cloud and Local LLMs
Temporal Referential Consistency: Do LLMs Favor Sequences Over Absolute Time References?
From Characters to Tokens: Dynamic Grouping with Hierarchical BPE
Latent Reasoning in LLMs as a Vocabulary-Space Superposition
Finetuning LLMs for EvaCun 2025 token prediction shared task
From Ghazals to Sonnets: Decoding the Polysemous Expressions of Love Across Languages
BiMax: Bidirectional MaxSim Score for Document-Level Alignment
The Elephant in the Coreference Room: Resolving Coreference in Full-Length French Fiction Works
HypoSpace: Evaluating LLM Creativity as Set-Valued Hypothesis Generators under Underdetermination
Leveraging LLMs for Context-Aware Implicit Textual and Multimodal Hate Speech Detection
Cost-Aware Retrieval-Augmentation Reasoning Models with Adaptive Retrieval Depth
Emergence of Linear Truth Encodings in Language Models
Paper2Web: Let's Make Your Paper Alive!
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI
Train a Unified Multimodal Data Quality Classifier with Synthetic Data
MAGPIE: A benchmark for Multi-AGent contextual PrIvacy Evaluation
Leveraging Test Driven Development with Large Language Models for Reliable and Verifiable Spreadsheet Code Generation: A Research Framework
SQuAI: Scientific Question-Answering with Multi-Agent Retrieval-Augmented Generation
GraphMind: Interactive Novelty Assessment System for Accelerating Scientific Discovery
Evaluating Large Language Models with Psychometrics
Cross-layer Attention Sharing for Pre-trained Large Language Models
To Err Is Human; To Annotate, SILICON? Reducing Measurement Error in LLM Annotation
Event Segmentation Applications in Large Language Model Enabled Automated Recall Assessments
Towards Human Cognition: Visual Context Guides Syntactic Priming in Fusion-Encoded Models
Generating patient cohorts from electronic health records using two-step retrieval-augmented text-to-SQL generation
Summarizing Speech: A Comprehensive Survey
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs
RAGRouter: Learning to Route Queries to Multiple Retrieval-Augmented Language Models
PBEBench: A Multi-Step Programming by Examples Reasoning Benchmark inspired by Historical Linguistics
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
MemeSense: An Adaptive In-Context Framework for Social Commonsense Driven Meme Moderation
Toward Safe and Human-Aligned Game Conversational Recommendation via Multi-Agent Decomposition
NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks
Constantly Improving Image Models Need Constantly Improving Benchmarks
LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models
MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning
Generalized Dynamics Generation towards Scannable Physical World Model
Directional Reasoning Injection for Fine-Tuning MLLMs
A solution to generalized learning from small training sets found in everyday infant experiences
SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images
TGT: Text-Grounded Trajectories for Locally Controlled Video Generation
Fourier Transform Multiple Instance Learning for Whole Slide Image Classification
Hyperparameter Optimization and Reproducibility in Deep Learning Model Training
Salient Concept-Aware Generative Data Augmentation
CARDIUM: Congenital Anomaly Recognition with Diagnostic Images and Unified Medical records
The Face of Persuasion: Analyzing Bias and Generating Culture-Aware Ads
DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion
RadioDiff-$k^2$: Helmholtz Equation Informed Generative Diffusion Model for Multi-Path Aware Radio Map Construction
Euclidean Distance Matrix Completion via Asymmetric Projected Gradient Descent
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production
msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML
Conditional Generative Modeling for Enhanced Credit Risk Management in Supply Chain Finance
Understanding Generalization in Node and Link Prediction
A Weakly Supervised Transformer for Rare Disease Diagnosis and Subphenotyping from EHRs with Pulmonary Case Studies
A Cycle-Consistency Constrained Framework for Dynamic Solution Space Reduction in Noninjective Regression
LeMat-Traj: A Scalable and Unified Dataset of Materials Trajectories for Atomistic Modeling
Learning Unified Representations from Heterogeneous Data for Robust Heart Rate Modeling
Traces Propagation: Memory-Efficient and Scalable Forward-Only Learning in Spiking Neural Networks
Machine Learning-Based Ultrasonic Weld Characterization Using Hierarchical Wave Modeling and Diffusion-Driven Distribution Alignment
When In Doubt, Abstain: The Impact of Abstention on Strategic Classification
How Sparse Can We Prune A Deep Network: A Fundamental Limit Perspective
Which exceptional low-dimensional projections of a Gaussian point cloud can be found in polynomial time?
Spatial Supply Repositioning with Censored Demand Data
End-to-End Learning Framework for Solving Non-Markovian Optimal Control
DeepRV: Accelerating spatiotemporal inference with pre-trained neural priors
Landmark-Based Node Representations for Shortest Path Distance Approximations in Random Graphs
SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow
Low-Rank Adaptation of Neural Fields
SYMI: Efficient Mixture-of-Experts Training via Model and Optimizer State Decoupling
Improving Inference-Time Optimisation for Vocal Effects Style Transfer with a Gaussian Prior
Explainable Machine Learning for Oxygen Diffusion in Perovskites and Pyrochlores
Uncertainty Quantification for Physics-Informed Neural Networks with Extended Fiducial Inference
VLMLight: Safety-Critical Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning Architecture
Onboard Mission Replanning for Adaptive Cooperative Multi-Robot Systems
Implicit neural representations for accurate estimation of the standard model of white matter
Autonomous Cyber Resilience via a Co-Evolutionary Arms Race within a Fortified Digital Twin Sandbox
Operationalizing Automated Essay Scoring: A Human-Aware Approach
Mind the Gap: Navigating Inference with Optimal Transport Maps
Meta-learning of Gibbs states for many-body Hamiltonians with applications to Quantum Boltzmann Machines
Migration as a Probe: A Generalizable Benchmark Framework for Specialist vs. Generalist Machine-Learned Force Fields
KGAlign: Joint Semantic-Structural Knowledge Encoding for Multimodal Fake News Detection
UNet with Self-Adaptive Mamba-Like Attention and Causal-Resonance Learning for Medical Image Segmentation
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech
EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification
Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems
Hyperbolic Dataset Distillation
FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning
When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective
Refer to Any Segmentation Mask Group With Vision-Language Prompts
MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning
FinHEAR: Human Expertise and Adaptive Risk-Aware Temporal Reasoning for Financial Decision-Making
General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting
Clarifying the Ti-V Phase Diagram Using First-Principles Calculations and Bayesian Learning
Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound
Extending Load Forecasting from Zonal Aggregates to Individual Nodes for Transmission System Operators
ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm
AlignFlow: Improving Flow-based Generative Models with Semi-Discrete Optimal Transport
IQNN-CS: Interpretable Quantum Neural Network for Credit Scoring
Internalizing World Models via Self-Play Finetuning for Agentic RL
Learn to Change the World: Multi-level Reinforcement Learning with Model-Changing Actions
Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models
Physics-informed data-driven machine health monitoring for two-photon lithography
Online Correlation Clustering: Simultaneously Optimizing All $\ell_p$-norms
Navigating the consequences of mechanical ventilation in clinical intensive care settings through an evolutionary game-theoretic framework
A Simple Method for PMF Estimation on Large Supports
Predicting the Unpredictable: Reproducible BiLSTM Forecasting of Incident Counts in the Global Terrorism Database (GTD)
Policy Transfer Ensures Fast Learning for Continuous-Time LQR with Entropy Regularization
A simple mean field model of feature learning
Finding geodesics with the Deep Ritz method
An Advanced Two-Stage Model with High Sensitivity and Generalizability for Prediction of Hip Fracture Risk Using Multiple Datasets
Dissecting Mahalanobis: How Feature Geometry and Normalization Shape OOD Detection
Soundness-Aware Level: A Microscopic Signature that Predicts LLM Reasoning Potential
Reflections from Research Roundtables at the Conference on Health, Inference, and Learning (CHIL) 2025
Machine Learning for Early Detection of Meningitis: Stacked Ensemble Learning with EHR data
Integrating Product Coefficients for Improved 3D LiDAR Data Classification (Part II)
Stress-Aware Learning under KL Drift via Trust-Decayed Mirror Descent
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in Finance Domain
Dual-Weighted Reinforcement Learning for Generative Preference Modeling
Spatiotemporal Transformers for Predicting Avian Disease Risk from Migration Trajectories
Causal Time Series Modeling of Supraglacial Lake Evolution in Greenland under Distribution Shift
Semi-Supervised Regression with Heteroscedastic Pseudo-Labels
Small Ensemble-based Data Assimilation: A Machine Learning-Enhanced Data Assimilation Method with Limited Ensemble Size
DFCA: Decentralized Federated Clustering Algorithm
On the Generalization Properties of Learning the Random Feature Models with Learnable Activation Functions
Backdoor or Manipulation? Graph Mixture of Experts Can Defend Against Various Graph Adversarial Attacks
Sequence Modeling with Spectral Mean Flows
Iterative Refinement of Flow Policies in Probability Space for Online Reinforcement Learning
Geometric Mixture Models for Electrolyte Conductivity Prediction
Online Kernel Dynamic Mode Decomposition for Streaming Time Series Forecasting with Adaptive Windowing
ParaFormer: Shallow Parallel Transformers with Progressive Approximation
Safe, Efficient, and Robust Reinforcement Learning for Ranking and Diffusion Models
Particle Dynamics for Latent-Variable Energy-Based Models
Adversary-Free Counterfactual Prediction via Information-Regularized Representations
Theoretical Refinement of CLIP by Utilizing Linear Structure of Optimal Similarity
Compressive Modeling and Visualization of Multivariate Scientific Data using Implicit Neural Representation
An Empirical Study on MC Dropout--Based Uncertainty--Error Correlation in 2D Brain Tumor Segmentation
Doubly Robust Estimation of Causal Effects in Strategic Equilibrium Systems
On the Neural Feature Ansatz for Deep Neural Networks
Attn-JGNN: Attention Enhanced Join-Graph Neural Networks
GRATING: Low-Latency and Memory-Efficient Semantic Selection on Device
Decentralized Parameter-Free Online Learning
Deep Neural ODE Operator Networks for PDEs
Fast and Compact Tsetlin Machine Inference on CPUs Using Instruction-Level Optimization
WARP-LUTs - Walsh-Assisted Relaxation for Probabilistic Look Up Tables
Constrained Adversarial Perturbation
A Comprehensive Evaluation of Graph Neural Networks and Physics Informed Learning for Surrogate Modelling of Finite Element Analysis
SAMix: Calibrated and Accurate Continual Learning via Sphere-Adaptive Mixup and Neural Collapse
Poultry Farm Intelligence: An Integrated Multi-Sensor AI Platform for Enhanced Welfare and Productivity
Cavity Duplexer Tuning with 1d Resnet-like Neural Networks
FIDDLE: Reinforcement Learning for Quantum Fidelity Enhancement
Transfer Orthology Networks
Learning Correlated Reward Models: Statistical Barriers and Opportunities
FIRE: Fact-checking with Iterative Retrieval and Verification
Estimand framework and intercurrent events handling for clinical trials with time-to-event outcomes
Reliable data clustering with Bayesian community detection
The Tree-SNE Tree Exists
Composition-Grounded Instruction Synthesis for Visual Reasoning
Comprehensive language-image pre-training for 3D medical image understanding
The Minimax Lower Bound of Kernel Stein Discrepancy Estimation
PoTS: Proof-of-Training-Steps for Backdoor Detection in Large Language Models
Polarization based direction of arrival estimation using a radio interferometric array
Deep generative priors for 3D brain analysis
Beyond PCA: Manifold Dimension Estimation via Local Graph Structure
OCR-APT: Reconstructing APT Stories from Audit Logs using Subgraph Anomaly Detection and LLMs
HyperAIRI: a plug-and-play algorithm for precise hyperspectral image reconstruction in radio interferometry
How to Sell High-Dimensional Data Optimally
HOB: A Holistically Optimized Bidding Strategy under Heterogeneous Auction Mechanisms with Organic Traffic
Minimisation of Submodular Functions Using Gaussian Zeroth-Order Random Oracles
Foresighted Online Policy Optimization with Interference
Hyperbolic Structured Classification for Robust Single Positive Multi-label Learning
Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation
Transfer Learning for Benign Overfitting in High-Dimensional Linear Regression
Singularity-free dynamical invariants-based quantum control
RankSEG-RMA: An Efficient Segmentation Algorithm via Reciprocal Moment Approximation
TranSimHub:A Unified Air-Ground Simulation Platform for Multi-Modal Perception and Decision-Making
Recursive Inference for Heterogeneous Multi-Output GP State-Space Models with Arbitrary Moment Matching
LILAC: Long-sequence Incremental Low-latency Arbitrary Motion Stylization via Streaming VAE-Diffusion with Causal Decoding
Information Theory in Open-world Machine Learning Foundations, Frameworks, and Future Direction
Semantic4Safety: Causal Insights from Zero-shot Street View Imagery Segmentation for Urban Road Safety
Nonlinear Dimensionality Reduction Techniques for Bayesian Optimization
Online Policy Learning via a Self-Normalized Maximal Inequality
AI and analytics in sports: Leveraging BERTopic to map the past and chart the future
Latent Feature Alignment: Discovering Biased and Interpretable Subpopulations in Face Recognition Models
VO-DP: Semantic-Geometric Adaptive Diffusion Policy for Vision-Only Robotic Manipulation
SpikeFit: Towards Optimal Deployment of Spiking Networks on Neuromorphic Hardware
Geometric Convergence Analysis of Variational Inference via Bregman Divergences
Kernel-Based Evaluation of Conditional Biological Sequence Models
Stochastic Optimization with Random Search
GOGH: Correlation-Guided Orchestration of GPUs in Heterogeneous Clusters
Bayesian Inference for PDE-based Inverse Problems using the Optimization of a Discrete Loss
Disentanglement of Sources in a Multi-Stream Variational Autoencoder
A Split-Client Approach to Second-Order Optimization
QSilk: Micrograin Stabilization and Adaptive Quantile Clipping for Detail-Friendly Latent Diffusion
On Non-interactive Evaluation of Animal Communication Translators
Towards more holistic interpretability: A lightweight disentangled Concept Bottleneck Model
Enhanced Renewable Energy Forecasting using Context-Aware Conformal Prediction
DexCanvas: Bridging Human Demonstrations and Robot Learning for Dexterous Manipulation
On Universality of Deep Equivariant Networks
Error analysis of a compositional score-based algorithm for simulation-based inference
Blackwell's Approachability for Sequential Conformal Inference
SpeechLLMs for Large-scale Contextualized Zero-shot Slot Filling
Personalized Semi-Supervised Federated Learning for Human Activity Recognition
Photovoltaic power forecasting using quantum machine learning
Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search
METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation
A Multimodal Lightweight Approach to Fault Diagnosis of Induction Motors in High-Dimensional Dataset
Scalable Multi-phase Word Embedding Using Conjunctive Propositional Clauses
Privacy-Preserving Dataset Combination
Closed-Form Training Dynamics Reveal Learned Features and Linear Structure in Word2Vec-like Models
Predicting gene essentiality and drug response from perturbation screens in preclinical cancer models with LEAP: Layered Ensemble of Autoencoders and Predictors
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Rethinking Robustness in Machine Learning: A Posterior Agreement Approach
LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration
Neural Mean-Field Games: Extending Mean-Field Game Theory with Neural Stochastic Differential Equations
PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold
Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models
End-to-End Multi-Modal Diffusion Mamba
Design and Analysis of Parallel Artificial Protozoa Optimizer (P-APO) using CUDA Architecture
DeepAries: Adaptive Rebalancing Interval Selection for Enhanced Portfolio Selection
RegimeFolio: A Regime Aware ML System for Sectoral Portfolio Optimization in Dynamic Markets
Constrained Diffusion for Protein Design with Hard Structural Constraints
The Role of Federated Learning in Improving Financial Security: A Survey
GAZE:Governance-Aware pre-annotation for Zero-shot World Model Environments
PC-UNet: An Enforcing Poisson Statistics U-Net for Positron Emission Tomography Denoising
Evaluation and Implementation of Machine Learning Algorithms to Predict Early Detection of Kidney and Heart Disease in Diabetic Patients
VaultGemma: A Differentially Private Gemma Model
Automated Snippet-Alignment Data Augmentation for Code Translation
TangledFeatures: Robust Feature Selection in Highly Correlated Spaces
Rethinking Toxicity Evaluation in Large Language Models: A Multi-Label Perspective
Can generative AI figure out figurative language? The influence of idioms on essay scoring by ChatGPT, Gemini, and Deepseek
Hybrid Autoencoder-Based Framework for Early Fault Detection in Wind Turbines
From Universal Approximation Theorem to Tropical Geometry of Multi-Layer Perceptrons
DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models
Active Honeypot Guardrail System: Probing and Confirming Multi-Turn LLM Jailbreaks
UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos
The Coverage Principle: How Pre-training Enables Post-Training
Sequential Comics for Jailbreaking Multimodal Large Language Models via Structured Visual Storytelling
DMRetriever: A Family of Models for Improved Text Retrieval in Disaster Management
Beyond Outcome-Based Imperfect-Recall: Higher-Resolution Abstractions for Imperfect-Information Games
Operator Flow Matching for Timeseries Forecasting
Continual Learning via Sparse Memory Finetuning
Targeted Attacks and Defenses for Distributed Federated Learning in Vehicular Networks
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
Latent Topic Synthesis: Leveraging LLMs for Electoral Ad Analysis
FarsiMCQGen: a Persian Multiple-choice Question Generation Framework
XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models
Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning
The Economics of AI Foundation Models: Openness, Competition, and Governance
Automotive Crash Dynamics Modeling Accelerated with Machine Learning
ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning
Extending Audio Context for Long-Form Understanding in Large Audio-Language Models
Adaptive Individual Uncertainty under Out-Of-Distribution Shift with Expert-Routed Conformal Prediction
Planner and Executor: Collaboration between Discrete Diffusion And Autoregressive Models in Reasoning
DRO-InstructZero: Distributionally Robust Prompt Optimization for Large Language Models
Robust Layerwise Scaling Rules by Proper Weight Decay Tuning
TraceCoder: Towards Traceable ICD Coding via Multi-Source Knowledge Integration
TACL: Threshold-Adaptive Curriculum Learning Strategy for Enhancing Medical Text Understanding
Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition
Post-Processing Methods for Improving Accuracy in MRI Inpainting
Exemplar-Guided Planing: Enhanced LLM Agent for KGQA
MTmixAtt: Integrating Mixture-of-Experts with Multi-Mix Attention for Large-Scale Recommendation
Identifying internal patterns in (1+1)-dimensional directed percolation using neural networks
VERA-MH Concept Paper
Latent Diffusion Model without Variational Autoencoder
DSSmoothing: Toward Certified Dataset Ownership Verification for Pre-trained Language Models via Dual-Space Smoothing
BeLLMan: Controlling LLM Congestion
ASBI: Leveraging Informative Real-World Data for Active Black-Box Simulator Tuning
Readability Reconsidered: A Cross-Dataset Analysis of Reference-Free Metrics
When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling
GaussGym: An open-source real-to-sim framework for learning locomotion from pixels
Kernel Regression in Structured Non-IID Settings: Theory and Implications for Denoising Score Learning
Cortical-SSM: A Deep State Space Model for EEG and ECoG Motor Imagery Decoding
Towards Robust Zero-Shot Reinforcement Learning
DroneAudioset: An Audio Dataset for Drone-based Search and Rescue
MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment
Robust High-Resolution Multi-Organ Diffusion MRI Using Synthetic-Data-Tuned Prompt Learning
Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs
Learning to Detect Unknown Jailbreak Attacks in Large Vision-Language Models
Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning
Expediting Reinforcement Learning by Incorporating Knowledge About Temporal Causality in the Environment
Robust Optimization in Causal Models and G-Causal Normalizing Flows
Learning to Answer from Correct Demonstrations
SoK: Taxonomy and Evaluation of Prompt Security in Large Language Models
Selecting and Combining Large Language Models for Scalable Code Clone Detection
An Experimental Study of Real-Life LLM-Proposed Performance Improvements
OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning
DeceptionBench: A Comprehensive Benchmark for AI Deception Behaviors in Real-world Scenarios
The Road Less Traveled: Enhancing Exploration in LLMs via Sequential Sampling
AI Adoption in NGOs: A Systematic Literature Review
Language Models are Injective and Hence Invertible
Revisiting Knowledge Distillation: The Hidden Role of Dataset Size
MCA: Modality Composition Awareness for Robust Composed Multimodal Retrieval
TokenTiming: A Dynamic Alignment Method for Universal Speculative Decoding Model Pairs
Rethinking Cross-lingual Gaps from a Statistical Viewpoint
Think Parallax: Solving Multi-Hop Problems via Multi-View Knowledge-Graph-Based Retrieval-Augmented Generation
ClapperText: A Benchmark for Text Recognition in Low-Resource Archival Documents
KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models
SpikeVox: Towards Energy-Efficient Speech Therapy Framework with Spike-driven Generative Language Models
The Spark Effect: On Engineering Creative Diversity in Multi-Agent AI Systems
Lightweight CycleGAN Models for Cross-Modality Image Transformation and Experimental Quality Assessment in Fluorescence Microscopy
CQD-SHAP: Explainable Complex Query Answering via Shapley Values
Enhance Large Language Models as Recommendation Systems with Collaborative Filtering
Valeo Near-Field: a novel dataset for pedestrian intent detection
CarBoN: Calibrated Best-of-N Sampling Improves Test-time Reasoning
ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings
Mixture of Experts Approaches in Dense Retrieval Tasks
Towards Label-Free Brain Tumor Segmentation: Unsupervised Learning with Multimodal MRI
KS-Net: Multi-layer network model for determining the rotor type from motor parameters in interior PMSMs
Exploring the Synergy of Quantitative Factors and Newsflow Representations from Large Language Models for Stock Return Prediction
ProofOptimizer: Training Language Models to Simplify Proofs without Human Demonstrations
Beyond-Diagonal RIS Under Non-Idealities: Learning-Based Architecture Discovery and Optimization
ProSh: Probabilistic Shielding for Model-free Reinforcement Learning
DGME-T: Directional Grid Motion Encoding for Transformer-Based Historical Camera Movement Classification
RLAF: Reinforcement Learning from Automaton Feedback
Attention Sinks in Diffusion Language Models
LLMs Judge Themselves: A Game-Theoretic Framework for Human-Aligned Evaluation
NDM: A Noise-driven Detection and Mitigation Framework against Implicit Sexual Intentions in Text-to-Image Generation
Semantic segmentation with coarse annotations
Controlling the image generation process with parametric activation functions
AB-UPT for Automotive and Aerospace Applications
Chronos-2: From Univariate to Universal Forecasting
GENESIS: A Generative Model of Episodic-Semantic Interaction
SNOO: Step-K Nesterov Outer Optimizer - The Surprising Effectiveness of Nesterov Momentum Applied to Pseudo-Gradients
Enhanced Sentiment Interpretation via a Lexicon-Fuzzy-Transformer Framework
Self-Certifying Primal-Dual Optimization Proxies for Large-Scale Batch Economic Dispatch
InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training
PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
Flexora: Flexible Low Rank Adaptation for Large Language Models
Beyond Static Assumptions: the Predictive Justified Perspective Model for Epistemic Planning
Where Common Knowledge Cannot Be Formed, Common Belief Can -- Planning with Multi-Agent Belief Using Group Justified Perspectives
FERA: Foil Fencing Referee Assistant Using Pose-Based Multi-Label Move Recognition and Rule Reasoning
Establishing trust in automated reasoning
MotionScript: Natural Language Descriptions for Expressive 3D Human Motions
BandCondiNet: Parallel Transformers-based Conditional Popular Music Generation with Multi-View Features
Retrieval-Augmented Test Generation: How Far Are We?
Memory-Efficient Large Language Models for Program Repair with Semantic-Guided Patch Generation
Variational Autoencoders for Efficient Simulation-Based Inference
Competition and Diversity in Generative AI
Towards smart and adaptive agents for active sensing on edge devices
Retro3D: A 3D-aware Template-free Method for Enhancing Retrosynthesis via Molecular Conformer Information
GuardReasoner: Towards Reasoning-based LLM Safeguards
FEMBA: Efficient and Scalable EEG Analysis with a Bidirectional Mamba Foundation Model
Methods and Trends in Detecting AI-Generated Images: A Comprehensive Review
Finetuning and Quantization of EEG-Based Foundational BioSignal Models on ECG and PPG Data for Blood Pressure Estimation
EMCee: Improving Multilingual Capability of LLMs via Bridging Knowledge and Reasoning with Extracted Synthetic Multilingual Context
NFIG: Autoregressive Image Generation with Next-Frequency Prediction
LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs
Beyond Final Code: A Process-Oriented Error Analysis of Software Development Agents in Real-World GitHub Scenarios
Text2Schema: Filling the Gap in Designing Database Table Structures based on Natural Language
Unfair Learning: GenAI Exceptionalism and Copyright Law
Multi-identity Human Image Animation with Structural Video Diffusion
Interpretable Hybrid-Rule Temporal Point Processes
SHeaP: Self-Supervised Head Geometry Predictor Learned via 2D Gaussians
MAYA: Addressing Inconsistencies in Generative Password Guessing through a Unified Benchmark
FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
PAD: Phase-Amplitude Decoupling Fusion for Multi-Modal Land Cover Classification
Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles
Uncertainty Quantification for Prior-Data Fitted Networks using Martingale Posteriors
OpenEstimate: Evaluating LLMs on Reasoning Under Uncertainty with Real-World Data
Procedural Game Level Design with Deep Reinforcement Learning
Towards Error Centric Intelligence I, Beyond Observational Learning
HugAgent: Evaluating LLMs in Simulating Human-Like Individual Reasoning on Open-Ended Tasks
WELD: A Large-Scale Longitudinal Dataset of Emotional Dynamics for Ubiquitous Affective Computing
From Checklists to Clusters: A Homeostatic Account of AGI Evaluation
Multi-dimensional Data Analysis and Applications Basing on LLM Agents and Knowledge Graph Interactions
Experience-Driven Exploration for Efficient API-Free AI Agents
AUGUSTUS: An LLM-Driven Multimodal Agent System with Contextualized User Memory
WebGen-V Bench: Structured Representation for Enhancing Visual Design in LLM-based Web Generation and Evaluation
VERITAS: Leveraging Vision Priors and Expert Fusion to Improve Multimodal Data
Towards Flash Thinking via Decoupled Advantage Policy Optimization
Advancing Routing-Awareness in Analog ICs Floorplanning
Corrigibility Transformation: Constructing Goals That Accept Updates
MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games
Adaptive Minds: Empowering Agents with LoRA-as-Tools
Taming the Judge: Deconflicting AI Feedback for Stable Reinforcement Learning
Hypergraph Contrastive Sensor Fusion for Multimodal Fault Diagnosis in Induction Motors
JudgeSQL: Reasoning over SQL Candidates with Weighted Consensus Tournament
Context-aware deep learning using individualized prior information reduces false positives in disease risk prediction and longitudinal health assessment
Unleashing Scientific Reasoning for Bio-experimental Protocol Generation via Structured Component-based Reward Mechanism
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation
Direct Preference Optimization with Unobserved Preference Heterogeneity: The Necessity of Ternary Preferences
Invoice Information Extraction: Methods and Performance Evaluation
AURA: An Agent Autonomy Risk Assessment Framework
Towards Relaxed Multimodal Inputs for Gait-based Parkinson's Disease Assessment
Preliminary Quantitative Study on Explainability and Trust in AI Systems
Self-evolving expertise in complex non-verifiable subject domains: dialogue as implicit meta-RL
Demo: Guide-RAG: Evidence-Driven Corpus Curation for Retrieval-Augmented Generation in Long COVID

Research Sources: 470 | Generated: 10/20/2025