AI Research News Feeds for September 16th, 2025

AI RESEARCH PAPERS & ACADEMIC SOURCES

Preconditioned subgradient method for composite optimization: overparameterization and fast convergence
High Effort, Low Gain: Fundamental Limits of Active Learning for Linear Dynamical Systems
Contractive kinetic Langevin samplers beyond global Lipschitz continuity
A comparison between geostatistical and machine learning models for spatio-temporal prediction of PM2.5 data
Generalized Dirichlet Energy and Graph Laplacians for Clustering Directed and Undirected Graphs
Piecewise Deterministic Markov Processes for Bayesian Neural Networks
Adapting Projection-Based Reduced-Order Models using Projected Gaussian Process
Deep learning joint extremes of metocean variables using the SPAR model
Kernel Embeddings and the Separation of Measure Phenomenon
Simulation-Based Sensitivity Analysis in Optimal Treatment Regimes and Causal Decomposition with Individualized Interventions
Eigen-convergence of Gaussian kernelized graph Laplacian by manifold heat interpolation
A Permutation-free Kernel Two-Sample Test
Early alignment in two-layer networks training is a two-edged sword
Robustness in the Face of Partial Identifiability in Reward Learning
Understanding Model Calibration -- A gentle introduction and visual exploration of calibration and the expected calibration error (ECE)
Weak instrumental variables due to nonlinearities in panel data: A Super Learner Control Function estimator
All Optical Echo State Network Reservoir Computing
Scalp Diagnostic System With Label-Free Segmentation and Training-Free Image Translation
Social Perception of Faces in a Vision-Language Model
DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction
HD-OOD3D: Supervised and Unsupervised Out-of-Distribution object detection in LiDAR data
Kernel-based Stochastic Approximation Framework for Nonlinear Operator Learning
Maximum diversity, weighting and invariants of time series
Predictable Compression Failures: Why Language Models Actually Hallucinate
Contrastive Network Representation Learning
Next-Generation Reservoir Computing for Dynamical Inference
Some Robustness Properties of Label Cleaning
A Particle-Flow Algorithm for Free-Support Wasserstein Barycenters
Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification
E-ROBOT: a dimension-free method for robust statistics and machine learning via Schr\"odinger bridge
SpaPool: Soft Partition Assignment Pooling for__Graph Neural Networks
Identifiable Autoregressive Variational Autoencoders for Nonlinear and Nonstationary Spatio-Temporal Blind Source Separation
MMM: Clustering Multivariate Longitudinal Mixed-type Data
The Morgan-Pitman Test of Equality of Variances and its Application to Machine Learning Model Evaluation and Selection
What is in a Price? Estimating Willingness-to-Pay with Bayesian Hierarchical Models
The Honest Truth About Causal Trees: Accuracy Limits for Heterogeneous Treatment Effect Estimation
Solving ill-conditioned polynomial equations using score-based priors with application to multi-target detection
Rate-Distortion Limits for Multimodal Retrieval: Theory, Optimal Codes, and Finite-Sample Guarantees
SH-SAS: An Implicit Neural Representation for Complex Spherical-Harmonic Scattering Fields for 3D Synthetic Aperture Sonar
UltraUPConvNet: A UPerNet- and ConvNeXt-Based Multi-Task Network for Ultrasound Tissue Segmentation and Disease Prediction
ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations
Realistic Environmental Injection Attacks on GUI Agents
Introduction to a Low-Cost AI-Powered GUI for Unstained Cell Culture Analysis
Geometric Analysis of Magnetic Labyrinthine Stripe Evolution via U-Net Segmentation
ParaEQsA: Parallel and Asynchronous Embodied Questions Scheduling and Answering
TrajBooster: Boosting Humanoid Whole-Body Manipulation via Trajectory-Centric Learning
Data-driven Smile Design: Personalized Dental Aesthetics Outcomes Using Deep Learning
Video-based Sign Language Recognition without Temporal Segmentation
SAIF: Sparse Adversarial and Imperceptible Attack Framework
SRSNetwork: Siamese Reconstruction-Segmentation Networks based on Dynamic-Parameter Convolution
Long-Tailed 3D Detection via Multi-Modal Fusion
Bayesian Unsupervised Disentanglement of Anatomy and Geometry for Deep Groupwise Image Registration
SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis
InstructHumans: Editing Animated 3D Human Textures with Instructions
HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising
Multilingual Diversity Improves Vision-Language Representations
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective
Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation
End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data
U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT
Progressive Flow-inspired Unfolding for Spectral Compressive Imaging
End-to-End 4D Heart Mesh Recovery Across Full-Stack and Sparse Cardiac MRI
FS-SAM2: Adapting Segment Anything Model 2 for Few-Shot Semantic Segmentation via Low-Rank Adaptation
RailSafeNet: Visual Scene Understanding for Tram Safety
3DViT-GAT: A Unified Atlas-Based 3D Vision Transformer and Graph Learning Framework for Major Depressive Disorder Detection Using Structural MRI Data
Open-ended Hierarchical Streaming Video Understanding with Vision Language Models
Multi Anatomy X-Ray Foundation Model
LoRA-fine-tuned Large Vision Models for Automated Assessment of Post-SBRT Lung Injury
HoloGarment: 360{\deg} Novel View Synthesis of In-the-Wild Garments
Domain-Adaptive Pretraining Improves Primate Behavior Recognition
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence
Character-Centric Understanding of Animated Movies
MIDOG 2025 Track 2: A Deep Learning Model for Classification of Atypical and Normal Mitotic Figures under Class and Hardness Imbalances
Automated Cervical Os Segmentation for Camera-Guided, Speculum-Free Screening
Adapting Medical Vision Foundation Models for Volumetric Medical Image Segmentation via Active Learning and Selective Semi-supervised Fine-tuning
Nav-R1: Reasoning and Navigation in Embodied Scenes
AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting
Probabilistic Robustness Analysis in High Dimensional Space: Application to Semantic Segmentation Network
Synthetic Captions for Open-Vocabulary Zero-Shot Segmentation
Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting
Bridging Vision Language Models and Symbolic Grounding for Video Question Answering
Dr.V: A Hierarchical Perception-Temporal-Cognition Framework to Diagnose Video Hallucination by Fine-grained Spatial-Temporal Grounding
Multi-animal tracking in Transition: Comparative Insights into Established and Emerging Methods
Do It Yourself (DIY): Modifying Images for Poems in a Zero-Shot Setting Using Weighted Prompt Manipulation
SAM-TTT: Segment Anything Model via Reverse Parameter Configuration and Test-Time Training for Camouflaged Object Detection
BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation
Logit Mixture Outlier Exposure for Fine-grained Out-of-Distribution Detection
Integrating Prior Observations for Incremental 3D Scene Graph Prediction
NeuroGaze-Distill: Brain-informed Distillation and Depression-Inspired Geometric Priors for Robust Facial Emotion Recognition
Enriched text-guided variational multimodal knowledge distillation network (VMD) for automated diagnosis of plaque vulnerability in 3D carotid artery MRI
Graph Algorithm Unrolling with Douglas-Rachford Iterations for Image Interpolation with Guaranteed Initialization
Sphere-GAN: a GAN-based Approach for Saliency Estimation in 360{\deg} Videos
CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation
Learning to Generate 4D LiDAR Sequences
Robust Concept Erasure in Diffusion Models: A Theoretical Perspective on Security and Robustness
RAM++: Robust Representation Learning via Adaptive Mask for All-in-One Image Restoration
Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing
Layout-Conditioned Autoregressive Text-to-Image Generation via Structured Masking
A Computer Vision Pipeline for Individual-Level Behavior Analysis: Benchmarking on the Edinburgh Pig Dataset
DUAL-VAD: Dual Benchmarks and Anomaly-Focused Sampling for Video Anomaly Detection
A Controllable 3D Deepfake Generation Framework with Gaussian Splatting
IS-Diff: Improving Diffusion-Based Inpainting with Better Initial Seed
WeatherBench: A Real-World Benchmark Dataset for All-in-One Adverse Weather Image Restoration
Joint-octamamba:an octa joint segmentation network based on feature enhanced mamba
DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition
RouteExtract: A Modular Pipeline for Extracting Routes from Paper Maps
IMD: A 6-DoF Pose Estimation Benchmark for Industrial Metallic Objects
Uncertainty-Aware Retinal Vessel Segmentation via Ensemble Distillation
The Quest for Universal Master Key Filters in DS-CNNs
Advanced Layout Analysis Models for Docling
Microsurgical Instrument Segmentation for Robot-Assisted Surgery
Bridging the Gap Between Sparsity and Redundancy: A Dual-Decoding Framework with Global Context for Map Inference
A Fully Open and Generalizable Foundation Model for Ultrasound Clinical Applications
MSMA: Multi-Scale Feature Fusion For Multi-Attribute 3D Face Reconstruction From Unconstrained Images
Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization
SA-UNetv2: Rethinking Spatial Attention U-Net for Retinal Vessel Segmentation
FineQuest: Adaptive Knowledge-Assisted Sports Video Understanding via Agent-of-Thoughts Reasoning
Pseudo-D: Informing Multi-View Uncertainty Estimation with Calibrated Neural Training Dynamics
LFRA-Net: A Lightweight Focal and Region-Aware Attention Network for Retinal Vessel Segmentatio
SpecVLM: Fast Speculative Decoding in Vision-Language Models
MAFS: Masked Autoencoder for Infrared-Visible Image Fusion and Semantic Segmentation
ROSGS: Relightable Outdoor Scenes With Gaussian Splatting
Leveraging Geometric Priors for Unaligned Scene Change Detection
UnLoc: Leveraging Depth Uncertainties for Floorplan Localization
Toward Next-generation Medical Vision Backbones: Modeling Finer-grained Long-range Visual Dependency
Dual Band Video Thermography Near Ambient Conditions
Beyond Instance Consistency: Investigating View Diversity in Self-supervised Learning
GLaVE-Cap: Global-Local Aligned Video Captioning with Vision Expert Integration
In-Vivo Skin 3-D Surface Reconstruction and Wrinkle Depth Estimation using Handheld High Resolution Tactile Sensing
MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation
No Modality Left Behind: Dynamic Model Generation for Incomplete Medical Data
On the Skinning of Gaussian Avatars
Disentanglement of Biological and Technical Factors via Latent Space Rotation in Clinical Imaging Improves Disease Pattern Discovery
MultiMAE for Brain MRIs: Robustness to Missing Inputs Using Multi-Modal Masked Autoencoder
Modality-Aware Infrared and Visible Image Fusion with Target-Aware Supervision
Multiple Instance Learning Framework with Masked Hard Instance Mining for Gigapixel Histopathology Image Analysis
SFGNet: Semantic and Frequency Guided Network for Camouflaged Object Detection
How Auxiliary Reasoning Unleashes GUI Grounding in VLMs
Gaussian-Plus-SDF SLAM: High-fidelity 3D Reconstruction at 150+ fps
Hierarchical Identity Learning for Unsupervised Visible-Infrared Person Re-Identification
Optimizing Class Distributions for Bias-Aware Multi-Class Learning
MVQA-68K: A Multi-dimensional and Causally-annotated Dataset with Quality Interpretability for Video Assessment
Disentangling Content from Style to Overcome Shortcut Learning: A Hybrid Generative-Discriminative Learning Framework
Action Hints: Semantic Typicality and Context Uniqueness for Generalizable Skeleton-based Video Anomaly Detection
Organoid Tracker: A SAM2-Powered Platform for Zero-shot Cyst Analysis in Human Kidney Organoid Videos
Mars Traversability Prediction: A Multi-modal Self-supervised Approach for Costmap Generation
End-to-End Visual Autonomous Parking via Control-Aided Attention
SMILE: A Super-resolution Guided Multi-task Learning Method for Hyperspectral Unmixing
A Copula-Guided Temporal Dependency Method for Multitemporal Hyperspectral Images Unmixing
3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment
Filling the Gaps: A Multitask Hybrid Multiscale Generative Framework for Missing Modality in Remote Sensing Semantic Segmentation
WildSmoke: Ready-to-Use Dynamic 3D Smoke Assets from a Single Video in the Wild
SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting
No Mesh, No Problem: Estimating Coral Volume and Surface from Sparse Multi-View Images
Traffic-MLLM: A Spatio-Temporal MLLM with Retrieval-Augmented Generation for Causal Inference in Traffic
Multispectral-NeRF:a multispectral modeling approach based on neural radiance fields
SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion
The Impact of Skin Tone Label Granularity on the Performance and Fairness of AI Based Dermatology Image Classification Models
Scaling Up Forest Vision with Synthetic Data
Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation
CCoMAML: Efficient Cattle Identification Using Cooperative Model-Agnostic Meta-Learning
ANROT-HELANet: Adverserially and Naturally Robust Attention-Based Aggregation Network via The Hellinger Distance for Few-Shot Classification
Contextualized Multimodal Lifelong Person Re-Identification in Hybrid Clothing States
Cross-Domain Attribute Alignment with CLIP: A Rehearsal-Free Approach for Class-Incremental Unsupervised Domain Adaptation
Synthetic Dataset Evaluation Based on Generalized Cross Validation
Enhancement Without Contrast: Stability-Aware Multicenter Machine Learning for Glioma MRI Imaging
Group Evidence Matters: Tiling-based Semantic Gating for Dense Object Detection
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts
Well-Conditioned Polynomial Representations for Mathematical Handwriting Recognition
Multi-Task Diffusion Approach For Prediction of Glioma Tumor Progression
Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios
OpenUrban3D: Annotation-Free Open-Vocabulary Semantic Segmentation of Large-Scale Urban Point Clouds
AutoOEP -- A Multi-modal Framework for Online Exam Proctoring
Total Variation Subgradient Guided Image Fusion for Dual-Camera CASSI System
Simulating Sinogram-Domain Motion and Correcting Image-Domain Artifacts Using Deep Learning in HR-pQCT Bone Imaging
Gaze Authentication: Factors Influencing Authentication Performance
TrueSkin: Towards Fair and Accurate Skin Tone Recognition and Generation
Policy-Driven Transfer Learning in Resource-Limited Animal Monitoring
Improving Fungi Prototype Representations for Few-Shot Classification
Cluster-Level Sparse Multi-Instance Learning for Whole-Slide Images
SurgLaVi: Large-Scale Hierarchical Dataset for Surgical Vision-Language Representation Learning
USCTNet: A deep unfolding nuclear-norm optimization solver for physically consistent HSI reconstruction
Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation
SegSLR: Promptable Video Segmentation for Isolated Sign Language Recognition
SCOPE: Speech-guided COllaborative PErception Framework for Surgical Scene Segmentation
Every Camera Effect, Every Time, All at Once: 4D Gaussian Ray Tracing for Physics-based Camera Effect Data Generation
EditDuet: A Multi-Agent System for Video Non-Linear Editing
LastingBench: Defend Benchmarks Against Knowledge Leakage
PDFMathTranslate: Scientific Document Translation Preserving Layouts
Persona-Based Synthetic Data Generation Using Multi-Stage Conditioning with Large Language Models for Emotion Recognition
Is In-Context Learning Learning?
Enhancing Prompt Injection Attacks to LLMs via Poisoning Alignment
A Survey on Large Language Model-based Agents for Statistics and Data Science
Evaluating and Aligning Human Economic Risk Preferences in LLMs
One Goal, Many Challenges: Robust Preference Optimization Amid Content-Aware and Multi-Source Noise
Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks
Lean Formalization of Generalization Error Bound by Rademacher Complexity
Rethinking LLM-Based Recommendations: A Personalized Query-Driven Parallel Integration
SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation
A Real-Time Diminished Reality Approach to Privacy in MR Collaboration
Hallucinated Span Detection with Multi-View Attention Features
Assessing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation
LogicTree: Structured Proof Exploration for Coherent and Rigorous Logical Reasoning with Large Language Models
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models
Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages
Improving Informally Romanized Language Identification
Base Models Beat Aligned Models at Randomness and Creativity
Synthesize-on-Graph: Knowledgeable Synthetic Data Generation for Continue Pre-training of Large Language Models
Multilingual Collaborative Defense for Large Language Models
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning
HiMATE: A Hierarchical Multi-Agent Framework for Machine Translation Evaluation
ReliableEval: A Recipe for Stochastic LLM Evaluation via Method of Moments
Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation
MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation
Hopscotch: Discovering and Skipping Redundancies in Language Models
Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models
GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View
Mirage of Mastery: Memorization Tricks LLMs into Artificially Inflated Self-Knowledge
Time is On My Side: Dynamics of Talk-Time Sharing in Video-chat Conversations
A Cross-Cultural Comparison of LLM-based Public Opinion Simulation: Evaluating Chinese and U.S. Models on Diverse Societies
LML: A Novel Lexicon for the Moral Foundation of Liberty
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Can Advanced LLMs Coach Smaller LLMs? Knowledge Distillation for Goal-Oriented Dialogs
GP-GPT: Large Language Model for Gene-Phenotype Mapping
Revealing the Inherent Instructability of Pre-Trained Language Models
Evaluating Automatic Speech Recognition Systems for Korean Meteorological Experts
Artificial intelligence contribution to translation industry: looking back and forward
FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering
Speak-to-Structure: Evaluating LLMs in Open-domain Natural Language-Driven Molecule Generation
IOLBENCH: Benchmarking LLMs on Linguistic Reasoning
Transformer-Based Multimodal Knowledge Graph Completion with Link-Aware Contexts
From Personas to Talks: Revisiting the Impact of Personas on LLM-Synthesized Emotional Support Conversations
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs
Efficient Environmental Claim Detection with Hyperbolic Graph Neural Networks
Rumor Detection by Multi-task Suffix Learning based on Time-series Dual Sentiments
Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology
LLM as a Broken Telephone: Iterative Generation Distorts Information
LinguaLens: Towards Interpreting Linguistic Mechanisms of Large Language Models via Sparse Auto-Encoder
Monitoring Decoding: Mitigating Hallucination via Evaluating the Factuality of Partial Response during Generation
Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter
Is 'Hope' a person or an idea? A pilot benchmark for NER: comparing traditional NLP tools and large language models on ambiguous entities
In-domain SSL pre-training and streaming ASR
GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models
CBP-Tuning: Efficient Local Customization for Black-box Large Language Models
XplaiNLP at CheckThat! 2025: Multilingual Subjectivity Detection with Finetuned Transformers and Prompt-Based Inference with Large Language Models
Pun Unintended: LLMs and the Illusion of Humor Understanding
RAGs to Riches: RAG-like Few-shot Learning for Large Language Model Role-playing
Preservation of Language Understanding Capabilities in Speech-aware Large Language Models
ReFineG: Synergizing Small Supervised Models and LLMs for Low-Resource Grounded Multimodal NER
Mitigating Hallucinations in Large Vision-Language Models by Self-Injecting Hallucinations
MALLM: Multi-Agent Large Language Models Framework
MindVL: Towards Efficient and Effective Training of Multimodal Large Language Models on Ascend NPUs
Collaborative Document Editing with Multiple Users and AI Agents
The AI Memory Gap: Users Misremember What They Created With AI or Without
Lost in Embeddings: Information Loss in Vision-Language Models
FinGEAR: Financial Mapping-Guided Enhanced Answer Retrieval
RadarLLM: Adapting Pretrained Large Language Models for Marine Radar Target Detection with Preference-aware Loss
When marine radar target detection meets pretrained large language models
Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models
Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm
Understanding Emergent In-Context Learning from a Kernel Regression Perspective
Tackling Fake News in Bengali: Unraveling the Impact of Summarization vs. Augmentation on Pre-trained Language Models
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
HalluDetect: Detecting, Mitigating, and Benchmarking Hallucinations in Conversational Systems
AesBiasBench: Evaluating Bias and Alignment in Multimodal Language Models for Personalized Image Aesthetic Assessment
EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI
A Dynamic Knowledge Update-Driven Model with Large Language Models for Fake News Detection
CoachMe: Decoding Sport Elements with a Reference-Based Coaching Instruction Generation Model
Room acoustics affect communicative success in hybrid meeting spaces: a pilot study
An Agentic Toolkit for Adaptive Information Extraction from Regulatory Documents
User eXperience Perception Insights Dataset (UXPID): Synthetic User Feedback from Public Industrial Forums
When Curiosity Signals Danger: Predicting Health Crises Through Online Medication Inquiries
From Fuzzy Speech to Medical Insight: Benchmarking LLMs on Noisy Patient Narratives
PledgeTracker: A System for Monitoring the Fulfilment of Pledges
SCDTour: Embedding Axis Ordering and Merging for Interpretable Semantic Change Detection
MOOM: Maintenance, Organization and Optimization of Memory in Ultra-Long Role-Playing Dialogues
Growing Perspectives: Modelling Embodied Perspective Taking and Inner Narrative Development Using Large Language Models
Uncertainty in Authorship: Why Perfect AI Detection Is Mathematically Impossible
Designing LLMs for cultural sensitivity: Evidence from English-Japanese translation
Spec-LLaVA: Accelerating Vision-Language Models with Dynamic Tree-Based Speculative Decoding
ToolRM: Outcome Reward Models for Tool-Calling Large Language Models
Query-Focused Extractive Summarization for Sentiment Explanation
Text Adaptation to Plain Language and Easy Read via Automatic Post-Editing Cycles
Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect
SENSE models: an open source solution for multilingual and multimodal semantic-based tasks
Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs
RanAT4BIE: Random Adversarial Training for Biomedical Information Extraction
The Prompt Engineering Report Distilled: Quick Start Guide for Life Sciences
Ko-PIQA: A Korean Physical Commonsense Reasoning Dataset with Cultural Context
!MSA at AraHealthQA 2025 Shared Task: Enhancing LLM Performance for Arabic Clinical Question Answering through Prompt Engineering and Ensemble Learning
Continually Adding New Languages to Multilingual Language Models
A Transformer-Based Cross-Platform Analysis of Public Discourse on the 15-Minute City Paradigm
CognitiveSky: Scalable Sentiment and Narrative Analysis for Decentralized Social Media
CEMTM: Contextual Embedding-based Multimodal Topic Modeling
Improving LLMs' Learning for Coreference Resolution
AKCIT-FN at CheckThat! 2025: Switching Fine-Tuned SLMs and LLM Prompting for Multilingual Claim Normalization
DeDisCo at the DISRPT 2025 Shared Task: A System for Discourse Relation Classification
Unsupervised Candidate Ranking for Lexical Substitution via Holistic Sentence Semantics
LVLMs are Bad at Overhearing Human Referential Communication
PeruMedQA: Benchmarking Large Language Models (LLMs) on Peruvian Medical Exams -- Dataset Construction and Evaluation
On the Distinctive Co-occurrence Characteristics of Antonymy
HARP: Hallucination Detection via Reasoning Subspace Projection
HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking
D$^2$HScore: Reasoning-Aware Hallucination Detection via Semantic Breadth and Depth Analysis in LLMs
Bhaasha, Bhasa, Zaban: A Survey for Low-Resourced Languages in South Asia -- Current Stage and Challenges
Analyzing Information-Seeking Behaviors in a Hakka AI Chatbot: A Cognitive-Pragmatic Study
Dynamic Span Interaction and Graph-Aware Memory for Entity-Level Sentiment Classification
Interdisciplinary Research in Conversation: A Case Study in Computational Morphology for Language Documentation
Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts
A Survey on Retrieval And Structuring Augmented Generation with Large Language Models
SearchInstruct: Enhancing Domain Adaptation via Retrieval-Based Instruction Dataset Creation
Reasoning Under Uncertainty: Exploring Probabilistic Reasoning Capabilities of LLMs
RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems
Evaluating Large Language Models for Evidence-Based Clinical Question Answering
GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings
Text2Sign Diffusion: A Generative Approach for Gloss-Free Sign Language Production
Quantifier Scope Interpretation in Language Learners and LLMs
Term2Note: Synthesising Differentially Private Clinical Notes from Medical Terms
Aligning ESG Controversy Data with International Guidelines through Semi-Automatic Ontology Construction
Introducing Spotlight: A Novel Approach for Generating Captivating Key Information from Documents
An Interpretable Benchmark for Clickbait Detection and Tactic Attribution
EmoBench-Reddit: A Hierarchical Benchmark for Evaluating the Emotional Intelligence of Multimodal Large Language Models
Joint Effects of Argumentation Theory, Audio Modality and Data Enrichment on LLM-Based Fallacy Classification
When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMs' Toxicity
Text2Mem: A Unified Memory Operation Language for Memory Operating System
MinatoLoader: Accelerating Machine Learning Training Through Efficient Data Preprocessing
Coordinated Reinforcement Learning Prefetching Architecture for Multicore Systems
PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models
Parameter estimation with uncertainty quantification from continuous measurement data using neural network ensembles
Why Bonds Fail Differently? Explainable Multimodal Learning for Multi-Class Default Prediction
Do machine learning climate models work in changing climate dynamics?
Learning Neural Networks by Neuron Pursuit
From Autoencoders to CycleGAN: Robust Unpaired Face Manipulation via Adversarial Learning
Dynamic Relational Priming Improves Transformer in Multivariate Time Series
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
Spectral Bottleneck in Deep Neural Networks: Noise is All You Need
The LLM as a Network Operator: A Vision for Generative AI in the 6G Radio Access Network
DeepSeasons: a Deep Learning scale-selecting approach to Seasonal Forecasts
Crystal Systems Classification of Phosphate-Based Cathode Materials Using Machine Learning for Lithium-Ion Battery
Adaptive Temporal Fusion Transformers for Cryptocurrency Price Prediction
Trial-Level Time-frequency EEG Desynchronization as a Neural Marker of Pain
Assessing the Limits of Graph Neural Networks for Vapor-Liquid Equilibrium Prediction: A Cryogenic Mixture Case Study
Building a General SimCLR Self-Supervised Foundation Model Across Neurological Diseases to Advance 3D Brain MRI Diagnoses
On a Geometry of Interbrain Networks
Multimodal Regression for Enzyme Turnover Rates Prediction
Visualization and Analysis of the Loss Landscape in Graph Neural Networks
Collapse of Irrelevant Representations (CIR) Ensures Robust and Non-Disruptive LLM Unlearning
FedDAF: Federated Domain Adaptation Using Model Functional Distance
Transparent and Fair Profiling in Employment Services: Evidence from Switzerland
MillStone: How Open-Minded Are LLMs?
Examining the Relationship between Scientific Publishing Activity and Hype-Driven Financial Bubbles: A Comparison of the Dot-Com and AI Eras
Low-rank Orthogonalization for Large-scale Matrix Optimization with Applications to Foundation Model Training
Learning from Uncertain Similarity and Unlabeled Data
Generalizing Behavior via Inverse Reinforcement Learning with Closed-Form Reward Centroids
AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models
Travel Time and Weather-Aware Traffic Forecasting in a Conformal Graph Neural Network Framework
Early Detection of Branched Broomrape (Phelipanche ramosa) Infestation in Tomato Crops Using Leaf Spectral Analysis and Machine Learning
A Time-Series Foundation Model by Universal Delay Embedding
Draw a Portrait of Your Graph Data: An Instance-Level Profiling Framework for Graph-Structured Data
$K$-Level Policy Gradients for Multi-Agent Reinforcement Learning
Online Omniprediction with Long-Term Constraints
PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits
Decoding Musical Origins: Distinguishing Human and AI Composers
Enhancing ML Models Interpretability for Credit Scoring
Learning to Optimize Multi-Objective Alignment Through Dynamic Reward Weighting
Drug Repurposing Using Deep Embedded Clustering and Graph Neural Networks
OASIS: A Deep Learning Framework for Universal Spectroscopic Analysis Driven by Novel Loss Functions
DARD: Dice Adversarial Robustness Distillation against Adversarial Attacks
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
Inducing Uncertainty for Test-Time Privacy
SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching
Reasoned Safety Alignment: Ensuring Jailbreak Defense via Answer-Then-Check
Measuring Visual Understanding in Telecom domain: Performance Metrics for Image-to-UML conversion using VLMs
DRAG: Data Reconstruction Attack using Guided Diffusion
Fast and Interpretable Machine Learning Modelling of Atmospheric Molecular Clusters
Data Fusion and Machine Learning for Ship Fuel Consumption Modelling -- A Case of Bulk Carrier Vessel
Stabilizing PINNs: A regularization scheme for PINN training to avoid unstable fixed points of dynamical systems
Verifying Computational Graphs in Production-Grade Distributed Machine Learning Frameworks
CrunchLLM: Multitask LLMs for Structured Business Reasoning and Outcome Prediction
Using LLMs for Late Multimodal Sensor Fusion for Activity Recognition
Matched-Pair Experimental Design with Active Learning
Neurosymbolic AI Transfer Learning Improves Network Intrusion Detection
CogGNN: Cognitive Graph Neural Networks in Generative Connectomics
Robustifying Diffusion-Denoised Smoothing Against Covariate Shift
California Wildfire Inventory (CAWFI): An Extensive Dataset for Predictive Techniques based on Artificial Intelligence
Data-Efficient Ensemble Weather Forecasting with Diffusion Models
Machine Learning Framework for Audio-Based Equipment Condition Monitoring: A Comparative Study of Classification Algorithms
GCN-TULHOR: Trajectory-User Linking Leveraging GCNs and Higher-Order Spatial Representations
BIGNet: Pretrained Graph Neural Network for Embedding Semantic, Spatial, and Topological Data in BIM Models
PINGS: Physics-Informed Neural Network for Fast Generative Sampling
MatQnA: A Benchmark Dataset for Multi-modal Large Language Models in Materials Characterization and Analysis
On the Escaping Efficiency of Distributed Adversarial Training Algorithms
ClaimIQ at CheckThat! 2025: Comparing Prompted and Fine-Tuned Language Models for Verifying Numerical Claims
Machine Learning-Driven Predictive Resource Management in Complex Science Workflows
Moment Estimates and DeepRitz Methods on Learning Diffusion Systems with Non-gradient Drifts
AttnBoost: Retail Supply Chain Sales Insights via Gradient Boosting Perspective
A Differential Manifold Perspective and Universality Analysis of Continuous Attractors in Artificial Neural Networks
Adaptive Preference Optimization with Uncertainty-aware Utility Anchor
Holographic Knowledge Manifolds: A Novel Pipeline for Continual Learning Without Catastrophic Forgetting in Large Language Models
Gradient Estimation Methods of Approximate Multipliers for High-Accuracy Retraining of Deep Learning Models
GTS_Forecaster: a novel deep learning based geodetic time series forecasting toolbox with python
pySigLib -- Fast Signature-Based Computations on CPU and GPU
Interpretable neural network system identification method for two families of second-order systems based on characteristic curves
Accurate and Private Diagnosis of Rare Genetic Syndromes from Facial Images with Federated Deep Learning
Quantum Architecture Search for Solving Quantum Machine Learning Tasks
Evalet: Evaluating Large Language Models by Fragmenting Outputs into Functions
Geometrically Constrained and Token-Based Probabilistic Spatial Transformers
TransZero: Parallel Tree Expansion in MuZero using Transformer Networks
Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation
Gradient Free Deep Reinforcement Learning With TabPFN
Embodied Intelligence in Disassembly: Multimodal Perception Cross-validation and Continual Learning in Neuro-Symbolic TAMP
Efficient Single-Step Framework for Incremental Class Learning in Neural Networks
A five-layer framework for AI governance: integrating regulation, standards, and certification
Transformer Enhanced Relation Classification: A Comparative Analysis of Contextuality, Data Efficiency and Sequence Complexity
Intelligent Reservoir Decision Support: An Integrated Framework Combining Large Language Models, Advanced Prompt Engineering, and Multimodal Data Fusion for Real-Time Petroleum Operations
From Firewalls to Frontiers: AI Red-Teaming is a Domain-Specific Evolution of Cyber Red-Teaming
Framing AI System Benchmarking as a Learning Task: FlexBench and the Open MLPerf Dataset
Enhancing Generalization in Vision-Language-Action Models by Preserving Pretrained Representations
Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning
Tabular Data with Class Imbalance: Predicting Electric Vehicle Crash Severity with Pretrained Transformers (TabPFN) and Mamba-Based Models
Beyond Frame-wise Tracking: A Trajectory-based Paradigm for Efficient Point Cloud Tracking
CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration
PanoLora: Bridging Perspective and Panoramic Video Generation with LoRA Adaptation
Multi-Modal Sensing Aided mmWave Beamforming for V2V Communications with Transformers
Application of Machine Learning for Correcting Defect-induced Neuromorphic Circuit Inference Errors
ENJ: Optimizing Noise with Genetic Algorithms to Jailbreak LSMs
Agentic Username Suggestion and Multimodal Gender Detection in Online Platforms: Introducing the PNGT-26K Dataset
AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs
An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift
Your Compiler is Backdooring Your Model: Understanding and Exploiting Compilation Inconsistency Vulnerabilities in Deep Learning Compilers
Differentially-private text generation degrades output language quality
The Psychogenic Machine: Simulating AI Psychosis, Delusion Reinforcement and Harm Enablement in Large Language Models
PHLoRA: data-free Post-hoc Low-Rank Adapter extraction from full-rank checkpoint
Decoupling Search and Learning in Neural Net Training
FragmentGPT: A Unified GPT Model for Fragment Growing, Linking, and Merging in Molecular Design
An Advanced Convolutional Neural Network for Bearing Fault Diagnosis under Limited Data
Length-Aware Rotary Position Embedding for Text-Speech Alignment
A Comparison and Evaluation of Fine-tuned Convolutional Neural Networks to Large Language Models for Image Classification and Segmentation of Brain Tumors on MRI
Pluralistic Alignment for Healthcare: A Role-Driven Framework
Privacy-Preserving Decentralized Federated Learning via Explainable Adaptive Differential Privacy
Kalman Bayesian Transformer
Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight
Automated MCQA Benchmarking at Scale: Evaluating Reasoning Traces as Retrieval Sources for Domain Adaptation of Small Language Models
HalluField: Detecting LLM Hallucinations via Field-Theoretic Modeling
Bridging Cultural Distance Between Models Default and Local Classroom Demands: How Global Teachers Adopt GenAI to Support Everyday Teaching Practices
GoldenTransformer: A Modular Fault Injection Framework for Transformer Robustness Research
Judge Q: Trainable Queries for Optimized Information Retention in KV Cache Eviction
Rethinking Sparse Autoencoders: Select-and-Project for Fairness and Control from Encoder Features Alone
Towards Automated Error Discovery: A Study in Conversational AI
A funny companion: Distinct neural responses to perceived AI- versus humangenerated humor
Pre-Storage Reasoning for Episodic Memory: Shifting Inference Burden to Memory for Personalized Dialogue
Physics-informed neural network solves minimal surfaces in curved spacetime
GTHNA: Local-global Graph Transformer with Memory Reconstruction for Holistic Node Anomaly Evaluation
ToMA: Token Merge with Attention for Image Generation with Diffusion Models
Clarifying Model Transparency: Interpretability versus Explainability in Deep Learning with MNIST and IMDB Examples
When the Code Autopilot Breaks: Why LLMs Falter in Embedded Machine Learning
Testing for LLM response differences: the case of a composite null consisting of semantically irrelevant query perturbations
Robust DDoS-Attack Classification with 3D CNNs Against Adversarial Methods
ASL360: AI-Enabled Adaptive Streaming of Layered 360{\deg} Video over UAV-assisted Wireless Networks
Uncovering the Vulnerability of Large Language Models in the Financial Domain via Risk Concealment
Biomarkers of brain diseases
AVEC: Bootstrapping Privacy for Local LLMs
MarkDiffusion: An Open-Source Toolkit for Generative Watermarking of Latent Diffusion Models
Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey
Quality Assessment of Tabular Data using Large Language Models and Code Generation
Gene-R1: Reasoning with Data-Augmented Lightweight LLMs for Gene Set Analysis
Aesthetic Experience and Educational Value in Co-creating Art with Generative AI: Evidence from a Survey of Young Learners
The Coding Limits of Robust Watermarking for Generative Models
LearnLens: An AI-Enhanced Dashboard to Support Teachers in Open-Ended Classrooms
Smart Trial: Evaluating the Use of Large Language Models for Recruiting Clinical Trial Participants via Social Media
Machine Unlearning for Responsible and Adaptive AI in Education
Assisting the Grading of a Handwritten General Chemistry Exam with Artificial Intelligence
SME-TEAM: Leveraging Trust and Ethics for Secure and Responsible Use of AI and LLMs in SMEs
GenAI Voice Mode in Programming Education
No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
Test-Time Warmup for Multimodal Large Language Models
Vibe Coding for UX Design: Understanding UX Professionals' Perceptions of AI-Assisted Design and Development
SCOR: A Framework for Responsible AI Innovation in Digital Ecosystems
Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
LLM in the Middle: A Systematic Review of Threats and Mitigations to Real-World LLM-based Systems
SABR: A Stable Adaptive Bitrate Framework Using Behavior Cloning Pretraining and Reinforcement Learning Fine-Tuning
Distributed Gossip-GAN for Low-overhead CSI Feedback Training in FDD mMIMO-OFDM Systems
Online Learning Based Efficient Resource Allocation for LoRaWAN Network
From Noise to Precision: A Diffusion-Driven Approach to Zero-Inflated Precipitation Prediction
FEDEXCHANGE: Bridging the Domain Gap in Federated Object Detection for Free
CAR-BRAINet: Sub-6GHz Aided Spatial Adaptive Beam Prediction with Multi Head Attention for Heterogeneous Vehicular Networks
The Anti-Ouroboros Effect: Emergent Resilience in Large Language Models from Recursive Selective Feedback
FireGNN: Neuro-Symbolic Graph Neural Networks with Trainable Fuzzy Rules for Interpretable Medical Image Classification
LogGuardQ: A Cognitive-Enhanced Reinforcement Learning Framework for Cybersecurity Anomaly Detection in Security Logs
Multimodal Deep Learning for ATCO Command Lifecycle Modeling and Workload Prediction
From Predictions to Explanations: Explainable AI for Autism Diagnosis and Identification of Critical Brain Regions
Data-Efficient Psychiatric Disorder Detection via Self-supervised Learning on Frequency-enhanced Brain Networks
Resource-Aware Neural Network Pruning Using Graph-based Reinforcement Learning
STM-Graph: A Python Framework for Spatio-Temporal Mapping and Graph Neural Network Predictions
Mitigating Catastrophic Forgetting and Mode Collapse in Text-to-Image Diffusion via Latent Replay
FinXplore: An Adaptive Deep Reinforcement Learning Framework for Balancing and Discovering Investment Opportunities
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
Semantic-guided LoRA Parameters Generation
EchoLeak: The First Real-World Zero-Click Prompt Injection Exploit in a Production LLM System
A Survey of Reasoning and Agentic Systems in Time Series with Large Language Models
AMLNet: A Knowledge-Based Multi-Agent Framework to Generate and Detect Realistic Money Laundering Transactions
Adapting and Evaluating Multimodal Large Language Models for Adolescent Idiopathic Scoliosis Self-Management: A Divide and Conquer Framework
Learning Representations in Video Game Agents with Supervised Contrastive Imitation Learning
BuildingGym: An open-source toolbox for AI-based building energy management using reinforcement learning
How to Evaluate Medical AI
Neuro-Symbolic Agents with Modal Logic for Autonomous Diagnostics
Agentic Temporal Graph of Reasoning with Multimodal Language Models: A Potential AI Aid to Healthcare
When Safe Unimodal Inputs Collide: Optimizing Reasoning Chains for Cross-Modal Safety in Multimodal Large Language Models
Bridging Engineering and AI Planning through Model-Based Knowledge Transformation for the Validation of Automated Production System Variants
JustEva: A Toolkit to Evaluate LLM Fairness in Legal Knowledge Inference
Co-Alignment: Rethinking Alignment as Bidirectional Human-AI Cognitive Adaptation
Advancing Medical Artificial Intelligence Using a Century of Cases
Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks
DSRAG: A Domain-Specific Retrieval Framework Based on Document-derived Multimodal Knowledge Graph
Learning Decomposed Contextual Token Representations from Pretrained and Collaborative Signals for Generative Recommendation
Real-Time RAG for the Identification of Supply Chain Vulnerabilities
AegisShield: Democratizing Cyber Threat Modeling with Generative AI
AI Answer Engine Citation Behavior An Empirical Analysis of the GEO16 Framework
LLM Enhancement with Domain Expert Mental Model to Reduce LLM Hallucination with Causal Prompt Engineering
From Grounding to Skolemization: A Logic-Constrained Vector Symbolic Architecture for Complex Query Answering
Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding
Enhancing Computational Cognitive Architectures with LLMs: A Case Study
Rethinking Human Preference Evaluation of LLM Rationales
Tractable Asymmetric Verification for Large Language Models via Deterministic Replicability
Difficulty-Aware Agent Orchestration in LLM-Powered Workflows
Neural cellular automata: applications to biology and beyond classical AI
AlignKT: Explicitly Modeling Knowledge State for Knowledge Tracing with Ideal State Alignment
AI-Generated Content in Cross-Domain Applications: Research Trends, Challenges and Propositions
Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble
MAPGD: Multi-Agent Prompt Gradient Descent for Collaborative Prompt Optimization
Securing AI Agents: Implementing Role-Based Access Control for Industrial Applications
Cross-Platform Scaling of Vision-Language-Action Models from Edge to Cloud GPUs
MedicalOS: An LLM Agent based Operating System for Digital Healthcare
Formal Reasoning for Intelligent QA Systems: A Case Study in the Educational Domain
ZapGPT: Free-form Language Prompting for Simulated Cellular Control
Understanding AI Evaluation Patterns: How Different GPT Models Assess Vision-Language Descriptions

Research Sources: 539 | Generated: 9/16/2025