AI RESEARCH PAPERS & ACADEMIC SOURCES
- Preconditioned subgradient method for composite optimization: overparameterization and fast convergence
- High Effort, Low Gain: Fundamental Limits of Active Learning for Linear Dynamical Systems
- Contractive kinetic Langevin samplers beyond global Lipschitz continuity
- A comparison between geostatistical and machine learning models for spatio-temporal prediction of PM2.5 data
- Generalized Dirichlet Energy and Graph Laplacians for Clustering Directed and Undirected Graphs
- Piecewise Deterministic Markov Processes for Bayesian Neural Networks
- Adapting Projection-Based Reduced-Order Models using Projected Gaussian Process
- Deep learning joint extremes of metocean variables using the SPAR model
- Kernel Embeddings and the Separation of Measure Phenomenon
- Simulation-Based Sensitivity Analysis in Optimal Treatment Regimes and Causal Decomposition with Individualized Interventions
- Eigen-convergence of Gaussian kernelized graph Laplacian by manifold heat interpolation
- A Permutation-free Kernel Two-Sample Test
- Early alignment in two-layer networks training is a two-edged sword
- Robustness in the Face of Partial Identifiability in Reward Learning
- Understanding Model Calibration -- A gentle introduction and visual exploration of calibration and the expected calibration error (ECE)
- Weak instrumental variables due to nonlinearities in panel data: A Super Learner Control Function estimator
- All Optical Echo State Network Reservoir Computing
- Scalp Diagnostic System With Label-Free Segmentation and Training-Free Image Translation
- Social Perception of Faces in a Vision-Language Model
- DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction
- HD-OOD3D: Supervised and Unsupervised Out-of-Distribution object detection in LiDAR data
- Kernel-based Stochastic Approximation Framework for Nonlinear Operator Learning
- Maximum diversity, weighting and invariants of time series
- Predictable Compression Failures: Why Language Models Actually Hallucinate
- Contrastive Network Representation Learning
- Next-Generation Reservoir Computing for Dynamical Inference
- Some Robustness Properties of Label Cleaning
- A Particle-Flow Algorithm for Free-Support Wasserstein Barycenters
- Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification
- E-ROBOT: a dimension-free method for robust statistics and machine learning via Schr\"odinger bridge
- SpaPool: Soft Partition Assignment Pooling for__Graph Neural Networks
- Identifiable Autoregressive Variational Autoencoders for Nonlinear and Nonstationary Spatio-Temporal Blind Source Separation
- MMM: Clustering Multivariate Longitudinal Mixed-type Data
- The Morgan-Pitman Test of Equality of Variances and its Application to Machine Learning Model Evaluation and Selection
- What is in a Price? Estimating Willingness-to-Pay with Bayesian Hierarchical Models
- The Honest Truth About Causal Trees: Accuracy Limits for Heterogeneous Treatment Effect Estimation
- Solving ill-conditioned polynomial equations using score-based priors with application to multi-target detection
- Rate-Distortion Limits for Multimodal Retrieval: Theory, Optimal Codes, and Finite-Sample Guarantees
- SH-SAS: An Implicit Neural Representation for Complex Spherical-Harmonic Scattering Fields for 3D Synthetic Aperture Sonar
- UltraUPConvNet: A UPerNet- and ConvNeXt-Based Multi-Task Network for Ultrasound Tissue Segmentation and Disease Prediction
- ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations
- Realistic Environmental Injection Attacks on GUI Agents
- Introduction to a Low-Cost AI-Powered GUI for Unstained Cell Culture Analysis
- Geometric Analysis of Magnetic Labyrinthine Stripe Evolution via U-Net Segmentation
- ParaEQsA: Parallel and Asynchronous Embodied Questions Scheduling and Answering
- TrajBooster: Boosting Humanoid Whole-Body Manipulation via Trajectory-Centric Learning
- Data-driven Smile Design: Personalized Dental Aesthetics Outcomes Using Deep Learning
- Video-based Sign Language Recognition without Temporal Segmentation
- SAIF: Sparse Adversarial and Imperceptible Attack Framework
- SRSNetwork: Siamese Reconstruction-Segmentation Networks based on Dynamic-Parameter Convolution
- Long-Tailed 3D Detection via Multi-Modal Fusion
- Bayesian Unsupervised Disentanglement of Anatomy and Geometry for Deep Groupwise Image Registration
- SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis
- InstructHumans: Editing Animated 3D Human Textures with Instructions
- HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising
- Multilingual Diversity Improves Vision-Language Representations
- What is the Visual Cognition Gap between Humans and Multimodal LLMs?
- Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
- AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective
- Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation
- End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data
- U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT
- Progressive Flow-inspired Unfolding for Spectral Compressive Imaging
- End-to-End 4D Heart Mesh Recovery Across Full-Stack and Sparse Cardiac MRI
- FS-SAM2: Adapting Segment Anything Model 2 for Few-Shot Semantic Segmentation via Low-Rank Adaptation
- RailSafeNet: Visual Scene Understanding for Tram Safety
- 3DViT-GAT: A Unified Atlas-Based 3D Vision Transformer and Graph Learning Framework for Major Depressive Disorder Detection Using Structural MRI Data
- Open-ended Hierarchical Streaming Video Understanding with Vision Language Models
- Multi Anatomy X-Ray Foundation Model
- LoRA-fine-tuned Large Vision Models for Automated Assessment of Post-SBRT Lung Injury
- HoloGarment: 360{\deg} Novel View Synthesis of In-the-Wild Garments
- Domain-Adaptive Pretraining Improves Primate Behavior Recognition
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
- LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence
- Character-Centric Understanding of Animated Movies
- MIDOG 2025 Track 2: A Deep Learning Model for Classification of Atypical and Normal Mitotic Figures under Class and Hardness Imbalances
- Automated Cervical Os Segmentation for Camera-Guided, Speculum-Free Screening
- Adapting Medical Vision Foundation Models for Volumetric Medical Image Segmentation via Active Learning and Selective Semi-supervised Fine-tuning
- Nav-R1: Reasoning and Navigation in Embodied Scenes
- AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting
- Probabilistic Robustness Analysis in High Dimensional Space: Application to Semantic Segmentation Network
- Synthetic Captions for Open-Vocabulary Zero-Shot Segmentation
- Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting
- Bridging Vision Language Models and Symbolic Grounding for Video Question Answering
- Dr.V: A Hierarchical Perception-Temporal-Cognition Framework to Diagnose Video Hallucination by Fine-grained Spatial-Temporal Grounding
- Multi-animal tracking in Transition: Comparative Insights into Established and Emerging Methods
- Do It Yourself (DIY): Modifying Images for Poems in a Zero-Shot Setting Using Weighted Prompt Manipulation
- SAM-TTT: Segment Anything Model via Reverse Parameter Configuration and Test-Time Training for Camouflaged Object Detection
- BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation
- Logit Mixture Outlier Exposure for Fine-grained Out-of-Distribution Detection
- Integrating Prior Observations for Incremental 3D Scene Graph Prediction
- NeuroGaze-Distill: Brain-informed Distillation and Depression-Inspired Geometric Priors for Robust Facial Emotion Recognition
- Enriched text-guided variational multimodal knowledge distillation network (VMD) for automated diagnosis of plaque vulnerability in 3D carotid artery MRI
- Graph Algorithm Unrolling with Douglas-Rachford Iterations for Image Interpolation with Guaranteed Initialization
- Sphere-GAN: a GAN-based Approach for Saliency Estimation in 360{\deg} Videos
- CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation
- Learning to Generate 4D LiDAR Sequences
- Robust Concept Erasure in Diffusion Models: A Theoretical Perspective on Security and Robustness
- RAM++: Robust Representation Learning via Adaptive Mask for All-in-One Image Restoration
- Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing
- Layout-Conditioned Autoregressive Text-to-Image Generation via Structured Masking
- A Computer Vision Pipeline for Individual-Level Behavior Analysis: Benchmarking on the Edinburgh Pig Dataset
- DUAL-VAD: Dual Benchmarks and Anomaly-Focused Sampling for Video Anomaly Detection
- A Controllable 3D Deepfake Generation Framework with Gaussian Splatting
- IS-Diff: Improving Diffusion-Based Inpainting with Better Initial Seed
- WeatherBench: A Real-World Benchmark Dataset for All-in-One Adverse Weather Image Restoration
- Joint-octamamba:an octa joint segmentation network based on feature enhanced mamba
- DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition
- RouteExtract: A Modular Pipeline for Extracting Routes from Paper Maps
- IMD: A 6-DoF Pose Estimation Benchmark for Industrial Metallic Objects
- Uncertainty-Aware Retinal Vessel Segmentation via Ensemble Distillation
- The Quest for Universal Master Key Filters in DS-CNNs
- Advanced Layout Analysis Models for Docling
- Microsurgical Instrument Segmentation for Robot-Assisted Surgery
- Bridging the Gap Between Sparsity and Redundancy: A Dual-Decoding Framework with Global Context for Map Inference
- A Fully Open and Generalizable Foundation Model for Ultrasound Clinical Applications
- MSMA: Multi-Scale Feature Fusion For Multi-Attribute 3D Face Reconstruction From Unconstrained Images
- Seg2Track-SAM2: SAM2-based Multi-object Tracking and Segmentation for Zero-shot Generalization
- SA-UNetv2: Rethinking Spatial Attention U-Net for Retinal Vessel Segmentation
- FineQuest: Adaptive Knowledge-Assisted Sports Video Understanding via Agent-of-Thoughts Reasoning
- Pseudo-D: Informing Multi-View Uncertainty Estimation with Calibrated Neural Training Dynamics
- LFRA-Net: A Lightweight Focal and Region-Aware Attention Network for Retinal Vessel Segmentatio
- SpecVLM: Fast Speculative Decoding in Vision-Language Models
- MAFS: Masked Autoencoder for Infrared-Visible Image Fusion and Semantic Segmentation
- ROSGS: Relightable Outdoor Scenes With Gaussian Splatting
- Leveraging Geometric Priors for Unaligned Scene Change Detection
- UnLoc: Leveraging Depth Uncertainties for Floorplan Localization
- Toward Next-generation Medical Vision Backbones: Modeling Finer-grained Long-range Visual Dependency
- Dual Band Video Thermography Near Ambient Conditions
- Beyond Instance Consistency: Investigating View Diversity in Self-supervised Learning
- GLaVE-Cap: Global-Local Aligned Video Captioning with Vision Expert Integration
- In-Vivo Skin 3-D Surface Reconstruction and Wrinkle Depth Estimation using Handheld High Resolution Tactile Sensing
- MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation
- No Modality Left Behind: Dynamic Model Generation for Incomplete Medical Data
- On the Skinning of Gaussian Avatars
- Disentanglement of Biological and Technical Factors via Latent Space Rotation in Clinical Imaging Improves Disease Pattern Discovery
- MultiMAE for Brain MRIs: Robustness to Missing Inputs Using Multi-Modal Masked Autoencoder
- Modality-Aware Infrared and Visible Image Fusion with Target-Aware Supervision
- Multiple Instance Learning Framework with Masked Hard Instance Mining for Gigapixel Histopathology Image Analysis
- SFGNet: Semantic and Frequency Guided Network for Camouflaged Object Detection
- How Auxiliary Reasoning Unleashes GUI Grounding in VLMs
- Gaussian-Plus-SDF SLAM: High-fidelity 3D Reconstruction at 150+ fps
- Hierarchical Identity Learning for Unsupervised Visible-Infrared Person Re-Identification
- Optimizing Class Distributions for Bias-Aware Multi-Class Learning
- MVQA-68K: A Multi-dimensional and Causally-annotated Dataset with Quality Interpretability for Video Assessment
- Disentangling Content from Style to Overcome Shortcut Learning: A Hybrid Generative-Discriminative Learning Framework
- Action Hints: Semantic Typicality and Context Uniqueness for Generalizable Skeleton-based Video Anomaly Detection
- Organoid Tracker: A SAM2-Powered Platform for Zero-shot Cyst Analysis in Human Kidney Organoid Videos
- Mars Traversability Prediction: A Multi-modal Self-supervised Approach for Costmap Generation
- End-to-End Visual Autonomous Parking via Control-Aided Attention
- SMILE: A Super-resolution Guided Multi-task Learning Method for Hyperspectral Unmixing
- A Copula-Guided Temporal Dependency Method for Multitemporal Hyperspectral Images Unmixing
- 3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment
- Filling the Gaps: A Multitask Hybrid Multiscale Generative Framework for Missing Modality in Remote Sensing Semantic Segmentation
- WildSmoke: Ready-to-Use Dynamic 3D Smoke Assets from a Single Video in the Wild
- SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting
- No Mesh, No Problem: Estimating Coral Volume and Surface from Sparse Multi-View Images
- Traffic-MLLM: A Spatio-Temporal MLLM with Retrieval-Augmented Generation for Causal Inference in Traffic
- Multispectral-NeRF:a multispectral modeling approach based on neural radiance fields
- SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion
- The Impact of Skin Tone Label Granularity on the Performance and Fairness of AI Based Dermatology Image Classification Models
- Scaling Up Forest Vision with Synthetic Data
- Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation
- CCoMAML: Efficient Cattle Identification Using Cooperative Model-Agnostic Meta-Learning
- ANROT-HELANet: Adverserially and Naturally Robust Attention-Based Aggregation Network via The Hellinger Distance for Few-Shot Classification
- Contextualized Multimodal Lifelong Person Re-Identification in Hybrid Clothing States
- Cross-Domain Attribute Alignment with CLIP: A Rehearsal-Free Approach for Class-Incremental Unsupervised Domain Adaptation
- Synthetic Dataset Evaluation Based on Generalized Cross Validation
- Enhancement Without Contrast: Stability-Aware Multicenter Machine Learning for Glioma MRI Imaging
- Group Evidence Matters: Tiling-based Semantic Gating for Dense Object Detection
- InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts
- Well-Conditioned Polynomial Representations for Mathematical Handwriting Recognition
- Multi-Task Diffusion Approach For Prediction of Glioma Tumor Progression
- Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios
- OpenUrban3D: Annotation-Free Open-Vocabulary Semantic Segmentation of Large-Scale Urban Point Clouds
- AutoOEP -- A Multi-modal Framework for Online Exam Proctoring
- Total Variation Subgradient Guided Image Fusion for Dual-Camera CASSI System
- Simulating Sinogram-Domain Motion and Correcting Image-Domain Artifacts Using Deep Learning in HR-pQCT Bone Imaging
- Gaze Authentication: Factors Influencing Authentication Performance
- TrueSkin: Towards Fair and Accurate Skin Tone Recognition and Generation
- Policy-Driven Transfer Learning in Resource-Limited Animal Monitoring
- Improving Fungi Prototype Representations for Few-Shot Classification
- Cluster-Level Sparse Multi-Instance Learning for Whole-Slide Images
- SurgLaVi: Large-Scale Hierarchical Dataset for Surgical Vision-Language Representation Learning
- USCTNet: A deep unfolding nuclear-norm optimization solver for physically consistent HSI reconstruction
- Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation
- SegSLR: Promptable Video Segmentation for Isolated Sign Language Recognition
- SCOPE: Speech-guided COllaborative PErception Framework for Surgical Scene Segmentation
- Every Camera Effect, Every Time, All at Once: 4D Gaussian Ray Tracing for Physics-based Camera Effect Data Generation
- EditDuet: A Multi-Agent System for Video Non-Linear Editing
- LastingBench: Defend Benchmarks Against Knowledge Leakage
- PDFMathTranslate: Scientific Document Translation Preserving Layouts
- Persona-Based Synthetic Data Generation Using Multi-Stage Conditioning with Large Language Models for Emotion Recognition
- Is In-Context Learning Learning?
- Enhancing Prompt Injection Attacks to LLMs via Poisoning Alignment
- A Survey on Large Language Model-based Agents for Statistics and Data Science
- Evaluating and Aligning Human Economic Risk Preferences in LLMs
- One Goal, Many Challenges: Robust Preference Optimization Amid Content-Aware and Multi-Source Noise
- Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks
- Lean Formalization of Generalization Error Bound by Rademacher Complexity
- Rethinking LLM-Based Recommendations: A Personalized Query-Driven Parallel Integration
- SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation
- A Real-Time Diminished Reality Approach to Privacy in MR Collaboration
- Hallucinated Span Detection with Multi-View Attention Features
- Assessing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation
- LogicTree: Structured Proof Exploration for Coherent and Rigorous Logical Reasoning with Large Language Models
- EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models
- Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages
- Improving Informally Romanized Language Identification
- Base Models Beat Aligned Models at Randomness and Creativity
- Synthesize-on-Graph: Knowledgeable Synthetic Data Generation for Continue Pre-training of Large Language Models
- Multilingual Collaborative Defense for Large Language Models
- ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning
- HiMATE: A Hierarchical Multi-Agent Framework for Machine Translation Evaluation
- ReliableEval: A Recipe for Stochastic LLM Evaluation via Method of Moments
- Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation
- MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation
- Hopscotch: Discovering and Skipping Redundancies in Language Models
- Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models
- GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View
- Mirage of Mastery: Memorization Tricks LLMs into Artificially Inflated Self-Knowledge
- Time is On My Side: Dynamics of Talk-Time Sharing in Video-chat Conversations
- A Cross-Cultural Comparison of LLM-based Public Opinion Simulation: Evaluating Chinese and U.S. Models on Diverse Societies
- LML: A Novel Lexicon for the Moral Foundation of Liberty
- Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
- Can Advanced LLMs Coach Smaller LLMs? Knowledge Distillation for Goal-Oriented Dialogs
- GP-GPT: Large Language Model for Gene-Phenotype Mapping
- Revealing the Inherent Instructability of Pre-Trained Language Models
- Evaluating Automatic Speech Recognition Systems for Korean Meteorological Experts
- Artificial intelligence contribution to translation industry: looking back and forward
- FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering
- Speak-to-Structure: Evaluating LLMs in Open-domain Natural Language-Driven Molecule Generation
- IOLBENCH: Benchmarking LLMs on Linguistic Reasoning
- Transformer-Based Multimodal Knowledge Graph Completion with Link-Aware Contexts
- From Personas to Talks: Revisiting the Impact of Personas on LLM-Synthesized Emotional Support Conversations
- DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs
- Efficient Environmental Claim Detection with Hyperbolic Graph Neural Networks
- Rumor Detection by Multi-task Suffix Learning based on Time-series Dual Sentiments
- Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology
- LLM as a Broken Telephone: Iterative Generation Distorts Information
- LinguaLens: Towards Interpreting Linguistic Mechanisms of Large Language Models via Sparse Auto-Encoder
- Monitoring Decoding: Mitigating Hallucination via Evaluating the Factuality of Partial Response during Generation
- Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter
- Is 'Hope' a person or an idea? A pilot benchmark for NER: comparing traditional NLP tools and large language models on ambiguous entities
- In-domain SSL pre-training and streaming ASR
- GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models
- CBP-Tuning: Efficient Local Customization for Black-box Large Language Models
- XplaiNLP at CheckThat! 2025: Multilingual Subjectivity Detection with Finetuned Transformers and Prompt-Based Inference with Large Language Models
- Pun Unintended: LLMs and the Illusion of Humor Understanding
- RAGs to Riches: RAG-like Few-shot Learning for Large Language Model Role-playing
- Preservation of Language Understanding Capabilities in Speech-aware Large Language Models
- ReFineG: Synergizing Small Supervised Models and LLMs for Low-Resource Grounded Multimodal NER
- Mitigating Hallucinations in Large Vision-Language Models by Self-Injecting Hallucinations
- MALLM: Multi-Agent Large Language Models Framework
- MindVL: Towards Efficient and Effective Training of Multimodal Large Language Models on Ascend NPUs
- Collaborative Document Editing with Multiple Users and AI Agents
- The AI Memory Gap: Users Misremember What They Created With AI or Without
- Lost in Embeddings: Information Loss in Vision-Language Models
- FinGEAR: Financial Mapping-Guided Enhanced Answer Retrieval
- RadarLLM: Adapting Pretrained Large Language Models for Marine Radar Target Detection with Preference-aware Loss
- When marine radar target detection meets pretrained large language models
- Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models
- Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm
- Understanding Emergent In-Context Learning from a Kernel Regression Perspective
- Tackling Fake News in Bengali: Unraveling the Impact of Summarization vs. Augmentation on Pre-trained Language Models
- Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
- HalluDetect: Detecting, Mitigating, and Benchmarking Hallucinations in Conversational Systems
- AesBiasBench: Evaluating Bias and Alignment in Multimodal Language Models for Personalized Image Aesthetic Assessment
- EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI
- A Dynamic Knowledge Update-Driven Model with Large Language Models for Fake News Detection
- CoachMe: Decoding Sport Elements with a Reference-Based Coaching Instruction Generation Model
- Room acoustics affect communicative success in hybrid meeting spaces: a pilot study
- An Agentic Toolkit for Adaptive Information Extraction from Regulatory Documents
- User eXperience Perception Insights Dataset (UXPID): Synthetic User Feedback from Public Industrial Forums
- When Curiosity Signals Danger: Predicting Health Crises Through Online Medication Inquiries
- From Fuzzy Speech to Medical Insight: Benchmarking LLMs on Noisy Patient Narratives
- PledgeTracker: A System for Monitoring the Fulfilment of Pledges
- SCDTour: Embedding Axis Ordering and Merging for Interpretable Semantic Change Detection
- MOOM: Maintenance, Organization and Optimization of Memory in Ultra-Long Role-Playing Dialogues
- Growing Perspectives: Modelling Embodied Perspective Taking and Inner Narrative Development Using Large Language Models
- Uncertainty in Authorship: Why Perfect AI Detection Is Mathematically Impossible
- Designing LLMs for cultural sensitivity: Evidence from English-Japanese translation
- Spec-LLaVA: Accelerating Vision-Language Models with Dynamic Tree-Based Speculative Decoding
- ToolRM: Outcome Reward Models for Tool-Calling Large Language Models
- Query-Focused Extractive Summarization for Sentiment Explanation
- Text Adaptation to Plain Language and Easy Read via Automatic Post-Editing Cycles
- Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect
- SENSE models: an open source solution for multilingual and multimodal semantic-based tasks
- Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs
- RanAT4BIE: Random Adversarial Training for Biomedical Information Extraction
- The Prompt Engineering Report Distilled: Quick Start Guide for Life Sciences
- Ko-PIQA: A Korean Physical Commonsense Reasoning Dataset with Cultural Context
- !MSA at AraHealthQA 2025 Shared Task: Enhancing LLM Performance for Arabic Clinical Question Answering through Prompt Engineering and Ensemble Learning
- Continually Adding New Languages to Multilingual Language Models
- A Transformer-Based Cross-Platform Analysis of Public Discourse on the 15-Minute City Paradigm
- CognitiveSky: Scalable Sentiment and Narrative Analysis for Decentralized Social Media
- CEMTM: Contextual Embedding-based Multimodal Topic Modeling
- Improving LLMs' Learning for Coreference Resolution
- AKCIT-FN at CheckThat! 2025: Switching Fine-Tuned SLMs and LLM Prompting for Multilingual Claim Normalization
- DeDisCo at the DISRPT 2025 Shared Task: A System for Discourse Relation Classification
- Unsupervised Candidate Ranking for Lexical Substitution via Holistic Sentence Semantics
- LVLMs are Bad at Overhearing Human Referential Communication
- PeruMedQA: Benchmarking Large Language Models (LLMs) on Peruvian Medical Exams -- Dataset Construction and Evaluation
- On the Distinctive Co-occurrence Characteristics of Antonymy
- HARP: Hallucination Detection via Reasoning Subspace Projection
- HiChunk: Evaluating and Enhancing Retrieval-Augmented Generation with Hierarchical Chunking
- D$^2$HScore: Reasoning-Aware Hallucination Detection via Semantic Breadth and Depth Analysis in LLMs
- Bhaasha, Bhasa, Zaban: A Survey for Low-Resourced Languages in South Asia -- Current Stage and Challenges
- Analyzing Information-Seeking Behaviors in a Hakka AI Chatbot: A Cognitive-Pragmatic Study
- Dynamic Span Interaction and Graph-Aware Memory for Entity-Level Sentiment Classification
- Interdisciplinary Research in Conversation: A Case Study in Computational Morphology for Language Documentation
- Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts
- A Survey on Retrieval And Structuring Augmented Generation with Large Language Models
- SearchInstruct: Enhancing Domain Adaptation via Retrieval-Based Instruction Dataset Creation
- Reasoning Under Uncertainty: Exploring Probabilistic Reasoning Capabilities of LLMs
- RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems
- Evaluating Large Language Models for Evidence-Based Clinical Question Answering
- GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings
- Text2Sign Diffusion: A Generative Approach for Gloss-Free Sign Language Production
- Quantifier Scope Interpretation in Language Learners and LLMs
- Term2Note: Synthesising Differentially Private Clinical Notes from Medical Terms
- Aligning ESG Controversy Data with International Guidelines through Semi-Automatic Ontology Construction
- Introducing Spotlight: A Novel Approach for Generating Captivating Key Information from Documents
- An Interpretable Benchmark for Clickbait Detection and Tactic Attribution
- EmoBench-Reddit: A Hierarchical Benchmark for Evaluating the Emotional Intelligence of Multimodal Large Language Models
- Joint Effects of Argumentation Theory, Audio Modality and Data Enrichment on LLM-Based Fallacy Classification
- When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMs' Toxicity
- Text2Mem: A Unified Memory Operation Language for Memory Operating System
- MinatoLoader: Accelerating Machine Learning Training Through Efficient Data Preprocessing
- Coordinated Reinforcement Learning Prefetching Architecture for Multicore Systems
- PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models
- Parameter estimation with uncertainty quantification from continuous measurement data using neural network ensembles
- Why Bonds Fail Differently? Explainable Multimodal Learning for Multi-Class Default Prediction
- Do machine learning climate models work in changing climate dynamics?
- Learning Neural Networks by Neuron Pursuit
- From Autoencoders to CycleGAN: Robust Unpaired Face Manipulation via Adversarial Learning
- Dynamic Relational Priming Improves Transformer in Multivariate Time Series
- Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
- Spectral Bottleneck in Deep Neural Networks: Noise is All You Need
- The LLM as a Network Operator: A Vision for Generative AI in the 6G Radio Access Network
- DeepSeasons: a Deep Learning scale-selecting approach to Seasonal Forecasts
- Crystal Systems Classification of Phosphate-Based Cathode Materials Using Machine Learning for Lithium-Ion Battery
- Adaptive Temporal Fusion Transformers for Cryptocurrency Price Prediction
- Trial-Level Time-frequency EEG Desynchronization as a Neural Marker of Pain
- Assessing the Limits of Graph Neural Networks for Vapor-Liquid Equilibrium Prediction: A Cryogenic Mixture Case Study
- Building a General SimCLR Self-Supervised Foundation Model Across Neurological Diseases to Advance 3D Brain MRI Diagnoses
- On a Geometry of Interbrain Networks
- Multimodal Regression for Enzyme Turnover Rates Prediction
- Visualization and Analysis of the Loss Landscape in Graph Neural Networks
- Collapse of Irrelevant Representations (CIR) Ensures Robust and Non-Disruptive LLM Unlearning
- FedDAF: Federated Domain Adaptation Using Model Functional Distance
- Transparent and Fair Profiling in Employment Services: Evidence from Switzerland
- MillStone: How Open-Minded Are LLMs?
- Examining the Relationship between Scientific Publishing Activity and Hype-Driven Financial Bubbles: A Comparison of the Dot-Com and AI Eras
- Low-rank Orthogonalization for Large-scale Matrix Optimization with Applications to Foundation Model Training
- Learning from Uncertain Similarity and Unlabeled Data
- Generalizing Behavior via Inverse Reinforcement Learning with Closed-Form Reward Centroids
- AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models
- Travel Time and Weather-Aware Traffic Forecasting in a Conformal Graph Neural Network Framework
- Early Detection of Branched Broomrape (Phelipanche ramosa) Infestation in Tomato Crops Using Leaf Spectral Analysis and Machine Learning
- A Time-Series Foundation Model by Universal Delay Embedding
- Draw a Portrait of Your Graph Data: An Instance-Level Profiling Framework for Graph-Structured Data
- $K$-Level Policy Gradients for Multi-Agent Reinforcement Learning
- Online Omniprediction with Long-Term Constraints
- PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits
- Decoding Musical Origins: Distinguishing Human and AI Composers
- Enhancing ML Models Interpretability for Credit Scoring
- Learning to Optimize Multi-Objective Alignment Through Dynamic Reward Weighting
- Drug Repurposing Using Deep Embedded Clustering and Graph Neural Networks
- OASIS: A Deep Learning Framework for Universal Spectroscopic Analysis Driven by Novel Loss Functions
- DARD: Dice Adversarial Robustness Distillation against Adversarial Attacks
- UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
- Inducing Uncertainty for Test-Time Privacy
- SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching
- Reasoned Safety Alignment: Ensuring Jailbreak Defense via Answer-Then-Check
- Measuring Visual Understanding in Telecom domain: Performance Metrics for Image-to-UML conversion using VLMs
- DRAG: Data Reconstruction Attack using Guided Diffusion
- Fast and Interpretable Machine Learning Modelling of Atmospheric Molecular Clusters
- Data Fusion and Machine Learning for Ship Fuel Consumption Modelling -- A Case of Bulk Carrier Vessel
- Stabilizing PINNs: A regularization scheme for PINN training to avoid unstable fixed points of dynamical systems
- Verifying Computational Graphs in Production-Grade Distributed Machine Learning Frameworks
- CrunchLLM: Multitask LLMs for Structured Business Reasoning and Outcome Prediction
- Using LLMs for Late Multimodal Sensor Fusion for Activity Recognition
- Matched-Pair Experimental Design with Active Learning
- Neurosymbolic AI Transfer Learning Improves Network Intrusion Detection
- CogGNN: Cognitive Graph Neural Networks in Generative Connectomics
- Robustifying Diffusion-Denoised Smoothing Against Covariate Shift
- California Wildfire Inventory (CAWFI): An Extensive Dataset for Predictive Techniques based on Artificial Intelligence
- Data-Efficient Ensemble Weather Forecasting with Diffusion Models
- Machine Learning Framework for Audio-Based Equipment Condition Monitoring: A Comparative Study of Classification Algorithms
- GCN-TULHOR: Trajectory-User Linking Leveraging GCNs and Higher-Order Spatial Representations
- BIGNet: Pretrained Graph Neural Network for Embedding Semantic, Spatial, and Topological Data in BIM Models
- PINGS: Physics-Informed Neural Network for Fast Generative Sampling
- MatQnA: A Benchmark Dataset for Multi-modal Large Language Models in Materials Characterization and Analysis
- On the Escaping Efficiency of Distributed Adversarial Training Algorithms
- ClaimIQ at CheckThat! 2025: Comparing Prompted and Fine-Tuned Language Models for Verifying Numerical Claims
- Machine Learning-Driven Predictive Resource Management in Complex Science Workflows
- Moment Estimates and DeepRitz Methods on Learning Diffusion Systems with Non-gradient Drifts
- AttnBoost: Retail Supply Chain Sales Insights via Gradient Boosting Perspective
- A Differential Manifold Perspective and Universality Analysis of Continuous Attractors in Artificial Neural Networks
- Adaptive Preference Optimization with Uncertainty-aware Utility Anchor
- Holographic Knowledge Manifolds: A Novel Pipeline for Continual Learning Without Catastrophic Forgetting in Large Language Models
- Gradient Estimation Methods of Approximate Multipliers for High-Accuracy Retraining of Deep Learning Models
- GTS_Forecaster: a novel deep learning based geodetic time series forecasting toolbox with python
- pySigLib -- Fast Signature-Based Computations on CPU and GPU
- Interpretable neural network system identification method for two families of second-order systems based on characteristic curves
- Accurate and Private Diagnosis of Rare Genetic Syndromes from Facial Images with Federated Deep Learning
- Quantum Architecture Search for Solving Quantum Machine Learning Tasks
- Evalet: Evaluating Large Language Models by Fragmenting Outputs into Functions
- Geometrically Constrained and Token-Based Probabilistic Spatial Transformers
- TransZero: Parallel Tree Expansion in MuZero using Transformer Networks
- Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation
- Gradient Free Deep Reinforcement Learning With TabPFN
- Embodied Intelligence in Disassembly: Multimodal Perception Cross-validation and Continual Learning in Neuro-Symbolic TAMP
- Efficient Single-Step Framework for Incremental Class Learning in Neural Networks
- A five-layer framework for AI governance: integrating regulation, standards, and certification
- Transformer Enhanced Relation Classification: A Comparative Analysis of Contextuality, Data Efficiency and Sequence Complexity
- Intelligent Reservoir Decision Support: An Integrated Framework Combining Large Language Models, Advanced Prompt Engineering, and Multimodal Data Fusion for Real-Time Petroleum Operations
- From Firewalls to Frontiers: AI Red-Teaming is a Domain-Specific Evolution of Cyber Red-Teaming
- Framing AI System Benchmarking as a Learning Task: FlexBench and the Open MLPerf Dataset
- Enhancing Generalization in Vision-Language-Action Models by Preserving Pretrained Representations
- Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning
- Tabular Data with Class Imbalance: Predicting Electric Vehicle Crash Severity with Pretrained Transformers (TabPFN) and Mamba-Based Models
- Beyond Frame-wise Tracking: A Trajectory-based Paradigm for Efficient Point Cloud Tracking
- CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration
- PanoLora: Bridging Perspective and Panoramic Video Generation with LoRA Adaptation
- Multi-Modal Sensing Aided mmWave Beamforming for V2V Communications with Transformers
- Application of Machine Learning for Correcting Defect-induced Neuromorphic Circuit Inference Errors
- ENJ: Optimizing Noise with Genetic Algorithms to Jailbreak LSMs
- Agentic Username Suggestion and Multimodal Gender Detection in Online Platforms: Introducing the PNGT-26K Dataset
- AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs
- An Entropy-Guided Curriculum Learning Strategy for Data-Efficient Acoustic Scene Classification under Domain Shift
- Your Compiler is Backdooring Your Model: Understanding and Exploiting Compilation Inconsistency Vulnerabilities in Deep Learning Compilers
- Differentially-private text generation degrades output language quality
- The Psychogenic Machine: Simulating AI Psychosis, Delusion Reinforcement and Harm Enablement in Large Language Models
- PHLoRA: data-free Post-hoc Low-Rank Adapter extraction from full-rank checkpoint
- Decoupling Search and Learning in Neural Net Training
- FragmentGPT: A Unified GPT Model for Fragment Growing, Linking, and Merging in Molecular Design
- An Advanced Convolutional Neural Network for Bearing Fault Diagnosis under Limited Data
- Length-Aware Rotary Position Embedding for Text-Speech Alignment
- A Comparison and Evaluation of Fine-tuned Convolutional Neural Networks to Large Language Models for Image Classification and Segmentation of Brain Tumors on MRI
- Pluralistic Alignment for Healthcare: A Role-Driven Framework
- Privacy-Preserving Decentralized Federated Learning via Explainable Adaptive Differential Privacy
- Kalman Bayesian Transformer
- Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight
- Automated MCQA Benchmarking at Scale: Evaluating Reasoning Traces as Retrieval Sources for Domain Adaptation of Small Language Models
- HalluField: Detecting LLM Hallucinations via Field-Theoretic Modeling
- Bridging Cultural Distance Between Models Default and Local Classroom Demands: How Global Teachers Adopt GenAI to Support Everyday Teaching Practices
- GoldenTransformer: A Modular Fault Injection Framework for Transformer Robustness Research
- Judge Q: Trainable Queries for Optimized Information Retention in KV Cache Eviction
- Rethinking Sparse Autoencoders: Select-and-Project for Fairness and Control from Encoder Features Alone
- Towards Automated Error Discovery: A Study in Conversational AI
- A funny companion: Distinct neural responses to perceived AI- versus humangenerated humor
- Pre-Storage Reasoning for Episodic Memory: Shifting Inference Burden to Memory for Personalized Dialogue
- Physics-informed neural network solves minimal surfaces in curved spacetime
- GTHNA: Local-global Graph Transformer with Memory Reconstruction for Holistic Node Anomaly Evaluation
- ToMA: Token Merge with Attention for Image Generation with Diffusion Models
- Clarifying Model Transparency: Interpretability versus Explainability in Deep Learning with MNIST and IMDB Examples
- When the Code Autopilot Breaks: Why LLMs Falter in Embedded Machine Learning
- Testing for LLM response differences: the case of a composite null consisting of semantically irrelevant query perturbations
- Robust DDoS-Attack Classification with 3D CNNs Against Adversarial Methods
- ASL360: AI-Enabled Adaptive Streaming of Layered 360{\deg} Video over UAV-assisted Wireless Networks
- Uncovering the Vulnerability of Large Language Models in the Financial Domain via Risk Concealment
- Biomarkers of brain diseases
- AVEC: Bootstrapping Privacy for Local LLMs
- MarkDiffusion: An Open-Source Toolkit for Generative Watermarking of Latent Diffusion Models
- Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey
- Quality Assessment of Tabular Data using Large Language Models and Code Generation
- Gene-R1: Reasoning with Data-Augmented Lightweight LLMs for Gene Set Analysis
- Aesthetic Experience and Educational Value in Co-creating Art with Generative AI: Evidence from a Survey of Young Learners
- The Coding Limits of Robust Watermarking for Generative Models
- LearnLens: An AI-Enhanced Dashboard to Support Teachers in Open-Ended Classrooms
- Smart Trial: Evaluating the Use of Large Language Models for Recruiting Clinical Trial Participants via Social Media
- Machine Unlearning for Responsible and Adaptive AI in Education
- Assisting the Grading of a Handwritten General Chemistry Exam with Artificial Intelligence
- SME-TEAM: Leveraging Trust and Ethics for Secure and Responsible Use of AI and LLMs in SMEs
- GenAI Voice Mode in Programming Education
- No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
- Test-Time Warmup for Multimodal Large Language Models
- Vibe Coding for UX Design: Understanding UX Professionals' Perceptions of AI-Assisted Design and Development
- SCOR: A Framework for Responsible AI Innovation in Digital Ecosystems
- Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration
- LLM in the Middle: A Systematic Review of Threats and Mitigations to Real-World LLM-based Systems
- SABR: A Stable Adaptive Bitrate Framework Using Behavior Cloning Pretraining and Reinforcement Learning Fine-Tuning
- Distributed Gossip-GAN for Low-overhead CSI Feedback Training in FDD mMIMO-OFDM Systems
- Online Learning Based Efficient Resource Allocation for LoRaWAN Network
- From Noise to Precision: A Diffusion-Driven Approach to Zero-Inflated Precipitation Prediction
- FEDEXCHANGE: Bridging the Domain Gap in Federated Object Detection for Free
- CAR-BRAINet: Sub-6GHz Aided Spatial Adaptive Beam Prediction with Multi Head Attention for Heterogeneous Vehicular Networks
- The Anti-Ouroboros Effect: Emergent Resilience in Large Language Models from Recursive Selective Feedback
- FireGNN: Neuro-Symbolic Graph Neural Networks with Trainable Fuzzy Rules for Interpretable Medical Image Classification
- LogGuardQ: A Cognitive-Enhanced Reinforcement Learning Framework for Cybersecurity Anomaly Detection in Security Logs
- Multimodal Deep Learning for ATCO Command Lifecycle Modeling and Workload Prediction
- From Predictions to Explanations: Explainable AI for Autism Diagnosis and Identification of Critical Brain Regions
- Data-Efficient Psychiatric Disorder Detection via Self-supervised Learning on Frequency-enhanced Brain Networks
- Resource-Aware Neural Network Pruning Using Graph-based Reinforcement Learning
- STM-Graph: A Python Framework for Spatio-Temporal Mapping and Graph Neural Network Predictions
- Mitigating Catastrophic Forgetting and Mode Collapse in Text-to-Image Diffusion via Latent Replay
- FinXplore: An Adaptive Deep Reinforcement Learning Framework for Balancing and Discovering Investment Opportunities
- Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings
- Semantic-guided LoRA Parameters Generation
- EchoLeak: The First Real-World Zero-Click Prompt Injection Exploit in a Production LLM System
- A Survey of Reasoning and Agentic Systems in Time Series with Large Language Models
- AMLNet: A Knowledge-Based Multi-Agent Framework to Generate and Detect Realistic Money Laundering Transactions
- Adapting and Evaluating Multimodal Large Language Models for Adolescent Idiopathic Scoliosis Self-Management: A Divide and Conquer Framework
- Learning Representations in Video Game Agents with Supervised Contrastive Imitation Learning
- BuildingGym: An open-source toolbox for AI-based building energy management using reinforcement learning
- How to Evaluate Medical AI
- Neuro-Symbolic Agents with Modal Logic for Autonomous Diagnostics
- Agentic Temporal Graph of Reasoning with Multimodal Language Models: A Potential AI Aid to Healthcare
- When Safe Unimodal Inputs Collide: Optimizing Reasoning Chains for Cross-Modal Safety in Multimodal Large Language Models
- Bridging Engineering and AI Planning through Model-Based Knowledge Transformation for the Validation of Automated Production System Variants
- JustEva: A Toolkit to Evaluate LLM Fairness in Legal Knowledge Inference
- Co-Alignment: Rethinking Alignment as Bidirectional Human-AI Cognitive Adaptation
- Advancing Medical Artificial Intelligence Using a Century of Cases
- Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks
- DSRAG: A Domain-Specific Retrieval Framework Based on Document-derived Multimodal Knowledge Graph
- Learning Decomposed Contextual Token Representations from Pretrained and Collaborative Signals for Generative Recommendation
- Real-Time RAG for the Identification of Supply Chain Vulnerabilities
- AegisShield: Democratizing Cyber Threat Modeling with Generative AI
- AI Answer Engine Citation Behavior An Empirical Analysis of the GEO16 Framework
- LLM Enhancement with Domain Expert Mental Model to Reduce LLM Hallucination with Causal Prompt Engineering
- From Grounding to Skolemization: A Logic-Constrained Vector Symbolic Architecture for Complex Query Answering
- Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding
- Enhancing Computational Cognitive Architectures with LLMs: A Case Study
- Rethinking Human Preference Evaluation of LLM Rationales
- Tractable Asymmetric Verification for Large Language Models via Deterministic Replicability
- Difficulty-Aware Agent Orchestration in LLM-Powered Workflows
- Neural cellular automata: applications to biology and beyond classical AI
- AlignKT: Explicitly Modeling Knowledge State for Knowledge Tracing with Ideal State Alignment
- AI-Generated Content in Cross-Domain Applications: Research Trends, Challenges and Propositions
- Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble
- MAPGD: Multi-Agent Prompt Gradient Descent for Collaborative Prompt Optimization
- Securing AI Agents: Implementing Role-Based Access Control for Industrial Applications
- Cross-Platform Scaling of Vision-Language-Action Models from Edge to Cloud GPUs
- MedicalOS: An LLM Agent based Operating System for Digital Healthcare
- Formal Reasoning for Intelligent QA Systems: A Case Study in the Educational Domain
- ZapGPT: Free-form Language Prompting for Simulated Cellular Control
- Understanding AI Evaluation Patterns: How Different GPT Models Assess Vision-Language Descriptions
Research Sources: 539 | Generated: 9/16/2025