AI RESEARCH PAPERS & ACADEMIC SOURCES
- MASRAD: Arabic Terminology Management Corpora with Semi-Automatic Construction
- Self-Routing RAG: Binding Selective Retrieval with Knowledge Verbalization
- SAFER: Advancing Safety Alignment via Efficient Ex-Ante Reasoning
- Measuring LLM Novelty As The Frontier Of Original And High-Quality Output
- What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts
- FAID: Fine-Grained AI-Generated Text Detection Using Multi-Task Auxiliary and Multi-Level Contrastive Learning
- Teaching Small Language Models to Learn Logic through Meta-Learning
- Unifying Inference-Time Planning Language Generation
- Tracing Multilingual Factual Knowledge Acquisition in Pretraining
- AVerImaTeC: A Dataset for Automatic Verification of Image-Text Claims with Evidence from the Web
- ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists
- When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation
- Language Models Surface the Unwritten Code of Science and Society
- Collaborative and Proactive Management of Task-Oriented Conversations
- Submodular Context Partitioning and Compression for In-Context Learning-short paper
- Characterizing Model Behavior Under Synthetic Data Training: An Empirical Study Across Scales and Mixing Ratios
- Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics
- LiRA: A Multi-Agent Framework for Reliable and Readable Literature Review Generation
- NLD-LLM: A systematic framework for evaluating small language transformer models on natural language description
- To model human linguistic prediction, make LLMs less superhuman
- Reliable End-to-End Material Information Extraction from the Literature with Source-Tracked Multi-Stage Large Language Models
- Can AI Truly Represent Your Voice in Deliberations? A Comprehensive Study of Large-Scale Opinion Aggregation with LLMs
- Camellia: Benchmarking Cultural Biases in LLMs for Asian Languages
- WeatherArchive-Bench: Benchmarking Retrieval-Augmented Reasoning for Historical Weather Archives
- Residualized Similarity for Faithfully Explainable Authorship Verification
- The End of Transformers? On Challenging Attention and the Rise of Sub-Quadratic Architectures
- Cross-Lingual Mental Health Ontologies for Indian Languages: Bridging Patient Expression and Clinical Understanding through Explainable AI and Human-in-the-Loop Validation
- A Lightweight Large Language Model-Based Multi-Agent System for 2D Frame Structural Analysis
- Self-Filtered Distillation with LLMs-generated Trust Indicators for Reliable Patent Classification
- SimulatorArena: Are User Simulators Reliable Proxies for Multi-Turn Evaluation of AI Assistants?
- AgentRouter: A Knowledge-Graph-Guided LLM Router for Collaborative Multi-Agent Question Answering
- SocialNLI: A Dialogue-Centric Social Inference Dataset
- Language Model as Planner and Formalizer under Constraints
- Prototype-Based Dynamic Steering for Large Language Models
- On the Role of Difficult Prompts in Self-Play Preference Optimization
- Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations
- Mission Impossible: Feedback-Guided Dynamic Interactive Planning for Improving Reasoning on LLMs
- A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks
- DecEx-RAG: Boosting Agentic Retrieval-Augmented Generation with Decision and Execution Optimization via Process Supervision
- Adaptive and Multi-Source Entity Matching for Name Standardization of Astronomical Observation Facilities
- Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes
- Mixture of Neuron Experts
- EEPO: Exploration-Enhanced Policy Optimization via Sample-Then-Forget
- Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer
- Automated Boilerplate: Prevalence and Quality of Contract Generators in the Context of Swiss Privacy Policies
- Evaluating the Sensitivity of LLMs to Harmful Contents in Long Input
- The fragility of "cultural tendencies" in LLMs
- Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological Lens
- Exploring Gaps in the APS: Direct Minimal Pair Analysis in LLM Syntactic Assessments
- MASA: Rethinking the Representational Bottleneck in LoRA with Multi-A Shared Adaptation
- Evaluating The Impact of Stimulus Quality in Investigations of LLM Language Performance
- ASPO: Asymmetric Importance Sampling Policy Optimization
- The Valley of Code Reasoning: Scaling Knowledge Distillation of Large Language Models
- Parallel Tokenizers: Rethinking Vocabulary Design for Cross-Lingual Transfer
- RoSE: Round-robin Synthetic Data Evaluation for Selecting LLM Generators without Human Test Sets
- VecInfer: Efficient LLM Inference with Low-Bit KV Cache via Outlier-Suppressed Vector Quantization
- Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context
- WaveSP-Net: Learnable Wavelet-Domain Sparse Prompt Tuning for Speech Deepfake Detection
- Quantum Concept Music Score from Quantum Picturalism: Musical Incarnation of a Bell-Pair under Measurements
- Sci-Phi: A Large Language Model Spatial Audio Descriptor
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models
- HEALTH-PARIKSHA: Assessing RAG Models for Health Chatbots in Real-World Multilingual Settings
- Explaining GPTs' Schema of Depression: A Machine Behavior Analysis
- Evaluating and Mitigating Social Bias for Large Language Models in Open-ended Settings
- On Relation-Specific Neurons in Large Language Models
- Evaluating the Effect of Retrieval Augmentation on Social Biases
- Deforming Videos to Masks: Flow Matching for Referring Video Segmentation
- ShapeGen4D: Towards High Quality 4D Shape Generation from Videos
- Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models
- Fine-grained Defocus Blur Control for Generative Image Models
- Human3R: Everyone Everywhere All at Once
- Advancing Automated Spatio-Semantic Analysis in Picture Description Using Language Models
- nnSAM2: nnUNet-Enhanced One-Prompt SAM2 for Few-shot Multi-Modality Segmentation and Composition Analysis of Lumbar Paraspinal Muscles
- Leveraging Vision Transformers for Enhanced Classification of Emotions using ECG Signals
- Towards Robust and Realible Multimodal Fake News Detection with Incomplete Modality
- A Warm-basis Method for Bridging Learning and Iteration: a Case Study in Fluorescence Molecular Tomography
- Overlap-aware segmentation for topological reconstruction of obscured objects
- A discussion about violin reduction: geometric analysis of contour lines and channel of minima
- Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection
- Imagining the Unseen: Generative Location Modeling for Object Placement
- LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation
- Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search
- Noise2Score3D: Tweedie's Approach for Unsupervised Point Cloud Denoising
- Tables Guide Vision: Learning to See the Heart through Tabular Data
- LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders
- AuxDet: Auxiliary Metadata Matters for Omni-Domain Infrared Small Target Detection
- Leveraging Foundation Models for Multimodal Graph-Based Action Recognition
- VisRet: Visualization Improves Knowledge-Intensive Text-to-Image Retrieval
- Robust Neural Rendering in the Wild with Asymmetric Dual 3D Gaussian Splatting
- When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
- Low-Rank Tensor Recovery via Variational Schatten-p Quasi-Norm and Jacobian Regularization
- Enhancing Fitness Movement Recognition with Attention Mechanism and Pre-Trained Feature Extractors
- High-pass filtered fidelity-imposed network edit (HP-FINE) for robust quantitative susceptibility mapping from high-pass filtered phase
- RimSet: Quantitatively Identifying and Characterizing Chronic Active Multiple Sclerosis Lesion on Quantitative Susceptibility Maps
- SAMCIRT: A Simultaneous Reconstruction and Affine Motion Compensation Technique for Four Dimensional Computed Tomography (4DCT)
- Trajectory Prediction Meets Large Language Models: A Survey
- ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge
- Attention-Enhanced Prototypical Learning for Few-Shot Infrastructure Defect Segmentation
- SkinMap: Weighted Full-Body Skin Segmentation for Robust Remote Photoplethysmography
- Fine-Tuned CNN-Based Approach for Multi-Class Mango Leaf Disease Detection
- Personalizing Retrieval using Joint Embeddings or "the Return of Fluffy"
- ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars
- Human Action Recognition from Point Clouds over Time
- Be Tangential to Manifold: Discovering Riemannian Metric for Diffusion Models
- HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video
- CalibCLIP: Contextual Calibration of Dominant Semantics for Text-Driven Image Retrieval
- Efficient Conditional Generation on Scale-based Visual Autoregressive Models
- TFM Dataset: A Novel Multi-task Dataset and Integrated Pipeline for Automated Tear Film Break-Up Segmentation
- Combined Hyperbolic and Euclidean Soft Triple Loss Beyond the Single Space Deep Metric Learning
- EduVerse: A User-Defined Multi-Agent Simulation Space for Education Scenario
- SD-MVSum: Script-Driven Multimodal Video Summarization Method and Datasets
- A Hierarchical Geometry-guided Transformer for Histological Subtyping of Primary Liver Cancer
- Teleportraits: Training-Free People Insertion into Any Scene
- When and How to Cut Classical Concerts? A Multimodal Automated Video Editing Approach
- Development and Validation of a Low-Cost Imaging System for Seedling Germination Kinetics through Time-Cumulative Analysis
- Context Matters: Learning Global Semantics for Visual Reasoning and Comprehension
- AgeBooth: Controllable Facial Aging and Rejuvenation via Diffusion Models
- Data Factory with Minimal Human Effort Using VLMs
- ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving
- OneVision: An End-to-End Generative Framework for Multi-view E-commerce Vision Search
- A Novel Technique for Robust Training of Deep Networks With Multisource Weak Labeled Remote Sensing Data
- Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection
- Rasterized Steered Mixture of Experts for Efficient 2D Image Regression
- Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow
- acia-workflows: Automated Single-cell Imaging Analysis for Scalable and Deep Learning-based Live-cell Imaging Analysis Workflows
- BioAutoML-NAS: An End-to-End AutoML Framework for Multimodal Insect Classification via Neural Architecture Search on Large-Scale Biodiversity Data
- Efficient Universal Models for Medical Image Segmentation via Weakly Supervised In-Context Learning
- Shaken or Stirred? An Analysis of MetaFormer's Token Mixing for Medical Imaging
- A Dynamic Mode Decomposition Approach to Morphological Component Analysis
- Diffusion-Based Image Editing for Breaking Robust Watermarks
- Continual Learning for Image Captioning through Improved Image-Text Alignment
- Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between
- There is More to Attention: Statistical Filtering Enhances Explanations in Vision Transformers
- Compact Multi-level-prior Tensor Representation for Hyperspectral Image Super-resolution
- Multimodal Feature Prototype Learning for Interpretable and Discriminative Cancer Survival Prediction
- Towards Data-Efficient Medical Imaging: A Generative and Semi-Supervised Framework
- ATOM: A Pretrained Neural Operator for Multitask Molecular Dynamics
- The Method of Infinite Descent
- NorMuon: Making Muon more efficient and scalable
- Fundamental Limits of Crystalline Equivariant Graph Neural Networks: A Circuit Complexity Perspective
- EEG-Based Acute Pain Classification: Machine Learning Model Comparison and Real-Time Clinical Feasibility
- NeST-BO: Fast Local Bayesian Optimization via Newton-Step Targeting of Gradient and Hessian Information
- Transfer Learning on Edge Connecting Probability Estimation under Graphon Model
- ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization
- LATTA: Langevin-Anchored Test-Time Adaptation for Enhanced Robustness and Stability
- Efficient Learning-based Graph Simulation for Temporal Graphs
- (Token-Level) \textbf{InfoRMIA}: Stronger Membership Inference and Memorization Assessment for LLMs
- When Does Global Attention Help? A Unified Empirical Study on Atomistic Graph Learning
- Riddled basin geometry sets fundamental limits to predictability and reproducibility in deep learning
- NEO: No-Optimization Test-Time Adaptation through Latent Re-Centering
- Inductive inference of gradient-boosted decision trees on graphs for insurance fraud detection
- Primal-Dual Direct Preference Optimization for Constrained LLM Alignment
- DiffSDA: Unsupervised Diffusion Sequential Disentanglement Across Modalities
- Neighborhood-Adaptive Generalized Linear Graph Embedding with Latent Pattern Mining
- Communication Enables Cooperation in LLM Agents: A Comparison with Curriculum-Based Approaches
- Improving Clinical Dataset Condensation with Mode Connectivity-based Trajectory Surrogates
- Multimodal Trajectory Representation Learning for Travel Time Estimation
- How to model Human Actions distribution with Event Sequence Data
- MaNGO - Adaptable Graph Network Simulators via Meta-Learning
- OBSR: Open Benchmark for Spatial Representations
- Sample Smart, Not Hard: Correctness-First Decoding for Better Reasoning in LLMs
- Uncertainty in Machine Learning
- RamPINN: Recovering Raman Spectra From Coherent Anti-Stokes Spectra Using Embedded Physics
- BLISS: A Lightweight Bilevel Influence Scoring Method for Data Selection in Language Model Pretraining
- Edit-Based Flow Matching for Temporal Point Processes
- Analyzing the Effect of Embedding Norms and Singular Values to Oversmoothing in Graph Neural Networks
- Learning from Failures: Understanding LLM Alignment through Failure-Aware Inverse RL
- The Alignment Auditor: A Bayesian Framework for Verifying and Refining LLM Objectives
- Influence Functions for Efficient Data Selection in Reasoning
- Downsized and Compromised?: Assessing the Faithfulness of Model Compression
- lm-Meter: Unveiling Runtime Inference Latency for On-Device Language Models
- Improved High-probability Convergence Guarantees of Decentralized SGD
- TabPFN-Wide: Continued Pre-Training for Extreme Feature Counts
- Thermodynamic Performance Limits for Score-Based Diffusion Models
- On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond
- Training Dynamics Impact Post-Training Quantization Robustness
- Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models
- Catalog-Native LLM: Speaking Item-ID Dialect with Less Entanglement for Recommendation
- Automated Alignment of Math Items to Content Standards in Large-Scale Assessments Using Language Models
- Curiosity-Driven LLM-as-a-judge for Personalized Creative Judgment
- Exploring Large Language Models for Financial Applications: Techniques, Performance, and Challenges with FinMA
- Adapting HFMCA to Graph Data: Self-Supervised Learning for Generalizable fMRI Representations
- Stratum: System-Hardware Co-Design with Tiered Monolithic 3D-Stackable DRAM for Efficient MoE Serving
- Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning
- Mitigating Diffusion Model Hallucinations with Dynamic Guidance
- LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation
- Aligning Language Models with Clinical Expertise: DPO for Heart Failure Nursing Documentation in Critical Care
- TensorBLEU: Vectorized GPU-based BLEU Score Implementation for Per-Sentence In-Training Evaluation
- H1B-KV: Hybrid One-Bit Caches for Memory-Efficient Large Language Model Inference
- Efficient learning of bosonic Gaussian unitaries
- Activation-Informed Pareto-Guided Low-Rank Compression for Efficient LLM/VLM
- Channel Simulation and Distributed Compression with Ensemble Rejection Sampling
- Midway Network: Learning Representations for Recognition and Motion from Latent Dynamics
- InstaGeo: Compute-Efficient Geospatial Machine Learning from Data to Deployment
- From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs
- Transcribing Rhythmic Patterns of the Guitar Track in Polyphonic Music
- M\"obius transforms and Shapley values for vector-valued functions on weighted directed acyclic multigraphs
- StereoSync: Spatially-Aware Stereo Audio Generation from Video
- FoleyGRAM: Video-to-Audio Generation with GRAM-Aligned Multimodal Encoders
- Prompt reinforcing for long-term planning of large language models
- EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models
- Adaptive Pruning for Increased Robustness and Reduced Computational Overhead in Gaussian Process Accelerated Saddle Point Searches
- Medical Vision Language Models as Policies for Robotic Surgery
- EmoHRNet: High-Resolution Neural Network Based Speech Emotion Recognition
- Non-iid hypothesis testing: from classical to quantum
- Differentiable Model Predictive Control on the GPU
- Climate Model Tuning with Online Synchronization-Based Parameter Estimation
- Modulation Discovery with Differentiable Digital Signal Processing
- Solar Irradiation Forecasting using Genetic Algorithms
- Nonlinear Filtering with Brenier Optimal Transport Maps
- A Universal Metric of Dataset Similarity for Cross-silo Federated Learning
- Mutatis Mutandis: Revisiting the Comparator in Discrimination Testing
- Cross-Domain Graph Data Scaling: A Showcase with Diffusion Models
- Spatiotemporal Graph Learning with Direct Volumetric Information Passing and Feature Enhancement
- EntryPrune: Neural Network Feature Selection using First Impressions
- DUA-D2C: Dynamic Uncertainty Aware Method for Overfitting Remediation in Deep Learning
- The Logical Implication Steering Method for Conditional Interventions on Transformer Generation
- Unifying Autoregressive and Diffusion-Based Sequence Generation
- A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
- Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction
- DeepBoost-AF: A Novel Unsupervised Feature Learning and Gradient Boosting Fusion for Robust Atrial Fibrillation Detection in Raw ECG Signals
- Generalizing Supervised Contrastive learning: A Projection Perspective
- Robust-Multi-Task Gradient Boosting
- Attribute Fusion-based Classifier on Framework of Belief Structure
- Fast Policy Learning for Linear Quadratic Control with Entropy Regularization
- Geometry-Preserving Encoder/Decoder in Latent Generative Models
- Strong bounds for large-scale Minimum Sum-of-Squares Clustering
- MatLLMSearch: Crystal Structure Discovery with Evolution-Guided Large Language Models
- A weakly-supervised deep learning model for fast localisation and delineation of the skeleton, internal organs, and spinal canal on Whole-Body Diffusion-Weighted MRI (WB-DWI)
- SAE-FiRE: Enhancing Earnings Surprise Predictions Through Sparse Autoencoder Feature Selection
- Conditional Local Independence Testing for It\^o processes with Applications to Dynamic Causal Discovery
- Context Biasing for Pronunciations-Orthography Mismatch in Automatic Speech Recognition
- Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance
- Investigating Forecasting Models for Pandemic Infections Using Heterogeneous Data Sources: A 2-year Study with COVID-19
- Multilingual Dataset Integration Strategies for Robust Audio Deepfake Detection: A SAFE Challenge System
- CDTP: A Large-Scale Chinese Data-Text Pair Dataset for Comprehensive Evaluation of Chinese LLMs
- VideoMiner: Iteratively Grounding Key Frames of Hour-Long Videos via Tree-based Group Relative Policy Optimization
- GLVD: Guided Learned Vertex Descent
- Controllable Audio-Visual Viewpoint Generation from 360{\deg} Spatial Information
- Reasoning under Vision: Understanding Visual-Spatial Cognition in Vision-Language Models for CAPTCHA
- When Thinking Drifts: Evidential Grounding for Robust Video Reasoning
- Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability
- A public cardiac CT dataset featuring the left atrial appendage
- Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models
- Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation
- CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits
- Multi-Task Reinforcement Learning with Language-Encoded Gated Policy Networks
- Bimanual 3D Hand Motion and Articulation Forecasting in Everyday Images
- Smartphone-based iris recognition through high-quality visible-spectrum iris image capture.V2
- RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback
- Automated Program Repair of Uncompilable Student Code
- BanglaTalk: Towards Real-Time Speech Assistance for Bengali Regional Dialects
- Latent Speech-Text Transformer
- StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars
- TokenChain: A Discrete Speech Chain via Semantic Token Modeling
- Reference Grounded Skill Discovery
- Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents
- EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark
- Fine-Grained and Thematic Evaluation of LLMs in Social Deduction Game
- Hallucination Detox: Sensitivity Dropout (SenD) for Large Language Model Training
- Extracting PAC Decision Trees from Black Box Binary Classifiers: The Gender Bias Case Study on BERT-based Language Models
- Applications of Large Models in Medicine
- Learning Exposure Mapping Functions for Inferring Heterogeneous Peer Effects
- SciSciGPT: Advancing Human-AI Collaboration in the Science of Science
- FLEx: Personalized Federated Learning for Mixture-of-Experts LLMs via Expert Grafting
- RepIt: Representing Isolated Targets to Steer Language Models
- From paintbrush to pixel: A review of deep neural networks in AI-generated art
- Generative transformations and patterns in LLM-native approaches for software verification and falsification
- A Generative Approach to Credit Prediction with Learnable Prompts for Multi-scale Temporal Representation Learning
- Artificial intelligence for context-aware visual change detection in software test automation
- An Investigation of Incorporating Mamba for Speech Enhancement
- Robustness of Large Language Models to Perturbations in Text
- How Reliable are Causal Probing Interventions?
- Interpretable Clustering: A Survey
- A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond
- BanglaLlama: LLaMA for Bangla Language
- BenchAgents: Multi-Agent Systems for Structured Benchmark Creation
- QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization
- HOG-Diff: Higher-Order Guided Diffusion for Graph Generation
- PartSDF: Part-Based Implicit Neural Representation for Composite 3D Shape Parametrization and Optimization
- Geometry-Guided Adversarial Prompt Detection via Curvature and Local Intrinsic Dimension
- WildIFEval: Instruction Following in the Wild
- Optimal Transport for Brain-Image Alignment: Unveiling Redundancy and Synergy in Neural Information Processing
- A Graph-Based Framework for Interpretable Whole Slide Image Analysis
- Bayesian Teaching Enables Probabilistic Reasoning in Large Language Models
- Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information
- Decentralized Collective World Model for Emergent Communication and Coordination
- MedHal: An Evaluation Dataset for Medical Hallucination Detection
- The Mirage of Performance Gains: Why Contrastive Decoding Fails to Mitigate Object Hallucinations in MLLMs?
- Cross-Document Cross-Lingual NLI via RST-Enhanced Graph Fusion and Interpretability Prediction
- QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
- Can We Ignore Labels In Out of Distribution Detection?
- Deep Reinforcement Learning for Urban Air Quality Management: Multi-Objective Optimization of Pollution Mitigation Booth Placement in Metropolitan Environments
- Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning
- ChartCards: A Chart-Metadata Generation Framework for Multi-Task Chart Understanding
- Optimal Policy Minimum Bayesian Risk
- An Embarrassingly Simple Defense Against LLM Abliteration Attacks
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning
- OWL: Probing Cross-Lingual Recall of Memorized Texts via World Literature
- How Malicious AI Swarms Can Threaten Democracy: The Fusion of Agentic AI and LLMs Marks a New Frontier in Information Warfare
- A Fairness-Aware Strategy for B5G Physical-layer Security Leveraging Reconfigurable Intelligent Surfaces
- Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions
- Learning The Minimum Action Distance
- Persona Features Control Emergent Misalignment
- Can Video Large Multimodal Models Think Like Doubters-or Double-Down: A Study on Defeasible Video Entailment
- Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models
- Scattering Transformer: A Training-Free Transformer Architecture for Heart Murmur Detection
- A Fuzzy Logic-Based Framework for Explainable Machine Learning in Big Data Analytics
- Auditing Algorithmic Bias in Transformer-Based Trading
- Machine learning for fraud detection in digital banking: a systematic literature review REVIEW
- Discretized Quadratic Integrate-and-Fire Neuron Model for Deep Spiking Neural Networks
- Carbon Emission Prediction in China Considering New Quality Productive Forces Using a Deep & Corss Learning Modeling Framework
- Learning More with Less: A Generalizable, Self-Supervised Framework for Privacy-Preserving Capacity Estimation with EV Charging Data
- Exact Causal Attention with 10% Fewer Operations
- A Data-Driven Prism: Multi-View Source Separation with Diffusion Model Priors
- Simultaneous Learning and Optimization via Misspecified Saddle Point Problems
- ECLipsE-Gen-Local: Efficient Compositional Local Lipschitz Estimates for Deep Neural Networks
- Decoding Partial Differential Equations: Cross-Modal Adaptation of Decoder-only Models to PDEs
- Gamma Mixture Modeling for Cosine Similarity in Small Language Models
- RegMix: Adversarial Mutual and Generalization Regularization for Enhancing DNN Robustness
- KVLinC : KV Cache Quantization with Hadamard Rotation and Linear Correction
- Physics-Informed Neural Networks with Fourier Features and Attention-Driven Decoding
- A Neural Network Algorithm for KL Divergence Estimation with Quantitative Error Bounds
- Correlating Cross-Iteration Noise for DP-SGD using Model Curvature
- Draft, Verify, and Improve: Toward Training-Aware Speculative Decoding
- MatheMagic: Generating Dynamic Mathematics Benchmarks Robust to Memorization
- ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models
- MixReasoning: Switching Modes to Think
- Scientific Algorithm Discovery by Augmenting AlphaEvolve with Deep Research
- TelecomTS: A Multi-Modal Observability Dataset for Time Series and Language Analysis
- Constraint-Aware Route Recommendation from Natural Language via Hierarchical LLM Agents
- Classical AI vs. LLMs for Decision-Maker Alignment in Health Insurance Choices
- Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
- Barbarians at the Gate: How AI is Upending Systems Research
- TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
- Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices
- Trainable Reference-Based Evaluation Metric for Identifying Quality of English-Gujarati Machine Translation System
- Hallucination is Inevitable for LLMs with the Open World Assumption
- CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation
- A Scalable AI Driven, IoT Integrated Cognitive Digital Twin for Multi-Modal Neuro-Oncological Prognostics and Tumor Kinetics Prediction using Enhanced Vision Transformer and XAI
- Improving Metacognition and Uncertainty Communication in Language Models
- Artificial Intelligence for Cost-Aware Resource Prediction in Big Data Pipelines
- Rationale-Augmented Retrieval with Constrained LLM Re-Ranking for Task Discovery
- Training Large Language Models To Reason In Parallel With Global Forking Tokens
- Linguistic Characteristics of AI-Generated Text: A Survey
- SynCED-EnDe 2025: A Synthetic and Curated English - German Dataset for Critical Error Detection in Machine Translation
- FlashResearch: Real-time Agent Orchestration for Efficient Deep Research
- Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
- Percepta: High Performance Stream Processing at the Edge
- Chronological Thinking in Full-Duplex Spoken Dialogue Language Models
- A Single Character can Make or Break Your LLM Evals
- Generative Inverse Design: From Single Point Optimization to a Diverse Design Portfolio via Conditional Variational Autoencoders
- Artificial-Intelligence Grading Assistance for Handwritten Components of a Calculus Exam
- SATER: A Self-Aware and Token-Efficient Approach to Routing and Cascading
- Emergent Coordination in Multi-Agent Language Models
- PatternKV: Flattening KV Representation Expands Quantization Headroom
- Logistic-Gated Operators Enable Auditable Unit-Aware Thresholds in Symbolic Regression
- OptPipe: Memory- and Scheduling-Optimized Pipeline Parallelism for LLM Training
- A novel hallucination classification framework
- Provable Speech Attributes Conversion via Latent Independence
- Approximate Gaussianity Beyond Initialisation in Neural Networks
- CMT-Benchmark: A Benchmark for Condensed Matter Theory Built by Expert Researchers
- Adjusting the Output of Decision Transformer with Action Gradient
- AUREXA-SE: Audio-Visual Unified Representation Exchange Architecture with Cross-Attention and Squeezeformer for Speech Enhancement
- RAG Makes Guardrails Unsafe? Investigating Robustness of Guardrails under RAG-style Contexts
- DeepAf: One-Shot Spatiospectral Auto-Focus Model for Digital Pathology
- Dynamic Functional Connectivity Features for Brain State Classification: Insights from the Human Connectome Project
- DeepV: A Model-Agnostic Retrieval-Augmented Framework for Verilog Code Generation with a High-Quality Knowledge Base
- Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization
- Physics-informed Attention-enhanced Fourier Neural Operator for Solar Magnetic Field Extrapolations
- MT-DAO: Multi-Timescale Distributed Adaptive Optimizers with Local Updates
- Context Length Alone Hurts LLM Performance Despite Perfect Retrieval
- Fusion-Based Neural Generalization for Predicting Temperature Fields in Industrial PET Preform Heating
- Comparing LSTM-Based Sequence-to-Sequence Forecasting Strategies for 24-Hour Solar Proton Flux Profiles Using GOES Data
- See the past: Time-Reversed Scene Reconstruction from Thermal Traces Using Visual Language Models
- Physics-Informed Machine Learning in Biomedical Science and Engineering
- UnitTenX: Generating Tests for Legacy Packages with AI Agents Powered by Formal Verification
- Adversarial Reinforcement Learning for Large Language Model Agent Safety
- QDeepGR4J: Quantile-based ensemble of deep learning and GR4J hybrid rainfall-runoff models for extreme flow prediction with uncertainty quantification
- AMAQ: Adaptive Mixed-bit Activation Quantization for Collaborative Parameter Efficient Fine-tuning
- LANTERN: Scalable Distillation of Large Language Models for Job-Person Fit and Explanation
- High-Fidelity Synthetic ECG Generation via Mel-Spectrogram Informed Diffusion Training
- Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting
- CAM: A Constructivist View of Agentic Memory for LLM-Based Reading Comprehension
- Provably Mitigating Corruption, Overoptimization, and Verbosity Simultaneously in Offline and Online RLHF/DPO Alignment
- Permutation-Invariant Representation Learning for Robust and Privacy-Preserving Feature Selection
- Seeing the Big Picture: Evaluating Multimodal LLMs' Ability to Interpret and Grade Handwritten Student Work
- Critical attention scaling in long-context transformers
- Generative Dynamic Graph Representation Learning for Conspiracy Spoofing Detection
- Deciphering Invariant Feature Decoupling in Source-free Time Series Forecasting with Proxy Denoising
- Improving Chain-of-Thought Efficiency for Autoregressive Image Generation
- HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection
- MADIAVE: Multi-Agent Debate for Implicit Attribute Value Extraction
- PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction
- Generative AI-Driven Hierarchical Multi-Agent Framework for Zero-Touch Optical Networks
- From Neural Activity to Computation: Biological Reservoirs for Pattern Recognition in Digit Classification
- The African Languages Lab: A Collaborative Approach to Advancing Low-Resource African NLP
- Ocular-Induced Abnormal Head Posture: Diagnosis and Missing Data Imputation
- Quantifying the Accuracy-Interpretability Trade-Off in Concept-Based Sidechannel Models
- Code-Switching In-Context Learning for Cross-Lingual Transfer of Large Language Models
- QGraphLIME - Explaining Quantum Graph Neural Networks
- vAttention: Verified Sparse Attention
- Sparse deepfake detection promotes better disentanglement
- Uncovering Representation Bias for Investment Decisions in Open-Source Large Language Models
- FinReflectKG - EvalBench: Benchmarking Financial KG with Multi-Dimensional Evaluation
- Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies
- Redefining Generalization in Visual Domains: A Two-Axis Framework for Fake Image Detection with FusionDetect
- Are Heterogeneous Graph Neural Networks Truly Effective? A Causal Perspective
- InforME: Improving Informativeness of Abstractive Text Summarization With Informative Attention Guided by Named Entity Salience
- Mellum: Production-Grade in-IDE Contextual Code Completion with Multi-File Project Understanding
- Data-efficient Targeted Token-level Preference Optimization for LLM-based Text-to-Speech
- Risk level dependent Minimax Quantile lower bounds for Interactive Statistical Decision Making
- Deformable Image Registration for Self-supervised Cardiac Phase Detection in Multi-View Multi-Disease Cardiac Magnetic Resonance Images
- DACP: Domain-Adaptive Continual Pre-Training of Large Language Models for Phone Conversation Summarization
- Revisiting Long-context Modeling from Context Denoising Perspective
- Segment-Factorized Full-Song Generation on Symbolic Piano Music
- $\bf{D^3}$QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection
- Paying Attention to Hybrid Attention: Untangling the Issues with Conversion Methods
- Kaputt: A Large-Scale Dataset for Visual Defect Detection
- An Attention-Augmented VAE-BiLSTM Framework for Anomaly Detection in 12-Lead ECG Signals
- Carr\'e du champ flow matching: better quality-generalisation tradeoff in generative models
- LLM-FS-Agent: A Deliberative Role-based Large Language Model Architecture for Transparent Feature Selection
- EvalMORAAL: Interpretable Chain-of-Thought and LLM-as-Judge Evaluation for Moral Alignment in Large Language Models
- Probing the Difficulty Perception Mechanism of Large Language Models
- LexiCon: a Benchmark for Planning under Temporal Constraints in Natural Language
- Diffusion Models for Low-Light Image Enhancement: A Multi-Perspective Taxonomy and Performance Analysis
- ECTSpeech: Enhancing Efficient Speech Synthesis via Easy Consistency Tuning
- Detection and Measurement of Hailstones with Multimodal Large Language Models
- Emergent AI Surveillance: Overlearned Person Re-Identification and Its Mitigation in Law Enforcement Context
- Fast Leave-One-Out Approximation from Fragment-Target Prevalence Vectors (molFTP) : From Dummy Masking to Key-LOO for Leakage-Free Feature Construction
- From Learning to Mastery: Achieving Safe and Efficient Real-World Autonomous Driving with Human-In-The-Loop Reinforcement Learning
- Rule Encoding and Compliance in Large Language Models: An Information-Theoretic Analysis
- Structured Cognition for Behavioral Intelligence in Large Language Model Agents: Preliminary Study
- Optimization Modeling via Semantic Anchored Alignment
- Structuring Reasoning for Complex Rules Beyond Flat Representations
- An Algorithmic Information-Theoretic Perspective on the Symbol Grounding Problem
- Lang-PINN: From Language to Physics-Informed Neural Networks via a Multi-Agent Framework
- Representation Potentials of Foundation Models for Multimodal Alignment: A Survey
- Real-time Framework for Interoperable Semantic-driven Internet-of-Things in Smart Agriculture
- Plug-and-Play Dramaturge: A Divide-and-Conquer Approach for Iterative Narrative Script Refinement via Collaborative LLM Agents
- Graph-based LLM over Semi-Structured Population Data for Dynamic Policy Response
- Beyond Monolithic Rewards: A Hybrid and Multi-Aspect Reward Optimization for MLLM Alignment
- BIRD-INTERACT: Re-imagining Text-to-SQL Evaluation for Large Language Models via Lens of Dynamic Interactions
- Biomedical reasoning in action: Multi-agent System for Auditable Biomedical Evidence Synthesis
- MHA-RAG: Improving Efficiency, Accuracy, and Consistency by Encoding Exemplars as Soft Prompts
- Teacher-Student Guided Inverse Modeling for Steel Final Hardness Estimation
- AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems
- NASP-T: A Fuzzy Neuro-Symbolic Transformer for Logic-Constrained Aviation Safety Report Classification
- Do Code Models Suffer from the Dunning-Kruger Effect?
- VAL-Bench: Measuring Value Alignment in Language Models
- Vul-R2: A Reasoning LLM for Automated Vulnerability Repair
- Decade-long Emission Forecasting with an Ensemble Model in Taiwan
- In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
- From Agentification to Self-Evolving Agentic AI for Wireless Networks: Concepts, Approaches, and Future Research Directions
- Large Language Model-Based Uncertainty-Adjusted Label Extraction for Artificial Intelligence Model Development in Upper Extremity Radiography
- Joint Communication Scheduling and Velocity Control for Multi-UAV-Assisted Post-Disaster Monitoring: An Attention-Based In-Context Learning Approach
- Syn-Diag: An LLM-based Synergistic Framework for Generalizable Few-shot Fault Diagnosis on the Edge
- Artificially intelligent agents in the social and behavioral sciences: A history and outlook
- ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems
- Uncertainty assessment in satellite-based greenhouse gas emissions estimates using emulated atmospheric transport
- Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis
- RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases
- ConstraintLLM: A Neuro-Symbolic Framework for Industrial-Level Constraint Programming
- Towards Label-Free Biological Reasoning Synthetic Dataset Creation via Uncertainty Filtering
- Optimizing for Persuasion Improves LLM Generalization: Evidence from Quality-Diversity Evolution of Debate Strategies
- Training-Free Time Series Classification via In-Context Reasoning with LLM Agents
- Back to Square Roots: An Optimal Bound on the Matrix Factorization Error for Multi-Epoch Differentially Private SGD
- Layered, Overlapping, and Inconsistent: A Large-Scale Analysis of the Multiple Privacy Policies and Controls of U.S. Banks
- CAI Fluency: A Framework for Cybersecurity AI Fluency
- A Set of Generalized Components to Achieve Effective Poison-only Clean-label Backdoor Attacks with Collaborative Sample Selection and Triggers
- Shortcuts Everywhere and Nowhere: Exploring Multi-Trigger Backdoor Attacks
- Differentially Private Online Community Detection for Censored Block Models: Algorithms and Fundamental Limits
- A Middle Path for On-Premises LLM Deployment: Preserving Privacy Without Sacrificing Model Confidentiality
- R\'enyi divergence-based uniformity guarantees for $k$-universal hash functions
- A Generative Approach to LLM Harmfulness Mitigation with Red Flag Tokens
- FlowVLA: Visual Chain of Thought-based Motion Reasoning for Vision-Language-Action Models
- mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies
- BC-ADMM: An Efficient Non-convex Constrained Optimizer with Robotic Applications
- Equivariant Filter for Relative Attitude and Target's Angular Velocity Estimation
- Emergent interactions lead to collective frustration in robotic matter
- CLAd-VR: Cognitive Load-based Adaptive Training for Machining Tasks in Virtual Reality
- Chrysalis: A Unified System for Comparing Active Teaching and Passive Learning with AI Agents in Education
- When Should Users Check? A Decision-Theoretic Model of Confirmation Frequency in Multi-Step AI Agent Tasks
- Exploring Student Choice and the Use of Multimodal Generative AI in Programming Learning
- Bloom: Designing for LLM-Augmented Behavior Change Interactions
- Two Modes of Reflection: How Temporal, Spatial, and Social Distances Affect Reflective Writing in Family Caregiving
- Locability: An Ability-Based Ranking Model for Virtual Reality Locomotion Techniques
- Vipera: Blending Visual and LLM-Driven Guidance for Systematic Auditing of Text-to-Image Generative AI
- The Interplay of Attention and Memory in Visual Enumeration
- From "Arbitrary Timberland" To "Skyline Charts": Is Visualization At Risk From The Pollution of Scientific Literature?
- Taxonomy of User Needs and Actions
- Observing Interaction Rather Than Interfaces
- MADS: Multi-Agent Dialogue Simulation for Diverse Persuasion Data Generation
- What Do You Mean? Exploring How Humans and AI Interact with Symbols and Meanings in Their Interactions
- Evidence of Cognitive Biases in Capture-the-Flag Cybersecurity Competitions
- "Your Doctor is Spying on You": An Analysis of Data Practices in Mobile Healthcare Applications
- Benchmark It Yourself (BIY): Preparing a Dataset and Benchmarking AI Models for Scatterplot-Related Tasks
- Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences
- LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams
- Exploring the Potential of Conversational AI Support for Agent-Based Social Simulation Model Design
- MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
- Teamwork: Collaborative Diffusion with Low-rank Coordination and Adaptation
- Submillimeter-Accurate 3D Lumbar Spine Reconstruction from Biplanar X-Ray Images: Incorporating a Multi-Task Network and Landmark-Weighted Loss
- SDFs from Unoriented Point Clouds using Neural Variational Heat Distances
- Scalable In-context Ranking with Generative Models
- Automated Research Article Classification and Recommendation Using NLP and ML
- AgentDR Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents
- Limitations of Current Evaluation Practices for Conversational Recommender Systems and the Potential of User Simulation
- How public datasets constrain the development of diversity-aware news recommender systems, and what law could do about it
- Towards Structured Knowledge: Advancing Triple Extraction from Regional Trade Agreements using Large Language Models
- KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance
- Deterministic Legal Retrieval: An Action API for Querying the SAT-Graph RAG
- Peeking inside the Black-Box: Reinforcement Learning for Explainable and Accurate Relation Extraction
- Soft Reasoning Paths for Knowledge Graph Completion
- Boosting Text-to-Chart Retrieval through Training with Synthesized Semantic Insights
- Text Clustering as Classification with LLMs
- Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
- Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain
- Deep Learning-Based Multi-Factor Authentication: A Survey of Biometric and Smart Card Integration Approaches
- Domain-Adapted Granger Causality for Real-Time Cross-Slice Attack Attribution in 6G Networks
- From Poisoned to Aware: Fostering Backdoor Self-Awareness in LLMs
- SafeGuider: Robust and Practical Content Safety Control for Text-to-Image Models
- Agentic Misalignment: How LLMs Could Be Insider Threats
- Auditing Pay-Per-Token in Large Language Models
- Adapting Insider Risk mitigations for Agentic Misalignment: an empirical study
- Indirect Prompt Injections: Are Firewalls All You Need, or Stronger Benchmarks?
- Constraint-Level Design of zkEVMs: Architectures, Trade-offs, and Evolution
- AutoDAN-Reasoning: Enhancing Strategies Exploration based Jailbreak Attacks with Test-Time Scaling
- A Brief Note on Cryptographic Pseudonyms for Anonymous Credentials
- AutoPentester: An LLM Agent-based Framework for Automated Pentesting
- Membership Inference Attacks on Tokenizers of Large Language Models
- Towards Reliable and Practical LLM Security Evaluations via Bayesian Modelling
- New Insights into Involutory and Orthogonal MDS Matrices
- SBOMproof: Beyond Alleged SBOM Compliance for Supply Chain Security of Container Images
- The Five Safes as a Privacy Context
- Privacy-Preserving On-chain Permissioning for KYC-Compliant Decentralized Applications
- Enhancing Automotive Security with a Hybrid Approach towards Universal Intrusion Detection System
- Fairness in Token Delegation: Mitigating Voting Power Concentration in DAOs
- PhishSSL: Self-Supervised Contrastive Learning for Phishing Website Detection
- AdProv: A Method for Provenance of Process Adaptations
- N-Parties Private Structure and Parameter Learning for Sum-Product Networks
- Optimal Good-Case Latency for Sleepy Consensus
- VeriGuard: Enhancing LLM Agent Safety via Verified Code Generation
- Adversarial Reinforcement Learning for Offensive and Defensive Agents in a Simulated Zero-Sum Network Environment
- OptiFLIDS: Optimized Federated Learning for Energy-Efficient Intrusion Detection in IoT
- Randomness from causally independent processes
- DP-Adam-AC: Privacy-preserving Fine-Tuning of Localizable Language Models Using Adam Optimization with Adaptive Clipping
- On Limits on the Provable Consequences of Quantum Pseudorandomness
- Power Mechanism: Private Tabular Representation Release for Model Agnostic Consumption
- Beyond Spectral Peaks: Interpreting the Cues Behind Synthetic Image Detection
- Empirical Comparison of Membership Inference Attacks in Deep Transfer Learning
- DP-SNP-TIHMM: Differentially Private, Time-Inhomogeneous Hidden Markov Models for Synthesizing Genome-Wide Association Datasets
- Classification of small binary bibraces via bilinear maps
- Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?
- On the Quantum Equivalence between $S|LWE\rangle$ and $ISIS$
- Anonymous Quantum Tokens with Classical Verification
- When Should Selfish Miners Double-Spend?
- Practical Secure Delegated Linear Algebra with Trapdoored Matrices
- Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code
- Federated Distributed Key Generation
- A Study on Malicious Browser Extensions in 2025
- Model Context Protocol (MCP): Landscape, Security Threats, and Future Research Directions
- DoomArena: A framework for Testing AI Agents Against Evolving Security Threats
- From Concept to Measurement: A Survey of How the Blockchain Trilemma Is Analyzed
- Refereed Learning
- A Probabilistic Basis for Low-Rank Matrix Learning
- Domain-Shift-Aware Conformal Prediction for Large Language Models
- Bilevel optimization for learning hyperparameters: Application to solving PDEs and inverse problems with Gaussian processes
- On the Theory of Continual Learning with Gradient Descent for Neural Networks
- Implicit Updates for Average-Reward Temporal Difference Learning
- Adaptive Reinforcement Learning for Dynamic Configuration Allocation in Pre-Production Testing
- Aneurysm Growth Time Series Reconstruction Using Physics-informed Autoencoder
- Efficient Prediction of Pass@k Scaling in Large Language Models
- Computing frustration and near-monotonicity in deep neural networks
- Tensor-on-tensor Regression Neural Networks for Process Modeling with High-dimensional Data
- Integrating Bayesian methods with neural network--based model predictive control: a review
- Prior-Aligned Meta-RL: Thompson Sampling with Learned Priors and Guarantees in Finite-Horizon MDPs
- Smart Contract Adoption under Discrete Overdispersed Demand: A Negative Binomial Optimization Perspective
- Monte Carlo-Type Neural Operator for Differential Equations
- Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
- ESS-Flow: Training-free guidance of flow-based models as inference in source space
- Gaussian Embeddings: How JEPAs Secretly Learn Your Data Density
- Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers
- Generalization of Gibbs and Langevin Monte Carlo Algorithms in the Interpolation Regime
- Learning Mixtures of Linear Dynamical Systems (MoLDS) via Hybrid Tensor-EM Method
- The Physics of Data and Tasks: Theories of Locality and Compositionality in Deep Learning
- PolyGraph Discrepancy: a classifier-based metric for graph generation
- Higher-Order Feature Attribution: Bridging Statistics, Explainable AI, and Topological Signal Processing
- Conformalized Gaussian processes for online uncertainty quantification over graphs
- Model-free generalized fiducial inference
- Conformal Prediction in Hierarchical Classification with Constrained Representation Complexity
- An inexact LPA for DC composite optimization and application to matrix completions with outliers
- Oblivious Stochastic Composite Optimization
- Convergence of the majorized PAM method with subspace correction for low-rank composite factorization model
- Information-Theoretic Thresholds for the Alignments of Partially Correlated Graphs
- SKADA-Bench: Benchmarking Unsupervised Domain Adaptation Methods with Realistic Validation On Diverse Modalities
- AuToMATo: An Out-Of-The-Box Persistence-Based Clustering Algorithm
- Binding Affinity Prediction: From Conventional to Machine Learning-Based Approaches
- Anchors Aweigh! Sail for Optimal Unified Multi-Modal Representations
- Conjugate gradient methods for high-dimensional GLMMs
- Can foundation models actively gather information in interactive environments to test hypotheses?
- Probabilistic Variational Contrastive Learning
- Gradient Methods with Online Scaling Part II. Practical Aspects
- VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing
- Adaptive Dynamics Planning for Robot Navigation
- A multi-modal tactile fingertip design for robotic hands to enhance dexterous manipulation
- Towards Online Robot Interaction Adaptation to Human Upper-limb Mobility Impairments in Return-to-Work Scenarios
- Active Semantic Perception
- AD-NODE: Adaptive Dynamics Learning with Neural ODEs for Mobile Robots Control
- Correlation-Aware Dual-View Pose and Velocity Estimation for Dynamic Robotic Manipulation
- ARRC: Advanced Reasoning Robot Control - Knowledge-Driven Autonomous Manipulation Using Retrieval-Augmented Generation
- GO-Flock: Goal-Oriented Flocking in 3D Unknown Environments with Depth Maps
- DeLTa: Demonstration and Language-Guided Novel Transparent Object Manipulation
- Verifier-free Test-Time Sampling for Vision Language Action Models
- Oracle-Guided Masked Contrastive Reinforcement Learning for Visuomotor Policies
- Stable Robot Motions on Manifolds: Learning Lyapunov-Constrained Neural Manifold ODEs
- Federated Split Learning for Resource-Constrained Robots in Industrial IoT: Framework Comparison, Optimization Strategies, and Future Directions
- Precise and Efficient Collision Prediction under Uncertainty in Autonomous Driving
- Human-in-the-loop Optimisation in Robot-assisted Gait Training
- VCoT-Grasp: Grasp Foundation Models with Visual Chain-of-Thought Reasoning for Language-driven Grasp Generation
- A Co-Design Framework for Energy-Aware Monoped Jumping with Detailed Actuator Modeling
- Learning to Crawl: Latent Model-Based Reinforcement Learning for Soft Robotic Adaptive Locomotion
- The DISTANT Design for Remote Transmission and Steering Systems for Planetary Robotics
- AI-Enabled Capabilities to Facilitate Next-Generation Rover Surface Operations
- Coordinate-Consistent Localization via Continuous-Time Calibration and Fusion of UWB and SLAM Observations
- Cross-Embodiment Dexterous Hand Articulation Generation via Morphology-Aware Learning
- Multi-Robot Distributed Optimization for Exploration and Mapping of Unknown Environments using Bioinspired Tactile-Sensor
- Towards Autonomous Tape Handling for Robotic Wound Redressing
- Vision-Guided Targeted Grasping and Vibration for Robotic Pollination in Controlled Environments
- A Preview of HoloOcean 2.0
- DYMO-Hair: Generalizable Volumetric Dynamics Modeling for Robot Hair Manipulation
- EmbodiedCoder: Parameterized Embodied Mobile Manipulation via Modern Coding Model
- Safety-Critical Control with Bounded Inputs: A Closed-Form Solution for Backup Control Barrier Functions
- MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
- The Safety Challenge of World Models for Embodied AI Agents: A Review
- Information-Theoretic Policy Pre-Training with Empowerment
- Hybrid Quantum-Classical Policy Gradient for Adaptive Control of Cyber-Physical Systems: A Comparative Study of VQC vs. MLP
- Dropping the D: RGB-D SLAM Without the Depth Sensor
- Interpreting Behaviors and Geometric Constraints as Knowledge Graphs for Robot Manipulation Control
- Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation
- Image-Based Visual Servoing for Enhanced Cooperation of Dual-Arm Manipulation
- Self-Supervised Representation Learning with Joint Embedding Predictive Architecture for Automotive LiDAR Object Detection
- pRRTC: GPU-Parallel RRT-Connect for Fast, Consistent, and Low-Cost Motion Planning
- IMPACT: Intelligent Motion Planning with Acceptable Contact Trajectories via Vision-Language Models
- Capturing a Moving Target by Two Robots in the F2F Model
- Decremental Dynamics Planning for Robot Navigation
- Toward Dynamic Control of Tendon-driven Continuum Robots using Clarke Transform
- Identifying Uncertainty in Self-Adaptive Robotics with Large Language Models
- CottonSim: A vision-guided autonomous robotic system for cotton harvesting in Gazebo simulation
- Distilling On-device Language Models for Robot Planning with Minimal Human Intervention
- Minima and Critical Points of the Bethe Free Energy Are Invariant Under Deformation Retractions of Factor Graphs
Research Sources: 652 | Generated: 10/8/2025