AI RESEARCH PAPERS & ACADEMIC SOURCES
- Machine learning for accuracy in density functional approximations
- Multi-Scale Node Embeddings for Graph Modeling and Generation
- Learning Low-Dimensional Embeddings for Black-Box Optimization
- GARG-AML against Smurfing: A Scalable and Interpretable Graph-Based Framework for Anti-Money Laundering
- Template-Guided 3D Molecular Pose Generation via Flow Matching and Differentiable Optimization
- Interpretable Machine Learning for Urban Heat Mitigation: Attribution and Weighting of Multi-Scale Drivers
- Accuracy of Discretely Sampled Stochastic Policies in Continuous-time Reinforcement Learning
- Enhancing Electricity-System Resilience with Adaptive Robust Optimization and Conformal Uncertainty Characterization
- A Few Large Shifts: Layer-Inconsistency Based Minimal Overhead Adversarial Example Detection
- Dynamic Bundling with Large Language Models for Zero-Shot Inference on Text-Attributed Graphs
- Learning Equivariant Models by Discovering Symmetries with Learnable Augmentations
- Learning Beyond Experience: Generalizing to Unseen State Space with Reservoir Computing
- Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior
- Scaling Laws for Optimal Data Mixtures
- Synthetic Blips: Generalizing Synthetic Controls for Dynamic Treatment Effects
- CardioRAG: A Retrieval-Augmented Generation Framework for Multimodal Chagas Disease Detection
- Reducing Simulation Dependence in Neutrino Telescopes with Masked Point Transformers
- PRESOL: a web-based computational setting for feature-based flare forecasting
- Microscaling Floating Point Formats for Large Language Models
- Smooth Quasar-Convex Optimization with Constraints
- Bias beyond Borders: Global Inequalities in AI-Generated Music
- Multi-bit Audio Watermarking
- ShapeGen3DCP: A Deep Learning Framework for Layer Shape Prediction in 3D Concrete Printing
- Variational Secret Common Randomness Extraction
- Multidata Causal Discovery for Statistical Hurricane Intensity Forecasting
- SoundReactor: Frame-level Online Video-to-Audio Generation
- High-Fidelity Speech Enhancement via Discrete Audio Tokens
- Quantum Fisher information matrices from R\'enyi relative entropies
- Consistent End-to-End Estimation for Counterfactual Fairness
- Development and Validation of a Dynamic Kidney Failure Prediction Model based on Deep Learning: A Real-World Study with External Validation
- Neuro-Symbolic AI for Analytical Solutions of Differential Equations
- Riemannian Variational Flow Matching for Material and Protein Design
- Catalyst GFlowNet for electrocatalyst design: A hydrogen evolution reaction case study
- Policy Gradient Guidance Enables Test Time Control
- Poolformer: Recurrent Networks with Pooling for Long-Sequence Modeling
- C2AL: Cohort-Contrastive Auxiliary Learning for Large-scale Recommendation Systems
- xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity
- PUL-Inter-slice Defender: An Anomaly Detection Solution for Distributed Slice Mobility Attacks
- Transformers Discover Molecular Structure Without Graph Priors
- Diffusion^2: Turning 3D Environments into Radio Frequency Heatmaps
- Fine-Grained Urban Traffic Forecasting on Metropolis-Scale Road Networks
- Knowledge Distillation Detection for Open-weights Models
- Robust Tangent Space Estimation via Laplacian Eigenvector Gradient Orthogonalization
- KaVa: Latent Reasoning via Compressed KV-Cache Distillation
- Hybrid Predictive Modeling of Malaria Incidence in the Amhara Region, Ethiopia: Integrating Multi-Output Regression and Time-Series Forecasting
- Combining complex Langevin dynamics with score-based and energy-based diffusion models
- Learning to Play Multi-Follower Bayesian Stackelberg Games
- Financial Stability Implications of Generative AI: Taming the Animal Spirits
- Comparative Field Deployment of Reinforcement Learning and Model Predictive Control for Residential HVAC
- Universal Dynamic Regret and Constraint Violation Bounds for Constrained Online Convex Optimization
- Randomized Gradient Subspaces for Efficient Large Language Model Training
- Multi-marginal temporal Schr\"odinger Bridge Matching for video generation from unpaired data
- A Methodology for Transparent Logic-Based Classification Using a Multi-Task Convolutional Tsetlin Machine
- StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold
- Moon: A Modality Conversion-based Efficient Multivariate Time Series Anomaly Detection
- Private Federated Multiclass Post-hoc Calibration
- PepCompass: Navigating peptide embedding spaces using Riemannian Geometry
- Normality Calibration in Semi-supervised Graph Anomaly Detection
- FairContrast: Enhancing Fairness through Contrastive learning and Customized Augmenting Methods on Tabular Data
- Mathematical Modeling and Convergence Analysis of Deep Neural Networks with Dense Layer Connectivities in Deep Learning
- Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions
- Learning Model Representations Using Publicly Available Model Hubs
- PENEX: AdaBoost-Inspired Neural Network Regularization
- Hybrid Deep Learning Modeling Approach to Predict Natural Gas Consumption of Home Subscribers on Limited Data
- DAG DECORation: Continuous Optimization for Structure Learning under Hidden Confounding
- Posterior Collapse as a Phase Transition in Variational Autoencoders
- CAT: Curvature-Adaptive Transformers for Geometry-Aware Learning
- Detecting Post-generation Edits to Watermarked LLM Outputs via Combinatorial Watermarking
- Support Basis: Fast Attention Beyond Bounded Entries
- PASTA: A Unified Framework for Offline Assortment Learning
- ActiNet: Activity intensity classification of wrist-worn accelerometers using self-supervised deep learning
- Accelerating Attention with Basis Decomposition
- Finite-Time Bounds for Distributionally Robust TD Learning with Linear Function Approximation
- Workplace Location Choice Model based on Deep Neural Network
- Private and Fair Machine Learning: Revisiting the Disparate Impact of Differentially Private SGD
- Learning Regularization Functionals for Inverse Problems: A Comparative Study
- Octax: Accelerated CHIP-8 Arcade Environments for Reinforcement Learning in JAX
- Neural non-canonical Hamiltonian dynamics for long-time simulations
- Sensitivity, Specificity, and Consistency: A Tripartite Evaluation of Privacy Filters for Synthetic Data Generation
- Black-Box Combinatorial Optimization with Order-Invariant Reinforcement Learning
- Learning Representations Through Contrastive Neural Model Checking
- Explicit Discovery of Nonlinear Symmetries from Dynamic Data
- Compositional meta-learning through probabilistic task inference
- How Well Can Preference Optimization Generalize Under Noisy Feedback?
- Fine-tuning LLMs with variational Bayesian last layer for high-dimensional Bayesian optimzation
- PEL-NAS: Search Space Partitioned Architecture Prompt Co-Evolutionary LLM-driven Hardware-Aware Neural Architecture Search
- Density-Ratio Weighted Behavioral Cloning: Learning Control Policies from Corrupted Datasets
- Realistic CDSS Drug Dosing with End-to-end Recurrent Q-learning for Dual Vasopressor Control
- Flock: A Knowledge Graph Foundation Model via Learning on Random Walks
- CarbonX: An Open-Source Tool for Computational Decarbonization Using Time Series Foundation Models
- On Integer Programming for the Binarized Neural Network Verification Problem
- Round-trip Reinforcement Learning: Self-Consistent Training for Better Chemical LLMs
- NVIDIA AI Aerial: AI-Native Wireless Communications
- TimeSeriesScientist: A General-Purpose AI Agent for Time Series Analysis
- Executable Counterfactuals: Improving LLMs' Causal Reasoning Through Code
- MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models
- Large-Scale Bayesian Causal Discovery with Interventional Data
- TetriServe: Efficient DiT Serving for Heterogeneous Image Generation
- Gradient Shaping Beyond Clipping: A Functional Perspective on Update Magnitude Control
- Securing generative artificial intelligence with parallel magnetic tunnel junction true randomness
- Machines are more productive than humans until they aren't, and vice versa
- Accelerating Long-Term Molecular Dynamics with Physics-Informed Time-Series Forecasting
- ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models
- Network-Level Vehicle Delay Estimation at Heterogeneous Signalized Intersections
- Quantum-inspired Benchmark for Estimating Intrinsic Dimension
- Self-Supervised Representation Learning as Mutual Information Maximization
- RheOFormer: A generative transformer model for simulation of complex fluids and flows
- Selective Underfitting in Diffusion Models
- Fine-Tuning Masked Diffusion for Provable Self-Correction
- Edge Artificial Intelligence: A Systematic Review of Evolution, Taxonomic Frameworks, and Future Horizons
- SoftAdaClip: A Smooth Clipping Strategy for Fair and Private Model Training
- SCOPED: Score-Curvature Out-of-distribution Proximity Evaluator for Diffusion
- Fixing That Free Lunch: When, Where, and Why Synthetic Data Fails in Model-Based Policy Optimization
- Time-o1: Time-Series Forecasting Needs Transformed Label Alignment
- Search-Based Software Engineering and AI Foundation Models: Current Landscape and Future Roadmap
- PiCa: Parameter-Efficient Fine-Tuning with Column Space Projection
- What happens when generative AI models train recursively on each others' outputs?
- Enhanced DACER Algorithm with High Diffusion Efficiency
- CodeSense: a Real-World Benchmark and Dataset for Code Semantic Reasoning
- Localized Forest Fire Risk Prediction: A Department-Aware Approach for Operational Decision Support
- MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
- PlaceFM: A Training-free Geospatial Foundation Model of Places using Large-Scale Point of Interest Data
- Can LLMs Find Fraudsters? Multi-level LLM Enhanced Graph Fraud Detection
- VAR-MATH: Probing True Mathematical Reasoning in LLMS via Symbolic Multi-Instance Benchmarks
- nDNA -- the Semantic Helix of Artificial Cognition
- R2 v2: The Pareto-compliant R2 Indicator for Better Benchmarking in Bi-objective Optimization
- QSpec: Speculative Decoding with Complementary Quantization Schemes
- Faster LLM Inference using DBMS-Inspired Preemption and Cache Replacement Policies
- Unraveling Indirect In-Context Learning Using Influence Functions
- Paper Quality Assessment based on Individual Wisdom Metrics from Open Peer Review
- Forget Forgetting: Continual Learning in a World of Abundant Memory
- CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
- Knowledge-guided machine learning for county-level corn yield prediction under drought
- When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reasoning Models
- Towards Effective E-Participation of Citizens in the European Union: The Development of AskThePublic
- AI-Powered Inverse Design of Ku-Band SIW Resonant Structures by Iterative Residual Correction Network
- How to Find Fantastic Papers: Self-Rankings as a Powerful Predictor of Scientific Impact Beyond Peer Review
- Comparing Contrastive and Triplet Loss in Audio-Visual Embedding: Intra-Class Variance and Greediness Analysis
- SIEVE: Towards Verifiable Certification for Code-datasets
- Go witheFlow: Real-time Emotion Driven Audio Effects Modulation
- GRACE: A Language Model Framework for Explainable Inverse Reinforcement Learning
- Detection of Chagas Disease from the ECG: The George B. Moody PhysioNet Challenge 2025
- DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning
- How to Combat Reactive and Dynamic Jamming Attacks with Reinforcement Learning
- Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
- A* Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks
- Towards end-to-end ASP computation
- Forms of Understanding for XAI-Explanations
- Goal Recognition Design for General Behavioral Agents using Machine Learning
- A Flexible Method for Behaviorally Measuring Alignment Between Human and Artificial Intelligence Using Representational Similarity Analysis
- AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents
- Neurosymbolic Association Rule Mining from Tabular Data
- Schema Generation for Large Knowledge Graphs Using Large Language Models
- Latency-aware Multimodal Federated Learning over UAV Networks
- Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
- Rethinking the shape convention of an MLP
- SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment
- Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets
- NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications
- A Modular Theory of Subjective Consciousness for Natural and Artificial Minds
- FINCH: Financial Intelligence using Natural language for Contextualized SQL Handling
- Small is Sufficient: Reducing the World AI Energy Consumption Through Model Selection
- HRTFformer: A Spatially-Aware Transformer for Personalized HRTF Upsampling in Immersive Audio Rendering
- Are LLMs Better GNN Helpers? Rethinking Robust Graph Learning under Deficiencies with Iterative Refinement
- Exploring Resolution-Wise Shared Attention in Hybrid Mamba-U-Nets for Improved Cross-Corpus Speech Enhancement
- Clarifying Semantics of In-Context Examples for Unit Test Generation
- The Current State of AI Bias Bounties: An Overview of Existing Programmes and Research
- KAIROS: Unified Training for Universal Non-Autoregressive Time Series Forecasting
- Unlocking Symbol-Level Precoding Efficiency Through Tensor Equivariant Neural Network
- VarCoNet: A variability-aware self-supervised framework for functional connectome extraction from resting-state fMRI
- BioinfoMCP: A Unified Platform Enabling MCP Interfaces in Agentic Bioinformatics
- Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regression
- The Three Regimes of Offline-to-Online Reinforcement Learning
- RealClass: A Framework for Classroom Speech Simulation with Public Datasets and Game Engines
- Pharmacophore-Guided Generative Design of Novel Drug-Like Molecules
- Understanding Adversarial Transfer: Why Representation-Space Attacks Fail Where Data-Space Attacks Succeed
- Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information
- Predictive Modeling and Explainable AI for Veterinary Safety Profiles, Residue Assessment, and Health Outcomes Using Real-World Data and Physicochemical Properties
- Rethinking KL Regularization in RLHF: From Value Estimation to Gradient Optimization
- From Supervision to Exploration: What Does Protein Language Model Learn During Reinforcement Learning?
- Enhancing Noise Robustness of Parkinson's Disease Telemonitoring via Contrastive Feature Augmentation
- BioBlobs: Differentiable Graph Partitioning for Protein Representation Learning
- Source-Free Cross-Domain Continual Learning
- The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM
- Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning
- Learning Time-Series Representations by Hierarchical Uniformity-Tolerance Latent Balancing
- Shift-Invariant Attribute Scoring for Kolmogorov-Arnold Networks via Shapley Value
- Representational Alignment Across Model Layers and Brain Regions with Hierarchical Optimal Transport
- BioX-Bridge: Model Bridging for Unsupervised Cross-Modal Knowledge Transfer across Biosignals
- Quantum-Assisted Correlation Clustering
- Mamba Outpaces Reformer in Stock Prediction with Sentiments from Top Ten LLMs
- Kant: An Efficient Unified Scheduling System for Large-Scale AI Clusters
- IoT-MCP: Bridging LLMs and IoT Systems Through Model Context Protocol
- RSTGCN: Railway-centric Spatio-Temporal Graph Convolutional Network for Train Delay Prediction
- Budgeted Broadcast: An Activity-Dependent Pruning Rule for Neural Network Efficiency
- Identifying Information-Transfer Nodes in a Recurrent Neural Network Reveals Dynamic Representations
- Noisy-Pair Robust Representation Alignment for Positive-Unlabeled Learning
- An Analysis of the New EU AI Act and A Proposed Standardization Framework for Machine Learning Fairness
- Emergent evaluation hubs in a decentralizing large language model ecosystem
- Evaluating New AI Cell Foundation Models on Challenging Kidney Pathology Cases Unaddressed by Previous Foundation Models
- Microsaccade-Inspired Probing: Positional Encoding Perturbations Reveal LLM Misbehaviours
- Enhancing the development of Cherenkov Telescope Array control software with Large Language Models
- DeMuon: A Decentralized Muon for Matrix Optimization over Graphs
- Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence
- Neural Network Surrogates for Free Energy Computation of Complex Chemical Systems
- BioVERSE: Representation Alignment of Biomedical Modalities to LLMs for Multi-Modal Reasoning
- Learning a Dense Reasoning Reward Model from Expert Demonstration via Inverse Reinforcement Learning
- To Mask or to Mirror: Human-AI Alignment in Collective Reasoning
- Zero-shot reasoning for simulating scholarly peer-review
- ReTabAD: A Benchmark for Restoring Semantic Context in Tabular Anomaly Detection
- Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning
- FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models
- LOGicalThought: Logic-Based Ontological Grounding of LLMs for High-Assurance Reasoning
- Step-Aware Policy Optimization for Reasoning in Diffusion Large Language Models
- AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning
- AgentRec: Next-Generation LLM-Powered Multi-Agent Collaborative Recommendation with Adaptive Intelligence
- Learning to Decide with Just Enough: Information-Theoretic Context Summarization for CDMPs
- Understanding the Geospatial Reasoning Capabilities of LLMs: A Trajectory Recovery Perspective
- GuruAgents: Emulating Wise Investors with Prompt-Guided LLM Agents
- MetaboT: AI-based agent for natural language-based interaction with metabolomics knowledge graphs
- A cybersecurity AI agent selection and decision support framework
- REBot: From RAG to CatRAG with Semantic Enrichment and Graph Routing
- Human-AI Teaming Co-Learning in Military Operations
- OR-Toolformer: Modeling and Solving Operations Research Problems with Tool Augmented Large Language Models
- Modeling Others' Minds as Code
- Cyber Academia-Chemical Engineering (CA-ChemE): A Living Digital Town for Self-Directed Research Evolution and Emergent Scientific Discovery
- The Social Laboratory: A Psychometric Framework for Multi-Agent LLM Evaluation
- Retrieval-Augmented Framework for LLM-Based Clinical Decision Support
- Automating Data-Driven Modeling and Analysis for Engineering Applications using Large Language Model Agents
- OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models
- A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining
- AIReg-Bench: Benchmarking Language Models That Assess AI Regulation Compliance
- Lateral Tree-of-Thoughts Surpasses ToT by Incorporating Logically-Consistent, Low-Utility Candidates
- Towards Interpretable and Inference-Optimal COT Reasoning with Sparse Autoencoder-Guided Generation
- Differentially Private Clustering in Data Streams
- Authentication Security of PRF GNSS Ranging
- An efficient quantum algorithm for computing $S$-units and its applications
- Privacy-Aware Sequential Learning
- Odontoceti: Ultra-Fast DAG Consensus with Two Round Commitment
- Adaptive Federated Learning Defences via Trust-Aware Deep Q-Networks
- Bypassing Prompt Guards in Production with Controlled-Release Prompting
- TAIBOM: Bringing Trustworthiness to AI-Enabled Systems
- FalseCrashReducer: Mitigating False Positive Crashes in OSS-Fuzz-Gen Using Agentic AI
- UpSafe$^\circ$C: Upcycling for Controllable Safety in Large Language Models
- Reproducible Builds for Quantum Computing
- A Quantitative Security Analysis of S-boxes in the NIST Lightweight Cryptography Finalists
- Differentially Private Federated Learning: A Systematic Review
- Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks
- It's not Easy: Applying Supervised Machine Learning to Detect Malicious Extensions in the Chrome Web Store
- Fine-Tuning Jailbreaks under Highly Constrained Black-Box Settings: A Three-Pronged Approach
- Integrated Security Mechanisms for Weight Protection in Memristive Crossbar Arrays
- Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks
- E-FuzzEdge: Optimizing Embedded Device Security with Scalable In-Place Fuzzing
- Securing IoT Devices in Smart Cities: A Review of Proposed Solutions
- POLAR: Automating Cyber Threat Prioritization through LLM-Powered Assessment
- Evaluating the Robustness of a Production Malware Detection System to Transferable Adversarial Attacks
- Towards Imperceptible Adversarial Defense: A Gradient-Driven Shield against Facial Manipulations
- Constructions of Efficiently Implementable Boolean Functions with Provable Nonlinearity/Resiliency/Algebraic Immunity Trade-Offs
- Secure Multi-Modal Data Fusion in Federated Digital Health Systems via MCP
- Mirage Fools the Ear, Mute Hides the Truth: Precise Targeted Adversarial Attacks on Polyphonic Sound Event Detection Systems
- NoMod: A Non-modular Attack on Module Learning With Errors
- Testing Stability and Robustness in Three Cryptographic Chaotic Systems
- TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
- Ranking Items from Discrete Ratings: The Cost of Unknown User Thresholds
- Contrastive Retrieval Heads Improve Attention-Based Re-Ranking
- Reliable Decision Making via Calibration Oriented Retrieval Augmented Generation
- Handling Heterophily in Recommender Systems with Wavelet Hypergraph Diffusion
- REALM: Recursive Relevance Modeling for LLM-based Document Re-Ranking
- Shilling Recommender Systems by Generating Side-feature-aware Fake User Profiles
- Gendered Inequalities in Online Harms: Fear, Safety Work, and Online Participation
- The Measurement Imbalance in Agentic AI Evaluation Undermines Industry Productivity Claims
- MIRAGE: Patient-Specific Mixed Reality Coaching for MRI via Depth-Only Markerless Registration and Immersive VR
- Automatic inference of a anatomically meaningful solid wood texture from a single photograph
- ViscoReg: Neural Signed Distance Functions via Viscosity Solutions
- Location Matters: Leveraging Multi-Resolution Geo-Embeddings for Housing Search
- Are LLMs ready to help non-expert users to make charts of official statistics data?
- Optimal signals assignment for eBay View Item page
- MetaSynth: Multi-Agent Metadata Generation from Implicit Feedback in Black-Box Systems
- IoDResearch: Deep Research on Private Heterogeneous Data via the Internet of Data
- Towards Human-Centered RegTech: Unpacking Professionals' Strategies and Needs for Using LLMs Safely
- Who is responsible? Social Identity, Robot Errors and Blame Attribution
- Komitee Equal Shares: Choosing Together as Voters and as Groups with a Co-designed Virtual Budget Algorithm
- Human-Robo-advisor collaboration in decision-making: Evidence from a multiphase mixed methods experimental study
- Agentic Reasoning and Refinement through Semantic Interaction
- EvolveCaptions: Empowering DHH Users Through Real-Time Collaborative Captioning
- A Locally Executable AI System for Improving Preoperative Patient Communication: A Multi-Domain Clinical Evaluation
- Multimodal Feedback for Task Guidance in Augmented Reality
- Multimodal Foundation Models for Early Disease Detection
- Understanding Dynamic Human-Robot Proxemics in the Case of Four-Legged Canine-Inspired Robots
- How AI and Human Behaviors Shape Psychosocial Effects of Extended Chatbot Use: A Longitudinal Randomized Controlled Study
- Design and Evaluation of Generative Agent-based Platform for Human-Assistant Interaction Research: A Tale of 10 User Studies
- Software Engineering for Self-Adaptive Robotics: A Research Agenda
- Manim for STEM Education: Visualizing Complex Problems Through Animation
- Beyond Divergence: Characterizing Co-exploration Patterns in Collaborative Design Processes
- An Anthropologist LLM to Elicit Users' Moral Preferences through Role-Play
- An Optical Measurement System for Open-Source Tracking of Jaw Motions
- How can AI agents support journalists' work? An experiment with designing an LLM-driven intelligent reporting system
- LegiScout: A Visual Tool for Understanding Complex Legislation
- Theory is Shapes
- The Command Line GUIde: Graphical Interfaces from Man Pages via AI
- From keywords to semantics: Perceptions of large language models in data discovery
- Dialogues with AI Reduce Beliefs in Misinformation but Build No Lasting Discernment Skills
- TimeGazer: Temporal Modeling of Predictive Gaze Stabilization for AR Interaction
- LPAC: Learnable Perception-Action-Communication Loops with Applications to Coverage Control
- One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment Locomotion
- Safe Navigation of Bipedal Robots via Koopman Operator-Based Model Predictive Control
- A Tactile Feedback Approach to Path Recovery after High-Speed Impacts for Collision-Resilient Drones
- Interactive Expressive Motion Generation Using Dynamic Movement Primitives
- FalconWing: An Ultra-Light Indoor Fixed-Wing UAV Platform for Vision-Based Autonomy
- Physics-Constrained Robot Grasp Planning for Dynamic Tool Use
- DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation
- ReactEMG: Zero-Shot, Low-Latency Intent Detection via sEMG
- CRAFT: Coaching Reinforcement Learning Autonomously using Foundation Models for Multi-Robot Coordination Tasks
- An effective control of large systems of active particles: An application to evacuation problem
- Sliced Distribution Matching based on Cumulative Distribution Functions with Applications to Control
- Model Evaluation of a Transformable CubeSat for Nonholonomic Attitude Reorientation Using a Drop Tower
- SCANS: A Soft Gripper with Curvature and Spectroscopy Sensors for In-Hand Material Differentiation
- Product Digital Twin Supporting End-of-life Phase of Electric Vehicle Batteries Utilizing Product-Process-Resource Asset Network
- Performance-Guided Refinement for Visual Aerial Navigation using Editable Gaussian Splatting in FalconGym 2.0
- Retargeting Matters: General Motion Retargeting for Humanoid Motion Tracking
- ARMADA: Autonomous Online Failure Detection and Human Shared Control Empower Scalable Real-world Deployment and Adaptation
- Better Than "Better Than Nothing": Design Strategies for Enculturated Empathetic AI Robot Companions for Older Adults
- A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab
- A Robust Neural Control Design for Multi-drone Slung Payload Manipulation with Control Contraction Metrics
- Predictive Preference Learning from Human Interventions
- Cooperative Guidance for Aerial Defense in Multiagent Systems
- Data-Driven Distributionally Robust Optimal Control with State-Dependent Noise
- Statistical Uncertainty Learning for Robust Visual-Inertial State Estimation
- Symskill: Symbol and Skill Co-Invention for Data-Efficient and Real-Time Long-Horizon Manipulation
- Geometric Backstepping Control of Omnidirectional Tiltrotors Incorporating Servo-Rotor Dynamics for Robustness against Sudden Disturbances
- PolySim: Bridging the Sim-to-Real Gap for Humanoid Control via Multi-Simulator Dynamics Randomization
- Contrastive Representation Regularization for Vision-Language-Action Models
- Dual-Mode Magnetic Continuum Robot for Targeted Drug Delivery
- An Anytime, Scalable and Complete Algorithm for Embedding a Manufacturing Procedure in a Smart Factory
- Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving
- What Matters in RL-Based Methods for Object-Goal Navigation? An Empirical Study and A Unified Framework
- Like Playing a Video Game: Spatial-Temporal Optimization of Foot Trajectories for Controlled Football Kicking in Bipedal Robots
- GreenhouseSplat: A Dataset of Photorealistic Greenhouse Simulations for Mobile Robotics
- TACOS: Task Agnostic COordinator of a multi-drone System
- SPARC: Spine with Prismatic and Revolute Compliance for Quadruped Robot
- Reducing Discomfort in Driving Simulators: Motion Cueing for Motion Sickness Mitigation
- EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction
- LangGrasp: Leveraging Fine-Tuned LLMs for Language Interactive Robot Grasping with Ambiguous Instructions
- Stand Up, NAO! Increasing the Reliability of Stand-Up Motions Through Error Compensation in Position Control
- Kilometer-Scale GNSS-Denied UAV Navigation via Heightmap Gradients: A Winning System from the SPRIN-D Challenge
- Safe Motion Planning and Control Using Predictive and Adaptive Barrier Methods for Autonomous Surface Vessels
- A Stochastic Framework for Continuous-Time State Estimation of Continuum Robots
- INSIGHT: INference-time Sequence Introspection for Generating Help Triggers in Vision-Language-Action Models
- Beyond Collision Cones: Dynamic Obstacle Avoidance for Nonholonomic Robots via Dynamic Parabolic Control Barrier Functions
- How Well do Diffusion Policies Learn Kinematic Constraint Manifolds?
- AFFORD2ACT: Affordance-Guided Automatic Keypoint Selection for Generalizable and Lightweight Robotic Manipulation
- Differentiable Skill Optimisation for Powder Manipulation in Laboratory Automation
- Touching the tumor boundary: A pilot study on ultrasound based virtual fixtures for breast-conserving surgery
- VL-KnG: Visual Scene Understanding for Navigation Goal Identification using Spatiotemporal Knowledge Graphs
- Pose Estimation of a Thruster-Driven Bioinspired Multi-Link Robot
- Online Hierarchical Policy Learning using Physics Priors for Robot Navigation in Unknown Environments
- Real-time Multi-Plane Segmentation Based on GPU Accelerated High-Resolution 3D Voxel Mapping for Legged Robot Locomotion
- MiniBEE: A New Form Factor for Compact Bimanual Dexterity
- FailSafe: Reasoning and Recovery from Failures in Vision-Language-Action Models
- Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive
- Neural Network Parameter-optimization of Gaussian pmDAGs
- Asymptotic theory of in-context learning by linear attention
- Policy-Oriented Binary Classification: Improving (KD-)CART Final Splits for Subpopulation Targeting
- Golden Ratio Weighting Prevents Model Collapse
- Online Multivariate Regularized Distributional Regression for High-dimensional Probabilistic Electricity Price Forecasting
- SIM-Shapley: A Stable and Computationally Efficient Approach to Shapley Value Approximation
- WWAggr: A Window Wasserstein-based Aggregation for Ensemble Change Point Detection
- A fast and effective kernel two-sample test for large-scale data
- Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits
- Adaptive Gradient Normalization and Independent Sampling for (Stochastic) Generalized-Smooth Optimization
- Gaussian DP for Reporting Differential Privacy Guarantees in Machine Learning
- Optimal Denoising in Score-Based Generative Models: The Role of Data Regularity
- Near-Optimal Sample Complexities of Divergence-based S-rectangular Distributionally Robust Reinforcement Learning
- To Augment or Not to Augment? Diagnosing Distributional Symmetry Breaking
- A theoretical framework for M-posteriors: frequentist guarantees and robustness properties
- DiffKnock: Diffusion-based Knockoff Statistics for Neural Networks Inference
- Scalable Asynchronous Federated Modeling for Spatial Data
- Predictively Oriented Posteriors
- Lower Bounds on Adversarial Robustness for Multiclass Classification with General Loss Functions
- Adaptive Heterogeneous Mixtures of Normalising Flows for Robust Variational Inference
- Inferring Optical Tissue Properties from Photoplethysmography using Hybrid Amortized Inference
- Ensemble Threshold Calibration for Stable Sensitivity Control
- Reinforcement Learning with Action-Triggered Observations
- Flatness-Aware Stochastic Gradient Langevin Dynamics
- Diffusion Transformers for Imputation: Statistical Efficiency and Uncertainty Quantification
- Efficiently Generating Correlated Sample Paths from Multi-step Time Series Foundation Models
- Drop-Muon: Update Less, Converge Faster
- Risk Phase Transitions in Spiked Regression: Alignment Driven Benign and Catastrophic Overfitting
- AI Foundation Model for Time Series with Innovations Representation
- A reproducible comparative study of categorical kernels for Gaussian process regression, with new clustering-based nested kernels
- Deep Hedging Under Non-Convexity: Limitations and a Case for AlphaZero
- Precise Dynamics of Diagonal Linear Networks: A Unifying Analysis by Dynamical Mean-Field Theory
- Uniform-in-time convergence bounds for Persistent Contrastive Divergence Algorithms
- Adaptive Kernel Selection for Stein Variational Gradient Descent
- Non-Asymptotic Analysis of Data Augmentation for Precision Matrix Estimation
- Hybrid Physics-ML Framework for Pan-Arctic Permafrost Infrastructure Risk at Record 2.9-Million Observation Scale
- Efficient Probabilistic Visualization of Local Divergence of 2D Vector Fields with Independent Gaussian Uncertainty
- Safe Reinforcement Learning-Based Vibration Control: Overcoming Training Risks with LQR Guidance
- Low Rank Gradients and Where to Find Them
- On the Identifiability of Latent Action Policies
- Private Realizable-to-Agnostic Transformation with Near-Optimal Sample Complexity
- Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling
- How Can Time Series Analysis Benefit From Multiple Modalities? A Survey and Outlook
- Audio-Enhanced Vision-Language Modeling with Latent Space Broadening for High Quality Data Expansion
- NeoARCADE: Robust Calibration for Distance Estimation to Support Assistive Drones for the Visually Impaired
- Feature Representation Transferring to Lightweight Models via Perception Coherence
- Learning to Weight Parameters for Training Data Attribution
- AniMaker: Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
- Development of a Mobile Application for at-Home Analysis of Retinal Fundus Images
- Towards Methane Detection Onboard Satellites
- SUPER-Net: Trustworthy Image Segmentation via Uncertainty Propagation in Encoder-Decoder Networks
- Grasp Pre-shape Selection by Synthetic Training: Eye-in-hand Shared Control on the Hannes Prosthesis
- Momentum-SAM: Sharpness Aware Minimization without Computational Overhead
- Hierarchical place recognition with omnidirectional images and curriculum learning-based loss functions
- Subspace Node Pruning
- NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields
- Continuous Wrist Control on the Hannes Prosthesis: a Vision-based Shared Autonomy Framework
- Oh-A-DINO: Understanding and Enhancing Attribute-Level Information in Self-Supervised Object-Centric Representations
- One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation
- RGS-DR: Deferred Reflections and Residual Shading in 2D Gaussian Splatting
- LRFusionPR: A Polar BEV-Based LiDAR-Radar Fusion Network for Place Recognition
- Fusing Foveal Fixations Using Linear Retinal Transformations and Bayesian Experimental Design
- PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
- Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box Settings
- GARLIC: GAussian Representation LearnIng for spaCe partitioning
- Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation
- LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR
- Concept Unlearning by Modeling Key Steps of Diffusion Process
- VITA: Vision-to-Action Flow Matching Policy
- LiDAR-HMR: 3D Human Mesh Recovery from LiDAR
- Meta-Transfer Derm-Diagnosis: Exploring Few-Shot Learning and Transfer Learning for Skin Disease Classification in Long-Tail Distribution
- There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models
- What Makes a Good Dataset for Knowledge Distillation?
- VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention
- Post-hoc Probabilistic Vision-Language Models
- DreamOmni: Unified Image Generation and Editing
- A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion
- Diffusion Adversarial Post-Training for One-Step Video Generation
- Multiple Queries with Multiple Keys: A Precise Prompt Matching Paradigm for Prompt-based Continual Learning
- L4P: Towards Unified Low-Level 4D Vision Perception
- How far can we go with ImageNet for Text-to-Image generation?
- What are You Looking at? Modality Contribution in Multimodal Medical Deep Learning
- An Improved Pure Fully Connected Neural Network for Rice Grain Classification
- Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation
- VaPR -- Vision-language Preference alignment for Reasoning
- Towards Photonic Band Diagram Generation with Transformer-Latent Diffusion Models
- Unsupervised Dynamic Feature Selection for Robust Latent Spaces in Vision Tasks
- GFSR-Net: Guided Focus via Segment-Wise Relevance Network for Interpretable Deep Learning in Medical Imaging
- ZK-WAGON: Imperceptible Watermark for Image Generation Models using ZK-SNARKs
- ROI-GS: Interest-based Local Quality 3D Gaussian Splatting
- $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
- A Multicentric Dataset for Training and Benchmarking Breast Cancer Segmentation in H&E Slides
- Spec-Gloss Surfels and Normal-Diffuse Priors for Relightable Glossy Objects
- SpurBreast: A Curated Dataset for Investigating Spurious Correlations in Real-world Breast MRI Classification
- DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis
- Uncovering Semantic Selectivity of Latent Groups in Higher Visual Cortex with Mutual Information-Guided Diffusion
- Measurement-Guided Consistency Model Sampling for Inverse Problems
- Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning
- Test-Time Anchoring for Discrete Diffusion Posterior Sampling
- Continual Personalization for Diffusion Models
- Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models
- Ultra-Efficient Decoding for End-to-End Neural Compression and Reconstruction
- On the Role of Domain Experts in Creating Effective Tutoring Systems
- Aligning Video Models with Human Social Judgments via Behavior-Guided Fine-Tuning
- ActiveUMI: Robotic Manipulation with Active Perception from Robot-Free Human Demonstrations
- MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics
- Median2Median: Zero-shot Suppression of Structured Noise in Images
- Beyond Simple Fusion: Adaptive Gated Fusion for Robust Multimodal Sentiment Analysis
- Inferring Dynamic Physical Properties from Video Foundation Models
- Clink! Chop! Thud! -- Learning Object Sounds from Real-World Interactions
- StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions
- Optimal Control Meets Flow Matching: A Principled Route to Multi-Subject Fidelity
- Development and Evaluation of an AI-Driven Telemedicine System for Prenatal Healthcare
- JaneEye: A 12-nm 2K-FPS 18.9-$\mu$J/Frame Event-based Eye Tracking Accelerator
- Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation
- From 2D to 3D, Deep Learning-based Shape Reconstruction in Magnetic Resonance Imaging: A Review
- MorphGen: Controllable and Morphologically Plausible Generative Cell-Imaging
- An Efficient Quality Metric for Video Frame Interpolation Based on Motion-Field Divergence
- VENTURA: Adapting Image Diffusion Models for Unified Task Conditioned Navigation
- Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
- GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
- Cross-Breed Pig Identification Using Auricular Vein Pattern Recognition: A Machine Learning Approach for Small-Scale Farming Applications
- MMDEW: Multipurpose Multiclass Density Estimation in the Wild
- TempoControl: Temporal Attention Guidance for Text-to-Video Models
- RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
- DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
- From Frames to Clips: Efficient Key Clip Selection for Long-Form Video Understanding
- Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities
- NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes
- microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
- VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL
- Self-Forcing++: Towards Minute-Scale High-Quality Video Generation
- Learning to Generate Object Interactions with Physics-Guided Video Diffusion
- MultiModal Action Conditioned Video Generation
- VideoNSA: Native Sparse Attention Scales Video Understanding
- NoiseShift: Resolution-Aware Noise Recalibration for Better Low-Resolution Image Generation
- Flow-Matching Guided Deep Unfolding for Hyperspectral Image Reconstruction
- Automated Defect Detection for Mass-Produced Electronic Components Based on YOLO Object Detection Models
- Foundation Visual Encoders Are Secretly Few-Shot Anomaly Detectors
- ClustViT: Clustering-based Token Merging for Semantic Segmentation
- Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
- TriAlignXA: An Explainable Trilemma Alignment Framework for Trustworthy Agri-product Grading
- 4DGS-Craft: Consistent and Interactive 4D Gaussian Splatting Editing
- Pure-Pass: Fine-Grained, Adaptive Masking for Dynamic Token-Mixing Routing in Lightweight Image Super-Resolution
- Generating Findings for Jaw Cysts in Dental Panoramic Radiographs Using GPT-4o: Building a Two-Stage Self-Correction Loop with Structured Output (SLSO) Framework
- LiLa-Net: Lightweight Latent LiDAR Autoencoder for 3D Point Cloud Reconstruction
- kabr-tools: Automated Framework for Multi-Species Behavioral Monitoring
- GaussianMorphing: Mesh-Guided 3D Gaussians for Semantic-Aware Object Morphing
- Zero-shot Human Pose Estimation using Diffusion-based Inverse solvers
- VGDM: Vision-Guided Diffusion Model for Brain Tumor Detection and Segmentation
- Mapping Historic Urban Footprints in France: Balancing Quality, Scalability and AI Techniques
- When Tracking Fails: Analyzing Failure Modes of SAM2 for Point-Based Tracking in Surgical Videos
- FRIEREN: Federated Learning with Vision-Language Regularization for Segmentation
- FideDiff: Efficient Diffusion Model for High-Fidelity Image Motion Deblurring
- LadderMoE: Ladder-Side Mixture of Experts Adapters for Bronze Inscription Recognition
- VirDA: Reusing Backbone for Unsupervised Domain Adaptation with Visual Reprogramming
- Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery
- Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale
- UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
- An Efficient Deep Template Matching and In-Plane Pose Estimation Method via Template-Aware Dynamic Convolution
- Look Less, Reason More: Rollout-Guided Adaptive Pixel-Space Reasoning
- Uncovering Overconfident Failures in CXR Models via Augmentation-Sensitivity Risk Scoring
- FreeViS: Training-free Video Stylization with Inconsistent References
- MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
- Holistic Order Prediction in Natural Scenes
- PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
- LOBE-GS: Load-Balanced and Efficient 3D Gaussian Splatting for Large-Scale Scene Reconstruction
- Pack and Force Your Memory: Long-form and Consistent Video Generation
- Calibrating the Full Predictive Class Distribution of 3D Object Detectors for Autonomous Driving
- Leveraging Prior Knowledge of Diffusion Model for Person Search
- DisCo: Reinforcement with Diversity Constraints for Multi-Human Generation
- GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings
- Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories
- Purrception: Variational Flow Matching for Vector-Quantized Image Generation
- AortaDiff: A Unified Multitask Diffusion Framework For Contrast-Free AAA Imaging
- WALT: Web Agents that Learn Tools
- MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation
- Towards Better Optimization For Listwise Preference in Diffusion Models
- Growing Visual Generative Capacity for Pre-Trained MLLMs
- Robust Classification of Oral Cancer with Limited Training Data
- Consistent Assistant Domains Transformer for Source-free Domain Adaptation
- Guiding Multimodal Large Language Models with Blind and Low Vision People Visual Questions for Proactive Visual Interpretations
- ImageNet-Think-250K: A Large-Scale Synthetic Dataset for Multimodal Reasoning for Vision Language Models
- NPN: Non-Linear Projections of the Null-Space for Imaging Inverse Problems
- Automated Genomic Interpretation via Concept Bottleneck Models for Medical Robotics
- VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
- Joint Deblurring and 3D Reconstruction for Macrophotography
- Synergizing LLMs and Knowledge Graphs: A Novel Approach to Software Repository-Related Question Answering
- MathArena: Evaluating LLMs on Uncontaminated Math Competitions
- Differential Information Distribution: A Bayesian Perspective on Direct Preference Optimization
- DrKGC: Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion across General and Biomedical Domains
- Beyond Chunking: Discourse-Aware Hierarchical Retrieval for Long Document Question Answering
- Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs
- Efficient Whole Slide Pathology VQA via Token Compression
- LVTINO: LAtent Video consisTency INverse sOlver for High Definition Video Restoration
- Image Generation Based on Image Style Extraction
- EvoStruggle: A Dataset Capturing the Evolution of Struggle across Activities and Skill Levels
- SPUS: A Lightweight and Parameter-Efficient Foundation Model for PDEs
- Should I Share this Translation? Evaluating Quality Feedback for User Reliance on Machine Translation
- MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
- Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning
- No Language Data Left Behind: A Comparative Study of CJK Language Datasets in the Hugging Face Ecosystem
- Reason to Rote: Rethinking Memorization in Reasoning
- Flexible Feature Distillation for Large Language Models
- Reasoning Models are Test Exploiters: Rethinking Multiple-Choice
- AutoScale: Scale-Aware Data Mixing for Pre-Training LLMs
- Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Spatial Reasoning
- Interpretable Text Embeddings and Text Similarity Explanation: A Survey
- When Disagreements Elicit Robustness: Investigating Self-Repair Capabilities under LLM Multi-Agent Disagreements
- FANS -- Formal Answer Selection for Natural Language Math Reasoning Using Lean4
- Probabilistic Reasoning with LLMs for k-anonymity Estimation
- TLUE: A Tibetan Language Understanding Evaluation Benchmark
- Boundless Byte Pair Encoding: Breaking the Pre-tokenization Barrier
- Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
- WebRollback: Enhancing Web Agents with Explicit Rollback Mechanisms
- Design and Application of Multimodal Large Language Model Based System for End to End Automation of Accident Dataset Generation
- OntoURL: A Benchmark for Evaluating Large Language Models on Symbolic Ontological Understanding, Reasoning and Learning
- LEXam: Benchmarking Legal Reasoning on 340 Law Exams
- ABBA-Adapters: Efficient and Expressive Fine-Tuning of Foundation Models
- MolLangBench: A Comprehensive Benchmark for Language-Prompted Molecular Structure Recognition, Editing, and Generation
- BiasLab: Toward Explainable Political Bias Detection with Dual-Axis Annotations and Rationale Indicators
- Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation
- Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
- When Models Reason in Your Language: Controlling Thinking Language Comes at the Cost of Accuracy
- StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
- The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models
- Study on LLMs for Promptagator-Style Dense Retriever Training
- ExGRPO: Learning to Reason from Experience
- The Unreasonable Effectiveness of Scaling Agents for Computer Use
- RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems
- Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks
- Interactive Training: Feedback-Driven Neural Network Optimization
- Superficial Safety Alignment Hypothesis
- Self-Consistency Falls Short! The Adverse Effects of Positional Bias on Long-Context Problems
- Reasoning over User Preferences: Knowledge Graph-Augmented LLMs for Explainable Conversational Recommendations
- Push the Limit of Multi-modal Emotion Recognition by Prompting LLMs with Receptive-Field-Aware Attention Weighting
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning
- Adapting Large Language Models for Character-based Augmentative and Alternative Communication
- Out-of-Distribution Detection using Synthetic Data Generation
- From Videos to Indexed Knowledge Graphs -- Framework to Marry Methods for Multimodal Content Analysis and Understanding
- Information Seeking for Robust Decision Making under Partial Observability
- InvThink: Towards AI Safety via Inverse Reasoning
- Synthetic Prefixes to Mitigate Bias in Real-Time Neural Query Autocomplete
- Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression
- Bridging Collaborative Filtering and Large Language Models with Dynamic Alignment, Multimodal Fusion and Evidence-grounded Explanations
- PychoBench: Evaluating the Psychology Intelligence of Large Language Models
- LLM4Rec: Large Language Models for Multimodal Generative Recommendation with Causal Debiasing
- Quagmires in SFT-RL Post-Training: When High SFT Scores Mislead and What to Use Instead
- Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls
- Position: Privacy Is Not Just Memorization!
- Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness
- Improving AGI Evaluation: A Data Science Perspective
- Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction
- Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning
- Constrained Adaptive Rejection Sampling
- Do AI Models Perform Human-like Abstract Reasoning Across Modalities?
- A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports
- Jailbreaking LLMs via Semantically Relevant Nested Scenarios with Targeted Toxic Knowledge
- Utilizing Modern Large Language Models (LLM) for Financial Trend Analysis and Digest Creation
- Automated Extraction of Material Properties using LLM-based AI Agents
- RSAVQ: Riemannian Sensitivity-Aware Vector Quantization for Large Language Models
- RLP: Reinforcement as a Pretraining Objective
- LLM-based Multi-Agent Blackboard System for Information Discovery in Data Science
- Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models
- Aristotle: IMO-level Automated Theorem Proving
- MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments
- WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents
- Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort
- Fine-tuning with RAG for Improving LLM Learning of New Skills
- Optimal Stopping vs Best-of-$N$ for Inference Time Optimization
- VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
- LSPO: Length-aware Dynamic Sampling for Policy Optimization in LLM Reasoning
- Extracting O*NET Features from the NLx Corpus to Build Public Use Aggregate Labor Market Data
- Style Over Story: A Process-Oriented Study of Authorial Creativity in Large Language Models
- Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
- Chain-of-Thought Reasoning in Streaming Full-Duplex End-to-End Spoken Dialogue Systems
- The Disparate Impacts of Speculative Decoding
- RESTRAIN: From Spurious Votes to Signals -- Self-Driven RL with Self-Penalization
- Learning to Reason for Hallucination Span Detection
- ARUQULA -- An LLM based Text2SPARQL Approach using ReAct and Knowledge Graph Exploration Utilities
- Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents
- More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration
- Enhanced Arabic-language cyberbullying detection: deep embedding and transformer (BERT) approaches
- AccurateRAG: A Framework for Building Accurate Retrieval-Augmented Question-Answering Applications
- Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation
- InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents
- Parallel Scaling Law: Unveiling Reasoning Generalization through A Cross-Linguistic Perspective
- From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens
- F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
- Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
- Control the Temperature: Selective Sampling for Diverse and High-Quality LLM Outputs
- How Do Language Models Compose Functions?
- Format Inertia: A Failure Mechanism of LLMs in Medical Pre-Consultation
- What MLLMs Learn about When they Learn about Multimodal Reasoning: Perception, Reasoning, or their Integration?
- Machine-interpretable Engineering Design Standards for Valve Specification
- Can LLMs Refuse Questions They Do Not Know? Measuring Knowledge-Aware Refusal in Factual Tasks
- Comparison of Unsupervised Metrics for Evaluating Judicial Decision Extraction
- Detecting LLM-Generated Spam Reviews by Integrating Language Model Embeddings and Graph Neural Network
- Syntactic Blind Spots: How Misalignment Leads to LLMs Mathematical Errors
- SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
- Model Merging to Maintain Language-Only Performance in Developmentally Plausible Multimodal Models
- REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration
- Enhancing Large Language Model Reasoning with Reward Models: An Analytical Survey
- Inverse Language Modeling towards Robust and Grounded LLMs
- Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
- Taking a SEAT: Predicting Value Interpretations from Sentiment, Emotion, Argument, and Topic Annotations
- Exploring Database Normalization Effects on SQL Generation
- LLM-Based Multi-Task Bangla Hate Speech Detection: Type, Severity, and Target
- TUMIX: Multi-Agent Test-Time Scaling with Tool-Use Mixture
- Evaluation Sheet for Deep Research: A Use Case for Academic Survey Writing
- HiSpec: Hierarchical Speculative Decoding for LLMs
- TAG-EQA: Text-And-Graph for Event Question Answering via Structured Prompting Strategies
- A-VERT: Agnostic Verification with Embedding Ranking Targets
- One More Question is Enough, Expert Question Decomposition (EQD) Model for Domain Quantitative Reasoning
- ReSSFormer: A Recursive Sparse Structured Transformer for Scalable and Long-Context Reasoning
- CLUE: Non-parametric Verification from Experience via Hidden-State Clustering
- A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation
- RAG-BioQA Retrieval-Augmented Generation for Long-Form Biomedical Question Answering
- Efficient Training of Robust Traditional Chinese LLaMA-1B on a Single Consumer GPU: Continual Pre-training, SFT, and DPO
- AMAS: Adaptively Determining Communication Topology for LLM-based Multi-Agent System
- NLP Methods for Detecting Novel LLM Jailbreaks and Keyword Analysis with BERT
- Learning to Look at the Other Side: A Semantic Probing Study of Word Embeddings in LLMs with Enabled Bidirectional Attention
- SoK: Measuring What Matters for Closed-Loop Security Agents
- MDSEval: A Meta-Evaluation Benchmark for Multimodal Dialogue Summarization
- FOR-Prompting: From Objection to Revision via an Asymmetric Prompting Protocol
- A Comparative Analysis of Sparse Autoencoder and Activation Difference in Language Model Steering
- Let's Play Across Cultures: A Large Multilingual, Multicultural Benchmark for Assessing Language Models' Understanding of Sports
- SSTAG: Structure-Aware Self-Supervised Learning Method for Text-Attributed Graphs
- LOCA: Logical Chain Augmentation for Scientific Corpus Cleaning
- GemDetox at TextDetox CLEF 2025: Enhancing a Massively Multilingual Model for Text Detoxification on Low-resource Languages
- Efficient Uncertainty Estimation for LLM-based Entity Linking in Tabular Data
- GPT and Prejudice: A Sparse Approach to Understanding Learned Representations in Large Language Models
- Do Bias Benchmarks Generalise? Evidence from Voice-based Evaluation of Gender Bias in SpeechLLMs
- Longitudinal Monitoring of LLM Content Moderation of Social Issues
- RJE: A Retrieval-Judgment-Exploration Framework for Efficient Knowledge Graph Question Answering with LLMs
- Measuring Algorithmic Partisanship via Zero-Shot Classification and Its Implications on Political Discourse
- In AI Sweet Harmony: Sociopragmatic Guardrail Bypasses and Evaluation-Awareness in OpenAI gpt-oss-20b
- OpenAI's GPT-OSS-20B Model and Safety Alignment Issues in a Low-Resource Language
- AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees
- Think Twice, Generate Once: Safeguarding by Progressive Self-Reflection
- TraceDet: Hallucination Detection from the Decoding Trace of Diffusion Large Language Models
- LLM Based Sentiment Classification From Bangladesh E-Commerce Reviews
- EEFSUVA: A New Mathematical Olympiad Benchmark
- Who is In Charge? Dissecting Role Conflicts in Instruction Following
- Enhancing Transformer-Based Rerankers with Synthetic Data and LLM-Based Supervision
- Geometric Structures and Patterns of Meaning: A PHATE Manifold Analysis of Chinese Character Embeddings
- Trustworthy Summarization via Uncertainty Quantification and Risk Awareness in Large Language Models
- Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks
- Computational Social Linguistics for Telugu Cultural Preservation: Novel Algorithms for Chandassu Metrical Pattern Recognition
- LLMRank: Understanding LLM Strengths for Model Routing
- GRPO++: Enhancing Dermatological Reasoning under Low Resource Settings
- Confidence-Aware Routing for Large Language Model Reliability Enhancement: A Multi-Signal Approach to Pre-Generation Hallucination Mitigation
- Silent Tokens, Loud Effects: Padding in LLMs
- CIFLEX: Contextual Instruction Flow for Sub-task Execution in Multi-Turn Interactions with a Single On-Device LLM
- SKYLENAGE Technical Report: Mathematical Reasoning and Contest-Innovation Benchmarks for Multi-Level Math Evaluation
- Redundancy-as-Masking: Formalizing the Artificial Age Score (AAS) to Model Memory Aging in Generative AI
- Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing
- Feasibility of Structuring Stress Documentation Using an Ontology-Guided Large Language Model
- SeMob: Semantic Synthesis for Dynamic Urban Mobility Prediction
- Uncovering Implicit Bias in Large Language Models with Concept Learning Dataset
- Towards Open-Ended Discovery for Low-Resource NLP
- Discourse vs emissions: Analysis of corporate narratives, symbolic practices, and mimicry through LLMs
- Context Matters: Comparison of commercial large language tools in veterinary medicine
- ClaimCheck: Real-Time Fact-Checking with Small Language Models
Research Sources: 724 | Generated: 10/3/2025