Updated on 2025.10.08

This page is maintained by Leheng Li that contains papers he interested in. Source code of this web is at here.

3D

Publish Date Title Authors PDF Code
2025-10-07 Human3R: Everyone Everywhere All at Once Yue Chen et.al. 2510.06219 null
2025-10-07 Dropping the D: RGB-D SLAM Without the Depth Sensor Mert Kiray et.al. 2510.06216 null
2025-10-07 ShapeGen4D: Towards High Quality 4D Shape Generation from Videos Jiraphon Yenphraphai et.al. 2510.06208 null
2025-10-07 DYMO-Hair: Generalizable Volumetric Dynamics Modeling for Robot Hair Manipulation Chengyang Zhao et.al. 2510.06199 null
2025-10-07 Vision-Guided Targeted Grasping and Vibration for Robotic Pollination in Controlled Environments Jaehwan Jeong et.al. 2510.06146 null
2025-10-07 Bimanual 3D Hand Motion and Articulation Forecasting in Everyday Images Aditya Prakash et.al. 2510.06145 null
2025-10-07 GLVD: Guided Learned Vertex Descent Pol Caselles Rico et.al. 2510.06046 null
2025-10-07 Coordinate-Consistent Localization via Continuous-Time Calibration and Fusion of UWB and SLAM Observations Tien-Dat Nguyen et.al. 2510.05992 null
2025-10-07 ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving Yongxuan Lyu et.al. 2510.05752 null
2025-10-07 PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction Ziqiao Meng et.al. 2510.05613 null
2025-10-07 HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video Hongchi Xia et.al. 2510.05560 null
2025-10-07 GO-Flock: Goal-Oriented Flocking in 3D Unknown Environments with Depth Maps Yan Rui Tan et.al. 2510.05553 null
2025-10-07 Human Action Recognition from Point Clouds over Time James Dickens et.al. 2510.05506 null
2025-10-07 ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars Peizhi Yan et.al. 2510.05488 null
2025-10-06 AD-NODE: Adaptive Dynamics Learning with Neural ODEs for Mobile Robots Control Shao-Yi Yu et.al. 2510.05443 null
2025-10-06 Active Semantic Perception Huayi Tang et.al. 2510.05430 null
2025-10-06 SegMASt3R: Geometry Grounded Segment Matching Rohit Jayanti et.al. 2510.05051 null
2025-10-06 Efficient Navigation in Unknown Indoor Environments with Vision-Language Models D. Schwartz et.al. 2510.04991 null
2025-10-06 Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion Xin Li et.al. 2510.04947 null
2025-10-06 From Actions to Kinesics: Extracting Human Psychological States through Bodily Movements Cheyu Lin et.al. 2510.04844 null
2025-10-06 Detailed Aerial Mapping of Photovoltaic Power Plants Through Semantically Significant Keypoints Viktor Kozák et.al. 2510.04840 null
2025-10-06 Flow Matching for Conditional MRI-CT and CBCT-CT Image Synthesis Arnela Hadzic et.al. 2510.04823 null
2025-10-06 Did you just see that? Arbitrary view synthesis for egocentric replay of operating room workflows from ambient sensors Han Zhang et.al. 2510.04802 null
2025-10-06 A Comparative Study of Vision Transformers and CNNs for Few-Shot Rigid Transformation and Fundamental Matrix Estimation Alon Kaya et.al. 2510.04794 null
2025-10-06 Hands-Free Heritage: Automated 3D Scanning for Cultural Heritage Digitization Javed Ahmad et.al. 2510.04781 null
2025-10-06 Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction Chi Yan et.al. 2510.04759 null
2025-10-06 Object-Centric Representation Learning for Enhanced 3D Scene Graph Prediction KunHo Heo et.al. 2510.04714 null
2025-10-06 Label-Efficient Cross-Modality Generalization for Liver Segmentation in Multi-Phase MRI Quang-Khai Bui-Tran et.al. 2510.04705 null
2025-10-06 Bio-Inspired Robotic Houbara: From Development to Field Deployment for Behavioral Studies Lyes Saad Saoud et.al. 2510.04692 null
2025-10-06 C3Editor: Achieving Controllable Consistency in 2D Model for 3D Editing Zeng Tao et.al. 2510.04539 null
2025-10-06 3Dify: a Framework for Procedural 3D-CG Generation Assisted by LLMs Using MCP and RAG Shun-ichiro Hayashi et.al. 2510.04536 null
2025-10-06 VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery Nonghai Zhang et.al. 2510.04479 null
2025-10-05 RAP: 3D Rasterization Augmented End-to-End Planning Lan Feng et.al. 2510.04333 null
2025-10-05 CARE-PD: A Multi-Site Anonymized Clinical Dataset for Parkinson’s Disease Gait Assessment Vida Adeli et.al. 2510.04312 null
2025-10-05 Scaling Sequence-to-Sequence Generative Neural Rendering Shikun Liu et.al. 2510.04236 null
2025-10-05 Diffusion Low Rank Hybrid Reconstruction for Sparse View Medical Imaging Zongyin Deng et.al. 2510.04069 null
2025-10-05 MetaFind: Scene-Aware 3D Asset Retrieval for Coherent Metaverse Scene Generation Zhenyu Pan et.al. 2510.04057 null
2025-10-05 Fit Pixels, Get Labels: Meta-learned Implicit Networks for Image Segmentation Kushal Vyas et.al. 2510.04021 null
2025-10-04 Sliding Window Attention for Learned Video Compression Alexander Kopte et.al. 2510.03926 null
2025-10-04 Talking Tennis: Language Feedback from 3D Biomechanical Action Recognition Arushi Dashore et.al. 2510.03921 null
2025-10-04 OpenFLAME: Federated Visual Positioning System to Enable Large-Scale Augmented Reality Applications Sagar Bharadwaj et.al. 2510.03915 null
2025-10-04 Bridge Thinking and Acting: Unleashing Physical Potential of VLM with Generalizable Action Expert Mingyu Liu et.al. 2510.03896 null
2025-10-04 Seeing the Bigger Picture: 3D Latent Mapping for Mobile Manipulation Policy Learning Sunghwan Kim et.al. 2510.03885 null
2025-10-04 DHQA-4D: Perceptual Quality Assessment of Dynamic 4D Digital Human Yunhao Li et.al. 2510.03874 null
2025-10-04 PoseGaze-AHP: A Knowledge-Based 3D Dataset for AI-Driven Ocular and Postural Diagnosis Saja Al-Dabet et.al. 2510.03873 null
2025-10-04 Efficiency vs. Efficacy: Assessing the Compression Ratio-Dice Score Relationship through a Simple Benchmarking Framework for Cerebrovascular 3D Segmentation Shimaa Elbana et.al. 2510.03769 null
2025-10-03 SketchPlan: Diffusion Based Drone Planning From Human Sketches Sixten Norelius et.al. 2510.03545 null
2025-10-03 Platonic Transformers: A Solid Choice For Equivariance Mohammad Mohaiminul Islam et.al. 2510.03511 null
2025-10-03 Digital-Twin Evaluation for Proactive Human-Robot Collision Avoidance via Prediction-Guided A-RRT* Vadivelan Murugesan et.al. 2510.03496 null
2025-10-03 Spatial-ViLT: Enhancing Visual Spatial Reasoning through Multi-Task Learning Chashi Mahiul Islam et.al. 2510.03441 null
2025-10-03 Style Brush: Guided Style Transfer for 3D Objects Áron Samuel Kovács et.al. 2510.03433 null
2025-10-03 Real-time nonlinear inversion of magnetic resonance elastography with operator learning Juampablo E. Heras Rivera et.al. 2510.03372 null
2025-10-03 Unified Unsupervised Anomaly Detection via Matching Cost Filtering Zhe Zhang et.al. 2510.03363 null
2025-10-02 Sonar Image Datasets: A Comprehensive Survey of Resources, Challenges, and Applications Larissa S. Gomes et.al. 2510.03353 null
2025-10-02 Visual Odometry with Transformers Vlardimir Yugay et.al. 2510.03348 null
2025-09-30 Universal Beta Splatting Rong Liu et.al. 2510.03312 null
2025-10-03 Memory Forcing: Spatio-Temporal Memory for Consistent Scene Generation on Minecraft Junchao Huang et.al. 2510.03198 null
2025-10-03 Dynamic Prompt Generation for Interactive 3D Medical Image Segmentation Training Tidiane Camaret Ndir et.al. 2510.03189 null
2025-10-03 ROGR: Relightable 3D Objects using Generative Relighting Jiapeng Tang et.al. 2510.03163 null
2025-10-03 GeoComplete: Geometry-Aware Diffusion for Reference-Driven Image Completion Beibei Lin et.al. 2510.03110 null
2025-10-03 Geometry Meets Vision: Revisiting Pretrained Semantics in Distilled Fields Zhiting Mei et.al. 2510.03104 null
2025-10-03 3D-CovDiffusion: 3D-Aware Diffusion Policy for Coverage Path Planning Chenyuan Chen et.al. 2510.03011 null
2025-10-03 Towards Scalable and Consistent 3D Editing Ruihao Xia et.al. 2510.02994 null
2025-10-03 PyRadiomics-cuda: a GPU-accelerated 3D features extraction from medical images within PyRadiomics Jakub Lisowski et.al. 2510.02894 null
2025-10-03 GS-Share: Enabling High-fidelity Map Sharing with Incremental Gaussian Splatting Xinran Zhang et.al. 2510.02884 null
2025-10-03 Flow with the Force Field: Learning 3D Compliant Flow Matching Policies from Force and Demonstration-Guided Simulation Data Tianyu Li et.al. 2510.02738 null
2025-10-03 From Tokens to Nodes: Semantic-Guided Motion Control for Dynamic 3D Gaussian Splatting Jianing Chen et.al. 2510.02732 null
2025-10-03 Visualizing Spatial Point Clouds: A Task-Oriented Taxonomy Mahsa Partovi et.al. 2510.02651 null
2025-10-02 Ego-Exo 3D Hand Tracking in the Wild with a Mobile Multi-Camera Rig Patrick Rim et.al. 2510.02601 null
2025-10-02 PhysHMR: Learning Humanoid Control Policies from Vision for Physically Plausible Human Motion Reconstruction Qiao Feng et.al. 2510.02566 null
2025-10-02 StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions Bo-Hsu Ke et.al. 2510.02314 null
2025-10-02 Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities Mario Medrano-Paredes et.al. 2510.02264 null
2025-10-02 GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation Weijia Dou et.al. 2510.02186 null
2025-10-02 DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis Jialin Gao et.al. 2510.02178 null
2025-10-02 EC3R-SLAM: Efficient and Consistent Monocular Dense SLAM with Feed-Forward 3D Reconstruction Lingxiang Hu et.al. 2510.02080 null
2025-10-02 GaussianMorphing: Mesh-Guided 3D Gaussians for Semantic-Aware Object Morphing Mengtian Li et.al. 2510.02034 null
2025-10-02 LiLa-Net: Lightweight Latent LiDAR Autoencoder for 3D Point Cloud Reconstruction Mario Resino et.al. 2510.02028 null
2025-10-02 ROI-GS: Interest-based Local Quality 3D Gaussian Splatting Quoc-Anh Bui et.al. 2510.01978 null
2025-10-02 Calibrating the Full Predictive Class Distribution of 3D Object Detectors for Autonomous Driving Cornelius Schröder et.al. 2510.01829 null
2025-10-02 An Anytime, Scalable and Complete Algorithm for Embedding a Manufacturing Procedure in a Smart Factory Christopher Leet et.al. 2510.01770 null
2025-10-02 LOBE-GS: Load-Balanced and Efficient 3D Gaussian Splatting for Large-Scale Scene Reconstruction Sheng-Hsiang Hung et.al. 2510.01767 null
2025-10-03 UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction Jin Cao et.al. 2510.01669 null
2025-10-02 Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale Yongbo Chen et.al. 2510.01665 null
2025-10-02 Discrete Facial Encoding: : A Framework for Data-driven Facial Display Discovery Minh Tran et.al. 2510.01662 null
2025-10-02 Joint Deblurring and 3D Reconstruction for Macrophotography Yifan Zhao et.al. 2510.01640 null
2025-10-02 MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics Changmin Lee et.al. 2510.01619 null
2025-10-02 ActiveUMI: Robotic Manipulation with Active Perception from Robot-Free Human Demonstrations Qiyuan Zeng et.al. 2510.01607 null
2025-10-02 Real-time Multi-Plane Segmentation Based on GPU Accelerated High-Resolution 3D Voxel Mapping for Legged Robot Locomotion Shun Niijima et.al. 2510.01592 null
2025-10-01 From 2D to 3D, Deep Learning-based Shape Reconstruction in Magnetic Resonance Imaging: A Review Emma McMillian et.al. 2510.01296 null
2025-10-01 EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory Jiahao Wang et.al. 2510.01183 null
2025-10-01 Audio Driven Real-Time Facial Animation for Social Telepresence Jiye Lee et.al. 2510.01176 null
2025-10-01 KeySG: Hierarchical Keyframe-Based 3D Scene Graphs Abdelrhman Werby et.al. 2510.01049 null
2025-10-01 A Scene is Worth a Thousand Features: Feed-Forward Camera Localization from a Collection of Image Features Axel Barroso-Laguna et.al. 2510.00978 null
2025-10-01 PAL-Net: A Point-Wise CNN with Patch-Attention for 3D Facial Landmark Localization Ali Shadman Yazdi et.al. 2510.00910 null
2025-10-01 AI-CNet3D: An Anatomically-Informed Cross-Attention Network with Multi-Task Consistency Fine-tuning for 3D Glaucoma Classification Roshan Kenia et.al. 2510.00882 null
2025-10-01 PhraseStereo: The First Open-Vocabulary Stereo Image Segmentation Dataset Thomas Campagnolo et.al. 2510.00818 null
2025-10-01 Beyond one-hot encoding? Journey into compact encoding for large multi-class segmentation Aaron Kujawa et.al. 2510.00667 null
2025-10-01 Enabling High-Frequency Cross-Modality Visual Positioning Service for Accurate Drone Landing Haoyang Wang et.al. 2510.00646 null
2025-10-01 Multi-level Dynamic Style Transfer for NeRFs Zesheng Li et.al. 2510.00592 null
2025-10-01 Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation Taeyun Woo et.al. 2510.00527 null
2025-10-01 Affordance-Guided Diffusion Prior for 3D Hand Reconstruction Naru Suzuki et.al. 2510.00506 null
2025-10-01 A Fast and Precise Method for Searching Rectangular Tumor Regions in Brain MR Images Hidenori Takeshima et.al. 2510.00505 null
2025-10-01 From Human Hands to Robot Arms: Manipulation Skills Transfer via Trajectory Alignment Han Zhou et.al. 2510.00491 null
2025-10-01 Domain-Specialized Interactive Segmentation Framework for Meningioma Radiotherapy Planning Junhyeok Lee et.al. 2510.00416 null
2025-09-30 TTT3R: 3D Reconstruction as Test-Time Training Xingyu Chen et.al. 2509.26645 null
2025-09-30 MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation Zhuoyang Liu et.al. 2509.26642 null
2025-09-30 Learning Generalizable Shape Completion with SIM(3) Equivariance Yuqing Wang et.al. 2509.26631 null
2025-09-30 HART: Human Aligned Reconstruction Transformer Xiyi Chen et.al. 2509.26621 null
2025-09-30 DA $^2$ : Depth Anything in Any Direction Haodong Li et.al. 2509.26618 null
2025-09-30 Memory-Efficient 2D/3D Shape Assembly of Robot Swarms Shuoyu Yue et.al. 2509.26518 null
2025-09-30 DEPTHOR++: Robust Depth Enhancement from a Real-World Lightweight dToF and RGB Guidance Jijun Xiang et.al. 2509.26498 null
2025-09-30 Stylos: Multi-View 3D Stylization with Single-Forward Gaussian Splatting Hanzhou Liu et.al. 2509.26455 null
2025-09-30 Continuous Space-Time Video Super-Resolution with 3D Fourier Fields Alexander Becker et.al. 2509.26325 null
2025-09-30 ISyHand: A Dexterous Multi-finger Robot Hand with an Articulated Palm Benjamin A. Richardson et.al. 2509.26236 null
2025-09-30 3DiFACE: Synthesizing and Editing Holistic 3D Facial Animation Balamurugan Thambiraja et.al. 2509.26233 null
2025-09-30 Text-to-Scene with Large Reasoning Models Frédéric Berdoz et.al. 2509.26091 null
2025-09-30 EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models Seamie Hayes et.al. 2509.26087 null
2025-09-30 GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts Zhenyu Shu et.al. 2509.26055 null
2025-09-30 PFDepth: Heterogeneous Pinhole-Fisheye Joint Depth Estimation via Distortion-aware Gaussian-Splatted Volumetric Fusion Zhiwei Zhang et.al. 2509.26008 null
2025-09-30 Towards Human Engagement with Realistic AI Combat Pilots Ardian Selmonaj et.al. 2509.26002 null
2025-09-30 PinPoint3D: Fine-Grained 3D Part Segmentation from a Few Clicks Bojun Zhang et.al. 2509.25970 null
2025-10-01 A Multimodal LLM Approach for Visual Question Answering on Multiparametric 3D Brain MRI Arvind Murari Vepa et.al. 2509.25889 null
2025-09-30 Vector sketch animation generation with differentialable motion trajectories Xinding Zhu et.al. 2509.25857 null
2025-09-30 IPDRecon: Image-Plane Geometric Decoding for View-Invariant Indoor Scene Reconstruction Mingyang Li et.al. 2509.25744 null
2025-09-30 Dragging with Geometry: From Pixels to Geometry-Guided Image Editing Xinyu Pu et.al. 2509.25740 null
2025-09-30 LieHMR: Autoregressive Human Mesh Recovery with $SO(3)$ Diffusion Donghwan Kim et.al. 2509.25739 null
2025-09-30 Using Images from a Video Game to Improve the Detection of Truck Axles Leandro Arab Marcomini et.al. 2509.25644 null
2025-09-29 GaussianLens: Localized High-Resolution Reconstruction via On-Demand Gaussian Densification Yijia Weng et.al. 2509.25603 null
2025-09-29 Online Mapping for Autonomous Driving: Addressing Sensor Generalization and Dynamic Map Updates in Campus Environments Zihan Zhang et.al. 2509.25542 null
2025-09-29 LLM-RG: Referential Grounding in Outdoor Scenarios using Large Language Models Pranav Saxena et.al. 2509.25528 null
2025-09-29 Robust Visual Localization in Compute-Constrained Environments by Salient Edge Rendering and Weighted Hamming Similarity Tu-Hoa Pham et.al. 2509.25520 null
2025-10-01 DepthLM: Metric Depth From Vision Language Models Zhipeng Cai et.al. 2509.25413 null
2025-09-29 Computational Design and Single-Wire Sensing of 3D Printed Objects with Integrated Capacitive Touchpoints S. Sandra Bae et.al. 2509.25387 null
2025-10-01 Editing Physiological Signals in Videos Using Latent Representations Tianwen Zhou et.al. 2509.25348 null
2025-09-29 VGGT-X: When VGGT Meets Dense Novel View Synthesis Yang Liu et.al. 2509.25191 null
2025-09-29 Visual Jigsaw Post-Training Improves MLLMs Penghao Wu et.al. 2509.25190 null
2025-09-29 PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos Ting-Hsuan Liao et.al. 2509.25183 null
2025-09-29 Triangle Splatting+: Differentiable Rendering with Opaque Triangles Jan Held et.al. 2509.25122 null
2025-09-29 Unsupervised Representation Learning for 3D Mesh Parameterization with Semantic and Visibility Objectives AmirHossein Zamani et.al. 2509.25094 null
2025-09-29 UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation Guanjun Wu et.al. 2509.25079 null
2025-10-02 GEM: 3D Gaussian Splatting for Efficient and Accurate Cryo-EM Reconstruction Huaizhi Qu et.al. 2509.25075 null
2025-09-29 LVT: Large-Scale Scene Reconstruction via Local View Transformers Tooba Imtiaz et.al. 2509.25001 null
2025-09-29 PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion Yuyang Yin et.al. 2509.24997 null
2025-09-29 Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes Yuhan Wang et.al. 2509.24986 null
2025-09-29 On-the-Fly Data Augmentation for Brain Tumor Segmentation Ishika Jain et.al. 2509.24973 null
2025-09-29 Social 3D Scene Graphs: Modeling Human Actions and Relations for Interactive Service Robots Ermanno Bartoli et.al. 2509.24966 null
2025-09-29 Real-time Recognition of Human Interactions from a Single RGB-D Camera for Socially-Aware Robot Navigation Thanh Long Nguyen et.al. 2509.24907 null
2025-09-29 DWGS: Enhancing Sparse-View Gaussian Splatting with Hybrid-Loss Depth Estimation and Bidirectional Warping Yu Ma et.al. 2509.24893 null
2025-09-29 Finding an Initial Probe Pose in Teleoperated Robotic Echocardiography via 2D LiDAR-Based 3D Reconstruction Mariadas Capsran Roshan et.al. 2509.24867 null
2025-09-29 UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections Zeyu Cai et.al. 2509.24817 null
2025-09-29 TACO-Net: Topological Signatures Triumph in 3D Object Classification Anirban Ghosh et.al. 2509.24802 null
2025-09-29 SkyLink: Unifying Street-Satellite Geo-Localization via UAV-Mediated 3D Scene Alignment Hongyang Zhang et.al. 2509.24783 null
2025-10-03 ExGS: Extreme 3D Gaussian Compression with Diffusion Priors Jiaqi Chen et.al. 2509.24758 null
2025-09-29 NeuralPVS: Learned Estimation of Potentially Visible Sets Xiangyu Wang et.al. 2509.24677 null
2025-09-29 PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control Haozhuo Zhang et.al. 2509.24591 null
2025-09-29 BFSM: 3D Bidirectional Face-Skull Morphable Model Zidu Wang et.al. 2509.24577 null
2025-09-29 CORE-3D: Context-aware Open-vocabulary Retrieval by Embeddings in 3D Mohamad Amin Mirzaei et.al. 2509.24528 null
2025-09-29 NeoWorld: Neural Simulation of Explorable Virtual Worlds via Progressive 3D Unfolding Yanpeng Zhao et.al. 2509.24441 null
2025-10-01 Proxy-GS: Efficient 3D Gaussian Splatting via Proxy Mesh Yuanyuan Gao et.al. 2509.24421 null
2025-09-29 RapidMV: Leveraging Spatio-Angular Representations for Efficient and Consistent Text-to-Multi-View Synthesis Seungwook Kim et.al. 2509.24410 null
2025-09-29 Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy Haijier Chen et.al. 2509.24385 null
2025-09-29 DINOReg: Strong Point Cloud Registration with Vision Foundation Model Congjia Chen et.al. 2509.24370 null
2025-09-29 SONAR: Semantic-Object Navigation with Aggregated Reasoning through a Cross-Modal Inference Paradigm Yao Wang et.al. 2509.24321 null
2025-09-29 ASIA: Adaptive 3D Segmentation using Few Image Annotations Sai Raj Kishore Perla et.al. 2509.24288 null
2025-09-29 Robust Partial 3D Point Cloud Registration via Confidence Estimation under Global Context Yongqiang Wang et.al. 2509.24275 null
2025-09-29 Skeleton-based Robust Registration Framework for Corrupted 3D Point Clouds Yongqiang Wang et.al. 2509.24273 null
2025-09-29 Cycle Diffusion Model for Counterfactual Image Generation Fangrui Huang et.al. 2509.24267 null
2025-09-29 Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse-view Videos Yingdong Hu et.al. 2509.24209 null
2025-09-29 An Efficient 3D Latent Diffusion Model for T1-contrast Enhanced MRI Generation Zach Eidex et.al. 2509.24194 null
2025-09-29 Tumor Synthesis conditioned on Radiomics Jonghun Kim et.al. 2509.24182 null
2025-09-29 LatXGen: Towards Radiation-Free and Accurate Quantitative Analysis of Sagittal Spinal Alignment Via Cross-Modal Radiographic View Synthesis Moxin Zhao et.al. 2509.24165 null
2025-09-29 Neural Visibility of Point Sets Jun-Hao Wang et.al. 2509.24150 null
2025-09-29 A Novel Model for 3D Motion Planning for a Generalized Dubins Vehicle with Pitch and Yaw Rate Constraints Deepak Prakash Kumar et.al. 2509.24143 null
2025-09-28 BOSfM: A View Planning Framework for Optimal 3D Reconstruction of Agricultural Scenes Athanasios Bacharis et.al. 2509.24126 null
2025-09-28 Unified Multi-Modal Interactive & Reactive 3D Motion Generation via Rectified Flow Prerit Gupta et.al. 2509.24099 null
2025-09-28 WireBend-kit: A Computational Design and Fabrication Toolkit for Wirebending Custom 3D Wireframe Structures Faraz Faruqi et.al. 2509.24083 null
2025-09-28 SIE3D: Single-image Expressive 3D Avatar generation via Semantic Embedding and Perceptual Expression Loss Zhiqi Huang et.al. 2509.24004 null
2025-09-28 RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization Dongki Jung et.al. 2509.23991 null
2025-09-28 CrashSplat: 2D to 3D Vehicle Damage Segmentation in Gaussian Splatting Dragoş-Andrei Chileban et.al. 2509.23947 null
2025-09-28 AssemblyHands-X: Modeling 3D Hand-Body Coordination for Understanding Bimanual Human Activities Tatsuro Banno et.al. 2509.23888 null
2025-09-28 Learning Adaptive Pseudo-Label Selection for Semi-Supervised 3D Object Detection Taehun Kong et.al. 2509.23880 null
2025-09-28 Towards Fine-Grained Text-to-3D Quality Assessment: A Benchmark and A Two-Stage Rank-Learning Metric Bingyang Cui et.al. 2509.23841 null
2025-09-28 Uni4D-LLM: A Unified SpatioTemporal-Aware VLM for 4D Understanding and Generation Hanyu Zhou et.al. 2509.23828 null
2025-09-28 Controllable Generation of Large-Scale 3D Urban Layouts with Semantic and Structural Guidance Mengyuan Niu et.al. 2509.23804 null
2025-09-28 GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State Guole Shen et.al. 2509.23737 null
2025-09-28 M3DLayout: A Multi-Source Dataset of 3D Indoor Layouts and Structured Descriptions for 3D Generation Yiheng Zhang et.al. 2509.23728 null
2025-09-28 Diff-3DCap: Shape Captioning with Diffusion Models Zhenyu Shu et.al. 2509.23718 null
2025-09-28 StrucADT: Generating Structure-controlled 3D Point Clouds with Adjacency Diffusion Transformer Zhenyu Shu et.al. 2509.23709 null
2025-09-28 MSD-KMamba: Bidirectional Spatial-Aware Multi-Modal 3D Brain Segmentation via Multi-scale Self-Distilled Fusion Strategy Dayu Tan et.al. 2509.23677 null
2025-09-28 Color-Pair Guided Robust Zero-Shot 6D Pose Estimation and Tracking of Cluttered Objects on Edge Devices Xingjian Yang et.al. 2509.23647 null
2025-09-28 Sparse-Up: Learnable Sparse Upsampling for 3D Generation with High-Fidelity Textures Lu Xiao et.al. 2509.23646 null
2025-09-28 BioVessel-Net and RetinaMix: Unsupervised Retinal Vessel Segmentation from OCTA Images Cheng Huang et.al. 2509.23617 null
2025-09-28 InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects Xinhao Cai et.al. 2509.23612 null
2025-09-28 FlowLUT: Efficient Image Enhancement via Differentiable LUTs and Iterative Flow Matching Liubing Hu et.al. 2509.23608 null
2025-09-28 ZeroScene: A Zero-Shot Framework for 3D Scene Generation from a Single Image and Controllable Texture Editing Xiang Tang et.al. 2509.23607 null
2025-09-28 Generalizable Coarse-to-Fine Robot Manipulation via Language-Aligned 3D Keypoints Jianshu Hu et.al. 2509.23575 null
2025-09-28 RAVEN: Resilient Aerial Navigation via Open-Set Semantic Memory and Behavior Adaptation Seungchan Kim et.al. 2509.23563 null
2025-09-28 From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations Javed Ahmad et.al. 2509.23555 null
2025-09-28 OVSeg3R: Learn Open-vocabulary Instance Segmentation from 2D via 3D Reconstruction Hongyang Li et.al. 2509.23541 null
2025-09-27 Orientation-anchored Hyper-Gaussian for 4D Reconstruction from Casual Videos Junyi Wu et.al. 2509.23492 null
2025-09-27 3DPCNet: Pose Canonicalization for Robust Viewpoint-Invariant 3D Kinematic Analysis from Monocular RGB cameras Tharindu Ekanayake et.al. 2509.23455 null
2025-09-30 FM-SIREN & FM-FINER: Nyquist-Informed Frequency Multiplier for Implicit Neural Representation with Periodic Activation Mohammed Alsakabi et.al. 2509.23438 null
2025-09-27 WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving Ziyue Zhu et.al. 2509.23402 null
2025-09-27 UniPose: Unified Cross-modality Pose Prior Propagation towards RGB-D data for Weakly Supervised 3D Human Pose Estimation Jinghong Zheng et.al. 2509.23376 null
2025-09-27 Code Arcades: 3d Visualization of Classes, Dependencies and Software Metrics Anthony Savidis et.al. 2509.23297 null
2025-09-27 OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting Atakan Topaloglu et.al. 2509.23258 null
2025-09-27 Unsupervised Online 3D Instance Segmentation with Synthetic Sequences and Dynamic Loss Yifan Zhang et.al. 2509.23194 null
2025-09-27 Confidence-Calibrating Regularization for Robust Brain MRI Segmentation Under Domain Shift Behraj Khan et.al. 2509.23176 null
2025-09-27 Sparse2Dense: A Keypoint-driven Generative Framework for Human Video Compression and Vertex Prediction Bolin Chen et.al. 2509.23169 null
2025-09-27 Open-Vocabulary Spatio-Temporal Scene Graph for Robot Perception and Teleoperation Planning Yi Wang et.al. 2509.23107 null
2025-09-27 GeLoc3r: Enhancing Relative Camera Pose Regression with Geometric Consistency Regularization Jingxing Li et.al. 2509.23038 null
2025-09-27 Desensitizing for Improving Corruption Robustness in Point Cloud Classification through Adversarial Training Zhiqiang Tian et.al. 2509.23010 null
2025-09-27 ARSS: Taming Decoder-only Autoregressive Visual Generation for View Synthesis From Single View Wenbin Teng et.al. 2509.23008 null
2025-09-26 Learning Unified Representation of 3D Gaussian Splatting Yuelin Xin et.al. 2509.22917 null
2025-09-26 Convolutional Set Transformer Federico Chinello et.al. 2509.22889 null
2025-09-26 ControlEvents: Controllable Synthesis of Event Camera Datawith Foundational Prior from Image Diffusion Models Yixuan Hu et.al. 2509.22864 null
2025-09-26 Empart: Interactive Convex Decomposition for Converting Meshes to Parts Brandon Vu et.al. 2509.22847 null
2025-09-26 See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation Chih Yao Hu et.al. 2509.22653 null
2025-09-26 JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation Shuang Zeng et.al. 2509.22548 null
2025-09-26 EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model Andrii Litvynchuk et.al. 2509.22527 null
2025-09-26 HELIOS: Hierarchical Exploration for Language-grounded Interaction in Open Scenes Katrina Ashton et.al. 2509.22498 null
2025-09-26 EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer Zhehao Dong et.al. 2509.22407 null
2025-09-26 Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss Javier Sequeiro González et.al. 2509.22394 null
2025-09-26 Johnson-Lindenstrauss Lemma Guided Network for Efficient 3D Medical Segmentation Jinpeng Lu et.al. 2509.22307 null
2025-09-26 MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning Jinkun Hao et.al. 2509.22281 null
2025-09-26 GS-2M: Gaussian Splatting for Joint Mesh Reconstruction and Material Decomposition Dinh Minh Nguyen et.al. 2509.22276 null
2025-09-26 Polysemous Language Gaussian Splatting via Matching-based Mask Lifting Jiayu Ding et.al. 2509.22225 null
2025-09-26 Rigidity-Aware 3D Gaussian Deformation from a Single Image Jinhyeok Kim et.al. 2509.22222 null
2025-09-26 MultiMat: Multimodal Program Synthesis for Procedural Materials using Large Multimodal Models Jonas Belouadi et.al. 2509.22151 null
2025-09-26 Joint graph entropy knowledge distillation for point cloud classification and robustness against corruptions Zhiqiang Tian et.al. 2509.22150 null
2025-09-26 Large Material Gaussian Model for Relightable 3D Generation Jingrui Ye et.al. 2509.22112 null
2025-09-26 Comparative Analysis of GAN and Diffusion for MRI-to-CT translation Emily Honey et.al. 2509.22049 null
2025-09-26 Rate-Distortion Optimized Communication for Collaborative Perception Genjia Liu et.al. 2509.21994 null
2025-09-29 PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D Data Zhe Zhu et.al. 2509.21965 null
2025-09-26 TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation Qihang Wang et.al. 2509.21905 null
2025-09-26 Drag4D: Align Your Motion with Text-Driven 3D Scene Generation Minjun Kang et.al. 2509.21888 null
2025-09-26 SRHand: Super-Resolving Hand Images and 3D Shapes via View/Pose-aware Neural Image Representations and Explicit 3D Meshes Minje Kim et.al. 2509.21859 null
2025-09-26 Dynamic Novel View Synthesis in High Dynamic Range Kaixuan Zhang et.al. 2509.21853 null
2025-09-26 DiTraj: training-free trajectory control for video diffusion transformer Cheng Lei et.al. 2509.21839 null
2025-09-25 PowerGS: Display-Rendering Power Co-Optimization for Neural Rendering in Power-Constrained XR Systems Weikai Lin et.al. 2509.21702 null
2025-09-25 MORPH: Shape-agnostic PDE Foundation Models Mahindra Singh Rautela et.al. 2509.21670 null
2025-09-25 FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction Yixiang Dai et.al. 2509.21657 null
2025-09-25 QuadGPT: Native Quadrilateral Mesh Generation with Autoregressive Models Jian Liu et.al. 2509.21420 null
2025-09-23 TUN3D: Towards Real-World Scene Understanding from Unposed Images Anton Konushin et.al. 2509.21388 null
2025-09-25 Quantized Visual Geometry Grounded Transformer Weilun Feng et.al. 2509.21302 null
2025-09-25 \LARGE GMP $^{3}$ : Learning-Driven, Bellman-Guided Trajectory Planning for UAVs in Real-Time on SE(3) Babak Salamat et.al. 2509.21264 null
2025-09-25 Dense Semantic Matching with VGGT Prior Songlin Yang et.al. 2509.21263 null
2025-09-25 Decipher-MR: A Vision-Language Foundation Model for 3D MRI Representations Zhijian Yang et.al. 2509.21249 null
2025-09-25 Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets Team Hunyuan3D et.al. 2509.21245 null
2025-09-25 CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling Yuze He et.al. 2509.21114 null
2025-09-25 Cross-Modal Instructions for Robot Motion Generation William Barron et.al. 2509.21107 null
2025-09-25 OmniPlantSeg: Species Agnostic 3D Point Cloud Organ Segmentation for High-Resolution Plant Phenotyping Across Modalities Andreas Gilson et.al. 2509.21038 null
2025-09-25 Multi-Robot Vision-Based Task and Motion Planning for EV Battery Disassembly and Sorting Abdelaziz Shaarawy et.al. 2509.21020 null
2025-09-25 Marching Neurons: Accurate Surface Extraction for Neural Implicit Shapes Christian Stippel et.al. 2509.21007 null
2025-09-25 BactoBot: A Low-Cost, Bacteria-Inspired Soft Underwater Robot for Marine Exploration Rubaiyat Tasnim Chowdhury et.al. 2509.20964 null
2025-09-25 Finding 3D Positions of Distant Objects from Noisy Camera Movement and Semantic Segmentation Sequences Julius Pesonen et.al. 2509.20906 null
2025-09-25 ArchGPT: Understanding the World’s Architectures with Large Multimodal Models Yuze Wang et.al. 2509.20858 null
2025-09-25 ARMesh: Autoregressive Mesh Generation via Next-Level-of-Detail Prediction Jiabao Lei et.al. 2509.20824 null
2025-09-25 MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM Yuxuan Zhou et.al. 2509.20757 null
2025-09-25 FreeInsert: Personalized Object Insertion with Geometric and Style Control Yuhong Zhang et.al. 2509.20756 null
2025-09-26 SeamCrafter: Enhancing Mesh Seam Generation for Artist UV Unwrapping via Reinforcement Learning Duoteng Xu et.al. 2509.20725 null
2025-09-24 Reflect3r: Single-View 3D Stereo Reconstruction Aided by Mirror Reflections Jing Wu et.al. 2509.20607 null
2025-09-24 Large Pre-Trained Models for Bimanual Manipulation in 3D Hanna Yurchyk et.al. 2509.20579 null
2025-09-24 MELEGROS: Monolithic Elephant-inspired Gripper with Optical Sensors Petr Trunin et.al. 2509.20510 null
2025-09-24 SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent Yandan Yang et.al. 2509.20414 null
2025-09-23 SGAligner++: Cross-Modal Language-Aided 3D Scene Graph Alignment Binod Singh et.al. 2509.20401 null
2025-09-23 SeHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing Yiyu Li et.al. 2509.20400 null
2025-09-24 PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation Chen Wang et.al. 2509.20358 null
2025-09-26 mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies Remo Steiner et.al. 2509.20297 null
2025-09-24 4D Driving Scene Generation With Stereo Forcing Hao Lu et.al. 2509.20251 null
2025-09-24 An Anisotropic Cross-View Texture Transfer with Multi-Reference Non-Local Attention for CT Slice Interpolation Kwang-Hyun Uhm et.al. 2509.20242 null
2025-09-24 PU-Gaussian: Point Cloud Upsampling using 3D Gaussian Representation Mahmoud Khater et.al. 2509.20207 null
2025-09-24 C-3TO: Continuous 3D Trajectory Optimization on Neural Euclidean Signed Distance Fields Guillermo Gil et.al. 2509.20084 null
2025-09-24 DB-TSDF: Directional Bitmask-based Truncated Signed Distance Fields for Efficient Volumetric Mapping Jose E. Maese et.al. 2509.20081 null
2025-09-24 Queryable 3D Scene Representation: A Multi-Modal Framework for Semantic Reasoning and Robotic Task Planning Xun Li et.al. 2509.20077 null
2025-09-25 OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving Pei Liu et.al. 2509.19973 null
2025-09-24 Generalist Robot Manipulation beyond Action Labeled Data Alexander Spiridonov et.al. 2509.19958 null
2025-09-24 AJAHR: Amputated Joint Aware 3D Human Mesh Recovery Hyunjin Cho et.al. 2509.19939 null
2025-09-24 GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes Guo Chen et.al. 2509.19937 null
2025-09-24 Aerial-Ground Image Feature Matching via 3D Gaussian Splatting-based Intermediate View Rendering Jiangxue Yu et.al. 2509.19898 null
2025-09-24 Generalized Shortest Path-based Superpixels for 3D Spherical Image Segmentation Rémi Giraud et.al. 2509.19895 null
2025-09-25 StrCGAN: A Generative Framework for Stellar Image Restoration Shantanusinh Parmar et.al. 2509.19805 null
2025-09-24 BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting Yixun Zhang et.al. 2509.19793 null
2025-09-24 PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction Yufei Han et.al. 2509.19726 null
2025-09-24 VIMD: Monocular Visual-Inertial Motion and Depth Estimation Saimouli Katragadda et.al. 2509.19713 null
2025-09-23 The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar William L. Muckelroy III et.al. 2509.19644 null
2025-09-23 Terra: Hierarchical Terrain-Aware 3D Scene Graph for Task-Agnostic Outdoor Mapping Chad R. Samuelson et.al. 2509.19579 null
2025-09-23 Autonomous Elemental Characterization Enabled by a Low Cost Robotic Platform Built Upon a Generalized Software Architecture Xuan Cao et.al. 2509.19541 null
2025-09-23 Real-Time Reinforcement Learning for Dynamic Tasks with a Parallel Soft Robot James Avtges et.al. 2509.19525 null
2025-09-23 VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction Weijie Wang et.al. 2509.19297 null
2025-09-23 Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Sherwin Bahmani et.al. 2509.19296 null
2025-09-24 MOIS-SAM2: Exemplar-based Segment Anything Model 2 for multilesion interactive segmentation of neurofibromas in whole-body MRI Georgii Kolokolnikov et.al. 2509.19277 null
2025-09-23 Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps Gabriel Maldonado et.al. 2509.19252 null
2025-09-23 HyKid: An Open MRI Dataset with Expert-Annotated Multi-Structure and Choroid Plexus in Pediatric Hydrocephalus Yunzhi Xu et.al. 2509.19218 null
2025-09-23 SlicerROS2: A Research and Development Module for Image-Guided Robotic Interventions Laura Connolly et.al. 2509.19076 null
2025-09-23 WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction Hung Nguyen et.al. 2509.19073 null
2025-09-23 Seeing Through Reflections: Advancing 3D Scene Reconstruction in Mirror-Containing Environments with Gaussian Splatting Zijing Guo et.al. 2509.18956 null
2025-09-23 Eva-VLA: Evaluating Vision-Language-Action Models’ Robustness Under Real-World Physical Variations Hanqing Liu et.al. 2509.18953 null
2025-09-23 Lang2Morph: Language-Driven Morphological Design of Robotic Hands Yanyuan Qiao et.al. 2509.18937 null
2025-09-23 SynapFlow: A Modular Framework Towards Large-Scale Analysis of Dendritic Spines Pamela Osuna-Vargas et.al. 2509.18926 null
2025-09-23 LiDAR Point Cloud Image-based Generation Using Denoising Diffusion Probabilistic Models Amirhesam Aghanouri et.al. 2509.18917 null
2025-09-23 DeblurSplat: SfM-free 3D Gaussian Splatting with Event Camera for Robust Deblurring Pengteng Li et.al. 2509.18898 null
2025-09-23 RS3DBench: A Comprehensive Benchmark for 3D Spatial Perception in Remote Sensing Jiayu Wang et.al. 2509.18897 null
2025-09-23 VGGT-DP: Generalizable Robot Control via Vision Foundation Models Shijia Ge et.al. 2509.18778 null
2025-09-23 FixingGS: Enhancing 3D Gaussian Splatting via Training-Free Score Distillation Zhaorui Wang et.al. 2509.18759 null
2025-09-23 3D Flow Diffusion Policy: Visuomotor Policy Learning via Generating Flow in 3D Space Sangjun Noh et.al. 2509.18676 null
2025-09-23 MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving Yuzhi Wu et.al. 2509.18613 null
2025-09-23 End-to-End Crop Row Navigation via LiDAR-Based Deep Reinforcement Learning Ana Luiza Mineiro et.al. 2509.18608 null
2025-09-23 Event-guided 3D Gaussian Splatting for Dynamic Human and Scene Reconstruction Xiaoting Yin et.al. 2509.18566 null
2025-09-23 GeoRemover: Removing Objects and Their Causal Visual Artifacts Zixin Zhu et.al. 2509.18538 null
2025-09-23 BridgeSplat: Bidirectionally Coupled CT and Non-Rigid Gaussian Splatting for Deformable Intraoperative Surgical Navigation Maximilian Fehrentz et.al. 2509.18501 null
2025-09-22 CPT-4DMR: Continuous sPatial-Temporal Representation for 4D-MRI Reconstruction Xinyang Wu et.al. 2509.18427 null
2025-09-22 TinyBEV: Cross Modal Knowledge Distillation for Efficient Multi Task Bird’s Eye View Perception and Planning Reeshad Khan et.al. 2509.18372 null
2025-09-22 OrthoLoC: UAV 6-DoF Localization and Calibration Using Orthographic Geodata Oussema Dhaouadi et.al. 2509.18350 null
2025-09-22 The Landform Contextual Mesh: Automatically Fusing Surface and Orbital Terrain for Mars 2020 Marsette Vona et.al. 2509.18330 null
2025-09-24 Rethinking Pulmonary Embolism Segmentation: A Study of Current Approaches and Challenges with an Open Weight Model Yixin Zhang et.al. 2509.18308 null
2025-09-22 PEEK: Guiding and Minimal Image Representations for Zero-Shot Generalization of Robot Manipulation Policies Jesse Zhang et.al. 2509.18282 null
2025-09-22 VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models Geonung Kim et.al. 2509.17985 null
2025-09-22 Multi-needle Localization for Pelvic Seed Implant Brachytherapy based on Tip-handle Detection and Matching Zhuo Xiao et.al. 2509.17931 null
2025-09-22 ProDyG: Progressive Dynamic Scene Reconstruction via Gaussian Splatting from Monocular Videos Shi Chen et.al. 2509.17864 null
2025-09-22 Selecting Optimal Camera Views for Gait Analysis: A Multi-Metric Assessment of 2D Projections Dong Chen et.al. 2509.17805 null
2025-09-22 Effect of Appearance and Animation Realism on the Perception of Emotionally Expressive Virtual Humans Nabila Amadou et.al. 2509.17803 null
2025-09-22 From Restoration to Reconstruction: Rethinking 3D Gaussian Splatting for Underwater Scenes Guoxi Huang et.al. 2509.17789 null
2025-09-23 RoboSeek: You Need to Interact with Your Objects Yibo Peng et.al. 2509.17783 null
2025-09-22 Automated Labeling of Intracranial Arteries with Uncertainty Quantification Using Deep Learning Javier Bisbal et.al. 2509.17726 null
2025-09-22 RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion Geonho Bang et.al. 2509.17712 null
2025-09-22 SD-VLM: Spatial Measuring and Understanding with Depth-Encoded Vision-Language Models Pingyi Chen et.al. 2509.17664 null
2025-09-22 Evict3R: Training-Free Token Eviction for Memory-Bounded Streaming Visual Geometry Transformers Soroush Mahdi et.al. 2509.17650 null
2025-09-22 VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video Yu Liu et.al. 2509.17647 null
2025-09-22 MRN: Harnessing 2D Vision Foundation Models for Diagnosing Parkinson’s Disease with Limited 3D MR Data Ding Shaodong et.al. 2509.17566 null
2025-09-22 Unified Multimodal Coherent Field: Synchronous Semantic-Spatial-Vision Fusion for Brain Tumor Segmentation Mingda Zhang et.al. 2509.17520 null
2025-09-22 Stable Video-Driven Portraits Mallikarjun B. R. et.al. 2509.17476 null
2025-09-22 MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception Changwon Kang et.al. 2509.17462 null
2025-09-23 Hierarchical Neural Semantic Representation for 3D Semantic Correspondence Keyu Du et.al. 2509.17431 null
2025-09-23 EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device Gunjan Chhablani et.al. 2509.17430 null
2025-09-22 FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR Junzhe Wu et.al. 2509.17390 null
2025-09-22 3D Printable Soft Liquid Metal Sensors for Delicate Manipulation Tasks Lois Liow et.al. 2509.17389 null
2025-09-22 AERO-MPPI: Anchor-Guided Ensemble Trajectory Optimization for Agile Mapless Drone Navigation Xin Chen et.al. 2509.17340 null
2025-09-22 SmokeSeer: 3D Gaussian Splatting for Smoke Removal and Scene Reconstruction Neham Jain et.al. 2509.17329 null
2025-09-21 Task-Oriented Communications for 3D Scene Representation: Balancing Timeliness and Fidelity Xiangmin Xu et.al. 2509.17282 null
2025-09-21 Learning and Optimization with 3D Orientations Alexandros Ntagkas et.al. 2509.17274 null
2025-09-21 SPFSplatV2: Efficient Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views Ranran Huang et.al. 2509.17246 null
2025-09-21 DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction Bo Liu et.al. 2509.17232 null
2025-09-21 High Resolution UDF Meshing via Iterative Networks Federico Stella et.al. 2509.17212 null
2025-09-21 Point-RTD: Replaced Token Denoising for Pretraining Transformer Models on Point Clouds Gunner Stone et.al. 2509.17207 null
2025-09-21 Guided and Unguided Conditional Diffusion Mechanisms for Structured and Semantically-Aware 3D Point Cloud Generation Gunner Stone et.al. 2509.17206 null
2025-09-21 Certifiably Optimal Doppler Positioning using Opportunistic LEO Satellites Baoshan Song et.al. 2509.17198 null
2025-09-21 Beat on Gaze: Learning Stylized Generation of Gaze and Head Dynamics Chengwei Shi et.al. 2509.17168 null
2025-09-21 Imagine2Act: Leveraging Object-Action Motion Consistency from Imagined Goals for Robotic Manipulation Liang Heng et.al. 2509.17125 null
2025-09-21 CoBEVMoE: Heterogeneity-aware Feature Fusion with Dynamic Mixture-of-Experts for Collaborative Perception Lingzhao Kong et.al. 2509.17107 null
2025-09-23 HyRF: Hybrid Radiance Fields for Memory-efficient and High-quality Novel View Synthesis Zipeng Wang et.al. 2509.17083 null
2025-09-21 Efficient 3D Scene Reconstruction and Simulation from Sparse Endoscopic Views Zhenya Yang et.al. 2509.17027 null
2025-09-21 SemanticGarment: Semantic-Controlled Generation and Editing of 3D Gaussian Garments Ruiyan Wang et.al. 2509.16960 null
2025-09-21 Leveraging RGB Images for Pre-Training of Event-Based Hand Pose Estimation Ruicong Liu et.al. 2509.16949 null
2025-09-21 ConfidentSplat: Confidence-Weighted Depth Fusion for Accurate 3D Gaussian Splatting SLAM Amanuel T. Dufera et.al. 2509.16863 null
2025-09-23 L2M-Reg: Building-level Uncertainty-aware Registration of Outdoor LiDAR Point Clouds and Semantic 3D City Models Ziyang Xu et.al. 2509.16832 null
2025-09-20 SMART-3D: Three-Dimensional Self-Morphing Adaptive Replanning Tree Priyanshu Agrawal et.al. 2509.16812 null
2025-09-20 MedGS: Gaussian Splatting for Multi-Modal 3D Medical Imaging Kacper Marzol et.al. 2509.16806 null
2025-09-20 MMPart: Harnessing Multi-Modal Large Language Models for Part-Aware 3D Generation Omid Bonakdar et.al. 2509.16768 null
2025-09-20 HyPlaneHead: Rethinking Tri-plane-like Representations in Full-Head Image Synthesis Heyuan Li et.al. 2509.16748 null
2025-09-23 Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment Xin Lei Lin et.al. 2509.16727 null
2025-09-20 Text-Scene: A Scene-to-Language Parsing Framework for 3D Scene Understanding Haoyuan Li et.al. 2509.16721 null
2025-09-20 SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving Haiming Zhang et.al. 2509.16588 null
2025-09-20 Person Identification from Egocentric Human-Object Interactions using 3D Hand Pose Muhammad Hamza et.al. 2509.16557 null
2025-09-20 ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting Xiaoyang Yan et.al. 2509.16552 null
2025-09-20 No Need for Real 3D: Fusing 2D Vision with Pseudo 3D Representations for Robotic Manipulation Learning Run Yu et.al. 2509.16532 null
2025-09-20 RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation Tianyi Yan et.al. 2509.16500 null
2025-09-20 Octree Latent Diffusion for Semantic 3D Scene Generation and Completion Xujia Zhang et.al. 2509.16483 null
2025-09-19 Explainable Gait Abnormality Detection Using Dual-Dataset CNN-LSTM Models Parth Agarwal et.al. 2509.16472 null
2025-09-19 TractoTransformer: Diffusion MRI Streamline Tractography using CNN and Transformer Networks Itzik Waizman et.al. 2509.16429 null
2025-09-23 3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction Maria Taktasheva et.al. 2509.16423 null
2025-09-19 StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes Zhengri Wu et.al. 2509.16415 null
2025-09-19 From Canopy to Ground via ForestGen3D: Learning Cross-Domain Generation of 3D Forest Structure from Aerial-to-Terrestrial LiDAR Juan Castorena et.al. 2509.16346 null
2025-09-19 Neural Atlas Graphs for Dynamic Scene Decomposition and Editing Jan Philipp Schneider et.al. 2509.16336 null
2025-09-19 Recovering Parametric Scenes from Very Few Time-of-Flight Pixels Carter Sifferman et.al. 2509.16132 null
2025-09-19 RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars Weiyi Xiong et.al. 2509.16119 null
2025-09-19 SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features Jinyuan Qu et.al. 2509.16098 null
2025-09-19 DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation Yue Su et.al. 2509.16063 null
2025-09-19 Graph-based Point Cloud Surface Reconstruction using B-Splines Stuti Pathak et.al. 2509.16050 null
2025-09-19 Towards Sharper Object Boundaries in Self-Supervised Depth Estimation Aurélien Cecille et.al. 2509.15987 null
2025-09-19 The Missing Piece: A Case for Pre-Training in 3D Medical Object Detection Katharina Eckstein et.al. 2509.15947 null
2025-09-19 PAN: Pillars-Attention-Based Network for 3D Object Detection Ruan Bispo et.al. 2509.15935 null
2025-09-19 Sparse Multiview Open-Vocabulary 3D Detection Olivier Moliner et.al. 2509.15924 null
2025-09-19 A CARLA-based Simulation of Electrically Driven Forklifts David Claus et.al. 2509.15909 null
2025-09-19 MoAngelo: Motion-Aware Neural Surface Reconstruction for Dynamic Scenes Mohamed Ebbed et.al. 2509.15892 null
2025-09-19 RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation Paul Julius Kühn et.al. 2509.15886 null
2025-09-19 Self-Supervised Cross-Modal Learning for Image-to-Point Cloud Registration Xingmei Wang et.al. 2509.15882 null
2025-09-19 Improving Robotic Manipulation with Efficient Geometry-Aware Vision Encoder An Dinh Vuong et.al. 2509.15880 null
2025-09-19 ENSAM: an efficient foundation model for interactive segmentation of 3D medical images Elias Stenhede et.al. 2509.15874 null
2025-09-19 Zero-Shot Visual Grounding in 3D Gaussians via View Retrieval Liwei Liao et.al. 2509.15871 null
2025-09-19 Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation Weimin Bai et.al. 2509.15772 null
2025-09-19 GP3: A 3D Geometry-Aware Policy with Multi-View Images for Robotic Manipulation Quanhao Qian et.al. 2509.15733 null
2025-09-19 SGMAGNet: A Baseline Model for 3D Cloud Phase Structure Reconstruction on a New Passive Active Satellite Benchmark Chi Yang et.al. 2509.15706 null
2025-09-19 SCENEFORGE: Enhancing 3D-text alignment with Structured Scene Compositions Cristian Sbrolli et.al. 2509.15693 null
2025-09-19 Camera Splatting for Continuous View Optimization Gahye Lee et.al. 2509.15677 null
2025-09-19 FingerSplat: Contactless Fingerprint 3D Reconstruction and Generation based on 3D Gaussian Splatting Yuwei Jia et.al. 2509.15648 null
2025-09-19 GS-Scale: Unlocking Large-Scale 3D Gaussian Splatting Training via Host Offloading Donghyun Lee et.al. 2509.15645 null
2025-09-19 Implicit Modeling for 3D-printed Multi-material Computational Object Design via Python Charles Wade et.al. 2509.15562 null
2025-09-22 MS-GS: Multi-Appearance Sparse-View 3D Gaussian Splatting in the Wild Deming Li et.al. 2509.15548 null
2025-09-19 STARC: See-Through-Wall Augmented Reality Framework for Human-Robot Collaboration in Emergency Response Shenghai Yuan et.al. 2509.15507 null
2025-09-18 GiAnt: A Bio-Inspired Hexapod for Adaptive Terrain Navigation and Object Detection Aasfee Mosharraf Bhuiyan et.al. 2509.15264 null
2025-09-18 Causal Reasoning Elicits Controllable 3D Scene Generation Shen Chen et.al. 2509.15249 null
2025-09-17 GenCAD-3D: CAD Program Generation using Multimodal Latent Space Alignment and Synthetic Dataset Balancing Nomi Yu et.al. 2509.15246 null
2025-09-17 ProFusion: 3D Reconstruction of Protein Complex Structures from Multi-view AFM Images Jaydeep Rade et.al. 2509.15242 null
2025-09-17 ChannelFlow-Tools: A Standardized Dataset Creation Pipeline for 3D Obstructed Channel Flows Shubham Kavane et.al. 2509.15236 null
2025-09-18 Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model Fangjinhua Wang et.al. 2509.15220 null
2025-09-18 Semi-Supervised 3D Medical Segmentation from 2D Natural Images Pretrained Model Pak-Hei Yeung et.al. 2509.15167 null
2025-09-19 RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes Fang Li et.al. 2509.15123 null
2025-09-18 Semantic-LiDAR-Inertial-Wheel Odometry Fusion for Robust Localization in Large-Scale Dynamic Environments Haoxuan Jiang et.al. 2509.14999 null
2025-09-19 SPATIALGEN: Layout-guided 3D Indoor Scene Generation Chuan Fang et.al. 2509.14981 null
2025-09-18 Beyond Random Masking: A Dual-Stream Approach for Rotation-Invariant Point Cloud Masked Autoencoders Xuanhua Yin et.al. 2509.14975 null
2025-09-18 RoboEye: Enhancing 2D Robotic Object Identification with Selective 3D Geometric Keypoint Matching Xingwu Zhang et.al. 2509.14966 null
2025-09-21 Seeing 3D Through 2D Lenses: 3D Few-Shot Class-Incremental Learning via Cross-Modal Geometric Rectification Tuo Xiang et.al. 2509.14958 null
2025-09-18 Human Interaction for Collaborative Semantic SLAM using Extended Reality Laura Ribeiro et.al. 2509.14949 null
2025-09-18 NeRF-based Visualization of 3D Cues Supporting Data-Driven Spacecraft Pose Estimation Antoine Legrand et.al. 2509.14890 null
2025-09-18 Radiology Report Conditional 3D CT Generation with Multi Encoder Latent diffusion Model Sina Amirrajab et.al. 2509.14780 null
2025-09-18 FMGS-Avatar: Mesh-Guided 2D Gaussian Splatting with Foundation Model Priors for 3D Monocular Avatar Reconstruction Jinlong Fan et.al. 2509.14739 null
2025-09-18 RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI Cong Tai et.al. 2509.14687 null
2025-09-18 Efficient 3D Perception on Embedded Systems via Interpolation-Free Tri-Plane Lifting and Volume Fusion Sibaek Lee et.al. 2509.14641 null
2025-09-18 HybridMamba: A Dual-domain Mamba for 3D Medical Image Segmentation Weitong Wu et.al. 2509.14609 null
2025-09-19 AToken: A Unified Tokenizer for Vision Jiasen Lu et.al. 2509.14476 null
2025-09-17 Perception-Integrated Safety Critical Control via Analytic Collision Cone Barrier Functions on 3D Gaussian Splatting Dario Tscholl et.al. 2509.14421 null
2025-09-17 Investigating the Ways in Which Mobile Phone Images with Open-Source Data Can Be Used to Create an Augmented Virtual Environment (AVE) Russell Beale et.al. 2509.14374 null
2025-09-17 MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping Zhihao Cao et.al. 2509.14191 null
2025-09-17 BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection Rongyu Zhang et.al. 2509.14151 null
2025-09-17 GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model Ali Abouzeid et.al. 2509.14117 null
2025-09-17 Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction Yifan Mo et.al. 2509.13938 null
2025-09-17 White Aggregation and Restoration for Few-shot 3D Point Cloud Semantic Segmentation Jiyun Im et.al. 2509.13907 null
2025-09-17 EvHand-FPV: Efficient Event-Based 3D Hand Tracking from First-Person View Zhen Xu et.al. 2509.13883 null
2025-09-17 Consistent View Alignment Improves Foundation Models for 3D Medical Image Segmentation Puru Vaish et.al. 2509.13846 null
2025-09-17 HGACNet: Hierarchical Graph Attention Network for Cross-Modal Point Cloud Completion Yadan Zeng et.al. 2509.13692 null
2025-09-17 CraftMesh: High-Fidelity Generative Mesh Manipulation via Poisson Seamless Fusion James Jincheng et.al. 2509.13688 null
2025-09-17 Gaussian Alignment for Relative Camera Pose Estimation via Single-View Reconstruction Yumin Li et.al. 2509.13652 null
2025-09-17 SAMIR, an efficient registration framework via robust feature learning from SAM Yue He et.al. 2509.13629 null
2025-09-17 Rest2Visual: Predicting Visually Evoked fMRI from Resting-State Scans Chuyang Zhou et.al. 2509.13612 null
2025-09-17 A Generalization of CLAP from 3D Localization to Image Processing, A Connection With RANSAC & Hough Transforms Ruochen Hou et.al. 2509.13605 null
2025-09-16 Object Pose Estimation through Dexterous Touch Amir-Hossein Shahidzadeh et.al. 2509.13591 null
2025-09-16 Semantic 3D Reconstructions with SLAM for Central Airway Obstruction Ayberk Acar et.al. 2509.13541 null
2025-09-16 MemGS: Memory-Efficient Gaussian Splatting for Real-Time SLAM Yinlong Bai et.al. 2509.13536 null
2025-09-16 ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors Romain Hardy et.al. 2509.13525 null
2025-09-16 Improving 3D Gaussian Splatting Compression by Scene-Adaptive Lattice Vector Quantization Hao Xu et.al. 2509.13482 null
2025-09-16 Semantic-Enhanced Cross-Modal Place Recognition for Robust Robot Localization Yujia Lin et.al. 2509.13474 null
2025-09-18 MapAnything: Universal Feed-Forward Metric 3D Reconstruction Nikhil Keetha et.al. 2509.13414 null
2025-09-16 Generative AI Pipeline for Interactive Prompt-driven 2D-to-3D Vascular Reconstruction for Fontan Geometries from Contrast-Enhanced X-Ray Fluoroscopy Imaging Prahlad G Menon et.al. 2509.13372 null
2025-09-15 3D Reconstruction of Coronary Vessel Trees from Biplanar X-Ray Images Using a Geometric Approach Ethan Koland et.al. 2509.13358 null
2025-09-13 Label-Efficient Grasp Joint Prediction with Point-JEPA Jed Guzelkabaagac et.al. 2509.13349 null
2025-09-16 3D Aware Region Prompted Vision Language Model An-Chieh Cheng et.al. 2509.13317 null
2025-09-16 Temporally Smooth Mesh Extraction for Procedural Scenes with Long-Range Camera Trajectories using Spacetime Octrees Zeyu Ma et.al. 2509.13306 null
2025-09-17 StyleSculptor: Zero-Shot Style-Controllable 3D Asset Generation with Texture-Geometry Dual Guidance Zefan Qu et.al. 2509.13301 null
2025-09-16 More performant and scalable: Rethinking contrastive vision-language pre-training of radiology in the LLM era Yingtai Li et.al. 2509.13175 null
2025-09-16 Enhancing Dual Network Based Semi-Supervised Medical Image Segmentation with Uncertainty-Guided Pseudo-Labeling Yunyao Lu et.al. 2509.13084 null
2025-09-16 DVDP: An End-to-End Policy for Mobile Robot Visual Docking with RGB-D Perception Haohan Min et.al. 2509.13024 null
2025-09-16 Dream3DAvatar: Text-Controlled 3D Avatar Reconstruction from a Single Image Gaofeng Liu et.al. 2509.13013 null
2025-09-16 Improving Accuracy and Efficiency of Implicit Neural Representations: Making SIREN a WINNER Hemanth Chandravamsi et.al. 2509.12980 null
2025-09-16 Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings Abdalla Arafa et.al. 2509.12938 null
2025-09-16 4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar Xiao Tang et.al. 2509.12931 null
2025-09-16 Few to Big: Prototype Expansion Network via Diffusion Learner for Point Cloud Few-shot Semantic Segmentation Qianguang Zhao et.al. 2509.12878 null
2025-09-16 Exploring Metric Fusion for Evaluation of NeRFs Shreyas Shivakumara et.al. 2509.12836 null
2025-09-16 Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation Biwen Lei et.al. 2509.12815 null
2025-09-16 SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation Jingdong Zhang et.al. 2509.12721 null
2025-09-16 DisorientLiDAR: Physical Attacks on LiDAR-based Localization Yizhen Lao et.al. 2509.12595 null
2025-09-15 DinoAtten3D: Slice-Level Attention Aggregation of DinoV2 for 3D Brain MRI Anomaly Classification Fazle Rafsani et.al. 2509.12512 null
2025-09-15 Axis-Aligned 3D Stalk Diameter Estimation from RGB-D Imagery Benjamin Vail et.al. 2509.12511 null
2025-09-15 Artist-Created Mesh Generation from Raw Observation Yao He et.al. 2509.12501 null
2025-09-15 Towards Foundational Models for Single-Chip Radar Tianshu Huang et.al. 2509.12482 null
2025-09-15 Neural 3D Object Reconstruction with Small-Scale Unmanned Aerial Vehicles Àlmos Veres-Vitàlyos et.al. 2509.12458 null
2025-09-15 Deep learning for 3D point cloud processing – from approaches, tasks to its implications on urban and environmental applications Zhenxin Zhang et.al. 2509.12452 null
2025-09-15 DYNAMO: Dependency-Aware Deep Learning Framework for Articulated Assembly Motion Prediction Mayank Patel et.al. 2509.12430 null
2025-09-15 An integrated process for design and control of lunar robotics using AI and simulation Daniel Lindmark et.al. 2509.12367 null
2025-09-15 3D Human Pose and Shape Estimation from LiDAR Point Clouds: A Review Salma Galaaoui et.al. 2509.12197 null
2025-09-15 HoloGarment: 360° Novel View Synthesis of In-the-Wild Garments Johanna Karras et.al. 2509.12187 null
2025-09-15 LoRA-fine-tuned Large Vision Models for Automated Assessment of Post-SBRT Lung Injury M. Bolhassani et.al. 2509.12155 null
2025-09-15 3DViT-GAT: A Unified Atlas-Based 3D Vision Transformer and Graph Learning Framework for Major Depressive Disorder Detection Using Structural MRI Data Nojod M. Alotaibi et.al. 2509.12143 null
2025-09-15 End-to-End 4D Heart Mesh Recovery Across Full-Stack and Sparse Cardiac MRI Yihong Chen et.al. 2509.12090 null
2025-09-15 Progressive Flow-inspired Unfolding for Spectral Compressive Imaging Xiaodong Wang et.al. 2509.12079 null
2025-09-15 U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT Zhi Qin Tan et.al. 2509.12069 null
2025-09-15 End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data Farahdiba Zarin et.al. 2509.12068 null
2025-09-15 Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation Sebastian Diaz et.al. 2509.12062 null
2025-09-15 E2-BKI: Evidential Ellipsoidal Bayesian Kernel Inference for Uncertainty-aware Gaussian Semantic Mapping Junyoung Kim et.al. 2509.11964 null
2025-09-15 Learning to Generate 4D LiDAR Sequences Ao Liang et.al. 2509.11959 null
2025-09-16 Enriched text-guided variational multimodal knowledge distillation network (VMD) for automated diagnosis of plaque vulnerability in 3D carotid artery MRI Bo Cao et.al. 2509.11924 null
2025-09-15 Integrating Prior Observations for Incremental 3D Scene Graph Prediction Marian Renz et.al. 2509.11895 null
2025-09-15 BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation Francis Xiatian Zhang et.al. 2509.11885 null
2025-09-15 Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting Yi-Hsin Li et.al. 2509.11853 null
2025-09-16 MSMA: Multi-Scale Feature Fusion For Multi-Attribute 3D Face Reconstruction From Unconstrained Images Danling Cao et.al. 2509.11763 null
2025-09-15 ParaEQsA: Parallel and Asynchronous Embodied Questions Scheduling and Answering Haisheng Wang et.al. 2509.11663 null
2025-09-15 A Controllable 3D Deepfake Generation Framework with Gaussian Splatting Wending Liu et.al. 2509.11624 null
2025-09-15 Inference-stage Adaptation-projection Strategy Adapts Diffusion Policy to Cross-manipulators Scenarios Xiangtong Yao et.al. 2509.11621 null
2025-09-15 Gaussian-Plus-SDF SLAM: High-fidelity 3D Reconstruction at 150+ fps Zhexi Peng et.al. 2509.11574 null
2025-09-14 Beyond Frame-wise Tracking: A Trajectory-based Paradigm for Efficient Point Cloud Tracking BaiChen Fan et.al. 2509.11453 null
2025-09-14 MultiMAE for Brain MRIs: Robustness to Missing Inputs Using Multi-Modal Masked Autoencoder Ayhan Can Erdur et.al. 2509.11442 null
2025-09-14 On the Skinning of Gaussian Avatars Nikolaos Zioulis et.al. 2509.11411 null
2025-09-14 3De Interactive Lenses for Visualization in Virtual Environments Roberta C. R. Mota et.al. 2509.11410 null
2025-09-14 3D Gaussian Modeling and Ray Marching of OpenVDB datasets for Scientific Visualization Isha Sharma et.al. 2509.11377 null
2025-09-14 ROSGS: Relightable Outdoor Scenes With Gaussian Splatting Lianjun Liao et.al. 2509.11275 null
2025-09-14 Scaling Up Forest Vision with Synthetic Data Yihang She et.al. 2509.11201 null
2025-09-14 SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion Zhiwen Yang et.al. 2509.11171 null
2025-09-14 Multispectral-NeRF:a multispectral modeling approach based on neural radiance fields Hong Zhang et.al. 2509.11169 null
2025-09-14 No Mesh, No Problem: Estimating Coral Volume and Surface from Sparse Multi-View Images Diego Eustachio Farchione et.al. 2509.11164 null
2025-09-14 ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations Zheng Li et.al. 2509.11125 null
2025-09-14 SVR-GS: Spatially Variant Regularization for Probabilistic Masks in 3D Gaussian Splatting Ashkan Taghipour et.al. 2509.11116 null
2025-09-14 WildSmoke: Ready-to-Use Dynamic 3D Smoke Assets from a Single Video in the Wild Yuqiu Liu et.al. 2509.11114 null
2025-09-14 3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment Nhut Le et.al. 2509.11097 null
2025-09-14 SH-SAS: An Implicit Neural Representation for Complex Spherical-Harmonic Scattering Fields for 3D Synthetic Aperture Sonar Omkar Shailendra Vengurlekar et.al. 2509.11087 null
2025-09-13 AD-GS: Alternating Densification for Sparse-Input 3D Gaussian Splatting Gurutva Patle et.al. 2509.11003 null
2025-09-13 Nav-R1: Reasoning and Navigation in Embodied Scenes Qingxiang Liu et.al. 2509.10884 null
2025-09-13 OpenUrban3D: Annotation-Free Open-Vocabulary Semantic Segmentation of Large-Scale Urban Point Clouds Chongyu Wang et.al. 2509.10842 null
2025-09-13 Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios Simone Mosco et.al. 2509.10841 null
2025-09-13 InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts Weipeng Zhong et.al. 2509.10813 null
2025-09-12 Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation Hao Zhang et.al. 2509.10687 null
2025-09-12 A Comparison and Evaluation of Fine-tuned Convolutional Neural Networks to Large Language Models for Image Classification and Segmentation of Brain Tumors on MRI Felicia Liu et.al. 2509.10683 null
2025-09-12 T2Bs: Text-to-Character Blendshapes via Video Generation Jiahao Luo et.al. 2509.10678 null
2025-09-12 Building a General SimCLR Self-Supervised Foundation Model Across Neurological Diseases to Advance 3D Brain MRI Diagnoses Emily Kaczmarek et.al. 2509.10620 null
2025-09-12 SSL-AD: Spatiotemporal Self-Supervised Learning for Generalizability and Adaptability Across Alzheimer’s Prediction Tasks and Datasets Emily Kaczmarek et.al. 2509.10453 null
2025-09-12 MCL-AD: Multimodal Collaboration Learning for Zero-Shot 3D Anomaly Detection Gang Li et.al. 2509.10282 null
2025-09-12 Robustness and Diagnostic Performance of Super-Resolution Fetal Brain MRI Ema Masterl et.al. 2509.10257 null
2025-09-15 On the Geometric Accuracy of Implicit and Primitive-based Representations Derived from View Rendering Constraints Elias De Smijter et.al. 2509.10241 null
2025-09-12 Leveraging Multi-View Weak Supervision for Occlusion-Aware Multi-Human Parsing Laura Bragagnolo et.al. 2509.10093 null
2025-09-12 Design and Evaluation of Two Spherical Systems for Mobile 3D Mapping Marawan Khalil et.al. 2509.10032 null
2025-09-16 Hierarchical MLANet: Multi-level Attention for 3D Face Reconstruction From Single Images Danling Cao et.al. 2509.10024 null
2025-09-12 Event Camera Guided Visual Media Restoration & 3D Reconstruction: A Survey Aupendu Kar et.al. 2509.09971 null
2025-09-12 Online 3D Multi-Camera Perception through Robust 2D Tracking and Depth-based Late Aggregation Vu-Minh Le et.al. 2509.09946 null
2025-09-12 Segment Anything for Cell Tracking Zhu Chen et.al. 2509.09943 null
2025-09-11 Purge-Gate: Backpropagation-Free Test-Time Adaptation for Point Clouds Classification via Token Purging Moslem Yazdanpanah et.al. 2509.09785 null
2025-09-09 Australian Supermarket Object Set (ASOS): A Benchmark Dataset of Physical Objects and 3D Models for Robotics and Computer Vision Akansel Cosgun et.al. 2509.09720 null
2025-09-11 SpatialVID: A Large-Scale Video Dataset with Spatial Annotations Jiahao Wang et.al. 2509.09676 null
2025-09-11 Geometric Neural Distance Fields for Learning Human Motion Priors Zhengdi Yu et.al. 2509.09667 null
2025-09-11 ObjectReact: Learning Object-Relative Control for Visual Navigation Sourav Garg et.al. 2509.09594 null
2025-09-11 Invisible Attributes, Visible Biases: Exploring Demographic Shortcuts in MRI-based Alzheimer’s Disease Classification Akshit Achara et.al. 2509.09558 null
2025-09-11 InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation Sirui Xu et.al. 2509.09555 null
2025-09-11 DualTrack: Sensorless 3D Ultrasound needs Local and Global Context Paul F. R. Wilson et.al. 2509.09530 null
2025-09-11 SMapper: A Multi-Modal Data Acquisition Platform for SLAM Benchmarking Pedro Miguel Bastos Soares et.al. 2509.09509 null
2025-09-11 Resource-Efficient Glioma Segmentation on Sub-Saharan MRI Freedmore Sidume et.al. 2509.09469 null
2025-09-12 OmniEVA: Embodied Versatile Planner via Task-Adaptive 3D-Grounded and Embodiment-aware Reasoning Yuecheng Liu et.al. 2509.09332 null
2025-09-11 Unified Start, Personalized End: Progressive Pruning for Efficient 3D Medical Image Segmentation Linhao Li et.al. 2509.09267 null
2025-09-11 Virtual staining for 3D X-ray histology of bone implants Sarah C. Irvine et.al. 2509.09235 null
2025-09-11 Medverse: A Universal Model for Full-Resolution 3D Medical Image Segmentation, Transformation and Enhancement Jiesi Hu et.al. 2509.09232 null
2025-09-11 CWSSNet: Hyperspectral Image Classification Enhanced by Wavelet Domain Convolution Yulin Tong et.al. 2509.09163 null
2025-09-11 Mind Meets Space: Rethinking Agentic Spatial Intelligence from a Neuroscience-inspired Perspective Bui Duc Manh et.al. 2509.09154 null
2025-09-11 Video Understanding by Design: How Datasets Shape Architectures and Insights Lei Wang et.al. 2509.09151 null
2025-09-11 Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation Yuiko Uchida et.al. 2509.09143 null
2025-09-11 AEOS: Active Environment-aware Optimal Scanning Control for UAV LiDAR-Inertial Odometry in Complex Scenes Jianping Li et.al. 2509.09141 null
2025-09-11 KoopMotion: Learning Almost Divergence Free Koopman Flow Fields for Motion Planning Alice Kate Li et.al. 2509.09074 null
2025-09-11 Enhancing 3D Medical Image Understanding with Pretraining Aided by 2D Multimodal Large Language Models Qiuhui Chen et.al. 2509.09064 null
2025-09-10 Integrating Anatomical Priors into a Causal Diffusion Model Binxu Li et.al. 2509.09054 null
2025-09-10 Rapid Manufacturing of Lightweight Drone Frames Using Single-Tow Architected Composites Md Habib Ullah Khan et.al. 2509.09024 null
2025-09-10 UltrON: Ultrasound Occupancy Networks Magdalena Wysocki et.al. 2509.08991 null
2025-09-10 iMatcher: Improve matching in point cloud registration via local-to-global geometric consistency learning Karim Slimani et.al. 2509.08982 null
2025-09-10 Live(r) Die: Predicting Survival in Colorectal Liver Metastasis Muhammad Alberb et.al. 2509.08935 null
2025-09-09 Morphology-Preserving Remeshing Approach to Particulate Microstructures via Harmonic Decomposition Mahmoud Shaqfa et.al. 2509.08855 null
2025-09-10 SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video David Stotko et.al. 2509.08828 null
2025-09-10 Calib3R: A 3D Foundation Model for Multi-Camera to Robot Calibration and 3D Metric-Scaled Scene Reconstruction Davide Allegro et.al. 2509.08813 null
2025-09-10 CrowdQuery: Density-Guided Query Module for Enhanced 2D and 3D Detection in Crowded Scenes Marius Dähling et.al. 2509.08738 null
2025-09-10 TANGO: Traversability-Aware Navigation with Local Metric Control for Topological Goals Stefan Podgorski et.al. 2509.08699 null
2025-09-10 X-Part: high fidelity and structure coherent shape decomposition Xinhao Yan et.al. 2509.08643 null
2025-09-10 Implicit Shape-Prior for Few-Shot Assisted 3D Segmentation Mathilde Monvoisin et.al. 2509.08580 null
2025-09-10 Semantic Causality-Aware Vision-Based 3D Occupancy Prediction Dubing Chen et.al. 2509.08388 null
2025-09-10 InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection Zhongyu Xia et.al. 2509.08374 null
2025-09-10 Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration Hyeonseok Kim et.al. 2509.08280 null
2025-09-10 Symmetry Interactive Transformer with CNN Framework for Diagnosis of Alzheimer’s Disease Using Structural MRI Zheng Yang et.al. 2509.08243 null
2025-09-09 Zero-Shot Metric Depth Estimation via Monocular Visual-Inertial Rescaling for Autonomous Aerial Navigation Steven Yang et.al. 2509.08159 null
2025-09-09 APML: Adaptive Probabilistic Matching Loss for Robust 3D Point Cloud Reconstruction Sasan Sharifipour et.al. 2509.08104 null
2025-09-08 CardioComposer: Flexible and Compositional Anatomical Structure Generation with Disentangled Geometric Guidance Karim Kadry et.al. 2509.08015 null
2025-09-11 3D and 4D World Modeling: A Survey Lingdong Kong et.al. 2509.07996 null
2025-09-09 One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation Zheng Geng et.al. 2509.07978 null
2025-09-09 Dynamic Scene 3D Reconstruction of an Uncooperative Resident Space Object Bala Prenith Reddy Gopu et.al. 2509.07932 null
2025-09-09 Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model Zhuoxu Huang et.al. 2509.07825 null
2025-09-09 SplatFill: 3D Scene Inpainting via Depth-Guided Gaussian Splatting Mahtab Dahaghin et.al. 2509.07809 null
2025-09-09 HairGS: Hair Strand Reconstruction based on 3D Gaussian Splatting Yimin Pan et.al. 2509.07774 null
2025-09-09 XSRD-Net: EXplainable Stroke Relapse Detection Christian Gapp et.al. 2509.07772 null
2025-09-09 Data-Efficient Fine-Tuning of Vision-Language Models for Diagnosis of Alzheimer’s Disease Fangqi Cheng et.al. 2509.07613 null
2025-09-09 Attention Maps in 3D Shape Classification for Dental Stage Estimation with Class Node Graph Attention Networks Barkin Buyukcakir et.al. 2509.07581 null
2025-09-09 PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image Peng Li et.al. 2509.07552 null
2025-09-09 HU-based Foreground Masking for 3D Medical Masked Image Modeling Jin Lee et.al. 2509.07534 null
2025-09-09 MVAT: Multi-View Aware Teacher for Weakly Supervised 3D Object Detection Saad Lahlali et.al. 2509.07507 null
2025-09-09 OmniMap: A General Mapping Framework Integrating Optics, Geometry, and Semantics Yinan Deng et.al. 2509.07500 null
2025-09-09 DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning Wenzhi Guo et.al. 2509.07493 null
2025-09-09 DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation Ze-Xin Yin et.al. 2509.07435 null
2025-09-08 Efficient Multi-Agent Coordination via Dynamic Joint-State Graph Construction Yanlin Zhou et.al. 2509.07234 null
2025-09-08 On design, analysis, and hybrid manufacturing of microstructured blade-like geometries Pablo Antolin et.al. 2509.07044 null
2025-09-07 MEGS $^{2}$ : Memory-Efficient Gaussian Splatting via Spherical Gaussians and Unified Pruning Jiarui Chen et.al. 2509.07021 null
2025-09-06 Human-in-the-Loop: Quantitative Evaluation of 3D Models Generation by Large Language Models Ahmed R. Sadik et.al. 2509.07010 null
2025-09-08 H $_{2}$ OT: Hierarchical Hourglass Tokenizer for Efficient Video Pose Transformers Wenhao Li et.al. 2509.06956 null
2025-09-08 Intraoperative 2D/3D Registration via Spherical Similarity Learning and Inference-Time Differentiable Levenberg-Marquardt Optimization Minheng Chen et.al. 2509.06890 null
2025-09-08 Matching Shapes Under Different Topologies: A Topology-Adaptive Deformation Guided Approach Aymen Merrouche et.al. 2509.06862 null
2025-09-08 SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis Zhengqing Chen et.al. 2509.06798 null
2025-09-10 P3-SAM: Native 3D Part Segmentation Changfeng Ma et.al. 2509.06784 null
2025-09-08 UrbanTwin: High-Fidelity Synthetic Replicas of Roadside Lidar Datasets Muhammad Shahbaz et.al. 2509.06781 null
2025-09-11 Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training Ruicheng Zhang et.al. 2509.06723 null
2025-09-08 Cortex-Synth: Differentiable Topology-Aware 3D Skeleton Synthesis with Hierarchical Graph Attention Mohamed Zayaan S et.al. 2509.06705 null
2025-09-08 Towards In-Air Ultrasonic QR Codes: Deep Learning for Classification of Passive Reflector Constellations Wouter Jansen et.al. 2509.06615 null
2025-09-08 From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans Marilyn Keller et.al. 2509.06607 null
2025-09-08 LiHRA: A LiDAR-Based HRI Dataset for Automated Risk Monitoring Methods Frederik Plahl et.al. 2509.06597 null
2025-09-08 CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis Xin Kong et.al. 2509.06579 null
2025-09-08 From Rigging to Waving: 3D-Guided Diffusion for Natural Animation of Hand-Drawn Characters Jie Zhou et.al. 2509.06573 null
2025-09-08 Predicting Brain Tumor Response to Therapy using a Hybrid Deep Learning and Radiomics Approach Daniil Tikhonov et.al. 2509.06511 null
2025-09-08 Does DINOv3 Set a New Medical Vision Standard? Che Liu et.al. 2509.06467 null
2025-09-08 A Statistical 3D Stomach Shape Model for Anatomical Analysis Erez Posner et.al. 2509.06464 null
2025-09-08 Cross3DReg: Towards a Large-scale Real-world Cross-source Point Cloud Registration Benchmark Zongyi Xu et.al. 2509.06456 null
2025-09-08 Real-time Photorealistic Mapping for Situational Awareness in Robot Teleoperation Ian Page et.al. 2509.06433 null
2025-09-11 Musculoskeletal simulation of limb movement biomechanics in Drosophila melanogaster Pembe Gizem Özdil et.al. 2509.06426 null
2025-09-08 3DOF+Quantization: 3DGS quantization for large scenes with limited Degrees of Freedom Matthieu Gendrin et.al. 2509.06400 null
2025-09-08 Towards scalable organ level 3D plant segmentation: Bridging the data algorithm computing gap Ruiming Du et.al. 2509.06329 null
2025-09-08 Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes Mohsen Gholami et.al. 2509.06266 null
2025-09-07 O $^3$ Afford: One-Shot 3D Object-to-Object Affordance Grounding for Generalizable Robotic Manipulation Tongxuan Tian et.al. 2509.06233 null
2025-09-07 Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen) Yifei Ren et.al. 2509.06191 null
2025-09-07 SpecSwin3D: Generating Hyperspectral Imagery from Multispectral Data via Transformer Networks Tang Sui et.al. 2509.06122 null
2025-09-07 MedSeqFT: Sequential Fine-tuning Foundation Models for 3D Medical Image Segmentation Yiwen Ye et.al. 2509.06096 null
2025-09-07 Robotic Manipulation Framework Based on Semantic Keypoints for Packing Shoes of Different Sizes, Shapes, and Softness Yi Dong et.al. 2509.06048 null
2025-09-07 Motion Aware ViT-based Framework for Monocular 6-DoF Spacecraft Pose Estimation Jose Sosa et.al. 2509.06000 null
2025-09-07 S-LAM3D: Segmentation-Guided Monocular 3D Object Detection via Feature Space Fusion Diana-Alexandra Sas et.al. 2509.05999 null
2025-09-07 Imagining Alternatives: Towards High-Resolution 3D Counterfactual Medical Image Generation via Language Guidance Mohamed Mohamed et.al. 2509.05978 null
2025-09-07 Spatial-Aware Self-Supervision for Medical 3D Imaging with Multi-Granularity Observable Tasks Yiqin Zhang et.al. 2509.05967 null
2025-09-07 Neural Bloom: A Deep Learning Approach to Real-Time Lighting Rafal Karp et.al. 2509.05963 null
2025-09-07 StripDet: Strip Attention-Based Lightweight 3D Object Detection from Point Cloud Weichao Wang et.al. 2509.05954 null
2025-09-07 Near Real-Time Dust Aerosol Detection with 3D Convolutional Neural Networks on MODIS Data Caleb Gates et.al. 2509.05887 null
2025-09-06 Programming tension in 3D printed networks inspired by spiderwebs Thijs Masmeijer et.al. 2509.05855 null
2025-09-06 CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation In-Jae Lee et.al. 2509.05785 null
2025-09-06 3DPillars: Pillar-based two-stage 3D object detection Jongyoun Noh et.al. 2509.05780 null
2025-09-06 Posterior shape models revisited: Improving 3D reconstructions from partial data using target specific models Jonathan Aellen et.al. 2509.05776 null
2025-09-06 JRN-Geo: A Joint Perception Network based on RGB and Normal images for Cross-view Geo-localization Hongyu Zhou et.al. 2509.05696 null
2025-09-06 MonoGlass3D: Monocular 3D Glass Detection with Plane Regression and Adaptive Feature Fusion Kai Zhang et.al. 2509.05599 null
2025-09-06 PaMO: Parallel Mesh Optimization for Intersection-Free Low-Poly Modeling on the GPU Seonghun Oh et.al. 2509.05595 null
2025-09-06 Reconstruction and Reenactment Separated Method for Realistic Gaussian Head Zhiling Ye et.al. 2509.05582 null
2025-09-06 OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision Ruixun Liu et.al. 2509.05578 null
2025-09-05 Visibility-Aware Language Aggregation for Open-Vocabulary Segmentation in 3D Gaussian Splatting Sen Wang et.al. 2509.05515 null
2025-09-05 Microrobot Vascular Parkour: Analytic Geometry-based Path Planning with Real-time Dynamic Obstacle Avoidance Yanda Yang et.al. 2509.05500 null
2025-09-05 Veriserum: A dual-plane fluoroscopic dataset with knee implant phantoms for deep learning in medical imaging Jinhao Wang et.al. 2509.05483 null
2025-09-02 INF-3DP: Implicit Neural Fields for Collision-Free Multi-Axis 3D Printing Jiasheng Qu et.al. 2509.05345 null
2025-09-04 Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control Haruo Fujiwara et.al. 2509.05285 null
2025-09-08 LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation Yinglin Duan et.al. 2509.05263 null
2025-09-05 Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet Mohammad Saeid et.al. 2509.05198 null
2025-09-05 SGS-3D: High-Fidelity 3D Instance Segmentation via Reliable Semantic Mask Splitting and Growing Chaolei Wang et.al. 2509.05144 null
2025-09-05 A Scalable Attention-Based Approach for Image-to-3D Texture Mapping Arianna Rampini et.al. 2509.05131 null
2025-09-05 GeoSplat: A Deep Dive into Geometry-Constrained Gaussian Splatting Yangming Li et.al. 2509.05075 null
2025-09-05 LUIVITON: Learned Universal Interoperable VIrtual Try-ON Cong Cao et.al. 2509.05030 null
2025-09-05 Ground-Aware Octree-A* Hybrid Path Planning for Memory-Efficient 3D Navigation of Ground Vehicles Byeong-Il Ham et.al. 2509.04950 null
2025-09-05 SynGen-Vision: Synthetic Data Generation for training industrial vision models Alpana Dubey et.al. 2509.04894 null
2025-09-05 CoRe-GS: Coarse-to-Refined Gaussian Splatting with Semantic Object Focus Hannah Schieber et.al. 2509.04859 null
2025-09-05 Pose-Free 3D Quantitative Phase Imaging of Flowing Cellular Populations Enze Ye et.al. 2509.04848 null
2025-09-04 Domain Adaptation for Different Sensor Configurations in 3D Object Detection Satoshi Tanaka et.al. 2509.04711 null
2025-09-04 Planning from Point Clouds over Continuous Actions for Multi-object Rearrangement Kallol Saha et.al. 2509.04645 null
2025-09-04 Few-step Flow for 3D Generation via Marginal-Data Transport Distillation Zanwei Zhou et.al. 2509.04406 null
2025-09-04 SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer Jimin Xu et.al. 2509.04379 null
2025-09-04 PAOLI: Pose-free Articulated Object Learning from Sparse-view Images Jianning Deng et.al. 2509.04276 null
2025-09-04 TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models Yuxin Gong et.al. 2509.04269 null
2025-09-04 TensoIS: A Step Towards Feed-Forward Tensorial Inverse Subsurface Scattering for Perlin Distributed Heterogeneous Media Ashish Tiwari et.al. 2509.04047 null
2025-09-04 SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation Han Huang et.al. 2509.03999 null
2025-09-04 TopoSculpt: Betti-Steered Topological Sculpting of 3D Fine-grained Tubular Shapes Minghui Zhang et.al. 2509.03938 null
2025-09-04 LMVC: An End-to-End Learned Multiview Video Coding Framework Xihua Sheng et.al. 2509.03922 null
2025-09-04 OccTENS: 3D Occupancy World Model via Temporal Next-Scale Prediction Bu Jin et.al. 2509.03887 null
2025-09-04 MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting Yuheng Li et.al. 2509.03800 null
2025-09-03 ContraGS: Codebook-Condensed and Trainable Gaussian Splatting for Fast, Memory-Efficient Reconstruction Sankeerth Durvasula et.al. 2509.03775 null
2025-09-03 Low-Cost Open-Source Ambidextrous Robotic Hand with 23 Direct-Drive servos for American Sign Language Alphabet Kelvin Daniel Gonzalez Amador et.al. 2509.03690 null
2025-09-03 Reg3D: Reconstructive Geometry Instruction Tuning for 3D Scene Understanding Hongpei Zheng et.al. 2509.03635 null
2025-09-03 treeX: Unsupervised Tree Instance Segmentation in Dense Forest Point Clouds Josafat-Mattias Burmeister et.al. 2509.03633 null
2025-09-03 PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection Qihang Zhou et.al. 2509.03277 null
2025-09-03 SynBT: High-quality Tumor Synthesis for Breast Tumor Segmentation by 3D Diffusion Model Hongxu Yang et.al. 2509.03267 null
2025-09-03 PI3DETR: Parametric Instance Detection of 3D Point Cloud Edges with a Geometry-Aware 3DETR Fabio F. Oberweger et.al. 2509.03262 null
2025-09-03 Efficient Active Training for Deep LiDAR Odometry Beibei Zhou et.al. 2509.03211 null
2025-09-03 Preserving instance continuity and length in segmentation through connectivity-aware loss computation Karol Szustakowski et.al. 2509.03154 null
2025-09-03 Temporally-Aware Diffusion Model for Brain Progression Modelling with Bidirectional Temporal Regularisation Mattia Litrico et.al. 2509.03141 null
2025-09-03 TRELLIS-Enhanced Surface Features for Comprehensive Intracranial Aneurysm Analysis Clément Hervé et.al. 2509.03095 null
2025-09-03 Isolated Bangla Handwritten Character Classification using Transfer Learning Abdul Karim et.al. 2509.03061 null
2025-09-03 Resilient Multimodal Industrial Surface Defect Detection with Uncertain Sensors Availability Shuai Jiang et.al. 2509.02962 null
2025-09-03 High-Fidelity Digital Twins for Bridging the Sim2Real Gap in LiDAR-Based ITS Perception Muhammad Shahbaz et.al. 2509.02904 null
2025-09-02 Robotic 3D Flower Pose Estimation for Small-Scale Urban Farms Harsh Muriki et.al. 2509.02870 null
2025-09-02 Improving the Resilience of Quadrotors in Underground Environments by Combining Learning-based and Safety Controllers Isaac Ronald Ward et.al. 2509.02808 null
2025-09-02 FastVGGT: Training-Free Acceleration of Visual Geometry Transformer You Shen et.al. 2509.02560 null
2025-09-02 Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots Minghuan Liu et.al. 2509.02530 null
2025-09-02 Enhancing Fitness Movement Recognition with Attention Mechanism and Pre-Trained Feature Extractors Shanjid Hasan Nishat et.al. 2509.02511 null
2025-09-02 Unifi3D: A Study on 3D Representations for Generation and Reconstruction in a Common Framework Nina Wiedemann et.al. 2509.02474 null
2025-09-02 TeRA: Rethinking Text-guided Realistic 3D Avatar Generation Yanwen Wang et.al. 2509.02466 null
2025-09-02 U-ARM : Ultra low-cost general teleoperation interface for robot manipulation Yanwen Zou et.al. 2509.02437 null
2025-09-02 Decoupling Bidirectional Geometric Representations of 4D cost volume with 2D convolution Xiaobao Wei et.al. 2509.02415 null
2025-09-02 Category-Aware 3D Object Composition with Disentangled Texture and Shape Multi-view Diffusion Zeren Xiong et.al. 2509.02357 null
2025-09-02 OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds Longrong Yang et.al. 2509.02322 null
2025-09-03 Sem-RaDiff: Diffusion-Based 3D Radar Semantic Perception in Cluttered Agricultural Environments Ruibin Zhang et.al. 2509.02283 null
2025-09-02 Think2Sing: Orchestrating Structured Motion Subtitles for Singing-Driven 3D Head Animation Zikai Huang et.al. 2509.02278 null
2025-09-02 GRMM: Real-Time High-Fidelity Gaussian Morphable Head Model with Learned Residuals Mohit Mendiratta et.al. 2509.02141 null
2025-09-02 2D Gaussian Splatting with Semantic Alignment for Image Inpainting Hongyu Li et.al. 2509.01964 null
2025-09-02 AI-Driven Marine Robotics: Emerging Trends in Underwater Perception and Ecosystem Monitoring Scarlett Raine et.al. 2509.01878 null
2025-09-02 Doctoral Thesis: Geometric Deep Learning For Camera Pose Prediction, Registration, Depth Estimation, and 3D Reconstruction Xueyang Kang et.al. 2509.01873 null
2025-09-01 Articulated Object Estimation in the Wild Abdelrhman Werby et.al. 2509.01708 null
2025-09-01 TransForSeg: A Multitask Stereo ViT for Joint Stereo Segmentation and 3D Force Estimation in Catheterization Pedram Fekri et.al. 2509.01605 null
2025-09-01 ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association Ganlin Zhang et.al. 2509.01584 null
2025-09-01 Unified Supervision For Vision-Language Modeling in 3D Computed Tomography Hao-Chih Lee et.al. 2509.01554 null
2025-09-01 FGO-SLAM: Enhancing Gaussian SLAM with Globally Consistent Opacity Radiance Field Fan Zhu et.al. 2509.01547 null
2025-09-01 A Continuous-Time Consistency Model for 3D Point Cloud Generation Sebastian Eilermann et.al. 2509.01492 null
2025-09-01 PointSlice: Accurate and Efficient Slice-Based Representation for 3D Object Detection from Point Clouds Liu Qifeng et.al. 2509.01487 null
2025-09-01 Im2Haircut: Single-view Strand-based Hair Reconstruction for Human Avatars Vanessa Sklyarova et.al. 2509.01469 null
2025-09-01 RibPull: Implicit Occupancy Fields and Medial Axis Extraction for CT Ribcage Scans Emmanouil Nikolakakis et.al. 2509.01402 null
2025-09-01 M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision Che Liu et.al. 2509.01360 null
2025-09-01 Guided Model-based LiDAR Super-Resolution for Resource-Efficient Automotive scene Segmentation Alexandros Gkillas et.al. 2509.01317 null
2025-09-01 Towards More Diverse and Challenging Pre-training for Point Cloud Learning: Self-Supervised Cross Reconstruction with Decoupled Views Xiangdong Zhang et.al. 2509.01250 null
2025-09-01 Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation Lee Chae-Yeon et.al. 2509.01242 null
2025-09-01 DcMatch: Unsupervised Multi-Shape Matching with Dual-Level Consistency Tianwei Ye et.al. 2509.01204 null
2025-09-01 RealMat: Realistic Materials with Diffusion and Reinforcement Learning Xilong Zhou et.al. 2509.01134 null
2025-09-01 Robix: A Unified Model for Robot Interaction, Reasoning and Planning Huang Fang et.al. 2509.01106 null
2025-09-01 Bidirectional Sparse Attention for Faster Video Diffusion Training Chenlu Zhan et.al. 2509.01085 null
2025-09-01 TARA: A Low-Cost 3D-Printed Robotic Arm for Accessible Robotics Education Thays Leach Mitre et.al. 2509.01043 null
2025-08-31 Towards Integrating Multi-Spectral Imaging with Gaussian Splatting Josef Grün et.al. 2509.00989 null
2025-09-03 GS-TG: 3D Gaussian Splatting Accelerator with Tile Grouping for Reducing Redundant Sorting while Preserving Rasterization Efficiency Joongho Jo et.al. 2509.00911 null
2025-09-03 UPGS: Unified Pose-aware Gaussian Splatting for Dynamic Scene Deblurring Zhijing Wu et.al. 2509.00831 null
2025-08-31 SWAGSplatting: Semantic-guided Water-scene Augmented Gaussian Splatting Zhuodong Jiang et.al. 2509.00800 null
2025-08-31 OmniReason: A Temporal-Guided Vision-Language-Action Framework for Autonomous Driving Pei Liu et.al. 2509.00789 null
2025-08-31 InterPose: Learning to Generate Human-Object Interactions from Large-Scale Web Videos Yangsong Zhang et.al. 2509.00767 null
2025-08-31 MarkSplatter: Generalizable Watermarking for 3D Gaussian Splatting Model via Splatter Image Structure Xiufeng Huang et.al. 2509.00757 null
2025-08-31 MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation Aviral Chharia et.al. 2509.00649 null
2025-08-30 Embodied Spatial Intelligence: from Implicit Scene Modeling to Spatial Reasoning Jiading Fang et.al. 2509.00465 null
2025-08-30 AGS: Accelerating 3D Gaussian Splatting SLAM via CODEC-Assisted Frame Covisibility Detection Houshu He et.al. 2509.00433 null
2025-08-30 Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation Jialiang Kang et.al. 2509.00379 null
2025-08-30 Adaptive Point-Prompt Tuning: Fine-Tuning Heterogeneous Foundation Models for 3D Point Cloud Analysis Mengke Li et.al. 2509.00374 null
2025-08-30 Autonomous Aggregate Sorting in Construction and Mining via Computer Vision-Aided Robotic Arm Systems Md. Taherul Islam Shawon et.al. 2509.00339 null
2025-08-29 3D-LATTE: Latent Space 3D Editing from Textual Instructions Maria Parelli et.al. 2509.00269 null
2025-08-29 MicroLabVR: Interactive 3D Visualization of Simulated Spatiotemporal Microbiome Data in Virtual Reality Simon Burbach et.al. 2508.21736 null
2025-08-29 CAD2DMD-SET: Synthetic Generation Tool of Digital Measurement Device CAD Model Datasets for fine-tuning Large Vision-Language Models João Valente et.al. 2508.21732 null
2025-08-29 Temporal Flow Matching for Learning Spatio-Temporal Trajectories in 4D Longitudinal Medical Imaging Nico Albert Disch et.al. 2508.21580 null
2025-08-29 Complete Gaussian Splats from a Single Image with Denoising Diffusion Models Ziwei Liao et.al. 2508.21542 null
2025-08-29 Scale-GS: Efficient Scalable Gaussian Splatting via Redundancy-filtering Training on Streaming Content Jiayu Yang et.al. 2508.21444 null
2025-08-29 Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image Qingran Miao et.al. 2508.21371 null
2025-08-29 Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning Yuquan Bi et.al. 2508.21363 null
2025-08-29 ARGS: Advanced Regularization on Aligning Gaussians over the Surface Jeong Uk Lee et.al. 2508.21344 null
2025-08-29 Mini Autonomous Car Driving based on 3D Convolutional Neural Networks Pablo Moraes et.al. 2508.21271 null
2025-08-28 PHD: Personalized 3D Human Body Fitting with Point Diffusion Hsuan-I Ho et.al. 2508.21257 null
2025-08-28 SYNBUILD-3D: A large, multi-modal, and semantically rich synthetic dataset of 3D building models at Level of Detail 4 Kevin Mayer et.al. 2508.21169 null
2025-08-28 RadGS-Reg: Registering Spine CT with Biplanar X-rays via Joint 3D Radiative Gaussians Reconstruction and 3D/3D Registration Ao Shen et.al. 2508.21154 null
2025-08-27 ScanMove: Motion Prediction and Transfer for Unregistered Body Meshes Thomas Besnier et.al. 2508.21095 null
2025-08-28 Multi-View 3D Point Tracking Frano Rajič et.al. 2508.21060 null
2025-08-28 ActLoc: Learning to Localize on the Move via Active Viewpoint Selection Jiajie Li et.al. 2508.20981 null
2025-08-28 DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes Yajiao Xiong et.al. 2508.20965 null
2025-08-28 PLUME: Procedural Layer Underground Modeling Engine Gabriel Manuel Garcia et.al. 2508.20926 null
2025-08-28 Estimating 2D Keypoints of Surgical Tools Using Vision-Language Models with Low-Rank Adaptation Krit Duangprom et.al. 2508.20830 null
2025-08-28 Surfel-based 3D Registration with Equivariant SE(3) Features Xueyang Kang et.al. 2508.20789 null
2025-08-28 SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding Jiawen Lin et.al. 2508.20758 null
2025-08-28 CardioMorphNet: Cardiac Motion Prediction Using a Shape-Guided Bayesian Recurrent Deep Network Reza Akbari Movahed et.al. 2508.20734 null
2025-08-28 Task-Oriented Edge-Assisted Cross-System Design for Real-Time Human-Robot Interaction in Industrial Metaverse Kan Chen et.al. 2508.20664 null
2025-08-28 AvatarBack: Back-Head Generation for Complete 3D Avatars from Front-View Images Shiqi Xin et.al. 2508.20623 null
2025-08-28 Optimization-Based Calibration for Intravascular Ultrasound Volume Reconstruction Karl-Philippe Beaudet et.al. 2508.20605 null
2025-08-28 Embracing Aleatoric Uncertainty: Generating Diverse 3D Human Motion Zheng Qin et.al. 2508.20604 null
2025-08-28 GLaRE: A Graph-based Landmark Region Embedding Network for Emotion Recognition Debasis Maji et.al. 2508.20579 null
2025-08-28 Enhancing Pseudo-Boxes via Data-Level LiDAR-Camera Fusion for Unsupervised 3D Object Detection Mingqian Ji et.al. 2508.20530 null
2025-08-28 Adam SLAM - the last mile of camera calibration with 3DGS Matthieu Gendrin et.al. 2508.20526 null
2025-08-28 IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection Xuanming Cao et.al. 2508.20492 null
2025-08-28 Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts Zixuan Hu et.al. 2508.20488 null
2025-08-28 Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation Jiusi Li et.al. 2508.20471 null
2025-08-28 Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation Xiaochuan Li et.al. 2508.20470 null
2025-08-28 Prediction of Distant Metastasis for Head and Neck Cancer Patients Using Multi-Modal Tumor and Peritumoral Feature Fusion Network Zizhao Tang et.al. 2508.20469 null
2025-08-27 MedNet-PVS: A MedNeXt-Based Deep Learning Model for Automated Segmentation of Perivascular Spaces Zhen Xuen Brandon Low et.al. 2508.20256 null
2025-08-27 Grounding Multimodal Large Language Models with Quantitative Skin Attributes: A Retrieval Study Max Torop et.al. 2508.20188 null
2025-08-27 Is the medical image segmentation problem solved? A survey of current developments and future directions Guoping Xu et.al. 2508.20139 null
2025-08-26 A Machine Learning Approach to Volumetric Computations of Solid Pulmonary Nodules Yihan Zhou et.al. 2508.20127 null
2025-08-27 Seam360GS: Seamless 360° Gaussian Splatting from Real-World Omnidirectional Images Changha Shin et.al. 2508.20080 null
2025-08-27 OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations Peng-Hao Hsu et.al. 2508.20063 null
2025-08-27 Visio-Verbal Teleimpedance Interface: Enabling Semi-Autonomous Control of Physical Interaction via Eye Tracking and Speech Henk H. A. Jekel et.al. 2508.20037 null
2025-08-27 Integrating SAM Supervision for 3D Weakly Supervised Point Cloud Segmentation Lechun You et.al. 2508.19909 null
2025-08-27 Multispectral LiDAR data for extracting tree points in urban and suburban areas Narges Takhtkeshha et.al. 2508.19881 null
2025-08-27 Multimodal Conditional MeshGAN for Personalized Aneurysm Growth Prediction Long Chen et.al. 2508.19862 null
2025-08-27 MAPo : Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction Han Jiao et.al. 2508.19786 null
2025-08-27 FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers Yue Wu et.al. 2508.19754 null
2025-08-27 LabelGS: Label-Aware 3D Gaussian Splatting for 3D Scene Segmentation Yupeng Zhang et.al. 2508.19699 null
2025-08-27 SAT: Supervisor Regularization and Animation Augmentation for Two-process Monocular Texture 3D Human Reconstruction Gangjian Zhang et.al. 2508.19688 null
2025-08-27 Beyond BEV: Optimizing Point-Level Tokens for Collaborative Perception Yang Li et.al. 2508.19638 null
2025-08-27 Generalizing Monocular 3D Object Detection Abhinav Kumar et.al. 2508.19593 null
2025-08-27 DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View Tian Qiu et.al. 2508.19508 null
2025-08-25 2D Ultrasound Elasticity Imaging of Abdominal Aortic Aneurysms Using Deep Neural Networks Utsav Ratna Tuladhar et.al. 2508.19303 null
2025-08-25 CellINR: Implicitly Overcoming Photo-induced Artifacts in 4D Live Fluorescence Microscopy Cunmin Zhao et.al. 2508.19300 null
2025-08-25 Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation Alexandros Gkillas et.al. 2508.19290 null
2025-08-26 VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space Lin Li et.al. 2508.19247 null
2025-08-26 Articulate3D: Zero-Shot Text-Driven 3D Object Posing Oishi Deb et.al. 2508.19244 null
2025-08-26 Style4D-Bench: A Benchmark Suite for 4D Stylization Beiqi Chen et.al. 2508.19243 null
2025-08-26 LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding Julian Ost et.al. 2508.19204 null
2025-08-26 Dual Enhancement on 3D Vision-Language Perception for Monocular 3D Visual Grounding Yuzhen Li et.al. 2508.19165 null
2025-08-26 Random forest-based out-of-distribution detection for robust lung cancer segmentation Aneesh Rangnekar et.al. 2508.19112 null
2025-08-26 GReAT: leveraging geometric artery data to improve wall shear stress assessment Julian Suk et.al. 2508.19030 null
2025-08-26 RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation Siyuan You et.al. 2508.19003 null
2025-08-26 Can we make NeRF-based visual localization privacy-preserving? Maxime Pietrantoni et.al. 2508.18971 null
2025-08-26 PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads Shashikant Verma et.al. 2508.18944 null
2025-08-26 ColorGS: High-fidelity Surgical Scene Reconstruction with Colored Gaussian Splatting Qun Ji et.al. 2508.18696 null
2025-08-26 AgriChrono: A Multi-modal Dataset Capturing Crop Growth and Lighting Variability with a Field Robot Jaehwan Jeong et.al. 2508.18694 null
2025-08-26 ROSE: Remove Objects with Side Effects in Videos Chenxuan Miao et.al. 2508.18633 null
2025-08-26 SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis Xiaohao Sun et.al. 2508.18597 null
2025-08-25 Real-time 3D Visualization of Radiance Fields on Light Field Displays Jonghyun Kim et.al. 2508.18540 null
2025-08-29 Adaptive Visual Navigation Assistant in 3D RPGs Kaijie Xu et.al. 2508.18539 null
2025-08-25 SAT-SKYLINES: 3D Building Generation from Satellite Imagery and Coarse Geometric Priors Zhangyu Jin et.al. 2508.18531 null
2025-08-25 DoGFlow: Self-Supervised LiDAR Scene Flow via Cross-Modal Doppler Guidance Ajinkya Khoche et.al. 2508.18506 null
2025-08-25 FastAvatar: Instant 3D Gaussian Splatting for Faces from Single Unconstrained Poses Hao Liang et.al. 2508.18389 null
2025-08-23 SERES: Semantic-aware neural reconstruction from sparse views Bo Xu et.al. 2508.18314 null
2025-08-22 Towards Training-Free Underwater 3D Object Detection from Sonar Point Clouds: A Comparison of Traditional and Deep Learning Approaches M. Salman Shaukat et.al. 2508.18293 null
2025-08-25 ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models Haitang Feng et.al. 2508.18271 null
2025-08-25 GSVisLoc: Generalizable Visual Localization for Gaussian Splatting Scene Representations Fadi Khatib et.al. 2508.18242 null
2025-08-21 PriorFormer: A Transformer for Real-time Monocular 3D Human Pose Estimation with Versatile Geometric Priors Mohamed Adjel et.al. 2508.18238 null
2025-08-25 Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance Ayce Idil Aytekin et.al. 2508.18213 null
2025-09-02 EventTracer: Fast Path Tracing-based Event Stream Rendering Zhenyang Li et.al. 2508.18071 null
2025-08-25 Topology Aware Neural Interpolation of Scalar Fields Mohamed Kissi et.al. 2508.17995 null
2025-08-25 SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization Junyuan Deng et.al. 2508.17972 null
2025-08-25 A holistic perception system of internal and external monitoring for ground autonomous vehicles: AutoTRUST paradigm Alexandros Gkillas et.al. 2508.17969 null
2025-08-25 Beam Geometry and Input Dimensionality: Impact on Sparse-Sampling Artifact Correction for Clinical CT with U-Nets Tina Dorosti et.al. 2508.17961 null
2025-08-25 EndoUFM: Utilizing Foundation Models for Monocular depth estimation of endoscopic images Xinning Yao et.al. 2508.17916 null
2025-08-25 Camera Pose Refinement via 3D Gaussian Splatting Lulu Hao et.al. 2508.17876 null
2025-08-25 HLG: Comprehensive 3D Room Construction via Hierarchical Layout Generation Xiping Wang et.al. 2508.17832 null
2025-08-25 CubeDN: Real-time Drone Detection in 3D Space from Dual mmWave Radar Cubes Yuan Fang et.al. 2508.17831 null
2025-08-25 MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting Hanzhi Chang et.al. 2508.17811 null
2025-08-25 DroneKey: Drone 3D Pose Estimation in Image Sequences using Gated Key-representation and Pose-adaptive Learning Seo-Bin Hwang et.al. 2508.17746 null
2025-08-25 MEVITA: Open-Source Bipedal Robot Assembled from E-Commerce Components via Sheet Metal Welding Kento Kawaharazuka et.al. 2508.17684 null
2025-08-28 Generating Human-AI Collaborative Design Sequence for 3D Assets via Differentiable Operation Graph Xiaoyang Huang et.al. 2508.17645 null
2025-08-25 Wound3DAssist: A Practical Framework for 3D Wound Assessment Remi Chierchia et.al. 2508.17635 null
2025-08-25 GWM: Towards Scalable Gaussian World Models for Robotic Manipulation Guanxing Lu et.al. 2508.17600 null
2025-08-25 TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints Vinh-Thuan Ly et.al. 2508.17595 null
2025-08-25 IDU: Incremental Dynamic Update of Existing 3D Virtual Environments with New Imagery Data Meida Chen et.al. 2508.17579 null
2025-08-24 Random-phase Gaussian Wave Splatting for Computer-generated Holography Brian Chao et.al. 2508.17480 null
2025-09-01 Investigating Domain Gaps for Indoor 3D Object Detection Zijing Zhao et.al. 2508.17439 null
2025-08-20 Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels Long Le et.al. 2508.17437 null
2025-08-24 MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling Haoyu Wang et.al. 2508.17404 null
2025-08-26 PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation Xiaoyang Hao et.al. 2508.17239 null
2025-08-24 4D Visual Pre-training for Robot Learning Chengkai Hou et.al. 2508.17230 null
2025-08-24 VROOM - Visual Reconstruction over Onboard Multiview Yajat Yadav et.al. 2508.17172 null
2025-08-23 DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method Qingwen Zhang et.al. 2508.17054 null
2025-08-23 PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models Xianjing Cheng et.al. 2508.17050 null
2025-08-23 M3DMap: Object-aware Multimodal 3D Mapping for Dynamic Environments Dmitry Yudin et.al. 2508.17044 null
2025-08-23 DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration Jiayi Li et.al. 2508.17034 null
2025-08-23 Fiducial Marker Splatting for High-Fidelity Robotics Simulations Diram Tabaa et.al. 2508.17012 null
2025-08-23 A Survey of Deep Learning-based Point Cloud Denoising Jinxi Wang et.al. 2508.17011 null
2025-08-23 Align 3D Representation and Text Embedding for 3D Content Personalization Qi Song et.al. 2508.16932 null
2025-08-23 Structural Energy-Guided Sampling for View-Consistent Text-to-3D Qing Zhang et.al. 2508.16917 null
2025-08-23 MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation Prerit Gupta et.al. 2508.16911 null
2025-08-23 Relative Navigation and Dynamic Target Tracking for Autonomous Underwater Proximity Operations David Baxter et.al. 2508.16901 null
2025-08-23 Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network Pouya Shiri et.al. 2508.16897 null
2025-08-23 A Workflow for Map Creation in Autonomous Vehicle Simulations Zubair Islam et.al. 2508.16856 null
2025-08-22 Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes Xinhao Xiang et.al. 2508.16812 null
2025-08-21 BrainPath: Generating Subject-Specific Brain Aging Trajectories Yifan Li et.al. 2508.16667 null
2025-08-22 MV-RAG: Retrieval Augmented Multiview Diffusion Yosef Dayani et.al. 2508.16577 null
2025-08-22 Real-time 3D Light-field Viewing with Eye-tracking on Conventional Displays Trung Hieu Pham et.al. 2508.16535 null
2025-08-26 Comparative Analysis of UAV Path Planning Algorithms for Efficient Navigation in Urban 3D Environments Hichem Cheriet et.al. 2508.16515 null
2025-08-22 On Kinodynamic Global Planning in a Simplicial Complex Environment: A Mixed Integer Approach Otobong Jerome et.al. 2508.16511 null
2025-08-22 Arbitrary-Scale 3D Gaussian Super-Resolution Huimin Zeng et.al. 2508.16467 null
2025-08-25 HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images Anilkumar Swamy et.al. 2508.16465 null
2025-08-22 HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction Sara Rojas et.al. 2508.16433 null
2025-08-22 SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather Edoardo Palladin et.al. 2508.16408 null
2025-08-22 Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars NVIDIA et.al. 2508.16401 null
2025-08-22 Self-Validated Learning for Particle Separation: A Correctness-Based Self-Training Framework Without Human Labels Philipp D. Lösel et.al. 2508.16224 null
2025-08-22 4D Virtual Imaging Platform for Dynamic Joint Assessment via Uni-Plane X-ray and 2D-3D Registration Hao Tang et.al. 2508.16138 null
2025-08-22 Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables Wontae Kim et.al. 2508.16121 null
2025-08-22 A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection Qifeng Liu et.al. 2508.16069 null
2025-08-22 Advances and Trends in the 3D Reconstruction of the Shape and Motion of Animals Ziqi Li et.al. 2508.16062 null
2025-08-22 NeuralMeshing: Complete Object Mesh Extraction from Casual Captures Floris Erich et.al. 2508.16026 null
2025-08-21 Self-Aligning EPM Connector: A Versatile Solution for Adaptive and Multi-Modal Interfaces Bingchao Wang et.al. 2508.16008 null
2025-08-21 GelSLAM: A Real-time, High-Fidelity, and Robust 3D Tactile SLAM System Hung-Jui Huang et.al. 2508.15990 null
2025-08-21 UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation Zhaodong Jiang et.al. 2508.15972 null
2025-08-21 Text-Driven 3D Hand Motion Generation from Sign Language Data Léore Bensabath et.al. 2508.15902 null
2025-08-21 Active Prostate Phantom with Multiple Chambers Sizhe Tian et.al. 2508.15873 null
2025-08-21 SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass Yanxu Meng et.al. 2508.15769 null
2025-08-21 ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling Jinhyung Park et.al. 2508.15767 null
2025-08-21 CM2LoD3: Reconstructing LoD3 Building Models Using Semantic Conflict Maps Franz Hanke et.al. 2508.15672 null
2025-08-25 Hessian-Based Lightweight Neural Network HessNet for State-of-the-Art Brain Vessel Segmentation on a Minimal Training Dataset Alexandra Bernadotte et.al. 2508.15660 null
2025-08-21 Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance Shuchao Pang et.al. 2508.15650 null
2025-08-21 Fast globally optimal Truncated Least Squares point cloud registration with fixed rotation axis Ivo Ivanov et.al. 2508.15613 null
2025-08-21 Deep Equilibrium Convolutional Sparse Coding for Hyperspectral Image Denoising Jin Ye et.al. 2508.15553 null
2025-08-21 MExECON: Multi-view Extended Explicit Clothed humans Optimized via Normal integration Fulden Ece Uğur et.al. 2508.15500 null
2025-08-21 Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework Zongqi He et.al. 2508.15457 null
2025-08-25 DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians Cong Wang et.al. 2508.15376 null
2025-08-21 Image-Conditioned 3D Gaussian Splat Quantization Xinshuang Liu et.al. 2508.15372 null
2025-08-21 RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features Olga Matykina et.al. 2508.15353 null
2025-08-21 Mag-Match: Magnetic Vector Field Features for Map Matching and Registration William McDonald et.al. 2508.15300 null
2025-08-21 BasketLiDAR: The First LiDAR-Camera Multimodal Dataset for Professional Basketball MOT Ryunosuke Hayashi et.al. 2508.15299 null
2025-08-21 Collaborative Multi-Modal Coding for High-Quality 3D Generation Ziang Cao et.al. 2508.15228 null
2025-08-25 MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion Xuyang Chen et.al. 2508.15169 null
2025-08-21 Reliable Multi-view 3D Reconstruction for `Just-in-time’ Edge Environments Md. Nurul Absur et.al. 2508.15158 null
2025-08-21 Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors Jeonghyun Noh et.al. 2508.15151 null
2025-08-20 Virtual Community: An Open World for Humans, Robots, and Society Qinhong Zhou et.al. 2508.14893 null
2025-08-20 Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds Jia Lu et.al. 2508.14892 null
2025-08-20 GaussianArt: Unified Modeling of Geometry and Motion for Articulated Objects Licheng Shen et.al. 2508.14891 null
2025-08-22 MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Bingquan Dai et.al. 2508.14879 null
2025-08-20 Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization Canyu Zhao et.al. 2508.14811 null
2025-08-20 Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels Fabian Holst et.al. 2508.14767 null
2025-08-20 GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting Jiaxin Wei et.al. 2508.14717 null
2025-08-20 GeMS: Efficient Gaussian Splatting for Extreme Motion Blur Gopi Raju Matta et.al. 2508.14682 null
2025-08-20 UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling Peiming Li et.al. 2508.14604 null
2025-08-20 Safety-Critical Learning for Long-Tail Events: The TUM Traffic Accident Dataset Walter Zimmer et.al. 2508.14567 null
2025-08-20 GOGS: High-Fidelity Geometry and Relighting for Glossy Objects via Gaussian Surfels Xingyuan Yang et.al. 2508.14563 null
2025-08-20 Making Pose Representations More Expressive and Disentangled via Residual Vector Quantization Sukhyun Jeong et.al. 2508.14561 null
2025-08-20 From Slices to Structures: Unsupervised 3D Reconstruction of Female Pelvic Anatomy from Freehand Transvaginal Ultrasound Max Krähenmann et.al. 2508.14552 null
2025-08-20 LookOut: Real-World Humanoid Egocentric Navigation Boxiao Pan et.al. 2508.14466 null
2025-08-20 D^3-Talker: Dual-Branch Decoupled Deformation Fields for Few-Shot 3D Talking Head Synthesis Yuhang Guo et.al. 2508.14449 null
2025-08-20 Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting Gyusam Chang et.al. 2508.14443 null
2025-08-20 HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation Bing Han et.al. 2508.14431 null
2025-08-20 Learning Point Cloud Representations with Pose Continuity for Depth-Based Category-Level 6D Object Pose Estimation Zhujun Li et.al. 2508.14358 null
2025-08-19 Pixels to Play: A Foundation Model for 3D Gameplay Yuguang Yue et.al. 2508.14295 null
2025-08-21 GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting Elena Alegret et.al. 2508.14278 null
2025-08-19 Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning Said Djafar Said et.al. 2508.14276 null
2025-08-19 SLAM-based Safe Indoor Exploration Strategy Omar Mostafa et.al. 2508.14235 null
2025-08-19 RynnEC: Bringing MLLMs into Embodied World Ronghao Dang et.al. 2508.14160 null
2025-08-19 Automated surgical planning with nnU-Net: delineation of the anatomy in hepatobiliary phase MRI Karin A. Olthof et.al. 2508.14133 null
2025-08-18 3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models Jolanta Mozyrska et.al. 2508.14122 null
2025-08-19 LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Chin-Yang Lin et.al. 2508.14041 null
2025-08-19 Distilled-3DGS:Distilled 3D Gaussian Splatting Lintao Xiang et.al. 2508.14037 null
2025-08-19 GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation Ken Deng et.al. 2508.14036 null
2025-08-19 Online 3D Gaussian Splatting Modeling with Novel View Selection Byeonggwon Lee et.al. 2508.14014 null
2025-08-19 ResPlan: A Large-Scale Vector-Graph Dataset of 17,000 Residential Floor Plans Mohamed Abouagour et.al. 2508.14006 null
2025-08-19 Self-Supervised Sparse Sensor Fusion for Long Range Perception Edoardo Palladin et.al. 2508.13995 null
2025-08-19 Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment Samuel Seligardi et.al. 2508.13989 null
2025-08-19 OmViD: Omni-supervised active learning for video action detection Aayush Rana et.al. 2508.13983 null
2025-08-19 ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving Xianda Guo et.al. 2508.13977 null
2025-08-19 Augmenting cobots for sheet-metal SMEs with 3D object recognition and localisation Martijn Cramer et.al. 2508.13964 null
2025-08-19 Real-Time, Population-Based Reconstruction of 3D Bone Models via Very-Low-Dose Protocols Yiqun Lin et.al. 2508.13947 null
2025-08-19 PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis Chunji Lv et.al. 2508.13911 null
2025-08-21 Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction Niklas Bubeck et.al. 2508.13826 null
2025-08-19 Is-NeRF: In-scattering Neural Radiance Field for Blurred Images Nan Luo et.al. 2508.13808 null
2025-08-19 Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing Feng-Lin Liu et.al. 2508.13797 null
2025-08-19 VisionLaw: Inferring Interpretable Intrinsic Dynamics from Visual Observations via Bilevel Optimization Jiajing Lin et.al. 2508.13792 null
2025-08-19 Shape-from-Template with Generalised Camera Agniva Sengupta et.al. 2508.13791 null
2025-08-19 Blast Hole Seeking and Dipping – The Navigation and Perception Framework in a Mine Site Inspection Robot Liyang Liu et.al. 2508.13785 null
2025-08-19 Deep Biomechanically-Guided Interpolation for Keypoint-Based Brain Shift Registration Tiago Assis et.al. 2508.13762 null
2025-08-19 Unleashing Semantic and Geometric Priors for 3D Scene Completion Shiyuan Chen et.al. 2508.13601 null
2025-08-19 The 9th AI City Challenge Zheng Tang et.al. 2508.13564 null
2025-08-19 Learnable SMPLify: A Neural Solution for Optimization-Free Human Pose Inverse Kinematics Yuchen Yang et.al. 2508.13562 null
2025-08-22 FLAIR: Frequency and Locality-Aware Implicit Neural Representations Sukhun Ko et.al. 2508.13544 null
2025-08-19 EAvatar: Expression-Aware Head Avatar Reconstruction with Generative Geometry Priors Shikun Zhang et.al. 2508.13537 null
2025-08-19 FAMNet: Integrating 2D and 3D Features for Micro-expression Recognition via Multi-task Learning and Hierarchical Attention Liangyu Fu et.al. 2508.13483 null
2025-08-18 Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction Sedigheh Dargahi et.al. 2508.13340 null
2025-08-18 InnerGS: Internal Scenes Rendering via Factorized 3D Gaussian Splatting Shuxin Liang et.al. 2508.13287 null
2025-08-17 PreSem-Surf: RGB-D Surface Reconstruction with Progressive Semantic Modeling and SG-MLP Pre-Rendering Mechanism Yuyan Ye et.al. 2508.13228 null
2025-08-18 4DNeX: Feed-Forward 4D Generative Modeling Made Easy Zhaoxi Chen et.al. 2508.13154 null
2025-08-18 IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion Wenhao Hu et.al. 2508.13153 null
2025-08-24 Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping Siddharth Khandelwal et.al. 2508.13065 null
2025-08-18 IntelliCap: Intelligent Guidance for Consistent View Sampling Ayaka Yasunaga et.al. 2508.13043 null
2025-08-18 Multi-Phase Automated Segmentation of Dental Structures in CBCT Using a Lightweight Auto3DSeg and SegResNet Implementation Dominic LaBella et.al. 2508.12962 null
2025-08-18 MaskSem: Semantic-Guided Masking for Learning 3D Hybrid High-Order Motion Representation Wei Wei et.al. 2508.12948 null
2025-08-18 Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models Jianshu Zeng et.al. 2508.12945 null
2025-08-18 CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction Zhiwei Ning et.al. 2508.12917 null
2025-08-18 CTFlow: Video-Inspired Latent Flow Matching for 3D CT Synthesis Jiayi Wang et.al. 2508.12900 null
2025-08-18 MCTR: Midpoint Corrected Triangulation for Autonomous Racing via Digital Twin Simulation in CARLA Junhao Ye et.al. 2508.12729 null
2025-08-18 Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting Kangjie Chen et.al. 2508.12720 null
2025-08-18 Neural Rendering for Sensor Adaptation in 3D Object Detection Felix Embacher et.al. 2508.12695 null
2025-08-18 Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection Zhongyao Li et.al. 2508.12684 null
2025-08-18 Stable Diffusion-Based Approach for Human De-Occlusion Seung Young Noh et.al. 2508.12663 null
2025-08-18 DyCrowd: Towards Dynamic Crowd Reconstruction from a Large-scene Video Hao Wen et.al. 2508.12644 null
2025-08-18 Synthesizing Accurate and Realistic T1-weighted Contrast-Enhanced MR Images using Posterior-Mean Rectified Flow Bastian Brandstötter et.al. 2508.12640 null
2025-08-19 WIPES: Wavelet-based Visual Primitives Wenhao Zhang et.al. 2508.12615 null
2025-08-17 Segmenting Thalamic Nuclei: T1 Maps Provide a Reliable and Efficient Solution Anqi Feng et.al. 2508.12508 null
2025-08-17 FractMorph: A Fractional Fourier-Based Multi-Domain Transformer for Deformable Image Registration Shayan Kebriti et.al. 2508.12445 null
2025-08-21 TiP4GEN: Text to Immersive Panorama 4D Scene Generation Ke Xing et.al. 2508.12415 null
2025-08-19 SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes Jun Zeng et.al. 2508.12410 null
2025-08-17 Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR Fatemeh Ghorbani Lohesara et.al. 2508.12336 null
2025-08-17 Semi-Infinite Programming for Collision-Avoidance in Optimal and Model Predictive Control Yunfan Gao et.al. 2508.12335 null
2025-08-17 Improving Densification in 3D Gaussian Splatting for High-Fidelity Rendering Xiaobin Deng et.al. 2508.12313 null
2025-08-17 In vivo 3D ultrasound computed tomography of musculoskeletal tissues with generative neural physics Zhijun Zeng et.al. 2508.12226 null
2025-08-17 Splat Feature Solver Butian Xiong et.al. 2508.12216 null
2025-08-16 RealTalk: Realistic Emotion-Aware Lifelike Talking-Head Synthesis Wenqing Wang et.al. 2508.12163 null
2025-08-16 VELVET-Med: Vision and Efficient Language Pre-training for Volumetric Imaging Tasks in Medicine Ziyang Zhang et.al. 2508.12108 null
2025-08-16 Enhancing 3D point accuracy of laser scanner through multi-stage convolutional neural network for applications in construction Qinyuan Fan et.al. 2508.12089 null
2025-08-16 VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models Haidong Xu et.al. 2508.12081 null
2025-08-16 OASIS: Real-Time Opti-Acoustic Sensing for Intervention Systems in Unstructured Environments Amy Phung et.al. 2508.12071 null
2025-08-16 InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes Hongyuan Liu et.al. 2508.12015 null
2025-08-16 UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding Yueming Xu et.al. 2508.11952 null
2025-08-16 Transferable Class Statistics and Multi-scale Feature Approximation for 3D Object Detection Hao Peng et.al. 2508.11951 null
2025-08-16 OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation Jilei Mao et.al. 2508.11898 null
2025-08-16 ComplicitSplat: Downstream Models are Vulnerable to Blackbox Attacks by 3D Gaussian Splat Camouflages Matthew Hull et.al. 2508.11854 null
2025-08-15 Towards Understanding 3D Vision: the Role of Gaussian Curvature Sherlon Almeida da Silva et.al. 2508.11825 null
2025-08-15 CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion Zhe Zhu et.al. 2508.11603 null
2025-08-15 Remove360: Benchmarking Residuals After Object Removal in 3D Gaussian Splatting Simona Kocour et.al. 2508.11431 null
2025-08-15 RMFAT: Recurrent Multi-scale Feature Atmospheric Turbulence Mitigator Zhiming Liu et.al. 2508.11409 null
2025-08-15 G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration Ramil Khafizov et.al. 2508.11379 null
2025-08-15 AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis Zonglin Wu et.al. 2508.11375 null
2025-08-15 HOID-R1: Reinforcement Learning for Open-World Human-Object Interaction Detection Reasoning with Multimodal Large Language Model Zhenhao Zhang et.al. 2508.11350 null
2025-08-15 Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking Haonan Zhang et.al. 2508.11323 null
2025-08-15 Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction Muzammil Khan et.al. 2508.11282 null
2025-08-15 Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds Pei He et.al. 2508.11265 null
2025-08-15 Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception Junjie Wang et.al. 2508.11256 null
2025-08-15 StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation Seungmi Lee et.al. 2508.11203 null
2025-08-15 CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector Abhinav Kumar et.al. 2508.11185 null
2025-08-14 HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing Xinjie Gao et.al. 2508.11106 null
2025-08-14 Data-Driven Abdominal Phenotypes of Type 2 Diabetes in Lean, Overweight, and Obese Cohorts Lucas W. Remedios et.al. 2508.11063 null
2025-08-14 Advancing 3D Scene Understanding with MV-ScanQA Multi-View Reasoning Evaluation and TripAlign Pre-training Dataset Wentao Mo et.al. 2508.11058 null
2025-08-20 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation Nikolaos Gkanatsios et.al. 2508.11002 null
2025-08-12 Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction Cheng Chen et.al. 2508.10936 null
2025-08-18 HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model Qi Liu et.al. 2508.10935 null
2025-08-12 ViPE: Video Pose Engine for 3D Geometric Perception Jiahui Huang et.al. 2508.10934 null
2025-08-14 Quantum Visual Fields with Neural Amplitude Encoding Shuteng Wang et.al. 2508.10900 null
2025-08-14 Puppeteer: Rig and Animate Your 3D Models Chaoyue Song et.al. 2508.10898 null
2025-08-14 Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning Mengyuan Liu et.al. 2508.10897 null
2025-08-14 STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer Yushi Lan et.al. 2508.10893 null
2025-08-14 TexVerse: A Universe of 3D Objects with High-Resolution Textures Yibo Zhang et.al. 2508.10868 null
2025-08-14 An Efficient Model-Driven Groupwise Approach for Atlas Construction Ziwei Zou et.al. 2508.10743 null
2025-08-14 Novel View Synthesis using DDIM Inversion Sehajdeep SIngh et.al. 2508.10688 null
2025-08-14 Physics-Informed Joint Multi-TE Super-Resolution with Implicit Neural Representation for Robust Fetal T2 Mapping Busra Bulut et.al. 2508.10680 null
2025-08-14 DIVA-VQA: Detecting Inter-frame Variations in UGC Video Quality Xinyi Wang et.al. 2508.10605 null
2025-08-14 SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving Philipp Wolters et.al. 2508.10567 null
2025-08-15 PTQAT: A Hybrid Parameter-Efficient Quantization Algorithm for 3D Perception Tasks Xinhao Wang et.al. 2508.10557 null
2025-08-14 Multi-Sample Anti-Aliasing and Constrained Optimization for 3D Gaussian Splatting Zheng Zhou et.al. 2508.10507 null
2025-08-14 STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes Keishi Ishihara et.al. 2508.10427 null
2025-08-14 SC-Lane: Slope-aware and Consistent Road Height Estimation Framework for 3D Lane Detection Chaesong Park et.al. 2508.10411 null
2025-08-14 Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models Hyundo Lee et.al. 2508.10382 null
2025-08-14 VIFSS: View-Invariant and Figure Skating-Specific Pose Representation Learning for Temporal Action Segmentation Ryota Tanaka et.al. 2508.10281 null
2025-08-14 Deep Learning for Crack Detection: A Review of Learning Paradigms, Generalizability, and Datasets Xinan Zhang et.al. 2508.10256 null
2025-08-13 EntropyGS: An Efficient Entropy Coding on 3D Gaussian Splatting Yuning Huang et.al. 2508.10227 null
2025-08-13 B-repLer: Semantic B-rep Latent Editor using Large Language Models Yilin Liu et.al. 2508.10201 null
2025-08-18 From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation Ke Niu et.al. 2508.10118 null
2025-08-13 A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation Shuting He et.al. 2508.09977 null
2025-08-13 PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image Geonhee Sim et.al. 2508.09973 null
2025-08-13 LIA-X: Interpretable Latent Portrait Animator Yaohui Wang et.al. 2508.09959 null
2025-08-13 E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras Chaoran Feng et.al. 2508.09912 null
2025-08-13 HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics Weiqi Li et.al. 2508.09858 null
2025-08-13 Toward Human-Robot Teaming: Learning Handover Behaviors from 3D Scenes Yuekun Wu et.al. 2508.09855 null
2025-08-13 ARI3D: A Software for Interactive Quantification of Regions in X-Ray CT 3D Images Jan Phillipp Albrecht et.al. 2508.09849 null
2025-08-13 RayletDF: Raylet Distance Fields for Generalizable 3D Surface Reconstruction from Point Clouds or Gaussians Shenxing Wei et.al. 2508.09830 null
2025-08-13 TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos Jinxi Li et.al. 2508.09811 null
2025-08-13 Automated Segmentation of Coronal Brain Tissue Slabs for 3D Neuropathology Jonathan Williams Ramirez et.al. 2508.09805 null
2025-08-13 MUJICA: Reforming SISR Models for PBR Material Super-Resolution via Cross-Map Attention Xin Du et.al. 2508.09802 null
2025-08-13 Surg-InvNeRF: Invertible NeRF for 3D tracking and reconstruction in surgical vision Gerardo Loza et.al. 2508.09681 null
2025-08-13 GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors Xingyilang Yin et.al. 2508.09667 null
2025-08-13 Noise-adapted Neural Operator for Robust Non-Line-of-Sight Imaging Lianfang Wang et.al. 2508.09655 null
2025-08-13 TOTNet: Occlusion-Aware Temporal Tracking for Robust Ball Detection in Sports Videos Hao Xu et.al. 2508.09650 null
2025-08-13 The Brain Resection Multimodal Image Registration (ReMIND2Reg) 2025 Challenge Reuben Dorent et.al. 2508.09649 null
2025-08-13 Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors Giorgos Karvounas et.al. 2508.09629 null
2025-08-14 Semantic-aware DropSplat: Adaptive Pruning of Redundant Gaussians for 3D Aerial-View Segmentation Xu Tang et.al. 2508.09626 null
2025-08-13 MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography Daniel Barco et.al. 2508.09616 null
2025-08-13 DualPhys-GS: Dual Physically-Guided 3D Gaussian Splatting for Underwater Scene Reconstruction Jiachen Li et.al. 2508.09610 null
2025-08-15 SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing Heyi Sun et.al. 2508.09597 null
2025-08-13 CaRoBio: 3D Cable Routing with a Bio-inspired Gripper Fingernail Jiahui Zuo et.al. 2508.09558 null
2025-08-14 Iterative Volume Fusion for Asymmetric Stereo Matching Yuanting Gao et.al. 2508.09543 null
2025-08-13 SkySplat: Generalizable 3D Gaussian Splatting from Multi-Temporal Sparse Satellite Images Xuejun Huang et.al. 2508.09479 null
2025-08-13 CitySeg: A 3D Open Vocabulary Semantic Segmentation Foundation Model in City-scale Scenarios Jialei Xu et.al. 2508.09470 null
2025-08-13 DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation Haoxiang Shi et.al. 2508.09444 null
2025-08-13 Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving Guangxun Zhu et.al. 2508.09404 null
2025-08-12 X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents Guoxian Song et.al. 2508.09383 null
2025-08-12 Gradient-Direction-Aware Density Control for 3D Gaussian Splatting Zheng Zhou et.al. 2508.09239 null
2025-08-12 Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices Ya Zou et.al. 2508.09136 null
2025-08-13 GeoVLA: Empowering 3D Representations in Vision-Language-Action Models Lin Sun et.al. 2508.09071 null
2025-08-12 A new dataset and comparison for multi-camera frame synthesis Conall Daly et.al. 2508.09068 null
2025-08-12 VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception Fuhao Chang et.al. 2508.09061 null
2025-08-12 DASC: Depth-of-Field Aware Scene Complexity Metric for 3D Visualization on Light Field Display Kamran Akbar et.al. 2508.08928 null
2025-08-12 Masked Clustering Prediction for Unsupervised Point Cloud Pre-training Bin Ren et.al. 2508.08910 null
2025-08-12 GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments Lin Zeng et.al. 2508.08867 null
2025-08-12 DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI Bo-Hsun Chen et.al. 2508.08831 null
2025-08-12 3DFroMLLM: 3D Prototype Generation only from Pretrained Multimodal LLMs Noor Ahmed et.al. 2508.08821 null
2025-08-12 MonoPartNeRF:Human Reconstruction from Monocular Video via Part-Based Neural Radiance Fields Yao Lu et.al. 2508.08798 null
2025-08-12 SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA) Trong-Thuan Nguyen et.al. 2508.08781 null
2025-08-12 ROD: RGB-Only Fast and Efficient Off-road Freespace Detection Tong Sun et.al. 2508.08697 null
2025-08-14 Yan: Foundational Interactive Video Generation Deheng Ye et.al. 2508.08601 null
2025-08-12 RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space Jingyun Liang et.al. 2508.08588 null
2025-08-12 Bio-Generative Design Morphology with Radiolaria: An application of a Nature-Based Generative Shape Grammar for Geometrical Design of Space Frames Michael Kleiss et.al. 2508.08572 null
2025-08-12 Revisiting the City Tower Project: Geometric Principles and Structural Morphology in the Works of Louis I. Kahn and Anne Tyng Aysan Mokhtarimousavi et.al. 2508.08561 null
2025-08-11 Empowering Children to Create AI-Enabled Augmented Reality Experiences Lei Zhang et.al. 2508.08467 null
2025-08-11 Enhanced Liver Tumor Detection in CT Images Using 3D U-Net and Bat Algorithm for Hyperparameter Optimization Nastaran Ghorbani et.al. 2508.08452 null
2025-08-11 ImageDDI: Image-enhanced Molecular Motif Sequence Representation for Drug-Drug Interaction Prediction Yuqin He et.al. 2508.08338 null
2025-08-11 Learning an Implicit Physics Model for Image-based Fluid Simulation Emily Yue-Ting Jia et.al. 2508.08254 null
2025-08-11 ReferSplat: Referring Segmentation in 3D Gaussian Splatting Shuting He et.al. 2508.08252 null
2025-08-11 LL3M: Large Language 3D Modelers Sining Lu et.al. 2508.08228 null
2025-08-11 SAGOnline: Segment Any Gaussians Online Wentao Sun et.al. 2508.08219 null
2025-08-11 Spatial-ORMLLM: Improve Spatial Relation Understanding in the Operating Room with Multimodal Large Language Model Peiqi He et.al. 2508.08199 null
2025-08-11 Emergent morphogenesis via planar fabrication enabled by a reduced model of composites Yupeng Zhang et.al. 2508.08198 null
2025-08-12 3D Human Mesh Estimation from Single View RGBD Ozhan Suat et.al. 2508.08178 null
2025-08-13 CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data Chongke Bi et.al. 2508.08173 null
2025-08-11 FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting Yitong Yang et.al. 2508.08136 null
2025-08-11 GRASPTrack: Geometry-Reasoned Association via Segmentation and Projection for Multi-Object Tracking Xudong Han et.al. 2508.08117 null
2025-08-11 3D Plant Root Skeleton Detection and Extraction Jiakai Lin et.al. 2508.08094 null
2025-08-11 Matrix-3D: Omnidirectional Explorable 3D World Generation Zhongqi Yang et.al. 2508.08086 null
2025-08-11 S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix Peng Dai et.al. 2508.08048 null
2025-08-11 Aerial Target Encirclement and Interception with Noisy Range Observations Fen Liu et.al. 2508.08046 null
2025-08-11 TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation Huawei Sun et.al. 2508.08038 null
2025-08-11 Mitigating Biases in Surgical Operating Rooms with Geometry Tony Danjun Wang et.al. 2508.08028 null
2025-08-11 TrackOR: Towards Personalized Intelligent Operating Rooms Through Robust Tracking Tony Danjun Wang et.al. 2508.07968 null
2025-08-11 Safeguarding Generative AI Applications in Preclinical Imaging through Hybrid Anomaly Detection Jakub Binda et.al. 2508.07923 null
2025-08-11 Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models Johanna P. Müller et.al. 2508.07903 null
2025-08-11 NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction Tianle Zeng et.al. 2508.07897 null
2025-08-11 Autonomous Navigation of Cloud-Controlled Quadcopters in Confined Spaces Using Multi-Modal Perception and LLM-Driven High Semantic Reasoning Shoaib Ahmmad et.al. 2508.07885 null
2025-08-11 Vertex Features for Neural Global Illumination Rui Su et.al. 2508.07852 null
2025-08-11 Tracking Any Point Methods for Markerless 3D Tissue Tracking in Endoscopic Stereo Images Konrad Reuter et.al. 2508.07851 null
2025-08-11 CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving Qi Xiang et.al. 2508.07838 null
2025-08-11 DiTVR: Zero-Shot Diffusion Transformer for Video Restoration Sicheng Gao et.al. 2508.07811 null
2025-08-11 Pose-RFT: Enhancing MLLMs for 3D Pose Generation via Hybrid Action Reinforcement Fine-Tuning Bao Li et.al. 2508.07804 null
2025-08-11 MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks Yushen Xu et.al. 2508.07803 null
2025-08-11 Forecasting Continuous Non-Conservative Dynamical Systems in SO(3) Lennart Bastian et.al. 2508.07775 null
2025-08-13 Multi-view Normal and Distance Guidance Gaussian Splatting for Surface Reconstruction Bo Jia et.al. 2508.07701 null
2025-08-11 Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing Weitao Wang et.al. 2508.07700 null
2025-08-11 GraphCoT-VLA: A 3D Spatial-Aware Reasoning Vision-Language-Action Model for Robotic Manipulation with Ambiguous Instructions Helong Huang et.al. 2508.07650 null
2025-08-11 Breaking Down and Building Up: Mixture of Skill-Based Vision-and-Language Navigation Agents Tianyi Ma et.al. 2508.07642 null
2025-08-11 End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy Zifan Wang et.al. 2508.07611 null
2025-08-12 Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring Ludan Zhang et.al. 2508.07552 null
2025-08-11 CoT-Pose: Chain-of-Thought Reasoning for 3D Pose Generation from Abstract Prompts Junuk Cha et.al. 2508.07540 null
2025-08-10 Novel View Synthesis with Gaussian Splatting: Impact on Photogrammetry Model Accuracy and Resolution Pranav Chougule et.al. 2508.07483 null
2025-08-10 CharacterShot: Controllable and Consistent 4D Character Animation Junyao Gao et.al. 2508.07409 null
2025-08-10 DIP-GS: Deep Image Prior For Gaussian Splatting Sparse View Recovery Rajaei Khatib et.al. 2508.07372 null
2025-08-10 GS4Buildings: Prior-Guided Gaussian Splatting for 3D Building Reconstruction Qilin Zhang et.al. 2508.07355 null
2025-08-10 Navigation and Exploration with Active Inference: from Biology to Industry Daria de Tinguy et.al. 2508.07269 null
2025-08-10 Fading the Digital Ink: A Universal Black-Box Attack Framework for 3DGS Watermarking Systems Qingyuan Zeng et.al. 2508.07263 null
2025-08-12 Understanding Dynamic Scenes in Ego Centric 4D Point Clouds Junsheng Huang et.al. 2508.07251 null
2025-08-10 3D Gaussian Representations with Motion Trajectory Field for Dynamic Scene Reconstruction Xuesong Li et.al. 2508.07182 null
2025-08-10 CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion Xiaotong Lin et.al. 2508.07162 null
2025-08-09 DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit Aiden Swann et.al. 2508.07118 null
2025-08-09 AugLift: Boosting Generalization in Lifting-based 3D Human Pose Estimation Nikolai Warner et.al. 2508.07112 null
2025-08-09 Communication-Efficient Multi-Agent 3D Detection via Hybrid Collaboration Yue Hu et.al. 2508.07092 null
2025-08-09 ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting Sandro Papais et.al. 2508.07089 null
2025-08-09 TeSO: Representing and Compressing 3D Point Cloud Scenes with Textured Surfel Octree Yueyu Hu et.al. 2508.07083 null
2025-08-09 SAGCNet: Spatial-Aware Graph Completion Network for Missing Slice Imputation in Population CMR Imaging Junkai Liu et.al. 2508.07041 null
2025-08-09 3DGS-VBench: A Comprehensive Video Quality Evaluation Benchmark for 3DGS Compression Yuke Xing et.al. 2508.07038 null
2025-08-12 HiMat: DiT-based Ultra-High Resolution SVBRDF Generation Zixiong Wang et.al. 2508.07011 null
2025-08-09 Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments Gian Mario Favero et.al. 2508.07006 null
2025-08-09 EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events Siyu Chen et.al. 2508.07003 null
2025-08-09 Evaluating Fisheye-Compatible 3D Gaussian Splatting Methods on Real Images Beyond 180 Degree Field of View Ulas Gunes et.al. 2508.06968 null
2025-08-09 Hybrid Machine Learning Framework for Predicting Geometric Deviations from 3D Surface Metrology Hamidreza Samadi et.al. 2508.06845 null
2025-08-09 Edge Detection for Organ Boundaries via Top Down Refinement and SubPixel Upsampling Aarav Mehta et.al. 2508.06805 null
2025-08-09 DiffUS: Differentiable Ultrasound Rendering from Volumetric Imaging Noe Bertramo et.al. 2508.06768 null
2025-08-09 VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions Yash Garg et.al. 2508.06757 null
2025-08-08 Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video Jixuan He et.al. 2508.06715 null
2025-08-08 Fourier Optics and Deep Learning Methods for Fast 3D Reconstruction in Digital Holography Justin London et.al. 2508.06703 null
2025-08-08 CoDe-NeRF: Neural Rendering via Dynamic Coefficient Decomposition Wenpeng Xing et.al. 2508.06632 null
2025-08-08 LightSwitch: Multi-view Relighting with Material-guided Diffusion Yehonathan Litman et.al. 2508.06494 null
2025-08-08 MotionSwap Om Patil et.al. 2508.06430 null
2025-08-08 FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation Wenbin Teng et.al. 2508.06392 null
2025-08-08 ViPro-2: Unsupervised State Estimation via Integrated Dynamics for Guiding Video Prediction Patrick Takenaka et.al. 2508.06335 null
2025-08-08 L2Calib: $SE(3)$ -Manifold Reinforcement Learning for Robust Extrinsic Calibration with Degenerate Motion Resilience Baorun Li et.al. 2508.06330 null
2025-08-08 Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging? Xin Ci Wong et.al. 2508.06327 null
2025-08-08 Real-Time 3D Vision-Language Embedding Mapping Christian Rauch et.al. 2508.06291 null
2025-08-08 Situationally-aware Path Planning Exploiting 3D Scene Graphs Saad Ejaz et.al. 2508.06283 null
2025-08-08 XAG-Net: A Cross-Slice Attention and Skip Gating Network for 2.5D Femur MRI Segmentation Byunghyun Ko et.al. 2508.06258 null
2025-08-08 PA-HOI: A Physics-Aware Human and Object Interaction Dataset Ruiyan Wang et.al. 2508.06205 null
2025-08-08 AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection Zhaopeng Gu et.al. 2508.06203 null
2025-08-08 UW-3DGS: Underwater 3D Reconstruction with Physics-Aware Gaussian Splatting Wenpeng Xing et.al. 2508.06169 null
2025-08-08 Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation YoungChan Choi et.al. 2508.06136 null
2025-08-12 GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving Jian Wang et.al. 2508.06113 null
2025-08-08 MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment Gui Zou et.al. 2508.06104 null
2025-08-08 Towards MR-Based Trochleoplasty Planning Michael Wehrli et.al. 2508.06076 null
2025-08-08 LV-Net: Anatomy-aware lateral ventricle shape modeling with a case study on Alzheimer’s disease, the Australian Imaging Biomarkers and Lifestyle flagship study of ageing Wonjung Park et.al. 2508.06055 null
2025-08-08 Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts Kiran Chhatre et.al. 2508.06032 null
2025-08-08 ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors Minsu Kim et.al. 2508.06014 null
2025-08-08 AnimateScene: Camera-controllable Animation in Any Scene Qingyang Liu et.al. 2508.05982 null
2025-08-08 A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image Yanxing Liang et.al. 2508.05950 null
2025-08-08 Enhancing Construction Site Analysis and Understanding with 3D Segmentation Sri Ramana Saketh Vasanthawada et.al. 2508.05922 null
2025-08-07 HOLODECK 2.0: Vision-Language-Guided 3D World Generation with Editing Zixuan Bian et.al. 2508.05899 null
2025-08-07 MZEN: Multi-Zoom Enhanced NeRF for 3-D Reconstruction with Unknown Camera Poses Jong-Ik Park et.al. 2508.05819 null
2025-08-07 Optimization-Free Style Transfer for 3D Gaussian Splats Raphael Du Sablon et.al. 2508.05813 null
2025-08-07 MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss Can Zhao et.al. 2508.05772 null
2025-08-07 GAP: Gaussianize Any Point Clouds with Text Guidance Weiqi Zhang et.al. 2508.05631 null
2025-08-07 Physically Controllable Relighting of Photographs Chris Careaga et.al. 2508.05626 null
2025-08-07 Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity Yuhan Zhang et.al. 2508.05609 null
2025-08-07 Robust adaptive fuzzy sliding mode control for trajectory tracking for of cylindrical manipulator Van Cuong Pham et.al. 2508.05584 null
2025-08-07 Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis Kunyu Feng et.al. 2508.05580 null
2025-08-07 Point cloud segmentation for 3D Clothed Human Layering Davide Garavaso et.al. 2508.05531 null
2025-08-07 Head Anchor Enhanced Detection and Association for Crowded Pedestrian Tracking Zewei Wu et.al. 2508.05514 null
2025-08-07 MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips Shibo Wang et.al. 2508.05506 null
2025-08-07 Symmetry Understanding of 3D Shapes via Chirality Disentanglement Weikang Wang et.al. 2508.05505 null
2025-08-07 Computational Design and Fabrication of Modular Robots with Untethered Control Manas Bhargava et.al. 2508.05410 null
2025-08-07 CT-GRAPH: Hierarchical Graph Attention Network for Anatomy-Guided CT Report Generation Hamza Kalisch et.al. 2508.05375 null
2025-08-07 3DGabSplat: 3D Gabor Splatting for Frequency-adaptive Radiance Field Rendering Junyu Zhou et.al. 2508.05343 null
2025-08-08 CF3: Compact and Fast 3D Feature Fields Hyunjoon Lee et.al. 2508.05254 null
2025-08-07 Coarse-to-Fine Joint Registration of MR and Ultrasound Images via Imaging Style Transfer Junyi Wang et.al. 2508.05240 null
2025-08-07 EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery Bingyu Yang et.al. 2508.05205 null
2025-08-07 Refining Gaussian Splatting: A Volumetric Densification Approach Mohamed Abdul Gafoor et.al. 2508.05187 null
2025-08-07 Learning to See and Act: Task-Aware View Planning for Robotic Manipulation Yongjie Bai et.al. 2508.05186 null
2025-08-07 FCBV-Net: Category-Level Robotic Garment Smoothing via Feature-Conditioned Bimanual Value Prediction Mohammed Daba et.al. 2508.05153 null
2025-08-07 FedGIN: Federated Learning with Dynamic Global Intensity Non-linear Augmentation for Organ Segmentation using Multi-modal Images Sachin Dudda Nagaraju et.al. 2508.05137 null
2025-08-07 A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding Mahmoud Chick Zaouali et.al. 2508.05064 null
2025-08-07 DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion Yifeng Huang et.al. 2508.05060 null
2025-08-07 MAG-Nav: Language-Driven Object Navigation Leveraging Memory-Reserved Active Grounding Weifan Zhang et.al. 2508.05021 null
2025-08-07 Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion Shenglun Chen et.al. 2508.04984 null
2025-08-07 UGOD: Uncertainty-Guided Differentiable Opacity and Soft Dropout for Enhanced Sparse-View 3DGS Zhihao Guo et.al. 2508.04968 null
2025-08-07 Laplacian Analysis Meets Dynamics Modelling: Gaussian Splatting for 4D Reconstruction Yifan Zhou et.al. 2508.04966 null
2025-08-07 Perceive-Sample-Compress: Towards Real-Time 3D Gaussian Splatting Zijian Wang et.al. 2508.04965 null
2025-08-06 CryoGS: Gaussian Splatting for Cryo-EM Homogeneous Reconstruction Suyi Chen et.al. 2508.04929 null
2025-08-06 LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction Md Zahidul Hasan et.al. 2508.04847 null
2025-08-06 Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models Mehrdad Moradi et.al. 2508.04818 null
2025-08-05 Neural Field-Based 3D Surface Reconstruction of Microstructures from Multi-Detector Signals in Scanning Electron Microscopy Shuo Chen et.al. 2508.04728 null
2025-08-06 Occupancy Learning with Spatiotemporal Memory Ziyang Leng et.al. 2508.04705 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 MienCap: Realtime Performance-Based Facial Animation with Live Mood Dynamics Ye Pan et.al. 2508.04687 null
2025-08-06 PixCuboid: Room Layout Estimation from Multi-view Featuremetric Alignment Gustav Hanning et.al. 2508.04659 null
2025-08-06 OmniDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment Tongfan Guan et.al. 2508.04611 null
2025-08-06 $NavA^3$ : Understanding Any Instruction, Navigating Anywhere, Finding Anything Lingfeng Zhang et.al. 2508.04598 null
2025-08-06 Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline Linqing Zhao et.al. 2508.04597 null
2025-08-06 LA-CaRe-CNN: Cascading Refinement CNN for Left Atrial Scar Segmentation Franz Thaler et.al. 2508.04553 null
2025-08-06 Surf3R: Rapid Surface Reconstruction from Sparse RGB Views in Seconds Haodong Zhu et.al. 2508.04508 null
2025-08-06 MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular Videos Daisheng Jin et.al. 2508.04505 null
2025-08-06 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation Shuzhou Yang et.al. 2508.04467 null
2025-08-06 Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models Yinan Yu et.al. 2508.04406 null
2025-08-06 RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization Yanyan Li et.al. 2508.04335 null
2025-08-07 Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research Ke Li et.al. 2508.04326 null
2025-08-06 MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction Yaopeng Lou et.al. 2508.04297 null
2025-08-06 PKSS-Align: Robust Point Cloud Registration on Pre-Kendall Shape Space Chenlei Lv et.al. 2508.04286 null
2025-08-06 PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction Muhua Zhu et.al. 2508.04236 null
2025-08-06 SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition Jiahui Li et.al. 2508.04224 null
2025-08-06 Small Lesions-aware Bidirectional Multimodal Multiscale Fusion Network for Lung Disease Classification Jianxun Yu et.al. 2508.04205 null
2025-08-06 IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control Lijuan Liu et.al. 2508.04147 null
2025-08-06 DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting Zexu Huang et.al. 2508.04099 null
2025-08-06 Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework Yi-Ting Chen et.al. 2508.04090 null
2025-08-06 RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting Zhan Li et.al. 2508.04078 null
2025-08-06 Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation Jiayi He et.al. 2508.04049 null
2025-08-06 JanusNet: Hierarchical Slice-Block Shuffle and Displacement for Semi-Supervised 3D Multi-Organ Segmentation Zheng Zhang et.al. 2508.03997 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-05 Inland-LOAM: Voxel-Based Structural Semantic Mapping for Inland Waterways Zhongbi Luo et.al. 2508.03672 null
2025-08-05 OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World Katherine Liu et.al. 2508.03669 null
2025-08-06 Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images Xiangyu Sun et.al. 2508.03643 null
2025-08-05 FPG-NAS: FLOPs-Aware Gated Differentiable Neural Architecture Search for Efficient 6DoF Pose Estimation Nassim Ali Ousalah et.al. 2508.03618 null
2025-08-05 CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models Ana Lawry Aguila et.al. 2508.03594 null
2025-08-05 Spatial Imputation Drives Cross-Domain Alignment for EEG Classification Hongjun Liu et.al. 2508.03437 null
2025-08-05 WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval Junlong Ren et.al. 2508.03343 null
2025-08-05 Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion Wentao Qu et.al. 2508.03252 null
2025-08-05 Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing Hongyu Shen et.al. 2508.03227 null
2025-08-05 Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling Heng Wu et.al. 2508.03186 null
2025-08-05 Duplex-GS: Proxy-Guided Weighted Blending for Real-Time Order-Independent Gaussian Splatting Weihang Liu et.al. 2508.03180 null
2025-08-05 H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction Heng Jia et.al. 2508.03118 null
2025-08-05 Point2Act: Efficient 3D Distillation of Multimodal LLMs for Zero-Shot Context-Aware Grasping Sang Min Kim et.al. 2508.03099 null
2025-08-05 RobustGS: Unified Boosting of Feedforward 3D Gaussian Splatting under Low-Quality Conditions Anran Wu et.al. 2508.03077 null
2025-08-05 SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation Bo Zhang et.al. 2508.03069 null
2025-08-05 A Survey of Medical Point Cloud Shape Learning: Registration, Reconstruction and Variation Tongxu Zhang et.al. 2508.03057 null
2025-08-05 SA-3DGS: A Self-Adaptive Compression Method for 3D Gaussian Splatting Liheng Zhang et.al. 2508.03017 null
2025-08-05 ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion Meng Zhou et.al. 2508.03008 null
2025-08-05 GACL: Grounded Adaptive Curriculum Learning with Active Task and Performance Monitoring Linji Wang et.al. 2508.02988 null
2025-08-04 Evaluation of 3D Counterfactual Brain MRI Generation Pengwei Sun et.al. 2508.02880 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing Mikołaj Zieliński et.al. 2508.02831 null
2025-08-04 PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation Zongyou Yang et.al. 2508.02806 null
2025-08-04 PMGS: Reconstruction of Projectile Motion across Large Spatiotemporal Spans via 3D Gaussian Splatting Yijun Xu et.al. 2508.02660 null
2025-08-04 RL-U $^2$ Net: A Dual-Branch UNet with Reinforcement Learning-Assisted Multimodal Feature Fusion for Accurate 3D Whole-Heart Segmentation Jierui Qu et.al. 2508.02557 null
2025-08-04 Uncertainty-Aware Perception-Based Control for Autonomous Racing Jelena Trisovic et.al. 2508.02494 null
2025-08-05 Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting Jianchao Wang et.al. 2508.02493 null
2025-08-06 GR-Gaussian: Graph-Based Radiative Gaussian Splatting for Sparse-View CT Reconstruction Yikuang Yuluo et.al. 2508.02408 null
2025-08-04 Correspondence-Free Fast and Robust Spherical Point Pattern Registration Anik Sarker et.al. 2508.02339 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 ASDR: Exploiting Adaptive Sampling and Data Reuse for CIM-based Instant Neural Rendering Fangxin Liu et.al. 2508.02304 null
2025-08-04 Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection Jae-Young Kang et.al. 2508.02288 null
2025-08-04 SplatSSC: Decoupled Depth-Guided Gaussian Splatting for Semantic Scene Completion Rui Qian et.al. 2508.02261 null
2025-08-04 GaussianCross: Cross-modal Self-supervised 3D Representation Learning via Gaussian Splatting Lei Yao et.al. 2508.02172 null
2025-08-04 Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes Tom Fischer et.al. 2508.02157 null
2025-08-04 ScrewSplat: An End-to-End Method for Articulated Object Recognition Seungyeon Kim et.al. 2508.02146 null
2025-08-04 VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling Yuru Xiao et.al. 2508.02129 null
2025-08-04 REACT-KD: Region-Aware Cross-modal Topological Knowledge Distillation for Interpretable Medical Image Classification Hongzhao Chen et.al. 2508.02104 null
2025-08-04 StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion Haoxin Yang et.al. 2508.02056 null
2025-08-04 Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure Ziling Wang et.al. 2508.02034 null
2025-08-04 On-the-Fly Object-aware Representative Point Selection in Point Cloud Xiaoyu Zhang et.al. 2508.01980 null
2025-08-04 From Photons to Physics: Autonomous Indoor Drones and the Future of Objective Property Assessment Petteri Teikari et.al. 2508.01965 null
2025-08-03 Less is More: AMBER-AFNO – a New Benchmark for Lightweight 3D Medical Image Segmentation Andrea Dosi et.al. 2508.01941 null
2025-08-03 MUTE-DSS: A Digital-Twin-Based Decision Support System for Minimizing Underwater Radiated Noise in Ship Voyage Planning Akash Venkateshwaran et.al. 2508.01907 null
2025-08-03 Beyond Vulnerabilities: A Survey of Adversarial Attacks as Both Threats and Defenses in Computer Vision Systems Zhongliang Guo et.al. 2508.01845 null
2025-08-03 OmniEvent: Unified Event Representation Learning Weiqi Yan et.al. 2508.01842 null
2025-08-03 Diffusion-based 3D Hand Motion Recovery with Intuitive Physics Yufei Zhang et.al. 2508.01835 null
2025-08-03 Skip priors and add graph-based anatomical information, for point-based Couinaud segmentation Xiaotong Zhang et.al. 2508.01785 null
2025-08-05 VPN: Visual Prompt Navigation Shuo Feng et.al. 2508.01766 null
2025-08-03 AG $^2$ aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing Zhaonan Wang et.al. 2508.01740 null
2025-08-03 OpenMap: Instruction Grounding via Open-Vocabulary Visual-Language Mapping Danyang Li et.al. 2508.01723 null
2025-08-03 LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving Luqi Cheng et.al. 2508.01704 null
2025-08-03 Register Anything: Estimating “Corresponding Prompts” for Segment Anything Model Shiqi Huang et.al. 2508.01697 null
2025-08-03 DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing Yufeng Chi et.al. 2508.01684 null
2025-08-03 DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding Hanqing Wang et.al. 2508.01651 null
2025-08-03 StrandDesigner: Towards Practical Strand Generation with Sketch Guidance Na Zhang et.al. 2508.01650 null
2025-08-03 Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection Hanxi Li et.al. 2508.01591 null
2025-08-03 A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction Hua Yu et.al. 2508.01585 null
2025-08-03 Deeply Supervised Multi-Task Autoencoder for Biological Brain Age estimation using three dimensional T $_1$ -weighted magnetic resonance imaging Mehreen Kanwal et.al. 2508.01565 null
2025-08-03 Adaptive LiDAR Scanning: Harnessing Temporal Cues for Efficient 3D Object Detection via Multi-Modal Fusion Sara Shoouri et.al. 2508.01562 null
2025-08-02 Decentralized Aerial Manipulation of a Cable-Suspended Load using Multi-Agent Reinforcement Learning Jack Zeng et.al. 2508.01522 null
2025-08-02 EfficientGFormer: Multimodal Brain Tumor Segmentation via Pruned Graph-Augmented Transformer Fatemeh Ziaeetabar et.al. 2508.01465 null
2025-08-02 Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians Quankai Gao et.al. 2508.01464 null
2025-08-02 Uncertainty-Aware Segmentation Quality Prediction via Deep Learning Bayesian Modeling: Comprehensive Evaluation and Interpretation on Skin Cancer and Liver Segmentation Sikha O K et.al. 2508.01460 null
2025-08-05 3DRot: 3D Rotation Augmentation for RGB-Based 3D Tasks Shitian Yang et.al. 2508.01423 null
2025-08-02 ReMu: Reconstructing Multi-layer 3D Clothed Human from Image Layers Onat Vuran et.al. 2508.01381 null
2025-08-02 P3P Made Easy Seong Hun Lee et.al. 2508.01312 null
2025-08-02 C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor Haoquan Lu et.al. 2508.01311 null
2025-08-02 CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis Alec Sargood et.al. 2508.01292 null
2025-08-02 Integrating Disparity Confidence Estimation into Relative Depth Prior-Guided Unsupervised Stereo Matching Chuang-Wei Liu et.al. 2508.01275 null
2025-08-05 MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh Shuangkang Fang et.al. 2508.01242 null
2025-08-02 OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS Han Ling et.al. 2508.01239 null
2025-08-02 Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system Jiyong Kim et.al. 2508.01230 null
2025-08-02 MoGaFace: Momentum-Guided and Texture-Aware Gaussian Avatars for Consistent Facial Geometry Yujian Liu et.al. 2508.01218 null
2025-08-02 Perspective from a Broader Context: Can Room Style Knowledge Help Visual Floorplan Localization? Bolei Chen et.al. 2508.01216 null
2025-08-02 A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Zhan Shi et.al. 2508.01197 null
2025-08-02 Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning Xinhang Wan et.al. 2508.01184 null
2025-08-02 No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views Ranran Huang et.al. 2508.01171 null
2025-08-02 DELTAv2: Accelerating Dense 3D Tracking Tuan Duc Ngo et.al. 2508.01170 null
2025-08-02 OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding Dianyi Yang et.al. 2508.01150 null
2025-08-02 Design of Q8bot: A Miniature, Low-Cost, Dynamic Quadruped Built with Zero Wires Yufeng Wu et.al. 2508.01149 null
2025-08-02 UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation Chaitanya Patel et.al. 2508.01126 null
2025-08-01 DreamSat-2.0: Towards a General Single-View Asteroid 3D Reconstruction Santiago Diaz et.al. 2508.01079 null
2025-08-01 Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation Fenghe Tang et.al. 2508.01064 null
2025-08-01 Structured Spectral Graph Learning for Anomaly Classification in 3D Chest CT Scans Theo Di Piazza et.al. 2508.01045 null
2025-08-01 3D Reconstruction via Incremental Structure From Motion Muhammad Zeeshan et.al. 2508.01019 null
2025-08-01 Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection Cheng-You Lu et.al. 2508.01014 null
2025-08-01 Cooperative Perception: A Resource-Efficient Framework for Multi-Drone 3D Scene Reconstruction Using Federated Diffusion and NeRF Massoud Pourmandi et.al. 2508.00967 null
2025-07-31 Investigating Crossing Perception in 3D Graph Visualisation Ying Zhang et.al. 2508.00950 null
2025-08-01 IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation Wenxuan Guo et.al. 2508.00823 null
2025-08-01 Cross-Dataset Semantic Segmentation Performance Analysis: Unifying NIST Point Cloud City Datasets for 3D Deep Learning Alexander Nikitas Dimopoulos et.al. 2508.00822 null
2025-08-01 GECO: Geometrically Consistent Embedding with Lightspeed Inference Regine Hartwig et.al. 2508.00746 null
2025-08-01 Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR Adwait Chandorkar et.al. 2508.00744 null
2025-08-04 DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior Junzhe Lu et.al. 2508.00599 null
2025-08-01 OmniUnet: A Multimodal Network for Unstructured Terrain Segmentation on Planetary Rovers Using RGB, Depth, and Thermal Imagery Raul Castilla-Arquillo et.al. 2508.00580 null
2025-08-04 LesiOnTime – Joint Temporal and Clinical Modeling for Small Breast Lesion Segmentation in Longitudinal DCE-MRI Mohammed Kamran et.al. 2508.00496 null
2025-08-01 HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection Jiaping Cao et.al. 2508.00473 null
2025-08-01 Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation Nan Xiang et.al. 2508.00428 null
2025-08-01 Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting Seunggeun Chi et.al. 2508.00427 null
2025-08-01 Sari Sandbox: A Virtual Retail Store Environment for Embodied AI Agents Janika Deborah Gajo et.al. 2508.00400 null
2025-08-01 Occlusion-robust Stylization for Drawing-based 3D Animation Sunjae Yoon et.al. 2508.00398 null
2025-08-01 SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies Liang Han et.al. 2508.00366 null
2025-08-01 Stable at Any Speed: Speed-Driven Multi-Object Tracking with Learnable Kalman Filtering Yan Gong et.al. 2508.00358 null
2025-08-01 Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging Tianshuang Qiu et.al. 2508.00354 null
2025-08-01 AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer Jin Lyu et.al. 2508.00298 null
2025-08-01 Towards Robust Semantic Correspondence: A Benchmark and Insights Wenyue Chong et.al. 2508.00272 null
2025-08-05 Multimodal Referring Segmentation: A Survey Henghui Ding et.al. 2508.00265 null
2025-08-01 PointGauss: Point Cloud-Guided Multi-Object Segmentation for Gaussian Splatting Wentao Sun et.al. 2508.00259 null
2025-08-01 Weakly Supervised Intracranial Aneurysm Detection and Segmentation in MR angiography via Multi-task UNet with Vesselness Prior Erin Rainville et.al. 2508.00235 null
2025-07-31 Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs Bhavya Goyal et.al. 2508.00169 null
2025-07-31 GEPAR3D: Geometry Prior-Assisted Learning for 3D Tooth Segmentation Tomasz Szczepański et.al. 2508.00155 null
2025-07-31 Stress-Aware Resilient Neural Training Ashkan Shakarami et.al. 2508.00098 null
2025-07-31 Punching Bag vs. Punching Person: Motion Transferability in Videos Raiyaan Abdullah et.al. 2508.00085 null
2025-07-31 Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Bowen Zhang et.al. 2507.23785 null
2025-07-31 Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions Li Siyao et.al. 2507.23778 null
2025-07-31 SeqAffordSplat: Scene-level Sequential Affordance Reasoning on 3D Gaussian Splatting Di Li et.al. 2507.23772 null
2025-08-05 Topology Optimization in Medical Image Segmentation with Fast Euler Characteristic Liu Li et.al. 2507.23763 null
2025-07-31 Enhanced Velocity Field Modeling for Gaussian Video Reconstruction Zhenyang Li et.al. 2507.23704 null
2025-07-31 Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents Shaofei Cai et.al. 2507.23698 null
2025-07-31 High-resolution eikonal imaging and uncertainty quantification of the Kilauea caldera Angela F. Gao et.al. 2507.23692 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes Xiaohan Li et.al. 2507.23677 null
2025-07-31 DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation Yuchen Zhou et.al. 2507.23599 null
2025-08-02 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization Maxime Pietrantoni et.al. 2507.23569 null
2025-07-31 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection Yung-Hsu Yang et.al. 2507.23567 null
2025-08-01 H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation Hongzhe Bi et.al. 2507.23523 null
2025-07-31 Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion Mutian Xu et.al. 2507.23483 null
2025-07-31 FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction Donghyun Lee et.al. 2507.23480 null
2025-07-31 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding Ting Huang et.al. 2507.23478 null
2025-07-31 NeRF Is a Valuable Assistant for 3D Gaussian Splatting Shuangkang Fang et.al. 2507.23374 null
2025-07-31 MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting Xingyue Peng et.al. 2507.23340 null
2025-08-01 Training-free Geometric Image Editing on Diffusion Models Hanshen Zhu et.al. 2507.23300 null
2025-07-31 iLRM: An Iterative Large 3D Reconstruction Model Gyeongjin Kang et.al. 2507.23277 null
2025-07-31 GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting Jaeseok Park et.al. 2507.23273 null
2025-07-31 Towards Affordable Tumor Segmentation and Visualization for 3D Breast MRI Using SAM2 Solha Kang et.al. 2507.23272 null
2025-07-30 Details Matter for Indoor Open-vocabulary 3D Instance Segmentation Sanghun Jung et.al. 2507.23134 null
2025-07-30 Rethink Domain Generalization in Heterogeneous Sequence MRI Segmentation Zheyuan Zhang et.al. 2507.23110 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-07-30 Adaptive Time-step Training for Enhancing Spike-Based Neural Radiance Fields Ranxi Lin et.al. 2507.23033 null
2025-07-30 Learning to Prune Branches in Modern Tree-Fruit Orchards Abhinav Jain et.al. 2507.23015 null
2025-07-30 Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction Zhensheng Yuan et.al. 2507.23006 null
2025-07-30 Viser: Imperative, Web-based 3D Visualization in Python Brent Yi et.al. 2507.22885 null
2025-07-30 DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion Qingcheng Zhao et.al. 2507.22825 null
2025-07-30 Wall Shear Stress Estimation in Abdominal Aortic Aneurysms: Towards Generalisable Neural Surrogate Models Patryk Rygiel et.al. 2507.22817 null
2025-07-30 Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques Weide Liu et.al. 2507.22791 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks Hang Su et.al. 2507.22733 null
2025-07-30 Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints Thuy Tran et.al. 2507.22699 null
2025-07-30 Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation Hongbin Lin et.al. 2507.22668 null
2025-07-30 trAIce3D: A Prompt-Driven Transformer Based U-Net for Semantic Segmentation of Microglial Cells from Large-Scale 3D Microscopy Images MohammadAmin Alamalhoda et.al. 2507.22635 null
2025-07-30 Estimating 2D Camera Motion with Hybrid Motion Basis Haipeng Li et.al. 2507.22480 null
2025-07-30 UAVScenes: A Multi-Modal Dataset for UAVs Sijie Wang et.al. 2507.22412 null
2025-07-30 UFV-Splatter: Pose-Free Feed-Forward 3D Gaussian Splatting Adapted to Unfavorable Views Yuki Fujimura et.al. 2507.22342 null
2025-07-30 A Segmentation Framework for Accurate Diagnosis of Amyloid Positivity without Structural Images Penghan Zhu et.al. 2507.22336 null
2025-07-29 Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception Christian Ellis et.al. 2507.22194 null
2025-07-29 Enhancing efficiency in paediatric brain tumour segmentation using a pathologically diverse single-center clinical dataset A. Piffer et.al. 2507.22152 null
2025-07-29 Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos Ziren Gong et.al. 2507.22052 null
2025-07-29 ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports Mohammed Baharoon et.al. 2507.22030 null
2025-07-29 Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images Yutao Hu et.al. 2507.22024 null
2025-07-29 XAI for Point Cloud Data using Perturbations based on Meaningful Segmentation Raju Ningappa Mulawade et.al. 2507.22020 null
2025-07-29 DISCOVERSE: Efficient Robot Simulation in Complex High-Fidelity Environments Yufei Jia et.al. 2507.21981 null
2025-07-29 PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction Jiahui Ren et.al. 2507.21960 null
2025-07-31 MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors Shouyi Lu et.al. 2507.21872 null
2025-07-29 VidFuncta: Towards Generalizable Neural Representations for Ultrasound Videos Julia Wolleb et.al. 2507.21863 null
2025-07-29 HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels HunyuanWorld Team et.al. 2507.21809 null
2025-07-29 AU-LLM: Micro-Expression Action Unit Detection via Enhanced LLM-Based Feature Fusion Zhishu Liu et.al. 2507.21778 null
2025-07-29 Multi-UAV Deployment in Obstacle-Cluttered Environments with LOS Connectivity Yuda Chen et.al. 2507.21772 null
2025-07-30 No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering Linye Wei et.al. 2507.21572 null
2025-07-29 Multi-View Reconstruction with Global Context for 3D Anomaly Detection Yihan Sun et.al. 2507.21555 null
2025-07-29 LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments Junhao Chen et.al. 2507.21517 null
2025-07-29 ST-DAI: Single-shot 2.5D Spatial Transcriptomics with Intra-Sample Domain Adaptive Imputation for Cost-efficient 3D Reconstruction Jiahe Qian et.al. 2507.21516 null
2025-07-29 BANG: Dividing 3D Assets via Generative Exploded Dynamics Longwen Zhang et.al. 2507.21493 null
2025-07-29 Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval Zhichuan Wang et.al. 2507.21489 null
2025-07-28 Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View Zitong Zhang et.al. 2507.21371 null
2025-08-03 Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy Jicheng Yuan et.al. 2507.21358 null
2025-07-28 DEM-NeRF: A Neuro-Symbolic Method for Scientific Discovery through Physics-Informed Simulation Wenkai Tan et.al. 2507.21350 null
2025-07-28 GLCP: Global-to-Local Connectivity Preservation for Tubular Structure Segmentation Feixiang Zhou et.al. 2507.21328 null
2025-07-28 VoluMe – Authentic 3D Video Calls from Live Gaussian Splat Prediction Martin de La Gorce et.al. 2507.21311 null
2025-07-28 Fluidically Innervated Lattices Make Versatile and Durable Tactile Sensors Annan Zhang et.al. 2507.21225 null
2025-08-03 Reconstructing 4D Spatial Intelligence: A Survey Yukang Cao et.al. 2507.21045 null
2025-07-28 GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction Tianhao Li et.al. 2507.20963 null
2025-07-28 $S^3$ LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping Ruoyu Fan et.al. 2507.20854 null
2025-07-28 An Efficient Machine Learning Framework for Forest Height Estimation from Multi-Polarimetric Multi-Baseline SAR data Francesca Razzano et.al. 2507.20798 null
2025-07-28 KASportsFormer: Kinematic Anatomy Enhanced Transformer for 3D Human Pose Estimation on Short Sports Scene Video Zhuoer Yin et.al. 2507.20763 null
2025-07-28 Methods for the Segmentation of Reticular Structures Using 3D LiDAR Data: A Comparative Evaluation Francisco J. Soler Mora et.al. 2507.20589 null
2025-07-28 M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast Jiacheng Lu et.al. 2507.20582 null
2025-07-28 Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation Hyung Kyu Kim et.al. 2507.20568 null
2025-07-28 MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization Hyung Kyu Kim et.al. 2507.20562 null
2025-07-28 Uni-Mapper: Unified Mapping Framework for Multi-modal LiDARs in Complex and Dynamic Environments Gilhwan Kang et.al. 2507.20538 null
2025-07-28 Enhancing Spatial Reasoning through Visual and Textual Thinking Xun Liang et.al. 2507.20529 null
2025-07-28 GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections Haiyang Bai et.al. 2507.20512 null
2025-07-28 Automated 3D-GS Registration and Fusion via Skeleton Alignment and Gaussian-Adaptive Features Shiyang Liu et.al. 2507.20480 null
2025-07-29 From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos Chenjian Gao et.al. 2507.20331 null
2025-07-27 Decomposing Densification in Gaussian Splatting for Faster 3D Scene Reconstruction Binxiao Huang et.al. 2507.20239 null
2025-07-27 NeuroVoxel-LM: Language-Aligned 3D Perception via Dynamic Voxelization and Meta-Embedding Shiyu Liu et.al. 2507.20110 null
2025-07-26 High-Speed Event Vision-Based Tactile Roller Sensor for Large Surface Measurements Akram Khairi et.al. 2507.19914 null
2025-07-30 RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection Xiaokai Bai et.al. 2507.19856 null
2025-07-26 Taking Language Embedded 3D Gaussian Splatting into the Wild Yuze Wang et.al. 2507.19830 null
2025-07-25 GSCache: Real-Time Radiance Caching for Volume Path Tracing using 3D Gaussian Splatting David Bauer et.al. 2507.19718 null
2025-07-25 DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations Ziren Gong et.al. 2507.19474 null
2025-07-25 Fast Learning of Non-Cooperative Spacecraft 3D Models through Primitive Initialization Pol Francesch Huc et.al. 2507.19459 null
2025-07-25 NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography Kirsten W. H. Maas et.al. 2507.19328 null
2025-07-25 3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering Wei-Hsing Huang et.al. 2507.19133 null
2025-07-25 Gaussian Set Surface Reconstruction through Per-Gaussian Optimization Zhentao Huang et.al. 2507.18923 null
2025-07-24 SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time Yun Chen et.al. 2507.18713 null
2025-07-24 Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping Chong Cheng et.al. 2507.18541 null
2025-07-24 G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM Gyuhyeon Pak et.al. 2507.18344 null
2025-07-24 LONG3R: Long Sequence Streaming 3D Reconstruction Zhuoguang Chen et.al. 2507.18255 null
2025-07-24 PS-GS: Gaussian Splatting for Multi-View Photometric Stereo Yixiao Chen et.al. 2507.18231 null
2025-07-24 High-fidelity 3D Gaussian Inpainting: preserving multi-view consistency and photorealistic details Jun Zhou et.al. 2507.18023 null
2025-07-24 Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners Kostas Karakontis et.al. 2507.17519 null
2025-07-23 Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field Yuzhe Zhu et.al. 2507.17351 null
2025-07-23 Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting Hyeongmin Lee et.al. 2507.17336 null
2025-07-24 PolarAnything: Diffusion-based Polarimetric Image Synthesis Kailong Zhang et.al. 2507.17268 null
2025-07-22 StreamME: Simplify 3D Gaussian Avatar within Live Stream Luchuan Song et.al. 2507.17029 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-22 Sparse-View 3D Reconstruction: Recent Advances and Open Challenges Tanveer Younis et.al. 2507.16406 null
2025-07-22 Dens3R: A Foundation Model for 3D Geometry Prediction Xianze Fang et.al. 2507.16290 null
2025-07-22 LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images Guichen Huang et.al. 2507.16144 null
2025-07-21 Appearance Harmonization via Bilateral Grid Prediction with Transformers for 3DGS Jisu Shin et.al. 2507.15748 null
2025-07-21 DWTGS: Rethinking Frequency Regularization for Sparse-view 3D Gaussian Splatting Hung Nguyen et.al. 2507.15690 null
2025-07-21 Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing Boni Hu et.al. 2507.15683 null
2025-07-21 Gaussian Splatting with Discretized SDF for Relightable Assets Zuo-Liang Zhu et.al. 2507.15629 null
2025-07-28 SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting Zihui Gao et.al. 2507.15602 null
2025-07-21 ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting Ruijie Zhu et.al. 2507.15454 null
2025-07-25 GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing Minnan Pei et.al. 2507.15300 null
2025-07-20 3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline Kaishva Chintan Shah et.al. 2507.14924 null
2025-07-20 Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction Xiufeng Huang et.al. 2507.14921 null
2025-07-20 An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks Xinyi Wu et.al. 2507.14798 null
2025-07-30 Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey Jiahui Zhang et.al. 2507.14501 null
2025-07-19 Adaptive 3D Gaussian Splatting Video Streaming: Visual Saliency-Aware Tiling and Meta-Learning-Based Bitrate Adaptation Han Gong et.al. 2507.14454 null
2025-07-19 Adaptive 3D Gaussian Splatting Video Streaming Han Gong et.al. 2507.14432 null
2025-08-01 C-DOG: Multi-View Multi-instance Feature Association Using Connected δ-Overlap Graphs Yung-Hong Sun et.al. 2507.14095 null
2025-07-18 TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views Hsiang-Hui Hung et.al. 2507.13929 null
2025-07-18 Neural-GASh: A CGA-based neural radiance prediction pipeline for real-time shading Efstratios Geronikolakis et.al. 2507.13917 null
2025-07-21 PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations Yu Wei et.al. 2507.13891 null
2025-07-18 EPSilon: Efficient Point Sampling for Lightening of Hybrid-based 3D Avatar Generation Seungjun Moon et.al. 2507.13648 null
2025-07-18 Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation Masahiro Ogawa et.al. 2507.13628 null
2025-07-19 AutoPartGen: Autogressive 3D Part Generation and Discovery Minghao Chen et.al. 2507.13346 null
2025-07-16 VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians Siyuan Yao et.al. 2507.12667 null
2025-07-16 NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting Kuangshi Ai et.al. 2507.12621 null
2025-07-21 Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition Beizhen Zhao et.al. 2507.12498 null
2025-07-19 SpatialTrackerV2: 3D Point Tracking Made Easy Yuxi Xiao et.al. 2507.12462 null
2025-07-16 Revealing the Ancient Beauty: Digital Reconstruction of Temple Tiles using Computer Vision Arkaprabha Basu et.al. 2507.12195 null
2025-07-16 DoRF: Doppler Radiance Fields for Robust Human Activity Recognition Using Wi-Fi Navid Hasanzadeh et.al. 2507.12132 null
2025-07-16 BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images Davide Di Nucci et.al. 2507.12095 null
2025-07-16 SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation Beining Xu et.al. 2507.12027 null
2025-07-16 HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing Tielong Wang et.al. 2507.11971 null
2025-07-16 Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark Jingqian Wu et.al. 2507.11931 null
2025-07-16 CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning Peiwen Xia et.al. 2507.11834 null
2025-07-15 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Zhen Xu et.al. 2507.11540 null
2025-07-21 Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling Hayeon Kim et.al. 2507.11061 null
2025-07-14 ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions Shivangi Aneja et.al. 2507.10542 null
2025-07-14 Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry Geyou Zhang et.al. 2507.10009 null
2025-07-19 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving Yixun Zhang et.al. 2507.09993 null
2025-07-14 VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling Zihang Zeng et.al. 2507.09987 null
2025-07-11 From images to properties: a NeRF-driven framework for granular material parameter inversion Cheng-Hsi Hsiao et.al. 2507.09005 null
2025-07-11 An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan Mengyuan Liu et.al. 2507.08690 null
2025-07-11 Adaptive Framework for Ambient Intelligence in Rehabilitation Assistance Gábor Baranyi et.al. 2507.08624 null
2025-07-11 Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT Wei Zhang et.al. 2507.08448 null
2025-07-11 RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting Ji Hyun Seo et.al. 2507.08434 null
2025-07-11 CL3R: 3D Reconstruction and Contrastive Learning for Enhanced Robotic Manipulation Representations Wenbo Cui et.al. 2507.08262 null
2025-07-10 Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction Hyungjun Doh et.al. 2507.08137 null
2025-07-18 RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration Chong Cheng et.al. 2507.08136 null
2025-07-10 Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions Longfei Li et.al. 2507.07978 null
2025-07-10 RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection Yongyang Zhou et.al. 2507.07733 null

Diffusion

Publish Date Title Authors PDF Code
2025-10-07 Fine-grained Defocus Blur Control for Generative Image Models Ayush Shrivastava et.al. 2510.06215 null
2025-10-07 Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models Jiahao Wang et.al. 2510.06209 null
2025-10-07 On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond Chenxiao Yang et.al. 2510.06190 null
2025-10-07 Thermodynamic Performance Limits for Score-Based Diffusion Models Nathan X. Kodama et.al. 2510.06174 null
2025-10-07 Bimanual 3D Hand Motion and Articulation Forecasting in Everyday Images Aditya Prakash et.al. 2510.06145 null
2025-10-07 CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits Kangyu Wang et.al. 2510.06133 null
2025-10-07 Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation Jiawei Mao et.al. 2510.06131 null
2025-10-07 Phase-induced switching of ferromagnetic insulators in Josephson spin valves A. A. Mazanik et.al. 2510.06109 null
2025-10-07 Complete Synchronization and Pattern Selection through Amplitude Dynamics and Diffusion in Heterogeneous Oscillatory Media Nicolas Thomé et.al. 2510.06083 null
2025-10-07 Mechanistic-statistical inference of mosquito dynamics from mark-release-recapture data Nga Nguyen et.al. 2510.06080 null
2025-10-07 Controllable Audio-Visual Viewpoint Generation from 360° Spatial Information Christian Marinoni et.al. 2510.06060 null
2025-10-07 Edit-Based Flow Matching for Temporal Point Processes David Lüdke et.al. 2510.06050 null
2025-10-07 The gamma-ray emission from Radio Galaxies and their contribution to the Isotropic Gamma-Ray Background A. Circiello et.al. 2510.06047 null
2025-10-07 Emergent Directedness in Social Contagion Fabian Tschofenig et.al. 2510.06012 null
2025-10-07 ECTSpeech: Enhancing Efficient Speech Synthesis via Easy Consistency Tuning Tao Zhu et.al. 2510.05984 null
2025-10-07 Diffusion-Based Image Editing for Breaking Robust Watermarks Yunyi Ni et.al. 2510.05978 null
2025-10-07 Diffusion Models for Low-Light Image Enhancement: A Multi-Perspective Taxonomy and Performance Analysis Eashan Adhikarla et.al. 2510.05976 null
2025-10-07 Quantum Lattice Boltzmann Method for Multiple Time Steps Without Reinitialization for Linear Advection-Diffusion Problems Aaron Nagel et.al. 2510.05965 null
2025-10-07 $\bf{D^3}$ QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection Yanran Zhang et.al. 2510.05891 null
2025-10-07 Dynamics of Choline Chloride based Deep Eutectic Solvents: Neutron Scattering Study Rinesh T. et.al. 2510.05882 null
2025-10-07 The Safety Challenge of World Models for Embodied AI Agents: A Review Lorenzo Baraldi et.al. 2510.05865 null
2025-10-07 FoleyGRAM: Video-to-Audio Generation with GRAM-Aligned Multimodal Encoders Riccardo Fosco Gramaccioni et.al. 2510.05829 null
2025-10-07 StereoSync: Spatially-Aware Stereo Audio Generation from Video Christian Marinoni et.al. 2510.05828 null
2025-10-07 First experimental measurements of biophotons from Astrocytes and Glioblastoma cell cultures L. De Paolis et.al. 2510.05792 null
2025-10-07 Models of topological barriers and molecular motors of bacterial DNA Marc Joyeux et.al. 2510.05790 null
2025-10-07 New Insights into Involutory and Orthogonal MDS Matrices Yogesh Kumar et.al. 2510.05766 null
2025-10-07 RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases Lang Qin et.al. 2510.05764 null
2025-10-07 Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis Sedat Dogan et.al. 2510.05761 null
2025-10-07 Vipera: Blending Visual and LLM-Driven Guidance for Systematic Auditing of Text-to-Image Generative AI Yanwei Huang et.al. 2510.05742 null
2025-10-07 Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies Chunsan Hong et.al. 2510.05725 null
2025-10-07 Data Factory with Minimal Human Effort Using VLMs Jiaojiao Ye et.al. 2510.05722 null
2025-10-07 DiffSDA: Unsupervised Diffusion Sequential Disentanglement Across Modalities Hedi Zisling et.al. 2510.05717 null
2025-10-07 AgeBooth: Controllable Facial Aging and Rejuvenation via Diffusion Models Shihao Zhu et.al. 2510.05715 null
2025-10-07 Hedging of exotic options in Hawkes jump-diffusion models by Malliavin calculus Ayub Ahmadi et.al. 2510.05689 null
2025-10-07 When and How to Cut Classical Concerts? A Multimodal Automated Video Editing Approach Daniel Gonzálbez-Biosca et.al. 2510.05661 null
2025-10-07 Teleportraits: Training-Free People Insertion into Any Scene Jialu Gao et.al. 2510.05660 null
2025-10-07 Beyond Spectral Peaks: Interpreting the Cues Behind Synthetic Image Detection Sara Mandelli et.al. 2510.05633 null
2025-10-07 Generative AI-Driven Hierarchical Multi-Agent Framework for Zero-Touch Optical Networks Yao Zhang et.al. 2510.05625 null
2025-10-07 PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction Ziqiao Meng et.al. 2510.05613 null
2025-10-07 Efficient Conditional Generation on Scale-based Visual Autoregressive Models Jiaqi Liu et.al. 2510.05610 null
2025-10-07 Improving Chain-of-Thought Efficiency for Autoregressive Image Generation Zeqi Gu et.al. 2510.05593 null
2025-10-07 Probing orbital currents through inverse orbital Hall and Rashba effects E. Santos et.al. 2510.05543 null
2025-10-07 Teamwork: Collaborative Diffusion with Low-rank Coordination and Adaptation Sam Sartor et.al. 2510.05532 null
2025-10-07 Be Tangential to Manifold: Discovering Riemannian Metric for Diffusion Models Shinnosuke Saito et.al. 2510.05509 null
2025-10-07 High-Fidelity Synthetic ECG Generation via Mel-Spectrogram Informed Diffusion Training Zhuoyi Huang et.al. 2510.05492 null
2025-10-06 Surface Excess Energy Governs the Non-Monotonic Behavior of Active Diffusivity with Activity A. Arango-Restrepo et.al. 2510.05435 null
2025-10-06 See the past: Time-Reversed Scene Reconstruction from Thermal Traces Using Visual Language Models Kebin Contreras et.al. 2510.05408 null
2025-10-06 LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation Yang Xiao et.al. 2510.05367 null
2025-10-06 Mitigating Diffusion Model Hallucinations with Dynamic Guidance Kostas Triaridis et.al. 2510.05356 null
2025-10-06 Domain Decomposition-Based Coupling of High-Fidelity Finite Element and Reduced Order Operator Inference Models Using the Schwarz Alternating Method Ian Moore et.al. 2510.05350 null
2025-10-06 A System Level Approach to LQR Control of the Diffusion Equation Addie McCurdy et.al. 2510.05345 null
2025-10-06 Learning the detector in optical tomography Zijian Wang et.al. 2510.05341 null
2025-10-06 Machine Learning Interatomic Potentials Enable Molecular Dynamics Simulations of Doped MoS2 Abrar Faiyad et.al. 2510.05339 null
2025-10-06 Resonance with quasinormal modes in long-range kinks’ collisions J. G. F. Campos et.al. 2510.05311 null
2025-10-06 Scalarized Hot Neutron Stars Containing Hyperons and $Δ$ -Resonances in Different Evolution Regimes Fahimeh Rahimi et.al. 2510.05302 null
2025-10-06 A Data-Driven Prism: Multi-View Source Separation with Diffusion Model Priors Sebastian Wagner-Carena et.al. 2510.05205 null
2025-10-06 Paper2Video: Automatic Video Generation from Scientific Papers Zeyu Zhu et.al. 2510.05096 null
2025-10-06 VChain: Chain-of-Visual-Thought for Reasoning in Video Generation Ziqi Huang et.al. 2510.05094 null
2025-10-06 Character Mixing for Video Generation Tingting Liao et.al. 2510.05093 null
2025-10-06 Factuality Matters: When Image Generation and Editing Meet Structured Visuals Le Zhuo et.al. 2510.05091 null
2025-10-06 Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models Runchu Tian et.al. 2510.05090 null
2025-10-06 SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder Ronen Kamenetsky et.al. 2510.05081 null
2025-10-06 SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs Dachuan Shi et.al. 2510.05069 null
2025-10-06 Spectral Properties of Anomalous Microwave Emission in 144 Galactic Clouds Roke Cepeda-Arroita et.al. 2510.05067 null
2025-10-06 StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation Mingyu Liu et.al. 2510.05057 null
2025-10-06 No-reference Quality Assessment of Contrast-distorted Images using Contrast-enhanced Pseudo Reference Mohammad-Ali Mahmoudpour et.al. 2510.05053 null
2025-10-06 Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts Jihoon Lee et.al. 2510.05040 null
2025-10-06 Graph-Aware Diffusion for Signal Generation Sergio Rozada et.al. 2510.05036 null
2025-10-06 Comparing fine-tuning strategies of MACE machine learning force field for modeling Li-ion diffusion in LiF for batteries Nada Alghamdi et.al. 2510.05020 null
2025-10-06 Bridging Text and Video Generation: A Survey Nilay Kumar et.al. 2510.04999 null
2025-10-06 SSDD: Single-Step Diffusion Decoder for Efficient Image Tokenization Théophane Vallaeys et.al. 2510.04961 null
2025-10-06 Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion Xin Li et.al. 2510.04947 null
2025-10-06 Steady-State Spread Bounds for Graph Diffusion via Laplacian Regularisation Ardavan Rahimian et.al. 2510.04924 null
2025-10-06 Effect of ice nucleating proteins on the structure-property relationships of ice: A molecular dynamics study A. K. Shargh et.al. 2510.04892 null
2025-10-06 Flow-Matching Based Refiner for Molecular Conformer Generation Xiangyang Xu et.al. 2510.04878 null
2025-10-06 Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails Siwei Han et.al. 2510.04860 null
2025-10-06 Efficient structure-preserving scheme for chemotaxis PDEs with singular sensitivity in crime and epidemic modeling Rui Wang et.al. 2510.04826 null
2025-10-06 Did you just see that? Arbitrary view synthesis for egocentric replay of operating room workflows from ambient sensors Han Zhang et.al. 2510.04802 null
2025-10-06 A behavioral reinvestigation of the effect of long ties on social contagions Luca Lazzaro et.al. 2510.04785 null
2025-10-06 ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs Wonjun Kang et.al. 2510.04767 null
2025-10-06 Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba Baher Mohammad et.al. 2510.04738 null
2025-10-06 Sub-Gaussian heat kernel estimates for reflected diffusion on inner uniform domains Riku Anttila et.al. 2510.04725 null
2025-10-06 BGRem: A background noise remover for astronomical images based on a diffusion model R. Nicolaas et.al. 2510.04718 null
2025-10-06 ReactDiff: Fundamental Multiple Appropriate Facial Reaction Diffusion Model Luo Cheng et.al. 2510.04712 null
2025-10-06 ID-Consistent, Precise Expression Generation with Blendshape-Guided Diffusion Foivos Paraperas Papantoniou et.al. 2510.04706 null
2025-10-06 ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement Habin Lim et.al. 2510.04668 null
2025-10-06 Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents Zeyi Zhang et.al. 2510.04637 null
2025-10-06 The Role of Acoustic Instability in Cosmic-Ray Self-Confinement Antonio Capanema et.al. 2510.04635 null
2025-10-06 Exploring the Power of Diffusion Large Language Models for Software Engineering: An Empirical Investigation Jingyao Zhang et.al. 2510.04605 null
2025-10-06 Investigating into mechanisms of high temperature strength of refractory high-entropy alloys Sai Anandhi Seetharaman et.al. 2510.04589 null
2025-10-06 Improved probabilistic regression using diffusion models Carlo Kneissl et.al. 2510.04583 null
2025-10-07 Constrained Dikin-Langevin diffusion for polyhedra James Chok et.al. 2510.04582 null
2025-10-06 Language Model Based Text-to-Audio Generation: Anti-Causally Aligned Collaborative Residual Transformers Juncheng Wang et.al. 2510.04577 null
2025-10-06 SONA: Learning Conditional, Unconditional, and Mismatching-Aware Discriminator Yuhta Takida et.al. 2510.04576 null
2025-10-07 LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning Haoqiang Kang et.al. 2510.04573 null
2025-10-06 3Dify: a Framework for Procedural 3D-CG Generation Assisted by LLMs Using MCP and RAG Shun-ichiro Hayashi et.al. 2510.04536 null
2025-10-06 TAG:Tangential Amplifying Guidance for Hallucination-Resistant Diffusion Sampling Hyunmin Cho et.al. 2510.04533 null
2025-10-06 Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion Satoshi Hayakawa et.al. 2510.04525 null
2025-10-06 Toward a Unified Geometry Understanding: Riemannian Diffusion Framework for Graph Generation and Prediction Yisen Gao et.al. 2510.04522 null
2025-10-06 Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation Zijing Hu et.al. 2510.04504 null
2025-10-06 Non-Monotone Traveling Waves of the Weak Competition Lotka-Volterra System Chiun-Chuan Chen et.al. 2510.04501 null
2025-10-06 Identifying non-equilibrium fluctuations in Intracellular Motion Using Recurrent Neural Networks Tomas Basile et.al. 2510.04485 null
2025-10-06 TBStar-Edit: From Image Editing Pattern Shifting to Consistency Enhancement Hao Fang et.al. 2510.04483 null
2025-10-06 A Diffusion-based Generative Machine Learning Paradigm for Contingency Screening Quan Tran et.al. 2510.04470 null
2025-10-06 REAR: Rethinking Visual Autoregressive Models via Generator-Tokenizer Consistency Regularization Qiyuan He et.al. 2510.04450 null
2025-10-06 Fractional Heat Kernel for Semi-Supervised Graph Learning with Small Training Sample Size Farid Bozorgnia et.al. 2510.04440 null
2025-10-06 spd-metrics-id: A Python Package for SPD-Aware Distance Metrics in Connectome Fingerprinting and Beyond Kaosar Uddin et.al. 2510.04438 null
2025-10-06 PAD-TRO: Projection-Augmented Diffusion for Direct Trajectory Optimization Jushan Chen et.al. 2510.04436 null
2025-10-05 On the Origin of Carrier Loss in Mg-Doped N-Polar GaN Masahiro Kamiyama et.al. 2510.04381 null
2025-10-05 Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction Yuhao Luo et.al. 2510.04365 null
2025-10-05 Score-based generative emulation of impact-relevant Earth system model outputs Shahine Bouabid et.al. 2510.04358 null
2025-10-05 Reliable and Scalable Robot Policy Evaluation with Imperfect Simulators Apurva Badithela et.al. 2510.04354 null
2025-10-05 On strong solution of a multidimensional SDE: extension of Yamada – Watanabe’s theorem A. A. Lyappieva et.al. 2510.04329 null
2025-10-05 FoilDiff: A Hybrid Transformer Backbone for Diffusion-based Modelling of 2D Airfoil Flow Fields Kenechukwu Ogbuagu et.al. 2510.04325 null
2025-10-05 ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation Jay Zhangjie Wu et.al. 2510.04290 null
2025-10-05 The best performance in the CARE 2025 – Liver Task (LiSeg-Contrast): Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation Jincan Lou et.al. 2510.04243 null
2025-10-05 Diffusion-Assisted Distillation for Self-Supervised Graph Representation Learning with MLPs Seong Jin Ahn et.al. 2510.04241 null
2025-10-05 Flexible Locomotion Learning with Diffusion Model Predictive Control Runhan Huang et.al. 2510.04234 null
2025-10-05 MASC: Boosting Autoregressive Image Generation with a Manifold-Aligned Semantic Clustering Lixuan He et.al. 2510.04220 null
2025-10-05 World-To-Image: Grounding Text-to-Image Generation with Agent-Driven World Knowledge Moo Hyun Son et.al. 2510.04201 null
2025-10-05 Let Features Decide Their Own Solvers: Hybrid Feature Caching for Diffusion Transformers Shikang Zheng et.al. 2510.04188 null
2025-10-05 Relief of EGFR/FOS-downregulated miR-103a by loganin alleviates NF-kappaB-triggered inflammation and gut barrier disruption in colitis Yan Li et.al. 2510.04176 null
2025-10-05 Drax: Speech Recognition with Discrete Flow Matching Aviv Navon et.al. 2510.04162 null
2025-10-05 GDiffuSE: Diffusion-based speech enhancement with noise model guidance Efrayim Yanir et.al. 2510.04157 null
2025-10-05 ObCLIP: Oblivious CLoud-Device Hybrid Image Generation with Privacy Preservation Haoqi Wu et.al. 2510.04153 null
2025-10-05 Self Speculative Decoding for Diffusion Large Language Models Yifeng Gao et.al. 2510.04147 null
2025-10-05 Beyond Next-Token Prediction: A Performance Characterization of Diffusion versus Autoregressive Language Models Minseo Kim et.al. 2510.04146 null
2025-10-05 Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation Seunghyun Lee et.al. 2510.04125 null
2025-10-07 Harnessing LLM for Noise-Robust Cognitive Diagnosis in Web-Based Intelligent Education Systems Guixian Zhang et.al. 2510.04093 null
2025-10-05 What Makes Diffusion Language Models Super Data Learners? Zitian Gao et.al. 2510.04071 null
2025-10-05 Diffusion Low Rank Hybrid Reconstruction for Sparse View Medical Imaging Zongyin Deng et.al. 2510.04069 null
2025-10-05 Approaching the scaling limit of transport through lattices with dephasing Subhajit Sarkar et.al. 2510.04062 null
2025-10-05 Variational Diffusion Unlearning: A Variational Inference Framework for Unlearning in Diffusion Models under Data Constraints Subhodip Panda et.al. 2510.04058 null
2025-10-05 Prompt-to-Prompt: Text-Based Image Editing Via Cross-Attention Mechanisms – The Research of Hyperparameters and Novel Mechanisms to Enhance Existing Frameworks Linn Bieske et.al. 2510.04034 null
2025-10-05 Principled and Tractable RL for Reasoning with Diffusion Language Models Anthony Zhan et.al. 2510.04019 null
2025-10-05 Dual Pruning and Sorting-Free Overestimation for Average-Utility Sequential Pattern Mining Kai Cao et.al. 2510.04014 null
2025-10-05 Optimal estimation of a factorizable density using diffusion models with ReLU neural networks Jianqing Fan et.al. 2510.03994 null
2025-10-05 Long time evolution of a pair of 2D viscous point vortices Ping Zhang et.al. 2510.03991 null
2025-10-04 A discrete data assimilation algorithm for the reconstruction of Gray–Scott dynamics Tsiry Avisoa Randrianasolo et.al. 2510.03972 null
2025-10-04 Global weak martingale solutions to a stochastic two-sidedly degenerate aggregation-diffusion equation issued from biology Mostafa Bendahmane et.al. 2510.03947 null
2025-10-04 Super-resolution image projection over an extended depth of field using a diffractive decoder Hanlong Chen et.al. 2510.03938 null
2025-10-04 Self-Speculative Masked Diffusions Andrew Campbell et.al. 2510.03929 null
2025-10-04 High-order, Compact, and Symmetric Finite Difference Methods for a $d$ -Dimensional Hypercube Qiwei Feng et.al. 2510.03927 null
2025-10-04 Generating Human Motion Videos using a Cascaded Text-to-Video Framework Hyelin Nam et.al. 2510.03909 null
2025-10-04 Rare Text Semantics Were Always There in Your Diffusion Transformer Seil Kang et.al. 2510.03886 null
2025-10-04 Adversarial Agent Collaboration for C to Rust Translation Tianyu Li et.al. 2510.03879 null
2025-10-04 PoseGaze-AHP: A Knowledge-Based 3D Dataset for AI-Driven Ocular and Postural Diagnosis Saja Al-Dabet et.al. 2510.03873 null
2025-10-04 SDAKD: Student Discriminator Assisted Knowledge Distillation for Super-Resolution Generative Adversarial Networks Nikolaos Kaparinos et.al. 2510.03870 null
2025-10-04 Mirage: Unveiling Hidden Artifacts in Synthetic Images with Large Vision-Language Models Pranav Sharma et.al. 2510.03840 null
2025-10-04 Proximal Diffusion Neural Sampler Wei Guo et.al. 2510.03824 null
2025-10-04 Contrastive-SDE: Guiding Stochastic Differential Equations with Contrastive Learning for Unpaired Image-to-Image Translation Venkata Narendra Kotyada et.al. 2510.03821 null
2025-10-04 Diverse Text-to-Image Generation via Contrastive Noise Optimization Byungjun Kim et.al. 2510.03813 null
2025-10-04 A Variational Method for Conformable Fractional Equations Using Rank-One Updates Maatank Parashar et.al. 2510.03778 null
2025-10-04 Bridging the Gap Between Multimodal Foundation Models and World Models Xuehai He et.al. 2510.03727 null
2025-10-04 Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models Leander Girrbach et.al. 2510.03721 null
2025-10-04 Non-negative diffusion bridge of the McKean-Vlasov type: analysis of singular diffusion and application to fish migration Hidekazu Yoshioka et.al. 2510.03692 null
2025-10-03 Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner Cai Zhou et.al. 2510.03206 null
2025-10-03 Memory Forcing: Spatio-Temporal Memory for Consistent Scene Generation on Minecraft Junchao Huang et.al. 2510.03198 null
2025-10-03 Product-Quantised Image Representation for High-Quality Image Synthesis Denis Zavadski et.al. 2510.03191 null
2025-10-03 HESS J1831 $-$ 098 – Exploring a pulsar halo scenario with H.E.S.S. data Karim Sabri et.al. 2510.03183 null
2025-10-03 UniShield: An Adaptive Multi-Agent Framework for Unified Forgery Image Detection and Localization Qing Huang et.al. 2510.03161 null
2025-10-03 Mask2IV: Interaction-Centric Video Generation via Mask Trajectories Gen Li et.al. 2510.03135 null
2025-10-03 HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion Shiyi Zhang et.al. 2510.03122 null
2025-10-03 Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction Kaisi Guan et.al. 2510.03117 null
2025-10-03 GeoComplete: Geometry-Aware Diffusion for Reference-Driven Image Completion Beibei Lin et.al. 2510.03110 null
2025-10-03 Deciphering the radio-star formation correlation on kpc scales. IV. Radio halos of highly-inclined Virgo cluster spiral galaxies B. Vollmer et.al. 2510.03098 null
2025-10-03 Distilled Protein Backbone Generation Liyang Xie et.al. 2510.03095 null
2025-10-03 Latent Diffusion Unlearning: Protecting Against Unauthorized Personalization Through Trajectory Shifted Perturbations Naresh Kumar Devulapally et.al. 2510.03089 null
2025-10-03 What Drives Compositional Generalization in Visual Generative Models? Karim Farid et.al. 2510.03075 null
2025-10-03 Self-consistent model of cosmic ray penetration into molecular clouds: Effect of energy losses D. O. Chernyshov et.al. 2510.03073 null
2025-10-03 Rogue waves in extended Gross-Pitaevskii Models with a Lee-Huang-Yang correction Sathyanarayanan Chandramouli et.al. 2510.03063 null
2025-10-03 When and Where do Events Switch in Multi-Event Video Generation? Ruotong Liao et.al. 2510.03049 null
2025-10-03 Physics-Constrained Inc-GAN for Tunnel Propagation Modeling from Sparse Line Measurements Yang Zhou et.al. 2510.03019 null
2025-10-03 Learning Robust Diffusion Models from Imprecise Supervision Dong-Dong Wu et.al. 2510.03016 null
2025-10-03 3D-CovDiffusion: 3D-Aware Diffusion Policy for Coverage Path Planning Chenyuan Chen et.al. 2510.03011 null
2025-10-03 TIT-Score: Evaluating Long-Prompt Based Text-to-Image Alignment via Text-to-Image-to-Text Consistency Juntong Wang et.al. 2510.02987 null
2025-10-03 Multi-faceted light pollution modelling and its application to the decline of artificial illuminance in France Rolf Buhler et.al. 2510.02977 null
2025-10-03 Long-Time Analysis of Stochastic Heavy Ball Dynamics for Convex Optimization and Monotone Equations Radu Ioan Bot et.al. 2510.02951 null
2025-10-03 Stationarity preserving nodal Finite Element methods for multi-dimensional linear hyperbolic balance laws via a Global Flux quadrature formulation Wasilij Barsukow et.al. 2510.02928 null
2025-10-03 Probing a theoretical framework for a Photonic Extreme Learning Machine Vicente Rocha et.al. 2510.02918 null
2025-10-03 SALSA-V: Shortcut-Augmented Long-form Synchronized Audio from Videos Amir Dellali et.al. 2510.02916 null
2025-10-03 DMark: Order-Agnostic Watermarking for Diffusion Large Language Models Linyu Wu et.al. 2510.02902 null
2025-10-03 Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models Tianren Ma et.al. 2510.02880 null
2025-10-03 Dust scattering halo of 4U 1630-47: High resolution X-ray and mm observations constrain source and molecular cloud distances E. Kalemci et.al. 2510.02879 null
2025-10-03 Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech Hieu-Nghia Huynh-Nguyen et.al. 2510.02848 null
2025-10-03 TridentServe: A Stage-level Serving System for Diffusion Pipelines Yifei Xia et.al. 2510.02838 null
2025-10-03 Multi-scale Autoregressive Models are Laplacian, Discrete, and Latent Diffusion Models in Disguise Steve Hong et.al. 2510.02826 null
2025-10-03 PromptMap: Supporting Exploratory Text-to-Image Generation Yuhan Guo et.al. 2510.02814 null
2025-10-03 TeV Emission from PSR B1055-52 with HESS: Evidence for a Pulsar Halo Tina Wach et.al. 2510.02802 null
2025-10-03 SongFormer: Scaling Music Structure Analysis with Heterogeneous Supervision Chunbo Hao et.al. 2510.02797 null
2025-10-03 Periodic Event-Triggered Prescribed Time Control of Euler-Lagrange Systems under State and Input Constraints Chidre Shravista Kashyap et.al. 2510.02769 null
2025-10-03 Neural Jump ODEs as Generative Models Robert A. Crowell et.al. 2510.02757 null
2025-10-03 Wide-field GMRT imaging of X-shaped Radio-Galaxies: Spectral properties of 4C32.25 and 4C61.23 E. Retana-Montenegro et.al. 2510.02753 null
2025-10-03 Denoising and Augmentation: A Dual Use of Diffusion Model for Enhanced CSI Recovery Yupeng Li et.al. 2510.02744 null
2025-10-03 Dale meets Langevin: A Multiplicative Denoising Diffusion Model Nishanth Shetty et.al. 2510.02730 null
2025-10-03 Flow Matching for Measure Transport and Feedback Stabilization of Control-Affine Systems Karthik Elamvazhuthi et.al. 2510.02706 null
2025-10-03 RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization Kai Fukazawa et.al. 2510.02695 null
2025-10-03 Fine-Tuning Diffusion Models via Intermediate Distribution Shaping Gautham Govind Anil et.al. 2510.02692 null
2025-10-03 Ohta-Kawasaki Model Reveals Patterns on Multicomponent Vesicles Wangbo Luo et.al. 2510.02688 null
2025-10-03 Smart-GRPO: Smartly Sampling Noise for Efficient RL of Flow-Matching Models Benjamin Yu et.al. 2510.02654 null
2025-10-03 Dispersion Relations and Pole-Skipping in a Holographic Charmonium Model with Rotating Plasma Luiz F. Ferreira et.al. 2510.02647 null
2025-10-03 Deep Generative Continual Learning using Functional LoRA: FunLoRA Victor Enescu et.al. 2510.02631 null
2025-10-02 Input-Aware Sparse Attention for Real-Time Co-Speech Video Generation Beijia Lu et.al. 2510.02617 null
2025-10-02 UMI-on-Air: Embodiment-Aware Guidance for Embodiment-Agnostic Visuomotor Policies Harsh Gupta et.al. 2510.02614 null
2025-10-02 PEO: Training-Free Aesthetic Quality Enhancement in Pre-Trained Text-to-Image Diffusion Models with Prompt Embedding Optimization Hovhannes Margaryan et.al. 2510.02599 null
2025-10-02 Surface Wave Solutions in 1D and 2D for the Broer-Kaup-Boussinesq-Kupershmidt (BKBK) System Darryl D. Holm et.al. 2510.02577 null
2025-10-02 How Confident are Video Models? Empowering Video Models to Express their Uncertainty Zhiting Mei et.al. 2510.02571 null
2025-10-02 Learning Microswimmer Collision Dynamics and Predicting Diffusivities using a Neural-Network-Assisted Boltzmann Approach Haruki Hayano et.al. 2510.02559 null
2025-10-02 Stable determination of the nonlinear parameter in the non-diffusive Westervelt equation from the Dirichlet-to-Neumann map Mike Wendels et.al. 2510.02553 null
2025-10-02 Active-Learning Inspired Ab Initio Theory-Experiment Loop Approach for Management of Material Defects: Application to Superconducting Qubits Sarvesh Chaudhari et.al. 2510.02544 null
2025-10-02 Self-supervised diffusion model fine-tuning for costate initialization using Markov chain Monte Carlo Jannik Graebner et.al. 2510.02527 null
2025-10-02 Graph Generation with Spectral Geodesic Flow Matching Xikun Huang et.al. 2510.02520 null
2025-10-02 Learning a distance measure from the information-estimation geometry of data Guy Ohayon et.al. 2510.02514 null
2025-10-02 Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling Kulunu Dharmakeerthi et.al. 2510.02499 null
2025-10-02 The Entangled Feedback Impacts of Supernovae in Coarse- versus High-Resolution Galaxy Simulations Eric Zhang et.al. 2510.02432 null
2025-10-02 Optimal Control Meets Flow Matching: A Principled Route to Multi-Subject Fidelity Eric Tillmann Bill et.al. 2510.02315 null
2025-10-02 Inferring Dynamic Physical Properties from Video Foundation Models Guanqi Zhan et.al. 2510.02311 null
2025-10-02 NoiseShift: Resolution-Aware Noise Recalibration for Better Low-Resolution Image Generation Ruozhen He et.al. 2510.02307 null
2025-10-02 Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive Tyler Farghly et.al. 2510.02305 null
2025-10-02 Knowledge Distillation Detection for Open-weights Models Qin Shi et.al. 2510.02302 null
2025-10-02 Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models Runqian Wang et.al. 2510.02300 null
2025-10-02 Continual Personalization for Diffusion Models Yu-Chien Liao et.al. 2510.02296 null
2025-10-02 Test-Time Anchoring for Discrete Diffusion Posterior Sampling Litu Rout et.al. 2510.02291 null
2025-10-02 MultiModal Action Conditioned Video Generation Yichen Li et.al. 2510.02287 null
2025-10-02 Learning to Generate Object Interactions with Physics-Guided Video Diffusion David Romero et.al. 2510.02284 null
2025-10-02 Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Justin Cui et.al. 2510.02283 null
2025-10-02 Diffusion^2: Turning 3D Environments into Radio Frequency Heatmaps Kyoungjun Park et.al. 2510.02274 null
2025-10-02 Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning Tianchong Jiang et.al. 2510.02268 null
2025-10-02 NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes Shiyi Zhang et.al. 2510.02266 null
2025-10-02 DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing Zihan Zhou et.al. 2510.02253 null
2025-10-02 TempoControl: Temporal Attention Guidance for Text-to-Video Models Shira Schiber et.al. 2510.02226 null
2025-10-02 Diffusion Transformers for Imputation: Statistical Efficiency and Uncertainty Quantification Zeqi Ye et.al. 2510.02216 null
2025-10-02 DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning Hanyang Zhao et.al. 2510.02212 null
2025-10-02 Measurement-Guided Consistency Model Sampling for Inverse Problems Amirreza Tanevardi et.al. 2510.02208 null
2025-10-02 Chaotic many-body quantum dynamics, spectral correlations, and energy diffusion J. T. Chalker et.al. 2510.02198 null
2025-10-02 Uncovering Semantic Selectivity of Latent Groups in Higher Visual Cortex with Mutual Information-Guided Diffusion Yule Wang et.al. 2510.02182 null
2025-10-02 Policy Gradient Guidance Enables Test Time Control Jianing Qi et.al. 2510.02148 null
2025-10-02 FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models Karan Dua et.al. 2510.02133 null
2025-10-02 SoundReactor: Frame-level Online Video-to-Audio Generation Koichi Saito et.al. 2510.02110 null
2025-10-02 Quantum Effects or Theoretical Artifacts? A Computational Reanalysis of Hydrogen at High-Pressure Stefano Racioppi et.al. 2510.02098 null
2025-10-02 VGDM: Vision-Guided Diffusion Model for Brain Tumor Detection and Segmentation Arman Behnam et.al. 2510.02086 null
2025-10-02 Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions Zhaoyi Li et.al. 2510.02081 null
2025-10-02 Spec-Gloss Surfels and Normal-Diffuse Priors for Relightable Glossy Objects Georgios Kouros et.al. 2510.02069 null
2025-10-02 MSRepaint: Multiple Sclerosis Repaint with Conditional Denoising Diffusion Implicit Model for Bidirectional Lesion Filling and Synthesis Jinwei Zhang et.al. 2510.02063 null
2025-10-02 Zero-shot Human Pose Estimation using Diffusion-based Inverse solvers Sahil Bhandary Karnoor et.al. 2510.02043 null
2025-10-02 RAD@home discovery of extragalactic radio rings and odd radio circles: clues to their origins Ananda Hota et.al. 2510.01999 null
2025-10-02 $\text{G}^2$ RPO: Granular GRPO for Precise Reward in Flow Models Yujie Zhou et.al. 2510.01982 null
2025-10-02 ZK-WAGON: Imperceptible Watermark for Image Generation Models using ZK-SNARKs Aadarsh Anantha Ramakrishnan et.al. 2510.01967 null
2025-10-02 StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold Zhizhong Li et.al. 2510.01938 null
2025-10-02 Dark characterization of Ti/Al LEKIDs for the search of axions in the W-band Victor Rollano et.al. 2510.01913 null
2025-10-02 A probabilistic representation for the gradient in a linear parabolic PDE with Neumann boundary condition Abdelatif Benchérif Madani et.al. 2510.01898 null
2025-10-02 Multi-marginal temporal Schrödinger Bridge Matching for video generation from unpaired data Thomas Gravier et.al. 2510.01894 null
2025-10-02 Fisher information and trajectorial interpretation to the Itô–Langevin relative entropy dissipation Jiaming Chen et.al. 2510.01870 null
2025-10-04 NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications Ying-Ren Chien et.al. 2510.01850 null
2025-10-02 Non-Gaussian Rotational Diffusion and Swing Motion of Dumbbell Probes in Two Dimensional Colloids Jeongmin Kim et.al. 2510.01847 null
2025-10-02 Leveraging Prior Knowledge of Diffusion Model for Person Search Giyeol Kim et.al. 2510.01841 null
2025-10-02 Representation and Integration by Parts Formulas for Affine Processes Arturo Kohatsu-Higa et.al. 2510.01839 null
2025-10-02 Intermediate diffusive-ballistic electron conduction around mesoscopic defects in graphene Toni Markovic et.al. 2510.01821 null
2025-10-02 Mean-field theory of the Santa Fe model revisited: a systematic derivation from an exact BBGKY hierarchy for the zero-intelligence limit-order book model Taiki Wakatsuki et.al. 2510.01814 null
2025-10-02 Efficient manifold evolution algorithm using adaptive B-Spline interpolation Muhammad Ammad et.al. 2510.01790 null
2025-10-03 Pack and Force Your Memory: Long-form and Consistent Video Generation Xiaofei Wu et.al. 2510.01784 null
2025-10-02 Unsupervised Dynamic Feature Selection for Robust Latent Spaces in Vision Tasks Bruno Corcuera et.al. 2510.01758 null
2025-10-02 Towards Photonic Band Diagram Generation with Transformer-Latent Diffusion Models Valentin Delchevalerie et.al. 2510.01749 null
2025-10-02 Edge GPU Aware Multiple AI Model Pipeline for Accelerated MRI Reconstruction and Analysis Ashiyana Abdul Majeed et.al. 2510.01730 null
2025-10-02 First passage times to T cell activation Tony Wong et.al. 2510.01694 null
2025-10-03 UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction Jin Cao et.al. 2510.01669 null
2025-10-02 FideDiff: Efficient Diffusion Model for High-Fidelity Image Motion Deblurring Xiaoyang Liu et.al. 2510.01641 null
2025-10-02 Finite isoresidual covers in strata of $k$ -differentials Dawei Chen et.al. 2510.01630 null
2025-10-02 Local linearization for estimating the diffusion parameter of nonlinear stochastic wave equations with spatially correlated noise Guoping Liu et.al. 2510.01627 null
2025-10-02 NPN: Non-Linear Projections of the Null-Space for Imaging Inverse Problems Roman Jacome et.al. 2510.01608 null
2025-10-02 Securing generative artificial intelligence with parallel magnetic tunnel junction true randomness Youwei Bao et.al. 2510.01598 null
2025-10-02 TetriServe: Efficient DiT Serving for Heterogeneous Image Generation Runyu Lu et.al. 2510.01565 null
2025-10-02 MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models Kevin Zhai et.al. 2510.01549 null
2025-10-02 Growing Visual Generative Capacity for Pre-Trained MLLMs Hanyu Wang et.al. 2510.01546 null
2025-10-02 Step-Aware Policy Optimization for Reasoning in Diffusion Large Language Models Shaoan Xie et.al. 2510.01544 null
2025-10-02 Towards Better Optimization For Listwise Preference in Diffusion Models Jiamu Bai et.al. 2510.01540 null
2025-10-01 Correlation estimates for Brownian particles with singular interactions Mitia Duerinckx et.al. 2510.01507 null
2025-10-01 AortaDiff: A Unified Multitask Diffusion Framework For Contrast-Free AAA Imaging Yuxuan Ou et.al. 2510.01498 null
2025-10-01 Purrception: Variational Flow Matching for Vector-Quantized Image Generation Răzvan-Andrei Matişan et.al. 2510.01478 null
2025-10-03 SCOPED: Score-Curvature Out-of-distribution Proximity Evaluator for Diffusion Brett Barkley et.al. 2510.01456 null
2025-10-01 Diffusion Modeling of the Three-Dimensional Magnetic Field in the Sun’s Corona Daniel E. da Silva et.al. 2510.01441 null
2025-10-01 DiffKnock: Diffusion-based Knockoff Statistics for Neural Networks Inference Heng Ge et.al. 2510.01418 null
2025-10-01 How Well do Diffusion Policies Learn Kinematic Constraint Manifolds? Lexi Foland et.al. 2510.01404 null
2025-10-01 Localized Pattern Formation and Oscillatory Instabilities in a Three-component Gierer Meinhardt Model Chunyi Gai et.al. 2510.01401 null
2025-10-01 DisCo: Reinforcement with Diversity Constraints for Multi-Human Generation Shubhankar Borse et.al. 2510.01399 null
2025-10-01 VENTURA: Adapting Image Diffusion Models for Unified Task Conditioned Navigation Arthur Zhang et.al. 2510.01388 null
2025-10-01 Fine-Tuning Masked Diffusion for Provable Self-Correction Jaeyeon Kim et.al. 2510.01384 null
2025-10-01 Selective Underfitting in Diffusion Models Kiwhan Song et.al. 2510.01378 null
2025-10-01 Microquasars as the major contributors to Galactic cosmic rays around the “knee” Samy Kaci et.al. 2510.01369 null
2025-10-01 Image Generation Based on Image Style Extraction Shuochen Chang et.al. 2510.01347 null
2025-10-01 Discovery of diffuse gamma-ray emission in the vicinity of G172.8+1.5: An old supernova remnant with different turbulence properties Yuan Li et.al. 2510.01340 null
2025-10-01 LVTINO: LAtent Video consisTency INverse sOlver for High Definition Video Restoration Alessio Spagnoletti et.al. 2510.01339 null
2025-10-01 Dynamical Excitation as a probe of planetary origins Brad M. S. Hansen et.al. 2510.01332 null
2025-10-01 Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling Huangjie Zheng et.al. 2510.01329 null
2025-10-01 Combining complex Langevin dynamics with score-based and energy-based diffusion models Gert Aarts et.al. 2510.01328 null
2025-10-01 IMAGEdit: Let Any Subject Transform Fei Shen et.al. 2510.01186 null
2025-10-01 Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models Yanbo Xu et.al. 2510.01184 null
2025-10-01 EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory Jiahao Wang et.al. 2510.01183 null
2025-10-01 Vanishing Acts: Quantifying Black Hole Formation with the DSNB Signal Tim Charissé et.al. 2510.01177 null
2025-10-01 Audio Driven Real-Time Facial Animation for Social Telepresence Jiye Lee et.al. 2510.01176 null
2025-10-01 Code2Video: A Code-centric Paradigm for Educational Video Generation Yanzhe Chen et.al. 2510.01174 null
2025-10-01 Multi-Marginal Flow Matching with Adversarially Learnt Interpolants Oskar Kviman et.al. 2510.01159 null
2025-10-01 Superpositions of Quantum Gaussian Processes Lorenzo Braccini et.al. 2510.01156 null
2025-10-01 Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition Jiahang Cao et.al. 2510.01068 null
2025-10-01 ReSWD: ReSTIR’d, not shaken. Combining Reservoir Sampling and Sliced Wasserstein Distance for Variance Reduction Mark Boss et.al. 2510.01061 null
2025-10-01 Authentic Discrete Diffusion Model Xiao Li et.al. 2510.01047 null
2025-10-01 Gated X-TFC: Soft Domain Decomposition for Forward and Inverse Problems in Sharp-Gradient PDEs Vikas Dwivedi et.al. 2510.01039 null
2025-10-01 Secure and reversible face anonymization with diffusion models Pol Labarbarie et.al. 2510.01031 null
2025-10-01 Syntax-Guided Diffusion Language Models with User-Integrated Personalization Ruqian Zhang et.al. 2510.01028 null
2025-10-01 Equivariant Geometric Scattering Networks via Vector Diffusion Wavelets David R. Johnson et.al. 2510.01022 null
2025-10-01 Molecular Mobility of Extraterrestrial Ices: Surface Diffusion in Astrochemistry and Planetary Science N. F. W. Ligterink et.al. 2510.01018 null
2025-10-01 ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning Yuxiang Guo et.al. 2510.01010 null
2025-10-02 SoftCFG: Uncertainty-guided Stable Guidance for Visual Autoregressive Model Dongli Xu et.al. 2510.00996 null
2025-10-01 Riemannian Consistency Model Chaoran Cheng et.al. 2510.00983 null
2025-10-01 JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation Siheng Wan et.al. 2510.00974 null
2025-09-30 Stitch: Training-Free Position Control in Multimodal Diffusion Transformers Jessica Bader et.al. 2509.26644 null
2025-09-30 Query-Kontext: An Unified Multimodal Model for Image Generation and Editing Yuxin Song et.al. 2509.26641 null
2025-09-30 Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training Junlin Han et.al. 2509.26625 null
2025-09-30 DiffCamera: Arbitrary Refocusing on Images Yiyang Wang et.al. 2509.26599 null
2025-09-30 Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation Agneet Chatterjee et.al. 2509.26555 null
2025-09-30 Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents Zhen Yang et.al. 2509.26539 null
2025-09-30 HilbertA: Hilbert Attention for Image Generation with Diffusion Models Shaoyi Zheng et.al. 2509.26538 null
2025-09-30 Stab-QRAM: An All-Clifford Quantum Random Access Memory for Special Data Guangyi Li et.al. 2509.26494 null
2025-09-30 Contrastive Diffusion Guidance for Spatial Inverse Problems Sattwik Basu et.al. 2509.26489 null
2025-09-30 dParallel: Learnable Parallel Decoding for dLLMs Zigeng Chen et.al. 2509.26488 null
2025-09-30 Closures of moment expansion of anisotropic active Brownian particles Timothée Gautry et.al. 2509.26453 null
2025-09-30 Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models Donghoon Kim et.al. 2509.26436 null
2025-10-01 AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size Guanxi Lu et.al. 2509.26432 null
2025-09-30 MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation Chenhui Zhu et.al. 2509.26391 null
2025-09-30 The Effective Reactivity for Capturing Brownian Motion by Partially Reactive Patches on a Spherical Surface Denis S. Grebenkov et.al. 2509.26381 null
2025-09-30 Go with Your Gut: Scaling Confidence for Autoregressive Image Generation Harold Haodong Chen et.al. 2509.26376 null
2025-09-30 Competition of small targets in planar domains: from Dirichlet to Robin and Steklov boundary condition Denis S. Grebenkov et.al. 2509.26367 null
2025-09-30 Data-to-Energy Stochastic Dynamics Kirill Tamogashev et.al. 2509.26364 null
2025-09-30 Universal critical dynamics near the chiral phase transition and the QCD critical point Yunxin Ye et.al. 2509.26355 null
2025-09-30 Fast-dLLM v2: Efficient Block-Diffusion LLM Chengyue Wu et.al. 2509.26328 null
2025-09-30 Anomaly detection for generic failure monitoring in robotic assembly, screwing and manipulation Niklas Grambow et.al. 2509.26308 null
2025-09-30 Two-component diffuse Galactic gamma-ray emission revealed with Fermi-LAT Qi-Ling Chen et.al. 2509.26290 null
2025-09-30 3DiFACE: Synthesizing and Editing Holistic 3D Facial Animation Balamurugan Thambiraja et.al. 2509.26233 null
2025-09-30 IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance Jiayi Guo et.al. 2509.26231 null
2025-09-30 Basic Cycle Ratio: Cost-Effective Ranking of Influential Spreaders from Local and Global Perspectives Wenxin Zheng et.al. 2509.26220 null
2025-09-30 Exact rate of convergence for the empirical measure of a subordinated process in $p$ -Wasserstein distance René L. Schilling et.al. 2509.26188 null
2025-09-30 BABY 1L: First Tritium Breeding Campaign Results Rémi Delaporte-Mathurin et.al. 2509.26174 null
2025-09-30 Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models Yuansen Liu et.al. 2509.26165 null
2025-09-30 Towards Continual Expansion of Data Coverage: Automatic Text-guided Edge-case Synthesis Kyeongryeol Go et.al. 2509.26158 null
2025-09-30 EchoGen: Generating Visual Echoes in Any Scene via Feed-Forward Subject-Driven Auto-Regressive Model Ruixiao Dong et.al. 2509.26127 null
2025-10-01 Tracer diffusion coefficients in a sheared granular gas. Exact results David González Méndez et.al. 2509.26115 null
2025-09-30 EVODiff: Entropy-aware Variance Optimized Diffusion Inference Shigui Li et.al. 2509.26096 null
2025-09-30 The diffusion-driven orthorhombic to tetragonal transition in YBa $_2$Cu$_3$O$_7$ derived with a machine learning interatomic potential Davide Gambino et.al. 2509.26095 null
2025-09-30 Fading to Grow: Growing Preference Ratios via Preference Fading Discrete Diffusion for Recommendation Guoqing Hu et.al. 2509.26063 null
2025-09-30 Initial traces and solvability of the fast diffusion equation with power-type nonlinearity Kazuhiro Ishige et.al. 2509.26054 null
2025-09-30 PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution Shian Du et.al. 2509.26025 null
2025-09-30 New Fourth-Order Grayscale Indicator-Based Telegraph Diffusion Model for Image Despeckling Rajendra K. Ray et.al. 2509.26010 null
2025-10-02 VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing Abdelilah Aitrouga et.al. 2509.25998 null
2025-09-30 Exact Solutions to the Quantum Schrödinger Bridge Problem Mykola Bordyuh et.al. 2509.25980 null
2025-09-30 Weak-strong uniqueness for general cross-diffusion systems with volume filling Maria Heitzinger et.al. 2509.25978 null
2025-09-30 Data-Free Continual Learning of Server Models in Model-Heterogeneous Federated learning Xiao Zhang et.al. 2509.25977 null
2025-09-30 CO3: Contrasting Concepts Compose Better Debottam Dutta et.al. 2509.25940 null
2025-09-30 Bringing Emerging Architectures to Sequence Labeling in NLP Ana Ezquerro et.al. 2509.25918 null
2025-10-01 LLaVAShield: Safeguarding Multimodal Multi-Turn Dialogues in Vision-Language Models Guolei Huang et.al. 2509.25896 null
2025-10-01 A Multimodal LLM Approach for Visual Question Answering on Multiparametric 3D Brain MRI Arvind Murari Vepa et.al. 2509.25889 null
2025-09-30 Kinetics of the photochromic effect in oxygen-containing rare-earth hydrides Dmitrii Moldarev et.al. 2509.25887 null
2025-09-30 Training-Free Reward-Guided Image Editing via Trajectory Optimal Control Jinho Chang et.al. 2509.25845 null
2025-09-30 HiStyle: Hierarchical Style Embedding Predictor for Text-Prompt-Guided Controllable Speech Synthesis Ziyu Zhang et.al. 2509.25842 null
2025-10-01 Act to See, See to Act: Diffusion-Driven Perception-Action Interplay for Adaptive Policies Jing Wang et.al. 2509.25822 null
2025-09-30 Pre-equilibrium charm quark dynamics and their impact on D-Meson observables Manu Kurian et.al. 2509.25806 null
2025-09-30 Numerical approximations to invariant measures of hybrid stochastic differential equations with superlinear coefficients via the backward Euler-Maruyama method Wei Liu et.al. 2509.25799 null
2025-09-30 PUREVQ-GAN: Defending Data Poisoning Attacks through Vector-Quantized Bottlenecks Alexander Branch et.al. 2509.25792 null
2025-09-30 Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation Mingyu Kang et.al. 2509.25776 null
2025-09-30 PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models Jeongjae Lee et.al. 2509.25774 null
2025-09-30 Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs Jia Jun Cheng Xian et.al. 2509.25771 null
2025-09-30 Quasi-Monte Carlo methods for uncertainty quantification of tumor growth modeled by a parametric semi-linear parabolic reaction-diffusion equation Alexander D. Gilbert et.al. 2509.25753 null
2025-09-30 ART-VITON: Measurement-Guided Latent Diffusion for Artifact-Free Virtual Try-On Junseo Park et.al. 2509.25749 null
2025-09-30 LieHMR: Autoregressive Human Mesh Recovery with $SO(3)$ Diffusion Donghwan Kim et.al. 2509.25739 null
2025-09-30 LaTo: Landmark-tokenized Diffusion Transformer for Fine-grained Human Face Editing Zhenghao Zhang et.al. 2509.25731 null
2025-09-30 Controlled Generation for Private Synthetic Text Zihao Zhao et.al. 2509.25729 null
2025-09-30 How Diffusion Models Memorize Juyeop Kim et.al. 2509.25705 null
2025-09-30 Radiative hydrodynamic simulations of FIP fractionation in solar flares Jeffrey W. Reep et.al. 2509.25695 null
2025-09-30 Hierarchical Diffusion Motion Planning with Task-Conditioned Uncertainty-Aware Priors Amelie Minji Kim et.al. 2509.25685 null
2025-09-30 dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought Junjie Wen et.al. 2509.25681 null
2025-09-30 Swift: An Autoregressive Consistency Model for Efficient Weather Forecasting Jason Stock et.al. 2509.25631 null
2025-09-30 Mean Field Type Control Problems Driven by Jump-diffusions Alain Bensoussan et.al. 2509.25614 null
2025-09-29 RFG: Test-Time Scaling for Diffusion Large Language Model Reasoning with Reward-Free Guidance Tianlang Chen et.al. 2509.25604 null
2025-09-29 MoReFlow: Motion Retargeting Learning through Unsupervised Flow Matching Wontaek Kim et.al. 2509.25600 null
2025-09-29 Machine Learning Algorithms for Improving Black Box Optimization Solvers Morteza Kimiaei et.al. 2509.25592 null
2025-09-29 IRIS: Intrinsic Reward Image Synthesis Yihang Chen et.al. 2509.25562 null
2025-09-29 Spatiotemporal Forecasting of Incidents and Congestion with Implications for Sustainable Traffic Control Tony Kinchen et.al. 2509.25515 null
2025-09-29 Non-Gaussian statistics of concentration fluctuations in free liquid diffusion Marco Bussoletti et.al. 2509.25511 null
2025-09-29 Analysis of a Cahn–Hilliard model for viscoelastoplastic two-phase flows Fan Cheng et.al. 2509.25508 null
2025-09-29 Kinetic Monte Carlo prediction of the morphology of pentaerythritol tetranitrate Jacob Jeffries et.al. 2509.25490 null
2025-09-29 Noise estimation of SDE from a single data trajectory Munawar Ali et.al. 2509.25484 null
2025-09-29 Translation from Wearable PPG to 12-Lead ECG Hui Ji et.al. 2509.25480 null
2025-09-29 Exponential Hedging for the Ornstein-Uhlenbeck Process in the Presence of Linear Price Impact Yan Dolinsky et.al. 2509.25472 null
2025-09-29 Generating Differentially Private Networks with a Modified Erdős-Rényi Model Huaiyuan Rao et.al. 2509.25431 null
2025-09-29 Stochastic dynamics on evolving geometric graphs Alexei Daletskii et.al. 2509.25427 null
2025-09-29 Electropolishing-Induced Topographic Defects in Niobium: Insights and Implications for Superconducting Radio Frequency Applications Oleksandr Hryhorenko et.al. 2509.25423 null
2025-09-29 Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization Jiacheng Shi et.al. 2509.25416 null
2025-09-29 FlashOmni: A Unified Sparse Attention Engine for Diffusion Transformers Liang Qiao et.al. 2509.25401 null
2025-09-29 Let Physics Guide Your Protein Flows: Topology-aware Unfolding and Generation Yogesh Verma et.al. 2509.25379 null
2025-09-29 Safe and Stable Control via Lyapunov-Guided Diffusion Models Xiaoyuan Cheng et.al. 2509.25375 null
2025-09-29 Diffusion with doubly stochastic resetting Maxence Arutkin et.al. 2509.25365 null
2025-09-29 The spatially-resolved effect of mergers on the stellar mass assembly of MaNGA galaxies Eirini Angeloudi et.al. 2509.25340 null
2025-09-29 LUMA: Low-Dimension Unified Motion Alignment with Dual-Path Anchoring for Text-to-Motion Diffusion Model Haozhe Jia et.al. 2509.25304 null
2025-09-29 Learning to Parallel: Accelerating Diffusion Large Language Models via Adaptive Parallel Decoding Wenrui Bao et.al. 2509.25188 null
2025-09-29 FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation Yunyang Ge et.al. 2509.25187 null
2025-09-29 Guided Diffusion for the Discovery of New Superconductors Pawan Prakash et.al. 2509.25186 null
2025-09-29 DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder Junyu Chen et.al. 2509.25182 null
2025-09-29 A bound-preserving multinumerics scheme for steady-state convection-diffusion equations Maurice S. Fabien et.al. 2509.25181 null
2025-10-01 DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space Wenkun He et.al. 2509.25180 null
2025-09-29 GHOST: Hallucination-Inducing Image Generation for Multimodal LLMs Aryan Yazdan Parast et.al. 2509.25178 null
2025-09-29 Personalized Vision via Visual In-Context Learning Yuxin Jiang et.al. 2509.25172 null
2025-09-29 TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion Sophia Tang et.al. 2509.25171 null
2025-09-29 GLASS Flows: Transition Sampling for Alignment of Flow and Diffusion Models Peter Holderrieth et.al. 2509.25170 null
2025-09-29 Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models Bowei Chen et.al. 2509.25162 null
2025-09-29 Rolling Forcing: Autoregressive Long Video Diffusion in Real Time Kunhao Liu et.al. 2509.25161 null
2025-09-29 GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts Fan Yuan et.al. 2509.25160 null
2025-09-29 LayerD: Decomposing Raster Graphic Designs into Layers Tomoyuki Suzuki et.al. 2509.25134 null
2025-09-29 Score Distillation of Flow Matching Models Mingyuan Zhou et.al. 2509.25127 null
2025-09-29 Diffuse Domain Methods with Dirichlet Boundary Conditions Luke Benfield et.al. 2509.25115 null
2025-09-29 MANI-Pure: Magnitude-Adaptive Noise Injection for Adversarial Purification Xiaoyi Huang et.al. 2509.25082 null
2025-09-29 Towards a Certificate of Trust: Task-Aware OOD Detection for Scientific AI Bogdan Raonić et.al. 2509.25080 null
2025-09-29 UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation Guanjun Wu et.al. 2509.25079 null
2025-09-29 Interstellar Dust-Catalyzed Molecular Hydrogen Formation Enabled by Nuclear Quantum Effects Xiaolong Yang et.al. 2509.25070 null
2025-09-29 Collective transport efficiency of microswimmer swarms optimized by tactic run-tumble dynamics Maggie Liu et.al. 2509.25068 null
2025-09-29 CharGen: Fast and Fluent Portrait Modification Jan-Niklas Dihlmann et.al. 2509.25058 null
2025-09-29 Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models Shuchen Xue et.al. 2509.25050 null
2025-09-29 Scaling Synthetic Task Generation for Agents via Exploration Ram Ramrakhya et.al. 2509.25047 null
2025-09-29 Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct Haoyang Zheng et.al. 2509.25035 null
2025-09-29 Lagrangian description and quantification of scalar mixing in fluid flows from particle tracks Anna Klünker et.al. 2509.25030 null
2025-09-29 STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation Xiaoxiao Ma et.al. 2509.25027 null
2025-09-29 Score-based Membership Inference on Diffusion Models Mingxing Rao et.al. 2509.25003 null
2025-09-29 PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion Yuyang Yin et.al. 2509.24997 null
2025-09-29 Path Diffuser: Diffusion Model for Data-Driven Traffic Simulator Da Saem Lee et.al. 2509.24995 null
2025-09-29 SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation Shuang Liang et.al. 2509.24980 null
2025-09-30 Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel Haotian Dong et.al. 2509.24979 null
2025-09-29 DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern Lekang Yang et.al. 2509.24975 null
2025-09-29 Double Descent as a Lens for Sample Efficiency in Autoregressive vs. Discrete Diffusion Models Ahmad Fraij et.al. 2509.24974 null
2025-09-29 VIVALDy: A Hybrid Generative Reduced-Order Model for Turbulent Flows, Applied to Vortex-Induced Vibrations Niccolò Tonioni et.al. 2509.24965 null
2025-09-29 Sharp behavior of semilinear damped wave equations driven by mixed local-nonlocal operators Wenhui Chen et.al. 2509.24940 null
2025-09-29 Scalable GANs with Transformers Sangeek Hyun et.al. 2509.24935 null
2025-09-29 Precision calculation of $^3$He$(α,γ)^7$ Be for solar physics Ratna Khadka et.al. 2509.24931 null
2025-09-29 SAGA-SR: Semantically and Acoustically Guided Audio Super-Resolution Jaekwon Im et.al. 2509.24924 null
2025-09-29 From Code to Action: Hierarchical Learning of Diffusion-VLM Policies Markus Peschl et.al. 2509.24917 null
2025-09-29 Segmentor-Guided Counterfactual Fine-Tuning for Image Synthesis Tian Xia et.al. 2509.24913 null
2025-09-29 When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis Xiang Li et.al. 2509.24912 null
2025-09-29 DRCP: Diffusion on Reinforced Cooperative Perception for Perceiving Beyond Limits Lantao Li et.al. 2509.24903 null
2025-09-29 OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing Zhihong Chen et.al. 2509.24900 null
2025-09-29 Attention Surgery: An Efficient Recipe to Linearize Your Video Diffusion Transformer Mohsen Ghafoorian et.al. 2509.24899 null
2025-09-29 RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark Yang Shi et.al. 2509.24897 null
2025-09-29 VAGUEGAN: Stealthy Poisoning and Backdoor Attacks on Image Generative Pipelines Mostafa Mohaimen Akand Faisal et.al. 2509.24891 null
2025-09-29 MMRQA: Signal-Enhanced Multimodal Large Language Models for MRI Quality Assessment Fankai Jia et.al. 2509.24888 null
2025-09-29 Response to dynamic shape changes in suspensions of hard rectangles Denis Dertli et.al. 2509.24885 null
2025-09-29 ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation Jiuhong Xiao et.al. 2509.24878 null
2025-09-29 Environment-Aware Satellite Image Generation with Diffusion Models Nikos Kostagiolas et.al. 2509.24875 null
2025-09-29 Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation Lei Tong et.al. 2509.24798 null
2025-09-29 Fidelity-Aware Data Composition for Robust Robot Generalization Zizhao Tong et.al. 2509.24797 null
2025-09-29 Collision types and times in interacting particle systems Sergio Andraus et.al. 2509.24790 null
2025-09-29 FESTIM v2.0: Upgraded framework for multi-species hydrogen transport and enhanced performance James Dark et.al. 2509.24760 null
2025-09-29 ExGS: Extreme 3D Gaussian Compression with Diffusion Priors Jiaqi Chen et.al. 2509.24758 null
2025-09-29 Fabrication of hydrogen-bonded metal inorganic-organic complex glasses by ligand-tuning approach Tianzhao Xu et.al. 2509.24755 null
2025-09-29 Geometric structure of stationary problem for spatial 1D self-diffusion equation with logistic growth Yu ICHIDA et.al. 2509.24752 null
2025-09-29 Direct numerical simulation of two-phase flows with surfactant-induced surface viscous effects Debashis Panda et.al. 2509.24722 null
2025-09-29 MAD: Manifold Attracted Diffusion Dennis Elbrächter et.al. 2509.24710 null
2025-09-29 Enhancing Physical Plausibility in Video Generation by Reasoning the Implausibility Yutong Hao et.al. 2509.24702 null
2025-09-29 SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Junsong Chen et.al. 2509.24695 null
2025-09-29 The influence of solute induced memory on interface migration Chad W. Sinclair et.al. 2509.24668 null
2025-09-29 Learning Object-Centric Representations Based on Slots in Real World Scenarios Adil Kaan Akan et.al. 2509.24652 null
2025-09-29 VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning Yixuan Zhou et.al. 2509.24650 null
2025-09-30 RIFLE: Removal of Image Flicker-Banding via Latent Diffusion Enhancement Libo Zhu et.al. 2509.24644 null
2025-09-29 PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control Haozhuo Zhang et.al. 2509.24591 null
2025-09-29 SAIP: A Plug-and-Play Scale-adaptive Module in Diffusion-based Inverse Problems Lingyu Wang et.al. 2509.24580 null
2025-09-29 U-DiT Policy: U-shaped Diffusion Transformers for Robotic Manipulation Linzhi Wu et.al. 2509.24579 null
2025-09-29 SCOPE: Semantic Conditioning for Sim2Real Category-Level Object Pose Estimation in Robotics Peter Hönig et.al. 2509.24572 null
2025-09-29 Training-Free Multimodal Guidance for Video to Audio Generation Eleonora Grassucci et.al. 2509.24550 null
2025-09-29 Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis Kaizhen Zhu et.al. 2509.24531 null
2025-09-29 CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models Zheyuan Hu et.al. 2509.24526 null
2025-09-29 The role of viral dynamics and infectivity in models of oncolytic virotherapy for tumours with different motility David Morselli et.al. 2509.24522 null
2025-09-29 Flow Crossover and Parallel Outflow during Collisionless Magnetic Reconnection Theerasarn Pianpanit et.al. 2509.24513 null
2025-09-29 A Novel Preprocessing Unit for Effective Deep Learning based Classification and Grading of Diabetic Retinopathy Pranoti Nage et.al. 2509.24497 null
2025-09-29 LaMoGen: Laban Movement-Guided Diffusion for Text-to-Motion Generation Heechang Kim et.al. 2509.24469 null
2025-09-29 An Agent-Based Framework for Automated Higher-Voice Harmony Generation Nia D’Souza Ganapathy et.al. 2509.24463 null
2025-09-29 Alternatives To Next Token Prediction In Text Generation – A Survey Charlie Wyatt et.al. 2509.24435 null
2025-09-29 UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark Ailing Zhang et.al. 2509.24427 null
2025-09-29 CLQ: Cross-Layer Guided Orthogonal-based Quantization for Diffusion Transformers Kai Liu et.al. 2509.24416 null
2025-09-29 Unsupervised Single-Channel Speech Separation with a Diffusion Prior under Speaker-Embedding Guidance Runwu Shi et.al. 2509.24395 null
2025-09-29 LLaDA-MoE: A Sparse MoE Diffusion Language Model Fengqi Zhu et.al. 2509.24389 null
2025-09-29 Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Xin Qiu et.al. 2509.24372 null
2025-09-29 From Satellite to Street: A Hybrid Framework Integrating Stable Diffusion and PanoGAN for Consistent Cross-View Synthesis Khawlah Bajbaa et.al. 2509.24369 null
2025-09-29 Watermarking Diffusion Language Models Thibaud Gloaguen et.al. 2509.24368 null
2025-09-29 Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models Jitai Hao et.al. 2509.24365 null
2025-09-29 DRIFT: Divergent Response in Filtered Transformations for Robust Adversarial Defense Amira Guesmi et.al. 2509.24359 null
2025-09-29 NeRV-Diffusion: Diffuse Implicit Neural Representations for Video Synthesis Yixuan Ren et.al. 2509.24353 null
2025-09-29 Hyperspherical Latents Improve Continuous-Token Autoregressive Generation Guolin Ke et.al. 2509.24335 null
2025-09-29 Wavelet-Assisted Mamba for Satellite-Derived Sea Surface Temperature Super-Resolution Wankun Chen et.al. 2509.24334 null
2025-09-29 3D Structure of Jet-induced Diffusion Wake Zhong Yang et.al. 2509.24315 null
2025-09-29 A study of Universal ODE approaches to predicting soil organic carbon Satyanarayana Raju G. V. V et.al. 2509.24306 null
2025-09-29 High-Precision Temperature Estimation Based on Magnetic Nanoparticles Dominated by Brownian Relaxation under Combined AC and DC Magnetic Fields Zhongzhou Du et.al. 2509.24301 null
2025-09-29 DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models Zherui Li et.al. 2509.24296 null
2025-09-29 ASIA: Adaptive 3D Segmentation using Few Image Annotations Sai Raj Kishore Perla et.al. 2509.24288 null
2025-09-29 Collisional Baryon-Dominated Dwarf Galaxies: A New Probe of Bursty Feedback and Dark Matter Physics Yi-Ying Wang et.al. 2509.24270 null
2025-09-29 Cycle Diffusion Model for Counterfactual Image Generation Fangrui Huang et.al. 2509.24267 null
2025-09-29 FreeAction: Training-Free Techniques for Enhanced Fidelity of Trajectory-to-Video Generation Seungwook Kim et.al. 2509.24241 null
2025-09-29 Geometry-induced criticality in $p$ -adic scaling limits of random walks Rahul Rajkumar et.al. 2509.24234 null
2025-09-29 Non-Invasive Detection of PROState Cancer with Novel Time-Dependent Diffusion MRI and AI-Enhanced Quantitative Radiological Interpretation: PROS-TD-AI Baltasar Ramos et.al. 2509.24227 null
2025-09-29 Semantic Editing with Coupled Stochastic Differential Equations Jianxin Zhang et.al. 2509.24223 null
2025-09-29 The role of the solid-melt interface in accelerating the self-catalyzed growth kinetics of III-V semiconductors Zhucong Xi et.al. 2509.24206 null
2025-09-30 UniVid: The Open-Source Unified Video Model Jiabin Luo et.al. 2509.24200 null
2025-09-29 An Efficient 3D Latent Diffusion Model for T1-contrast Enhanced MRI Generation Zach Eidex et.al. 2509.24194 null
2025-09-29 Simulating Post-Neoadjuvant Chemotherapy Breast Cancer MRI via Diffusion Model with Prompt Tuning Jonghun Kim et.al. 2509.24185 null
2025-09-29 Tumor Synthesis conditioned on Radiomics Jonghun Kim et.al. 2509.24182 null
2025-09-29 LatXGen: Towards Radiation-Free and Accurate Quantitative Analysis of Sagittal Spinal Alignment Via Cross-Modal Radiographic View Synthesis Moxin Zhao et.al. 2509.24165 null
2025-09-29 Asymmetric VAE for One-Step Video Super-Resolution Acceleration Jianze Li et.al. 2509.24142 null
2025-09-28 GANji: A Framework for Introductory AI Image Generation Chandon Hamel et.al. 2509.24128 null
2025-09-28 Progressive Layer Stripping Analysis for HVSR Interpretation Mersad Fathizadeh et.al. 2509.24121 null
2025-09-28 GeoFunFlow: Geometric Function Flow Matching for Inverse Operator Learning over Complex Geometries Sifan Wang et.al. 2509.24117 null
2025-09-28 BTC-SAM: Leveraging LLMs for Generation of Bias Test Cases for Sentiment Analysis Models Zsolt T. Kardkovács et.al. 2509.24101 null
2025-09-26 Pixel Motion Diffusion is What We Need for Robot Control E-Ro Nguyen et.al. 2509.22652 null
2025-09-26 RefAM: Attention Magnets for Zero-Shot Referral Segmentation Anna Kukleva et.al. 2509.22650 null
2025-09-26 Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs Xingyu Fu et.al. 2509.22646 null
2025-09-26 Language Models Can Learn from Verbal Feedback Without Scalar Rewards Renjie Luo et.al. 2509.22638 null
2025-09-26 Scale-Wise VAR is Secretly Discrete Diffusion Amandeep Kumar et.al. 2509.22636 null
2025-09-26 Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance Luc Boudier et.al. 2509.22635 null
2025-09-26 LongLive: Real-time Interactive Long Video Generation Shuai Yang et.al. 2509.22622 null
2025-09-26 Exact solutions of open quantum Brownian motions on the real line for two-level systems Manuel D. de la Iglesia et.al. 2509.22604 null
2025-09-26 Transport Based Mean Flows for Generative Modeling Elaheh Akbari et.al. 2509.22592 null
2025-09-26 EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation Yuan Xu et.al. 2509.22578 null
2025-09-26 UniMIC: Token-Based Multimodal Interactive Coding for Human-AI Collaboration Qi Mao et.al. 2509.22570 null
2025-09-26 ConQuER: Modular Architectures for Control and Bias Mitigation in IQP Quantum Generative Models Xiaocheng Zou et.al. 2509.22551 null
2025-09-26 EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model Andrii Litvynchuk et.al. 2509.22527 null
2025-09-26 JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation Guillem Capellera et.al. 2509.22522 null
2025-09-26 A phenotype-structured reaction-diffusion model of avascular glioma growth Francesca Ballatore et.al. 2509.22519 null
2025-09-26 Group Critical-token Policy Optimization for Autoregressive Image Generation Guohui Zhang et.al. 2509.22485 null
2025-09-26 Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation Chen Li et.al. 2509.22476 null
2025-09-26 Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs) Nikita Kornilov et.al. 2509.22459 null
2025-09-26 Overclocking Electrostatic Generative Models Daniil Shlenskii et.al. 2509.22454 null
2025-09-26 LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer Song Fei et.al. 2509.22414 null
2025-09-26 EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer Zhehao Dong et.al. 2509.22407 null
2025-09-26 Closing the Safety Gap: Surgical Concept Erasure in Visual Autoregressive Models Xinhao Zhong et.al. 2509.22400 null
2025-09-26 Gradient-based multi-focus image fusion with focus-aware saliency enhancement Haoyu Li et.al. 2509.22392 null
2025-09-26 SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis Marie Brockschmidt et.al. 2509.22352 null
2025-09-26 Decoding quantum low density parity check codes with diffusion Zejun Liu et.al. 2509.22347 null
2025-09-26 RAPID^3: Tri-Level Reinforced Acceleration Policies for Diffusion Transformer Wangbo Zhao et.al. 2509.22323 null
2025-09-26 NIFTY: a Non-Local Image Flow Matching for Texture Synthesis Pierrick Chatillon et.al. 2509.22318 null
2025-09-26 Self-organization mechanism in Bridgman-grown MnBi2Te4/(Bi2Te3)n: influence on layer sequence and magnetic properties Paweł Skupiński et.al. 2509.22303 null
2025-09-26 HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models Seyedmorteza Sadat et.al. 2509.22300 null
2025-09-26 Jailbreaking on Text-to-Video Models via Scene Splitting Strategy Wonjun Lee et.al. 2509.22292 null
2025-09-26 Wavelength-scale noise-resistant on-chip spectrometer Jianbo Yu et.al. 2509.22286 null
2025-09-26 Conditional Denoising Diffusion Autoencoders for Wireless Semantic Communications Mehdi Letafati et.al. 2509.22282 null
2025-09-26 FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing Junyi Wu et.al. 2509.22244 null
2025-09-26 The moving patch model with fractional diffusion Sebastián Flores-Sepúlveda et.al. 2509.22234 null
2025-09-26 Question-Driven Analysis and Synthesis: Building Interpretable Thematic Trees with LLMs for Text Clustering and Controllable Generation Tiago Fernandes Tavares et.al. 2509.22211 null
2025-09-26 MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training Haoyun Li et.al. 2509.22199 null
2025-09-26 DragGANSpace: Latent Space Exploration and Control for GANs Kirsten Odendaal et.al. 2509.22169 null
2025-09-26 REFINE-CONTROL: A Semi-supervised Distillation Method For Conditional Image Generation Yicheng Jiang et.al. 2509.22139 null
2025-09-26 Guidance Watermarking for Diffusion Models Enoal Gesny et.al. 2509.22126 null
2025-09-26 Countering adversarial evasion in regression analysis David Benfield et.al. 2509.22113 null
2025-09-26 Large Material Gaussian Model for Relightable 3D Generation Jingrui Ye et.al. 2509.22112 null
2025-09-26 50 mm $\times$ 50 mm Cesium Atomic Vapor Cell for Terahertz Imaging: Implementation and Application Bin Zhang et.al. 2509.22098 null
2025-09-26 Factor-Based Conditional Diffusion Model for Portfolio Optimization Xuefeng Gao et.al. 2509.22088 null
2025-09-26 SpecXNet: A Dual-Domain Convolutional Network for Robust Deepfake Detection Inzamamul Alam et.al. 2509.22070 null
2025-09-26 High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling Chao Huang et.al. 2509.22063 null
2025-09-26 Comparative Analysis of GAN and Diffusion for MRI-to-CT translation Emily Honey et.al. 2509.22049 null
2025-09-26 Latent Diffusion : Multi-Dimension Stable Diffusion Latent Space Explorer Zhihua Zhong et.al. 2509.22038 null
2025-09-26 Stage-wise Dynamics of Classifier-Free Guidance in Diffusion Models Cheng Jin et.al. 2509.22007 null
2025-09-26 Exposing Hallucinations To Suppress Them: VLMs Representation Editing With Generative Anchors Youxu Shi et.al. 2509.21997 null
2025-09-26 FailureAtlas:Mapping the Failure Landscape of T2I Models via Active Exploration Muxi Chen et.al. 2509.21995 null
2025-09-26 Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation Abdelrahman Eldesokey et.al. 2509.21989 null
2025-09-26 Hybrid Diffusion for Simultaneous Symbolic and Continuous Planning Sigmund Hennum Høeg et.al. 2509.21983 null
2025-09-26 Electric-field effect on spin diffusion length in solids: An \textit{ab initio} study beyond the drift-diffusion model Junqing Xu et.al. 2509.21962 null
2025-09-26 MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning Tao Wu et.al. 2509.21953 null
2025-09-26 Modeling the Equilibrium Vacancy Concentration in Multi-Principal Element Alloys from First-Principles Damien K. J. Lee et.al. 2509.21944 null
2025-09-26 Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning Xianghua Zeng et.al. 2509.21942 null
2025-09-26 SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet Woosung Joung et.al. 2509.21938 null
2025-09-26 EqDiff-CT: Equivariant Conditional Diffusion model for CT Image Synthesis from CBCT Alzahra Altalib et.al. 2509.21913 null
2025-09-26 Discrete Guidance Matching: Exact Guidance for Discrete Flow Matching Zhengyan Wan et.al. 2509.21912 null
2025-09-26 Logarithmic evolutions in solutions to the convection-diffusion equation of Burgers type Masakazu Yamamoto et.al. 2509.21909 null
2025-09-26 Error Analysis of Discrete Flow with Generator Matching Zhengyan Wan et.al. 2509.21906 null
2025-09-26 TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation Qihang Wang et.al. 2509.21905 null
2025-09-26 Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers Jibin Song et.al. 2509.21893 null
2025-09-26 Drag4D: Align Your Motion with Text-Driven 3D Scene Generation Minjun Kang et.al. 2509.21888 null
2025-09-26 StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing Liyang Chen et.al. 2509.21887 null
2025-09-26 Abductive Logical Rule Induction by Bridging Inductive Logic Programming and Multimodal Large Language Models Yifei Peng et.al. 2509.21874 null
2025-09-26 Deepfakes: we need to re-think the concept of “real” images Janis Keuper et.al. 2509.21864 null
2025-09-26 DiTraj: training-free trajectory control for video diffusion transformer Cheng Lei et.al. 2509.21839 null
2025-09-26 On the Complexity Theory of Masked Discrete Diffusion: From $\mathrm{poly}(1/ε)$ to Nearly $ε$ -Free Xunpeng Huang et.al. 2509.21835 null
2025-09-26 MoWM: Mixture-of-World-Models for Embodied Planning via Latent-to-Pixel Feature Modulation Yu Shang et.al. 2509.21797 null
2025-09-26 LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE Yu Shang et.al. 2509.21790 null
2025-09-26 DeHate: A Stable Diffusion-based Multimodal Approach to Mitigate Hate Speech in Images Dwip Dalal et.al. 2509.21787 null
2025-09-26 UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models Lan Chen et.al. 2509.21760 null
2025-09-26 Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription Michael Yeung et.al. 2509.21739 null
2025-09-26 MESA Isochrones and Stellar Tracks (MIST) III. The White Dwarf Cooling Sequence Evan B. Bauer et.al. 2509.21717 null
2025-09-26 MusicWeaver: Coherent Long-Range and Editable Music Generation from a Beat-Aligned Structural Plan Xuanchen Wang et.al. 2509.21714 null
2025-09-25 Snapshot Synthetic Aperture Imaging with Boiling Speckle Janith B. Senanayaka et.al. 2509.21682 null
2025-09-25 Generating Stable Placements via Physics-guided Diffusion Models Philippe Nadeau et.al. 2509.21664 null
2025-09-25 RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with application to full waveform inversion Siming Shan et.al. 2509.21659 null
2025-09-25 FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction Yixiang Dai et.al. 2509.21657 null
2025-09-25 DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models Yinuo Ren et.al. 2509.21655 null
2025-09-25 A comprehensive equivalent circuit model for high overtone bulk acoustic resonators (HBARs) Vikrant J. Gokhale et.al. 2509.21640 null
2025-09-25 Guiding Audio Editing with Audio Language Model Zitong Lan et.al. 2509.21625 null
2025-09-25 Message passing for epidemiological interventions on networks with loops Erik Weis et.al. 2509.21596 null
2025-09-25 Transabdominal Fetal Oximetry via Diffuse Optics: Principled Analysis and Demonstration in Pregnant Ovine Models Weitai Qian et.al. 2509.21594 null
2025-09-25 What Happens Next? Anticipating Future Motion by Generating Point Trajectories Gabrijel Boduljak et.al. 2509.21592 null
2025-09-25 X-Streamer: Unified Human World Modeling with Audiovisual Interaction You Xie et.al. 2509.21574 null
2025-09-25 No Alignment Needed for Generation: Learning Linearly Separable Representations in Diffusion Models Junno Yun et.al. 2509.21565 null
2025-09-25 ControlHair: Physically-based Video Diffusion for Controllable Dynamic Hair Rendering Weikai Lin et.al. 2509.21541 null
2025-09-25 Patch-Based Diffusion for Data-Efficient, Radiologist-Preferred MRI Reconstruction Rohan Sanda et.al. 2509.21531 null
2025-09-25 Shortcut Flow Matching for Speech Enhancement: Step-Invariant flows via single stage training Naisong Zhou et.al. 2509.21522 null
2025-09-25 DistillKac: Few-Step Image Generation via Damped Wave Equations Weiqiao Han et.al. 2509.21513 null
2025-09-25 Quantum algorithms for solving a drift-diffusion equation: analysing circuit depths Ellen Devereux et.al. 2509.21509 null
2025-09-25 SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models Arani Roy et.al. 2509.21498 null
2025-09-25 d2: Improved Techniques for Training Reasoning Diffusion Language Models Guanghan Wang et.al. 2509.21474 null
2025-09-25 Are Hallucinations Bad Estimations? Hude Liu et.al. 2509.21473 null
2025-09-25 Score-based Idempotent Distillation of Diffusion Models Shehtab Zaman et.al. 2509.21470 null
2025-09-25 Gender Stereotypes in Professional Roles Among Saudis: An Analytical Study of AI-Generated Images Using Language Models Khaloud S. AlKhalifah et.al. 2509.21466 null
2025-09-25 Viscous Growth Law in Bubble Coarsening: A Molecular Dynamics Perspective Parameshwaran A et.al. 2509.21457 null
2025-09-25 SD3.5-Flash: Distribution-Guided Distillation of Generative Flows Hmrishav Bandyopadhyay et.al. 2509.21318 null
2025-09-25 Two ADI compact difference methods for variable-exponent diffusion wave equations Hao Zhang et.al. 2509.21316 null
2025-09-25 NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics Yu Yuan et.al. 2509.21309 null
2025-09-25 Einstein@Home Searches for Gamma-ray Pulsars in the Inner Galaxy C. J. Clark et.al. 2509.21307 null
2025-09-26 Outflow-cloud interaction as the possible origin of the peculiar radio emission in the tidal disruption event AT2018cqh Lei Yang et.al. 2509.21299 null
2025-09-25 Does FLUX Already Know How to Perform Physically Plausible Image Composition? Shilin Lu et.al. 2509.21278 null
2025-09-25 Dense Semantic Matching with VGGT Prior Songlin Yang et.al. 2509.21263 null
2025-09-25 Un-Doubling Diffusion: LLM-guided Disambiguation of Homonym Duplication Evgeny Kaskov et.al. 2509.21262 null
2025-09-25 Hallucination as an Upper Bound: A New Perspective on Text-to-Image Evaluation Seyed Amir Kasaei et.al. 2509.21257 null
2025-09-25 Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets Team Hunyuan3D et.al. 2509.21245 null
2025-09-25 Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation Seyed Amir Kasaei et.al. 2509.21227 null
2025-09-25 A Unified Framework for Diffusion Model Unlearning with f-Divergence Nicola Novello et.al. 2509.21167 null
2025-09-25 DAGDiff: Guiding Dual-Arm Grasp Diffusion to Stable and Collision-Free Grasps Md Faizal Karim et.al. 2509.21145 null
2025-09-25 The Unwinnable Arms Race of AI Image Detection Till Aczel et.al. 2509.21135 null
2025-09-25 MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation Guojun Lei et.al. 2509.21119 null
2025-09-25 Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks? Rostislav Makarov et.al. 2509.21087 null
2025-09-25 UniTransfer: Video Concept Transfer via Progressive Spatial and Timestep Decomposition Guojun Lei et.al. 2509.21086 null
2025-09-25 Normalizing Flows are Capable Visuomotor Policy Learning Models Simon Kristoffersson Lind et.al. 2509.21073 null
2025-09-25 SPREAD: Sampling-based Pareto front Refinement via Efficient Adaptive Diffusion Sedjro Salomon Hotegni et.al. 2509.21058 null
2025-09-25 Actor-Critic without Actor Donghyeon Ki et.al. 2509.21022 null
2025-09-25 Graphical Willmore Problems with Low-Regularity Boundary and Dirichlet Data Boris Gulyak et.al. 2509.21018 null
2025-09-25 Unbiased Parameter Estimation of Partially Observed Diffusions using Diffusion Bridges Miguel Alvarez et.al. 2509.21015 null
2025-09-25 A Single Neuron Works: Precise Concept Erasure in Text-to-Image Diffusion Models Qinqin He et.al. 2509.21008 null
2025-09-26 TF-Restormer: Complex Spectral Prediction for Speech Restoration Ui-Hyeop Shin et.al. 2509.21003 null
2025-09-25 High energy gammas and neutrinos from the Sun, Jupiter and Earth Pablo de la Torre et.al. 2509.20970 null
2025-09-25 Flow Matching in the Low-Noise Regime: Pathologies and a Contrastive Remedy Weili Zeng et.al. 2509.20952 null
2025-09-25 SMC-X: A Distributed Scalable Monte Carlo Simulation Method for Chemically Complex Alloys Xianglin Liu et.al. 2509.20949 null
2025-09-25 Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting Yanfeng Yang et.al. 2509.20928 null
2025-09-25 SimDiff: Simulator-constrained Diffusion Model for Physically Plausible Motion Generation Akihisa Watanabe et.al. 2509.20927 null
2025-09-25 Deterministic Discrete Denoising Hideyuki Suzuki et.al. 2509.20896 null
2025-09-25 AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion Junyoung Koh et.al. 2509.20891 null
2025-09-25 FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies Shuqiao Liang et.al. 2509.20890 null
2025-09-25 Holographic Brownian dynamics of a heavy particle in a boosted thermal plasma background Anirban Roy Chowdhury et.al. 2509.20889 null
2025-09-25 Nuclear Diffusion Models for Low-Rank Background Suppression in Videos Tristan S. W. Stevens et.al. 2509.20886 null
2025-09-25 Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering Zhifei Li et.al. 2509.20884 null
2025-09-25 WeFT: Weighted Entropy-driven Fine-Tuning for dLLMs Guowei Xu et.al. 2509.20863 null
2025-09-25 Causal Time Series Generation via Diffusion Models Yutong Xia et.al. 2509.20846 null
2025-09-25 Topological Catenation-induced Pore Size in 2D Olympic Network Wenbo Zhao et.al. 2509.20827 null
2025-09-25 T2I-Diff: fMRI Signal Generation via Time-Frequency Image Transform and Classifier-Free Denoising Diffusion Models Hwa Hui Tew et.al. 2509.20822 null
2025-09-25 Diffusive Scaling limit of stochastic Box-Ball systems and PushTASEP David Keating et.al. 2509.20779 null
2025-09-25 CusEnhancer: A Zero-Shot Scene and Controllability Enhancement Method for Photo Customization via ResInversion Maoye Ren et.al. 2509.20775 null
2025-09-25 Measuring LLM Sensitivity in Transformer-based Tabular Data Synthesis Maria F. Davila R et.al. 2509.20768 null
2025-09-25 FreeInsert: Personalized Object Insertion with Geometric and Style Control Yuhong Zhang et.al. 2509.20756 null
2025-09-25 RAPTOR-GEN: RApid PosTeriOR GENerator for Bayesian Learning in Biomanufacturing Wandi Xu et.al. 2509.20753 null
2025-09-25 Parallel Thinking, Sequential Answering: Bridging NAR and AR for Efficient Reasoning Qihang Ai et.al. 2509.20744 null
2025-09-25 Quantum Algorithm for Subcellular Multiscale Reaction-Diffusion Systems Margot Lockwood et.al. 2509.20668 null
2025-09-25 Atomistic Insights into Cu/amorphous-Ta $_x$ N Interfacial Adhesion via Machine Learning Interatomic Potentials: Effects of Stoichiometry and Interface Construction Jeong Min Choi et.al. 2509.20662 null
2025-09-25 Scaling limit for Brownian motions on the $l$ -level Sierpinski gaskets: The fractal to Euclidean crossover David A. Croydon et.al. 2509.20657 null
2025-09-25 Stray light in 3D porous nanostructures of single crystalline copper film Yu-Seong Seo et.al. 2509.20644 null
2025-09-24 FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models Amin Karimi Monsefi et.al. 2509.20624 null
2025-09-24 MMG: Mutual Information Estimation via the MMSE Gap in Diffusion Longxuan Yu et.al. 2509.20609 null
2025-09-24 The X-ray Emission of NGC 5005: An Unobscured Low-Luminosity AGN with a Weakly Accreting Broad-Line Region Anna Trindade Falcão et.al. 2509.20597 null
2025-09-24 von Kármán–Howarth Similarity of Spatial Correlations and the Distribution of Correlation Lengths in Solar Photospheric Turbulence Rohit Chhiber et.al. 2509.20590 null
2025-09-24 Burning games on strong path products Sally Ambrose et.al. 2509.20572 null
2025-09-24 PIRF: Physics-Informed Reward Fine-Tuning for Diffusion Models Mingze Yuan et.al. 2509.20570 null
2025-09-24 A Hierarchical Adaptive Diffusion Model for Flexible Protein-Protein Docking Rujie Yin et.al. 2509.20542 null
2025-09-24 Pattern Formation in Agent-Based and PDE Models for Evolutionary Games with Payoff-Driven Motion Tianyong Yao et.al. 2509.20538 null
2025-09-24 InstructVTON: Optimal Auto-Masking and Natural-Language-Guided Interactive Style Control for Inpainting-Based Virtual Try-On Julien Han et.al. 2509.20524 null
2025-09-24 A Recovery Theory for Diffusion Priors: Deterministic Analysis of the Implicit Prior Algorithm Oscar Leong et.al. 2509.20511 null
2025-09-24 How two-dimensional are planet-disc interactions? II. Radiation hydrodynamics and suitable cooling prescriptions Alexandros Ziampras et.al. 2509.20464 null
2025-09-24 On the Hydrodynamic Approximation of Quantum Integrable Models – An Illustration via the repulsive Lieb-Liniger Model Friedrich Hübner et.al. 2509.20445 null
2025-09-24 pop-cosmos: Star formation over 12 Gyr from generative modelling of a deep infrared-selected galaxy catalogue Sinan Deger et.al. 2509.20430 null
2025-09-24 Seedream 4.0: Toward Next-generation Multimodal Image Generation Team Seedream et.al. 2509.20427 null
2025-09-24 Adversarial Defense in Cybersecurity: A Systematic Review of GANs for Threat Detection and Mitigation Tharcisse Ndayipfukamiye et.al. 2509.20411 null
2025-09-25 EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning Xuan Ju et.al. 2509.20360 null
2025-09-24 PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation Chen Wang et.al. 2509.20358 null
2025-09-26 mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies Remo Steiner et.al. 2509.20297 null
2025-09-26 FAST: Foreground-aware Diffusion with Accelerated Sampling Trajectory for Segmentation-oriented Anomaly Synthesis Xichen Xu et.al. 2509.20295 null
2025-09-24 Biologically Plausible Learning via Bidirectional Spike-Based Distillation Changze Lv et.al. 2509.20284 null
2025-09-24 On Brinkman flows with curvature-induced phase separation in binary mixtures Pierluigi Colli et.al. 2509.20282 null
2025-09-24 Turing instability and 2-D pattern formation in reaction-diffusion systems derived from kinetic theory Stefano Boccelli et.al. 2509.20268 null
2025-09-24 Radial Variations in Residence Time Distribution for Pipe Flows Etienne Boulais et.al. 2509.20256 null
2025-09-24 AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving Jinhao Chai et.al. 2509.20253 null
2025-09-24 4D Driving Scene Generation With Stereo Forcing Hao Lu et.al. 2509.20251 null
2025-09-24 Universal Camouflage Attack on Vision-Language Models for Autonomous Driving Dehong Kong et.al. 2509.20196 null
2025-09-24 KSDiff: Keyframe-Augmented Speech-Aware Dual-Path Diffusion for Facial Animation Tianle Lyu et.al. 2509.20128 null
2025-09-24 Experiments on geostrophic convection: the role of the Prandtl number Hannah M. Clercx et.al. 2509.20126 null
2025-09-24 Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving Pengxiang Li et.al. 2509.20109 null
2025-09-24 First-Extinction Law for Resampling Processes Matteo Benati et.al. 2509.20101 null
2025-09-24 Incomplete Data, Complete Dynamics: A Diffusion Approach Zihan Zhou et.al. 2509.20098 null
2025-09-24 Constrained Higher-Order Binary Optimization for Wireless Communications Systems Using Ising Machines Gan Zheng et.al. 2509.20092 null
2025-09-24 Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing Zizheng Yang et.al. 2509.20091 null
2025-09-24 Hierarchy of timescales in a disordered spin- $1/2$ XX ladder Kadir Çeven et.al. 2509.20078 null
2025-09-25 From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training Tianqiao Liu et.al. 2509.20072 null
2025-09-24 Resistive switching behaviors in vertically aligned MoS $_2$ films with Cu, Ag, and Au electrodes Shuei-De Huang et.al. 2509.20061 null
2025-09-24 Discrete Diffusion for Generative Modeling of Text-Aligned Speech Tokens Pin-Jui Ku et.al. 2509.20060 null
2025-09-25 Diffusion-Augmented Contrastive Learning: A Noise-Robust Encoder for Biosignal Representations Rami Zewail et.al. 2509.20048 null
2025-09-24 The role of photospheric magnetic flux diffusion in initiation of solar eruptions Xinkai Bian et.al. 2509.20040 null
2025-09-24 Development of a time calibration system for the KLM upgrade in the Belle II experiment Ziyu Liu et.al. 2509.20029 null
2025-09-24 Generative Adversarial Networks Applied for Privacy Preservation in Biometric-Based Authentication and Identification Lubos Mjachky et.al. 2509.20024 null
2025-09-24 CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion Chenhao Ji et.al. 2509.19979 null
2025-09-24 Learnable Sampler Distillation for Discrete Diffusion Models Feiyang Fu et.al. 2509.19962 null
2025-09-24 GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes Guo Chen et.al. 2509.19937 null
2025-09-25 GUIDE: A Diffusion-Based Autonomous Robot Exploration Framework Using Global Graph Inference Zijun Che et.al. 2509.19916 null
2025-09-24 Dynamically Optimal Unraveling Schemes for Simulating Lindblad Equations Yu Cao et.al. 2509.19887 null
2025-09-24 Adaptive User Interest Modeling via Conditioned Denoising Diffusion For Click-Through Rate Prediction Qihang Zhao et.al. 2509.19876 null
2025-09-24 FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models Xin Wang et.al. 2509.19870 null
2025-09-24 Parameter Estimation for Jump-Diffusion Stochastic Master Equations Weichao Liang et.al. 2509.19862 null
2025-09-24 Gauge invariance and hyperforce correlation theory for equilibrium fluid mixtures Joshua Matthes et.al. 2509.19837 null
2025-09-24 Boundary effect on asymptotic behaviour of solution to the hyperbolic-parabolic chemotaxis system Nangao Zhang et.al. 2509.19828 null
2025-09-24 An Efficient Conditional Score-based Filter for High Dimensional Nonlinear Filtering Problems Zhijun Zeng et.al. 2509.19816 null
2025-09-25 StrCGAN: A Generative Framework for Stellar Image Restoration Shantanusinh Parmar et.al. 2509.19805 null
2025-09-24 Colossal Effect of Nanopore Surface Ionic Charge on the Dynamics of Confined Water Armin Mozhdehei et.al. 2509.19802 null
2025-09-24 On The Cutoff Phenomenon For Dyson-Laguerre Processes Samuel Chan-Ashing et.al. 2509.19798 null
2025-09-24 Beyond Human Demonstrations: Diffusion-Based Reinforcement Learning to Generate Data for VLA Training Rushuai Yang et.al. 2509.19752 null
2025-09-24 Talking Head Generation via AU-Guided Landmark Prediction Shao-Yu Chang et.al. 2509.19749 null
2025-09-24 Controls on the ocean response to idealized Antarctic meltwater input Rory Basinski-Ferris et.al. 2509.19730 null
2025-09-24 PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction Yufei Han et.al. 2509.19726 null
2025-09-24 TopoCut: Learning Multi-Step Cutting with Spectral Rewards and Discrete Diffusion Policies Liquan Wang et.al. 2509.19712 null
2025-09-24 Diffusion and Flow-based Copulas: Forgetting and Remembering Dependencies David Huk et.al. 2509.19707 null
2025-09-24 Diffusion-Based Impedance Learning for Contact-Rich Manipulation Tasks Noah Geiger et.al. 2509.19696 null
2025-09-24 From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition Ling Lo et.al. 2509.19690 null
2025-09-24 Formal Safety Verification and Refinement for Generative Motion Planners via Certified Local Stabilization Devesh Nath et.al. 2509.19688 null
2025-09-24 Selective Classifier-free Guidance for Zero-shot Text-to-speech John Zheng et.al. 2509.19668 null
2025-09-24 Long-Range Dependence in Financial Markets: Empirical Evidence and Generative Modeling Challenges Yifan He et.al. 2509.19663 null
2025-09-24 Statistical Parameter Calibration with the Generalized Fluctuation Dissipation Theorem and Generative Modeling Ludovico T. Giorgini et.al. 2509.19660 null
2025-09-23 TIMED: Adversarial and Autoregressive Refinement of Diffusion-Based Time Series Generation MohammadReza EskandariNasab et.al. 2509.19638 null
2025-09-23 Connecting cosmologically decaying dark matter to neutrino physics Lea Fuß et.al. 2509.19596 null
2025-09-23 Synthesizing Artifact Dataset for Pixel-level Detection Dennis Menn et.al. 2509.19589 null
2025-09-23 DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions Zongyue Li et.al. 2509.19538 null
2025-09-23 Real-Time Reinforcement Learning for Dynamic Tasks with a Parallel Soft Robot James Avtges et.al. 2509.19525 null
2025-09-23 Frame-based Equivariant Diffusion Models for 3D Molecular Generation Mohan Guo et.al. 2509.19506 null
2025-09-23 Hierarchical null controllability of a degenerate parabolic equation with nonlocal coefficient Juan Límaco et.al. 2509.19505 null
2025-09-23 Reaction/Diffusion Competition Drives Anomalous Relaxation of Vitrimers Makayla R. Branham-Ferrari et.al. 2509.19496 null
2025-09-23 ArtiFree: Detecting and Reducing Generative Artifacts in Diffusion-based Speech Enhancement Bhawana Chhaglani et.al. 2509.19495 null
2025-09-23 Anchored Langevin Algorithms Mert Gurbuzbalaban et.al. 2509.19455 null
2025-09-23 ROPA: Synthetic Robot Pose Generation for RGB-D Bimanual Data Augmentation Jason Chen et.al. 2509.19454 null
2025-09-23 Two-moment cosmic ray transport in RAMSES Joki Rosdahl et.al. 2509.19447 null
2025-09-23 CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching Chen Chen et.al. 2509.19300 null
2025-09-23 Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Sherwin Bahmani et.al. 2509.19296 null
2025-09-23 OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps Bingnan Li et.al. 2509.19282 null
2025-09-23 A Gradient Flow Approach to Solving Inverse Problems with Latent Diffusion Models Tim Y. J. Wang et.al. 2509.19276 null
2025-09-23 Reconstruction of a potential parameter in time-fractional diffusion problems via a Kohn–Vogelius type functional: Theoretical aspects Hamza Kahlaoui et.al. 2509.19260 null
2025-09-23 Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps Gabriel Maldonado et.al. 2509.19252 null
2025-09-24 Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation Shufan Li et.al. 2509.19244 null
2025-09-23 Stability and Generalization of Adversarial Diffusion Training Hesam Hosseini et.al. 2509.19234 null
2025-09-23 Enabling Plant Phenotyping in Weedy Environments using Multi-Modal Imagery via Synthetic and Generated Training Data Earl Ranario et.al. 2509.19208 null
2025-09-23 Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions Ioanna Ntinou et.al. 2509.19203 null
2025-09-23 Detachment limited interlayer transport processes during SrTiO3 pulsed laser epitaxy Jeffrey G. Ulbrandt et.al. 2509.19181 null
2025-09-23 A noise-robust Monte Carlo method for electric field calculations in EMC3 William De Deyn et.al. 2509.19178 null
2025-09-23 2D implementation of Kinetic-diffusion Monte Carlo in Eiron Oskar Lappi et.al. 2509.19140 null
2025-09-23 FUNCanon: Learning Pose-Aware Action Primitives via Functional Object Canonicalization for Generalizable Robotic Manipulation Hongli Xu et.al. 2509.19102 null
2025-09-23 World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation Zhennan Jiang et.al. 2509.19080 null
2025-09-23 Diffusion Bridge Variational Inference for Deep Gaussian Processes Jian Xu et.al. 2509.19078 null
2025-09-23 WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction Hung Nguyen et.al. 2509.19073 null
2025-09-23 Dwarf Galaxies in the MATLAS Survey: Hubble Space Telescope Observations of Nuclear Star Clusters Mélina Poulain et.al. 2509.19068 null
2025-09-23 ManipForce: Force-Guided Policy Learning with Frequency-Aware Representation for Contact-Rich Manipulation Geonhyup Lee et.al. 2509.19047 null
2025-09-23 Latent Danger Zone: Distilling Unified Attention for Cross-Architecture Black-box Attacks Yang Li et.al. 2509.19044 null
2025-09-24 Improving Credit Card Fraud Detection through Transformer-Enhanced GAN Oversampling Kashaf Ul Emaan et.al. 2509.19032 null
2025-09-23 OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment Teng Xiao et.al. 2509.19018 null
2025-09-23 Pure Vision Language Action (VLA) Models: A Comprehensive Survey Dapeng Zhang et.al. 2509.19012 null
2025-09-23 Generative data augmentation for biliary tract detection on intraoperative images Cristina Iacono et.al. 2509.18958 null
2025-09-23 One-shot Embroidery Customization via Contrastive LoRA Modulation Jun Ma et.al. 2509.18948 null
2025-09-23 Soret and Dufour effects in hot and dense QCD matter Kamaljeet Singh et.al. 2509.18946 null
2025-09-23 1-bit RIS-aided Index Modulation with Quantum Annealing Ioannis Krikidis et.al. 2509.18932 null
2025-09-23 Direct Preference Optimization for Speech Autoregressive Diffusion Models Zhijun Liu et.al. 2509.18928 null
2025-09-23 Diffusive Stochastic Master Equation (SME) with dispersive qubit/cavity coupling Pierre Rouchon et.al. 2509.18925 null
2025-09-23 LiDAR Point Cloud Image-based Generation Using Denoising Diffusion Probabilistic Models Amirhesam Aghanouri et.al. 2509.18917 null
2025-09-23 RS3DBench: A Comprehensive Benchmark for 3D Spatial Perception in Remote Sensing Jiayu Wang et.al. 2509.18897 null
2025-09-23 How special are the dynamics of deep eutectic solvents? A Look at the Prototypical Case of Ethaline Mohammad Nadim Kamar et.al. 2509.18896 null
2025-09-23 Quantum-to-classical transition and H-theorem in surface diffusion E. E. Torres-Miyares et.al. 2509.18844 null
2025-09-23 Validation of a Reynolds-averaged numerical simulation environment to simulate high-pressure, auto-igniting hydrogen diffusion flames N. Diepstraten et.al. 2509.18841 null
2025-09-23 Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters Pin-Yen Chiu et.al. 2509.18831 null
2025-09-23 Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation Yanzuo Lu et.al. 2509.18824 null
2025-09-23 Training-Free Data Assimilation with GenCast Thomas Savary et.al. 2509.18811 null
2025-09-23 Nonlocal degenerate parabolic hyperbolic equations on bounded domains. Part II: Existence Jørgen Endal et.al. 2509.18797 null
2025-09-23 Towards Application Aligned Synthetic Surgical Image Synthesis Danush Kumar Venkatesh et.al. 2509.18796 null
2025-09-23 FixingGS: Enhancing 3D Gaussian Splatting via Training-Free Score Distillation Zhaorui Wang et.al. 2509.18759 null
2025-09-23 Complexity of Activity Patterns in a Bio-Inspired Hopfield-Type Network in Different Topologies Marco Cafiso et.al. 2509.18758 null
2025-09-23 RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images Ke Li et.al. 2509.18711 null
2025-09-23 AGSwap: Overcoming Category Boundaries in Object Fusion via Adaptive Group Swapping Zedong Zhang et.al. 2509.18699 null
2025-09-23 FlowCrypt: Flow-Based Lightweight Encryption with Near-Lossless Recovery for Cloud Photo Privacy Xiaohui Yang et.al. 2509.18696 null
2025-09-23 Advances in Large Language Models for Medicine Zhiyu Kan et.al. 2509.18690 null
2025-09-23 Query-Centric Diffusion Policy for Generalizable Robotic Assembly Ziyi Xu et.al. 2509.18686 null
2025-09-23 3D Flow Diffusion Policy: Visuomotor Policy Learning via Generating Flow in 3D Space Sangjun Noh et.al. 2509.18676 null
2025-09-23 Global Existence of Solutions for A Class of Nonlocal Reaction-Diffusion Systems and Their Diffusive Limit Md Shah Alam et.al. 2509.18645 null
2025-09-23 Well-posedness of the Electron MHD with random diffusion Ruimeng Hu et.al. 2509.18640 null
2025-09-23 Understanding-in-Generation: Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation Yuanhuiyi Lyu et.al. 2509.18639 null
2025-09-23 Prompt-Guided Dual Latent Steering for Inversion Problems Yichen Wu et.al. 2509.18619 null
2025-09-23 Flow marching for a generative PDE foundation model Zituo Chen et.al. 2509.18611 null
2025-09-23 SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering Jiarui Hai et.al. 2509.18603 null
2025-09-23 Training-Free Multi-Style Fusion Through Reference-Based Adaptive Modulation Xu Liu et.al. 2509.18602 null
2025-09-23 SSCM: A Spatial-Semantic Consistent Model for Multi-Contrast MRI Super-Resolution Xiaoman Wu et.al. 2509.18593 null
2025-09-23 Kernel Variational Inference Flow for Nonlinear Filtering Problem Weiye Gan et.al. 2509.18589 null
2025-09-23 DS-Diffusion: Data Style-Guided Diffusion Model for Time-Series Generation Mingchun Sun et.al. 2509.18584 null
2025-09-23 Active Ornstein-Uhlenbeck particle under stochastic resetting Uma Shankari et.al. 2509.18515 null
2025-09-23 Source-Free Domain Adaptive Semantic Segmentation of Remote Sensing Images with Diffusion-Guided Label Enrichment Wenjie Liu et.al. 2509.18502 null
2025-09-23 Differentiable Light Transport with Gaussian Surfels via Adapted Radiosity for Efficient Relighting and Geometry Reconstruction Kaiwen Jiang et.al. 2509.18497 null
2025-09-23 An Advection-Difusion Model Incorporating Investor Inertia for the Dynamics of Financial Asset Prices Diego et.al. 2509.18488 null
2025-09-22 Discrete-time diffusion-like models for speech synthesis Xiaozhou Tan et.al. 2509.18470 null
2025-09-22 Zero-Shot Visual Deepfake Detection: Can AI Predict and Prevent Fake Content Before It’s Created? Ayan Sar et.al. 2509.18461 null
2025-09-22 Learning Geometry-Aware Nonprehensile Pushing and Pulling with Dexterous Hands Yunshuang Li et.al. 2509.18455 null
2025-09-22 Diffusion Policies with Offline and Inverse Reinforcement Learning for Promoting Physical Activity in Older Adults Using Wearable Sensors Chang Liu et.al. 2509.18433 null
2025-09-22 Measurement Score-Based MRI Reconstruction with Automatic Coil Sensitivity Estimation Tingjun Liu et.al. 2509.18402 null
2025-09-22 Efficient Particle Acceleration in 2.5-Dimensional, Hybrid-Kinetic Simulations of Decaying, Supersonic, Plasma Turbulence Keyan Gootkin et.al. 2509.18374 null
2025-09-22 Galactic Center Gamma-Ray Emission in MHD Galaxy Formation Simulations with Full Cosmic Ray Spectra Isabel S. Sands et.al. 2509.18351 null
2025-09-22 Bootstrapping transport in the Drude-Kadanoff-Martin model Subham Dutta Chowdhury et.al. 2509.18255 null
2025-09-22 Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers Chaehyun Kim et.al. 2509.18096 null
2025-09-22 ComposeMe: Attribute-Specific Image Prompts for Controllable Human Image Generation Guocheng Gordon Qian et.al. 2509.18092 null
2025-09-22 RnGCam: High-speed video from rolling & global shutter measurements Kevin Tandi et.al. 2509.18087 null
2025-09-22 Spiffy: Multiplying Diffusion LLM Acceleration via Lossless Speculative Decoding Sudhanshu Agrawal et.al. 2509.18085 null
2025-09-22 RadarSFD: Single-Frame Diffusion with Pretrained Priors for Radar Point Clouds Bin Zhao et.al. 2509.18068 null
2025-09-22 Introduction to the relative Langlands program Raphaël Beuzart-Plessis et.al. 2509.18062 null
2025-09-22 Density convergence on Markov diffusion chaos via Stein’s method Thanh Dang et.al. 2509.18045 null
2025-09-22 Prepare Before You Act: Learning From Humans to Rearrange Initial States Yinlong Dai et.al. 2509.18043 null
2025-09-22 Microsecond-Pulsed Nanocalorimetry: A Scalable Approach for Ultrasensitive Heat Capacity Measurements Hugo Gómez-Torres et.al. 2509.18019 null
2025-09-23 StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models Haoxin Yang et.al. 2509.17993 null
2025-09-22 VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models Geonung Kim et.al. 2509.17985 null
2025-09-22 Cosmic inventory of the background fields of relativistic particles in the Universe Jonathan Biteau et.al. 2509.17954 null
2025-09-22 ComposableNav: Instruction-Following Navigation in Dynamic Environments via Composable Diffusion Zichao Hu et.al. 2509.17941 null
2025-09-22 MEF: A Systematic Evaluation Framework for Text-to-Image Models Xiaojing Dong et.al. 2509.17907 null
2025-09-23 Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark Siu Hang Ho et.al. 2509.17894 null
2025-09-22 Invariance of finite-dimensional realisations of Heath-Jarrow-Morton models under diffusion estimation Andreas Celary et.al. 2509.17875 null
2025-09-22 SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model Xiao Zhou et.al. 2509.17850 null
2025-09-22 Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology Saghir Alfasly et.al. 2509.17847 null
2025-09-22 The origin of the intra-cluster light in The Three Hundred simulations A. Contreras-Santos et.al. 2509.17831 null
2025-09-22 Folding-unfolding transition of active polymer on the reconfiguration of bidirectional tangential active force Arindam Panda et.al. 2509.17824 null
2025-09-22 ContextFlow: Training-Free Video Object Editing via Adaptive Context Enrichment Yiyang Chen et.al. 2509.17818 null
2025-09-22 Solving time-fractional diffusion equations with Robin boundary conditions via fractional Hamiltonian boundary value methods Qian Luo et.al. 2509.17793 null
2025-09-22 Elucidating the Design Space of FP4 training Robert Hu et.al. 2509.17791 null
2025-09-22 Conditional Diffusion Models for CT Image Synthesis from CBCT: A Systematic Review Alzahra Altalib et.al. 2509.17790 null
2025-09-22 I2VWM: Robust Watermarking for Image to Video Generation Guanjie Wang et.al. 2509.17773 null
2025-09-22 Qwen3-Omni Technical Report Jin Xu et.al. 2509.17765 null
2025-09-22 Multi-Agent Amodal Completion: Direct Synthesis with Fine-Grained Semantic Guidance Hongxing Fan et.al. 2509.17757 null
2025-09-22 GAN-Based Multi-Microphone Spatial Target Speaker Extraction Shrishti Saha Shetu et.al. 2509.17741 null
2025-09-22 Non-equilibrium state during proton-deuteron exchange at a liquid-liquid interface Tillmann Buttersack et.al. 2509.17724 null
2025-09-22 DINOv3-Diffusion Policy: Self-Supervised Large Visual Model for Visuomotor Diffusion Policy Learning ThankGod Egbe et.al. 2509.17684 null
2025-09-23 Clothing agnostic Pre-inpainting Virtual Try-ON Sehyun Kim et.al. 2509.17654 null
2025-09-22 SISMA: Semantic Face Image Synthesis with Mamba Filippo Botti et.al. 2509.17651 null
2025-09-22 VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video Yu Liu et.al. 2509.17647 null
2025-09-22 OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models Jinshu Chen et.al. 2509.17627 null
2025-09-22 Audio Super-Resolution with Latent Bridge Models Chang Li et.al. 2509.17609 null
2025-09-22 Measurements and scaling of X-ray total scattering from single crystals S. Gorfman et.al. 2509.17605 null
2025-09-22 Conditioning in Generative Quantum Denoising Diffusion Models Daniel Quinn et.al. 2509.17569 null
2025-09-22 Robust spectral preconditioning for high-Péclet number convection-diffusion Lukas Holbach et.al. 2509.17531 null
2025-09-22 Stable Video-Driven Portraits Mallikarjun B. R. et.al. 2509.17476 null
2025-09-22 CARINOX: Inference-time Scaling with Category-Aware Reward-based Initial Noise Optimization and Exploration Seyed Amir Kasaei et.al. 2509.17458 null
2025-09-22 Learning Dexterous Manipulation with Quantized Hand State Ying Feng et.al. 2509.17450 null
2025-09-22 Exploring Machine Learning Models for Physical Dose Calculation in Carbon Ion Therapy Using Heterogeneous Imaging Data - A Proof of Concept Study Miriam Schwarze et.al. 2509.17433 null
2025-09-22 Single-Image Depth from Defocus with Coded Aperture and Diffusion Posterior Sampling Hodaka Kawachi et.al. 2509.17427 null
2025-09-22 Diff-GNSS: Diffusion-based Pseudorange Error Estimation Jiaqi Zhu et.al. 2509.17397 null
2025-09-22 The Asymptotic Analysis of Some PDE and Steklov Eigenvalue Problems with Partially Reactive Patches in 3-D Denis S. Grebenkov et.al. 2509.17394 null
2025-09-22 Magnetically Enhanced Thermoelectric Effect Driven by Martensitic Transformation in the Weak Itinerant Ferromagnet Co $_2$ NbSn Takumi Kihara et.al. 2509.17378 null
2025-09-22 Volume Density Mapper: 3D Density Reconstruction Algorithm for Molecular Clouds Guang-Xing Li et.al. 2509.17369 null
2025-09-22 SeqUDA-Rec: Sequential User Behavior Enhanced Recommendation via Global Unsupervised Data Augmentation for Personalized Content Marketing Ruihan Luo et.al. 2509.17361 null
2025-09-22 DiffQ: Unified Parameter Initialization for Variational Quantum Algorithms via Diffusion Models Chi Zhang et.al. 2509.17324 null
2025-09-22 GraphWeave: Interpretable and Robust Graph Generation via Random Walk Trajectories Rahul Nandakumar et.al. 2509.17291 null
2025-09-21 Graph Signal Generative Diffusion Models Yigit Berkay Uslu et.al. 2509.17250 null
2025-09-21 Scalable Multi Agent Diffusion Policies for Coverage Control Frederic Vatnsdal et.al. 2509.17244 null
2025-09-21 DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction Bo Liu et.al. 2509.17232 null
2025-09-21 Virtual Consistency for Audio Editing Matthieu Cervera et.al. 2509.17219 null
2025-09-21 Guided and Unguided Conditional Diffusion Mechanisms for Structured and Semantically-Aware 3D Point Cloud Generation Gunner Stone et.al. 2509.17206 null
2025-09-21 Conditional Policy Generator for Dynamic Constraint Satisfaction and Optimization Wook Lee et.al. 2509.17205 null
2025-09-21 Echo-Path: Pathology-Conditioned Echo Video Generation Kabir Hamzah Muhammad et.al. 2509.17190 null
2025-09-21 Towards a unified turbulence model through multi-objective learning Zhuo-Ran Liu et.al. 2509.17189 null
2025-09-21 Ambiguous Medical Image Segmentation Using Diffusion Schrödinger Bridge Lalith Bharadwaj Baru et.al. 2509.17187 null
2025-09-21 SynergyNet: Fusing Generative Priors and State-Space Models for Facial Beauty Prediction Djamel Eddine Boukhari et.al. 2509.17172 null
2025-09-21 Criticality of a stochastic modern Hopfield network model with exponential interaction function Marco Cafiso et.al. 2509.17152 null
2025-09-21 Stencil: Subject-Driven Generation with Context Guidance Gordon Chen et.al. 2509.17120 null
2025-09-21 ScenGAN: Attention-Intensive Generative Model for Uncertainty-Aware Renewable Scenario Forecasting Yifei Wu et.al. 2509.17119 null
2025-09-21 $\texttt{DiffSyn}$ : A Generative Diffusion Approach to Materials Synthesis Planning Elton Pan et.al. 2509.17094 null
2025-09-21 AlignedGen: Aligning Style Across Generated Images Jiexuan Zhang et.al. 2509.17088 null
2025-09-21 CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving Ruiguo Zhong et.al. 2509.17080 null
2025-09-21 Global classical solutions to a two-dimensional chemotaxis-fluid system involving signal-dependent degenerate diffusion Yansheng Ma et.al. 2509.17073 null
2025-09-21 Intention-aware Hierarchical Diffusion Model for Long-term Trajectory Anomaly Detection Chen Wang et.al. 2509.17068 null
2025-09-21 Geodesic Prototype Matching via Diffusion Maps for Interpretable Fine-Grained Recognition Junhao Jia et.al. 2509.17050 null
2025-09-21 Boundary Feller-Dynkin processes associated with Laguerre processes and Pickrell diffusions Alexander I. Bufetov et.al. 2509.17045 null
2025-09-21 When Color-Space Decoupling Meets Diffusion for Adverse-Weather Image Restoration Wenxuan Fang et.al. 2509.17024 null
2025-09-21 Multiscale solution decomposition of nonlocal-in-time problems with application in numerical computation Mengmeng Liu et.al. 2509.17020 null
2025-09-21 DocIQ: A Benchmark Dataset and Feature Fusion Network for Document Image Quality Assessment Zhichao Ma et.al. 2509.17012 null
2025-09-21 Generalized Momenta-Based Koopman Formalism for Robust Control of Euler-Lagrangian Systems Rajpal Singh et.al. 2509.17010 null
2025-09-21 Radiation Mediated Shock and Planar Shock Breakout in the Presence of Atomic Transition Lines Jonathan Morag et.al. 2509.16996 null
2025-09-21 VCE: Safe Autoregressive Image Generation via Visual Contrast Exploitation Feng Han et.al. 2509.16986 null
2025-09-21 Ledrappier-Young entropy formula for $C^1$ diffeomorphisms with dominated splitting Part 1: Unstable entropy formula and invariance principle Shaobo Gan et.al. 2509.16981 null
2025-09-21 Penalizing Boundary Activation for Object Completeness in Diffusion Models Haoyang Xu et.al. 2509.16968 null
2025-09-21 SemanticGarment: Semantic-Controlled Generation and Editing of 3D Gaussian Garments Ruiyan Wang et.al. 2509.16960 null
2025-09-21 VidCLearn: A Continual Learning Approach for Text-to-Video Generation Luca Zanchetta et.al. 2509.16956 null
2025-09-21 Machine learning meets Singular Optics II: Single-pixel Detection of Structured Light Purnesh Singh Badavath et.al. 2509.16946 null
2025-09-21 Discrete Heat Kernels on Simplicial Complexes and Its Application to Functional Brain Networks Sixtus Dakurah et.al. 2509.16908 null
2025-09-21 PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion Xuewan He et.al. 2509.16897 null
2025-09-21 A Mutil-conditional Diffusion Transformer for Versatile Seismic Wave Generation Longfei Duan et.al. 2509.16874 null
2025-09-21 $\mathtt{M^3VIR}$ : A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation Yuanzhi Li et.al. 2509.16873 null
2025-09-21 HOGraspFlow: Exploring Vision-based Generative Grasp Synthesis with Hand-Object Priors and Taxonomy Awareness Yitian Shi et.al. 2509.16871 null
2025-09-21 PhysHDR: When Lighting Meets Materials and Scene Geometry in HDR Reconstruction Hrishav Bakul Barua et.al. 2509.16869 null
2025-09-20 DoubleGen: Debiased Generative Modeling of Counterfactuals Alex Luedtke et.al. 2509.16842 null
2025-09-20 Factorizing Diffusion Policies for Observation Modality Prioritization Omkar Patil et.al. 2509.16830 null
2025-09-20 DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images Ozgur Kara et.al. 2509.16767 null
2025-09-20 Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees Yuchen Liang et.al. 2509.16756 null
2025-09-20 HyPlaneHead: Rethinking Tri-plane-like Representations in Full-Head Image Synthesis Heyuan Li et.al. 2509.16748 null
2025-09-20 Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment Xin Lei Lin et.al. 2509.16727 null
2025-09-20 Animalbooth: multimodal feature enhancement for animal subject personalization Chen Liu et.al. 2509.16702 null
2025-09-20 InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention Qiang Xiang et.al. 2509.16691 null
2025-09-20 Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation Yue Ma et.al. 2509.16630 null
2025-09-20 Investigation of the Axe-shaped Radio Galaxy J1051+5523 with uGMRT Sudheesh T. P. et.al. 2509.16624 null
2025-09-20 Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing Mengqi Wang et.al. 2509.16622 null
2025-09-20 An Octave-based Multi-Resolution CQT Architecture for Diffusion-based Audio Generation Maurício do V. M. da Costa et.al. 2509.16603 null
2025-09-20 FakeChain: Exposing Shallow Cues in Multi-Step Deepfake Detection Minji Heo et.al. 2509.16602 null
2025-09-19 MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer Yanghao Li et.al. 2509.16197 null
2025-09-19 AcT2I: Evaluating and Improving Action Depiction in Text-to-Image Models Vatsal Malaviya et.al. 2509.16141 null
2025-09-19 Dynamic Classifier-Free Diffusion Guidance via Online Feedback Pinelopi Papalampidi et.al. 2509.16131 null
2025-09-19 DiffusionNFT: Online Diffusion Reinforcement with Forward Process Kaiwen Zheng et.al. 2509.16117 null
2025-09-19 KRED: Korea Research Economic Database for Macroeconomic Research Changryong Baek et.al. 2509.16115 null
2025-09-19 PRISM: Probabilistic and Robust Inverse Solver with Measurement-Conditioned Diffusion Prior for Blind Inverse Problems Yuanyun Hu et.al. 2509.16106 null
2025-09-19 Blind-Spot Guided Diffusion for Self-supervised Real-World Denoising Shen Cheng et.al. 2509.16091 null
2025-09-19 Generating Detailed Character Motion from Blocking Poses Purvi Goel et.al. 2509.16064 null
2025-09-19 Latent Conditioned Loco-Manipulation Using Motion Priors Maciej Stępień et.al. 2509.16061 null
2025-09-19 Compose by Focus: Scene Graph-based Atomic Skills Han Qi et.al. 2509.16053 null
2025-09-19 A Note on the formulation of the Neumann boundary condition for a nonlocal problem Antonio Luiz Pereira et.al. 2509.16041 null
2025-09-19 SLaM-DiMM: Shared Latent Modeling for Diffusion Based Missing Modality Synthesis in MRI Bhavesh Sandbhor et.al. 2509.16019 null
2025-09-19 DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching Meng Yang et.al. 2509.16017 null
2025-09-19 Going with the Flow: Solving for Symmetry-Driven PDE dynamics with Physics-informed Neural Networks Michail Kavousanakis et.al. 2509.15963 null
2025-09-19 Structured Information for Improving Spatial Relationships in Text-to-Image Generation Sander Schildermans et.al. 2509.15962 null
2025-09-19 Optimal Experimental Design of a Moving Sensor for Linear Bayesian Inverse Problems Nicole Aretz et.al. 2509.15961 null
2025-09-19 Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement Gang Yang et.al. 2509.15952 null
2025-09-19 UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation Mingdong Wu et.al. 2509.15934 null
2025-09-19 Bayesian Physics Informed Neural Networks for Reliable Transformer Prognostics Ibai Ramirez et.al. 2509.15933 null
2025-09-19 Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search Zhiyu Mou et.al. 2509.15927 null
2025-09-19 An optimal-control framework for reaction diffusion systems with application to synthetic developmental biology Mohamed Amine Ouchdiri et.al. 2509.15889 null
2025-09-19 A Multidimensional Self-Adaptive Numerical Simulation Framework for Semiconductor Boltzmann Transport Equation Zeyu Zhang et.al. 2509.15879 null
2025-09-19 SAGE: Semantic-Aware Shared Sampling for Efficient Diffusion Haoran Zhao et.al. 2509.15865 null
2025-09-19 Observation of the Galactic Center in the Sub-MeV Gamma-Ray Band with an Electron-Tracking Compton Camera Tomonori Ikeda et.al. 2509.15851 null
2025-09-19 Turing Patterns in a Morphogenetic Model with Single Regulatory Function Mohamed Amine Ouchdiri et.al. 2509.15829 null
2025-09-19 QWD-GAN: Quality-aware Wavelet-driven GAN for Unsupervised Medical Microscopy Images Denoising Qijun Yang et.al. 2509.15814 null
2025-09-19 Polynomial approximation from diffused data: unisolvence and stability Ludovico Bruni Bruno et.al. 2509.15813 null
2025-09-19 CIDER: A Causal Cure for Brand-Obsessed Text-to-Image Models Fangjian Shen et.al. 2509.15803 null
2025-09-19 Monte Carlo Tree Diffusion with Multiple Experts for Protein Design Xuefeng Liu et.al. 2509.15796 null
2025-09-19 Absence of Radio Emission Reveals an Exceptionally Weak Explosion of the Putative Historical Supernova Pa 30 Yi-xuan Shao et.al. 2509.15792 null
2025-09-19 Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation Weimin Bai et.al. 2509.15772 null
2025-09-19 Learning to Optimize Capacity Planning in Semiconductor Manufacturing Philipp Andelfinger et.al. 2509.15767 null
2025-09-19 Utility-based Privacy Preserving Data Mining Qingfeng Zhou et.al. 2509.15755 null
2025-09-19 Discovering Top-k Periodic and High-Utility Patterns Qingfeng Zhou et.al. 2509.15732 null
2025-09-19 Search for cosmic-ray induced gamma-ray emission from local galaxy clusters using Fermi-LAT data Judit Pérez-Romero et.al. 2509.15720 null
2025-09-19 Imagination at Inference: Synthesizing In-Hand Views for Robust Visuomotor Policy Inference Haoran Ding et.al. 2509.15717 null
2025-09-19 Weak Error Estimates of Ergodic Approximations for Monotone Jump-diffusion SODEs Zhihui Liu et.al. 2509.15698 null
2025-09-19 Bose’s Probabilistic Interactions, Einstein’s Objections, and Their Legacy in Quantum Optics and Stochastic Mechanics Partha Ghose et.al. 2509.15686 null
2025-09-19 Spontaneous stochasticity in the Armstrong-Vicol passive scalar Wandrille Ruffenach et.al. 2509.15683 null
2025-09-19 Layout Stroke Imitation: A Layout Guided Handwriting Stroke Generation for Style Imitation with Diffusion Model Sidra Hanif et.al. 2509.15678 null
2025-09-19 Diffusion of gravitactic chiral active Brownian particles in an asymmetric channel Narender Khatri et.al. 2509.15630 null
2025-09-19 MAGENTA: Magnitude and Geometry-ENhanced Training Approach for Robust Long-Tailed Sound Event Localization and Detection Jun-Wei Yeow et.al. 2509.15599 null
2025-09-19 Global Existence of Solutions of Nonlocal Geirer-Meinhardt Model and Effect of Nonlocal Operator in Pattern Formation Md Shah Alam et.al. 2509.15598 null
2025-09-19 Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification Zinan Lin et.al. 2509.15591 null
2025-09-19 Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification Tian Lan et.al. 2509.15553 null
2025-09-19 PolyJuice Makes It Real: Black-Box, Universal Red Teaming for Synthetic Image Detectors Sepehr Dehdashtian et.al. 2509.15551 null
2025-09-19 Global Existence and Boundedness of Gray-Scott Model with Local and Nonlocal Diffusion Md Shah Alam et.al. 2509.15535 null
2025-09-19 Lynx: Towards High-Fidelity Personalized Video Generation Shen Sang et.al. 2509.15496 null
2025-09-18 Full Quantum Stack: Ket Platform Evandro Rosa et.al. 2509.15484 null
2025-09-18 OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data Björn Möller et.al. 2509.15479 null
2025-09-18 Efficient Multimodal Dataset Distillation via Generative Models Zhenghao Zhao et.al. 2509.15472 null
2025-09-18 $ν$ SpaceSim: A Comprehensive Simulation Package for Modeling the Measurement of Cosmic Neutrinos using the Earth as the Neutrino Target and Space-based Detectors Mary Hall Reno et.al. 2509.15469 null
2025-09-18 SERVAL: Surprisingly Effective Zero-Shot Visual Document Retrieval Powered by Large Vision and Language Models Thong Nguyen et.al. 2509.15432 null
2025-09-18 Random Matrix Theory-guided sparse PCA for single-cell RNA-seq data Victor Chardès et.al. 2509.15429 null
2025-09-18 Thin-film boundary-layer diffusion of non-equilibrium flow to kinetically limited reactive surfaces via Damköhler thermochemistry tables Jeffrey D. Engerer et.al. 2509.15427 null
2025-09-18 Spectral Characterization of Wave Scattering at a Granular-Elastic Solid Interface: From Hyperbolic Wave Propagation to Near-Parabolic Diffusion Joshua R. Tempelman et.al. 2509.15415 null
2025-09-18 Causal Fingerprints of AI Generative Models Hui Xu et.al. 2509.15406 null
2025-09-18 Caught in the Cosmic Web: Evidence for Ram-Pressure Stripping of a Low-Mass Galaxy by the Cosmic Web Nicholas Luber et.al. 2509.15405 null
2025-09-18 RaceGAN: A Framework for Preserving Individuality while Converting Racial Information for Image-to-Image Translation Mst Tasnim Pervin et.al. 2509.15391 null
2025-09-18 MaskAttn-SDXL: Controllable Region-Level Text-To-Image Generation Yu Chang et.al. 2509.15357 null
2025-09-18 LowDiff: Efficient Diffusion Sampling with Low-Resolution Condition Jiuyi Xu et.al. 2509.15342 null
2025-09-18 WALLABY Pilot Survey: A gas-rich diffuse dwarf on the baryonic Tully Fisher relation Rebecca Dudley et.al. 2509.15340 null
2025-09-18 Kuramoto Orientation Diffusion Models Yue Song et.al. 2509.15328 null
2025-09-18 Anisotropic Cosmic Ray Transport resulting from Magnetic Mirroring and Resonant Curvature Scattering Jeremiah Lübke et.al. 2509.15320 null
2025-09-18 PRISM: Phase-enhanced Radial-based Image Signature Mapping framework for fingerprinting AI-generated images Emanuele Ricco et.al. 2509.15270 null
2025-09-18 Autoguided Online Data Curation for Diffusion Model Training Valeria Pais et.al. 2509.15267 null
2025-09-18 Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model Fangjinhua Wang et.al. 2509.15220 null
2025-09-18 RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation Yuming Jiang et.al. 2509.15212 null
2025-09-18 Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning Yeongbin Seo et.al. 2509.15188 null
2025-09-18 Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation Xiaoyu Yue et.al. 2509.15185 null
2025-09-18 Conditional Prior-based Non-stationary Channel Estimation Using Accelerated Diffusion Models Muhammad Ahmed Mohsin et.al. 2509.15182 null
2025-09-18 A Race Bias Free Face Aging Model for Reliable Kinship Verification Ali Nazari et.al. 2509.15177 null
2025-09-18 Unveiling TeV halos among unidentified extended TeV sources Michela Rigoselli et.al. 2509.15168 null
2025-09-18 AnoF-Diff: One-Step Diffusion-Based Anomaly Detection for Forceful Tool Use Yating Lin et.al. 2509.15153 null
2025-09-18 WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance Chenxi Song et.al. 2509.15130 null
2025-09-18 Learning Mechanistic Subtypes of Neurodegeneration with a Physics-Informed Variational Autoencoder Mixture Model Sanduni Pinnawala et.al. 2509.15124 null
2025-09-18 LOFAR 58 MHz Legacy Survey of the 3CRR Catalog J. M. Boxelaar et.al. 2509.15115 null
2025-09-18 Real-Time Streaming Mel Vocoding with Generative Flow Matching Simon Welker et.al. 2509.15085 null
2025-09-18 Forecasting and Visualizing Air Quality from Sky Images with Vision-Language Models Mohammad Saleh Vahdatpour et.al. 2509.15076 null
2025-09-19 Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue Xingyao Lin et.al. 2509.15061 null
2025-09-18 How long does it take an Elephant Random Walk to forget its training Zheng Fang et.al. 2509.15049 null
2025-09-18 AutoEdit: Automatic Hyperparameter Tuning for Image Editing Chau Pham et.al. 2509.15031 null
2025-09-19 Sea-ing Through Scattered Rays: Revisiting the Image Formation Model for Realistic Underwater Image Generation Vasiliki Ismiroglou et.al. 2509.15011 null
2025-09-19 SPATIALGEN: Layout-guided 3D Indoor Scene Generation Chuan Fang et.al. 2509.14981 null
2025-09-18 M4Diffuser: Multi-View Diffusion Policy with Manipulability-Aware Control for Robust Mobile Manipulation Ju Dong et.al. 2509.14980 null
2025-09-19 Stochastic Hamiltonian Type Jump Diffusion Systems with Countable Regimes: Strong Feller Property and Exponential Ergodicity Fubao Xi et.al. 2509.14951 null
2025-09-18 A Novel Task-Driven Diffusion-Based Policy with Affordance Learning for Generalizable Manipulation of Articulated Objects Hao Zhang et.al. 2509.14939 null
2025-09-18 Mitigating data replication in text-to-audio generative diffusion models through anti-memorization guidance Francisco Messina et.al. 2509.14934 null
2025-09-18 Back to Ear: Perceptually Driven High Fidelity Music Reconstruction Kangdi Wang et.al. 2509.14912 null
2025-09-18 Finite Volumes for a dissipative free boundary problem Clément Cancès et.al. 2509.14908 null
2025-09-18 Constraining gamma-ray burst parameters with the first ultra-high energy neutrino event KM3-230213A KM3NeT Collaboration et.al. 2509.14895 null
2025-09-18 NeRF-based Visualization of 3D Cues Supporting Data-Driven Spacecraft Pose Estimation Antoine Legrand et.al. 2509.14890 null
2025-09-18 CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human Nan Sun et.al. 2509.14889 null
2025-09-18 Controllable Localized Face Anonymization Via Diffusion Inpainting Ali Salar et.al. 2509.14866 null
2025-09-19 MeanFlowSE: one-step generative speech enhancement via conditional mean flow Duojia Li et.al. 2509.14858 null
2025-09-18 A class of flexible and efficient partitioned Runge-Kutta-Chebyshev methods for some time-dependent partial differential equations Xiao Tang et.al. 2509.14847 null
2025-09-18 [Re] Improving Interpretation Faithfulness for Vision Transformers Izabela Kurek et.al. 2509.14846 null
2025-09-18 Diffusion-Based Scenario Tree Generation for Multivariate Time Series Prediction and Multistage Stochastic Optimization Stelios Zarifis et.al. 2509.14832 null
2025-09-18 Spectral survey of the diffuse gas toward BL Lac in the Q band Maryvonne Gerin et.al. 2509.14822 null
2025-09-18 Acoustic Simulation Framework for Multi-channel Replay Speech Detection Michael Neri et.al. 2509.14789 null
2025-09-18 MELA-TTS: Joint transformer-diffusion model with representation alignment for speech synthesis Keyu An et.al. 2509.14784 null
2025-09-18 Radiology Report Conditional 3D CT Generation with Multi Encoder Latent diffusion Model Sina Amirrajab et.al. 2509.14780 null
2025-09-18 Dataset Distillation for Super-Resolution without Class Labels and Pre-trained Models Sunwoo Cho et.al. 2509.14777 null
2025-09-18 Diffuse emission from stochastic sources Anton Stall et.al. 2509.14776 null
2025-09-18 UMind: A Unified Multitask Network for Zero-Shot M/EEG Visual Decoding Chengjian Xu et.al. 2509.14772 null
2025-09-18 Hydrodynamic Attraction and Hindered Diffusion Govern First-passage Times of Swimming Microorganisms Yanis Baouche et.al. 2509.14765 null
2025-09-18 Data Augmentation via Latent Diffusion Models for Detecting Smell-Related Objects in Historical Artworks Ahmed Sheta et.al. 2509.14755 null
2025-09-18 Chain-of-Thought Re-ranking for Image Retrieval Tasks Shangrong Wu et.al. 2509.14746 null
2025-09-18 UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets Pengyu Wang et.al. 2509.14738 null
2025-09-18 Towards Pre-trained Graph Condensation via Optimal Transport Yeyu Yan et.al. 2509.14722 null
2025-09-18 DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images Kazuma Nagata et.al. 2509.14685 null
2025-09-18 Enhancing Situational Awareness in Wearable Audio Devices Using a Lightweight Sound Event Localization and Detection System Jun-Wei Yeow et.al. 2509.14650 null
2025-09-18 On the algebraic stretching dynamics of variable-density mixing in shock-bubble interaction Xu Han et.al. 2509.14607 null
2025-09-18 DICE: Diffusion Consensus Equilibrium for Sparse-view CT Reconstruction Leon Suarez-Rodriguez et.al. 2509.14566 null
2025-09-18 DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising Li Gao et.al. 2509.14565 null
2025-09-18 Adaptive and Iterative Point Cloud Denoising with Score-Based Diffusion Model Zhaonan Wang et.al. 2509.14560 null
2025-09-18 Radiolunadiff: Estimation of wireless network signal strength in lunar terrain Paolo Torrado et.al. 2509.14559 null
2025-09-18 Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods Adam D. Hines et.al. 2509.14516 null
2025-09-18 A Time-Inconsistent Stochastic Optimal Control Problem in an Infinite Time Horizon Qingmeng Wei et.al. 2509.14495 null
2025-09-17 Error analysis of a fully discrete structure-preserving finite element scheme for a diffuse-interface model of tumour growth Agus L. Soenjaya et.al. 2509.14486 null
2025-09-17 AToken: A Unified Tokenizer for Vision Jiasen Lu et.al. 2509.14476 null
2025-09-17 Keywords are not always the key: A metadata field analysis for natural language search on open data portals Lisa-Yao Gan et.al. 2509.14457 null
2025-09-17 On the equivalence and optimality of transformations of diffusive systems Davide Gabrielli et.al. 2509.14450 null
2025-09-17 Diffusion-Based Unsupervised Audio-Visual Speech Separation in Noisy Environments with Noise Prior Yochai Yemini et.al. 2509.14379 null
2025-09-17 Electricity in international comparison – Future technologies in power generation Axel Kleidon et.al. 2509.14365 null
2025-09-17 DreamControl: Human-Inspired Whole-Body Humanoid Control for Scene Interaction via Guided Diffusion Dvij Kalaria et.al. 2509.14353 null
2025-09-17 Enhanced Radio Emission Between a Galaxy Cluster Pair Andrea Botteon et.al. 2509.14348 null
2025-09-17 Dichotomy in Long-Lived Radio Emission from Tidal Disruption Events AT 2020zso and AT 2021sdu: Multi-Component Outflows vs. Host Contamination Collin T. Christy et.al. 2509.14317 null
2025-09-17 FlowDrive: Energy Flow Field for End-to-End Autonomous Driving Hao Jiang et.al. 2509.14303 null
2025-09-17 D4PM: A Dual-branch Driven Denoising Diffusion Probabilistic Model with Joint Posterior Diffusion Sampling for EEG Artifacts Removal Feixue Shao et.al. 2509.14302 null
2025-09-17 SpeechOp: Inference-Time Task Composition for Generative Speech Processing Justin Lovelace et.al. 2509.14298 null
2025-09-17 GenExam: A Multidisciplinary Text-to-Image Exam Zhaokai Wang et.al. 2509.14232 null
2025-09-17 Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics Benjamin Sterling et.al. 2509.14225 null
2025-09-17 Looking into the faintEst WIth MUSE (LEWIS): Exploring the nature of ultra-diffuse galaxies in the Hydra-I cluster IV. A study of the Globular Cluster population in four UDGs Marco Mirabile et.al. 2509.14206 null
2025-09-17 Mass Transport, Turbulent Mixing, and Inflow in Black Hole Accretion George N. Wong et.al. 2509.14202 null
2025-09-16 \textsc{Gen2Real}: Towards Demo-Free Dexterous Manipulation by Harnessing Generated Video Kai Ye et.al. 2509.14178 null
2025-09-17 Reaction-diffusion models of invasive tree pest spread: quantifying the spread of oak processionary moth in the UK Jamie P. McKeown et.al. 2509.14166 null
2025-09-17 Quantum Reinforcement Learning-Guided Diffusion Model for Image Synthesis via Hybrid Quantum-Classical Generative Model Architectures Chi-Sheng Chen et.al. 2509.14163 null
2025-09-17 MIMIC-D: Multi-modal Imitation for MultI-agent Coordination with Decentralized Diffusion Policies Dayi Dong et.al. 2509.14159 null
2025-09-17 An Exploratory Study on Abstract Images and Visual Representations Learned from Them Haotian Li et.al. 2509.14149 null
2025-09-17 FlightDiffusion: Revolutionising Autonomous Drone Training with Diffusion Models Generating FPV Video Valerii Serpiva et.al. 2509.14082 null
2025-09-17 Dissipativity-Based Data-Driven Decentralized Control of Interconnected Systems Taiki Nakano et.al. 2509.14047 null
2025-09-17 Cross-diffusion limits in multispecies kinetic models Ansgar Jüngel et.al. 2509.14046 null
2025-09-17 A Pearl in the Shell: an ultra-compact dwarf within the tidal debris surrounding spiral galaxy NGC 7531 David Martínez-Delgado et.al. 2509.14038 null
2025-09-17 Improving cosmological reach of a gravitational wave observatory using Deep Loop Shaping Jonas Buchli et.al. 2509.14016 null
2025-09-17 RFM-Editing: Rectified Flow Matching for Text-guided Audio Editing Liting Gao et.al. 2509.14003 null
2025-09-17 Reconstruction of strong degeneracy region for parabolic equations and systems Piermarco Cannarsa et.al. 2509.13962 null
2025-09-17 Noise-Level Diffusion Guidance: Well Begun is Half Done Harvey Mannering et.al. 2509.13936 null
2025-09-17 Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification Wenkui Yang et.al. 2509.13922 null
2025-09-17 Recovering the Coupled Treatment of Redshift-Space Distortions and the Lightcone Effect after Diffuse Foreground Removal Jennifer Feron et.al. 2509.13920 null
2025-09-17 Inverse Design of Amorphous Materials with Targeted Properties Jonas A. Finkler et.al. 2509.13916 null
2025-09-17 Using Deep Learning Methods to Detect for Ultra-diffuse Galaxies in KiDS Hao Su et.al. 2509.13910 null
2025-09-17 A Tight Quantum Algorithm for Multiple Collision Search Xavier Bonnetain et.al. 2509.13909 null
2025-09-17 PhysicalAgent: Towards General Cognitive Robotics with Foundation World Models Artem Lykov et.al. 2509.13903 null
2025-09-17 Masked Diffusion Models as Energy Minimization Sitong Chen et.al. 2509.13866 null
2025-09-17 EDITS: Enhancing Dataset Distillation with Implicit Textual Semantics Qianxin Xia et.al. 2509.13858 null
2025-09-17 Surfing on chemical waves: a simple yet dynamically rich two-sphere responsive gel swimmer Joseph J. Webber et.al. 2509.13850 null
2025-09-17 SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation Jiayi Pan et.al. 2509.13848 null
2025-09-17 Polycyclic aromatic hydrocarbons destruction in star-forming regions across 42 nearby galaxies Oleg V. Egorov et.al. 2509.13845 null
2025-09-18 BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching Hanshuai Cui et.al. 2509.13789 null
2025-09-17 Generative Image Coding with Diffusion Prior Jianhui Chang et.al. 2509.13768 null
2025-09-17 Iterative Prompt Refinement for Safer Text-to-Image Generation Jinwoo Jeon et.al. 2509.13760 null
2025-09-17 Controllable-Continuous Color Editing in Diffusion Model via Color Mapping Yuqi Yang et.al. 2509.13756 null
2025-09-17 Cross-modal Full-mode Fine-grained Alignment for Text-to-Image Person Retrieval Hao Yin et.al. 2509.13754 null
2025-09-17 Heavy Traffic Diffusion Limit for a Closed Queueing Network with Single-Server and Infinite-Server Stations Amir A. Alwan et.al. 2509.13748 null
2025-09-17 Ion-modulated structure, proton transfer, and capacitance in the Pt(111)/water electric double layer Xiaoyu Wang et.al. 2509.13727 null
2025-09-17 StyleProtect: Safeguarding Artistic Identity in Fine-tuned Diffusion Models Qiuyu Tang et.al. 2509.13711 null
2025-09-17 LLM-I: LLMs are Naturally Interleaved Multimodal Creators Zirun Guo et.al. 2509.13642 null
2025-09-17 Generative Consistency Models for Estimation of Kinetic Parametric Image Posteriors in Total-Body PET Yun Zhao et.al. 2509.13614 null
2025-09-16 Cross-Distribution Diffusion Priors-Driven Iterative Reconstruction for Sparse-View CT Haodong Li et.al. 2509.13576 null
2025-09-16 ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors Romain Hardy et.al. 2509.13525 null
2025-09-16 AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions Väinö Hatanpää et.al. 2509.13523 null
2025-09-16 DEFT-VTON: Efficient Virtual Try-On with Consistent Generalised H-Transform Xingzi Xu et.al. 2509.13506 null
2025-09-16 BiasMap: Leveraging Cross-Attentions to Discover and Mitigate Hidden Social Biases in Text-to-Image Generation Rajatsubhra Chakraborty et.al. 2509.13496 null
2025-09-16 The effect of parameter drift in the transport of magnetized plasma particles P. Haerter et.al. 2509.13472 null
2025-09-18 Unified Spatiotemporal Physics-Informed Learning (USPIL): A Framework for Modeling Complex Predator-Prey Dynamics Julian Evan Chrisnanto et.al. 2509.13425 null
2025-09-16 Modeling Cosmological Evolution of Jetted Seyfert Galaxies for z<10 Julianne Goddard et.al. 2509.13418 null
2025-09-16 SOFIA Polarization Spectrum of Three Star-Forming Clouds Erin G. Cox et.al. 2509.13416 null
2025-09-16 EdiVal-Agent: An Object-Centric Framework for Automated, Scalable, Fine-Grained Evaluation of Multi-Turn Editing Tianyu Chen et.al. 2509.13399 null
2025-09-16 Valuation of Exotic Options and Counterparty Games Based on Conditional Diffusion Helin Zhao et.al. 2509.13374 null
2025-09-16 Runaway electron interactions with whistler waves in tokamak plasmas: energy-dependent transport scaling Yashika Ghai et.al. 2509.13271 null
2025-09-16 Beyond Private or Public: Large Language Models as Quasi-Public Goods in the AI Economy Yukun Zhang et.al. 2509.13265 null
2025-09-16 Geometry, Energy and Sensitivity in Stochastic Proton Dynamics Veronika Chronholm et.al. 2509.13223 null
2025-09-17 The Gamma Expansion of the Level Two Large Deviation Rate Functional for Reversible Diffusion Processes Claudio Landim et.al. 2509.13222 null
2025-09-18 End4: End-to-end Denoising Diffusion for Diffusion-Based Inpainting Detection Fei Wang et.al. 2509.13214 null
2025-09-16 Global existence and decay of small solutions in a viscous half Klein-Gordon equation Louis Garénaux et.al. 2509.13188 null
2025-09-16 PDE-Based Bayesian Hierarchical Modeling for Event Spread, with Application to COVID-19 Infection Mengqi Cen et.al. 2509.13174 null
2025-09-17 TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving Jiawei Wang et.al. 2509.13164 null
2025-09-16 Enhancing Video Large Language Models with Structured Multi-Video Collaborative Reasoning (early version) Zhihao He et.al. 2509.13161 null
2025-09-16 MSDNet: Efficient 4D Radar Super-Resolution via Multi-Stage Distillation Minqing Huang et.al. 2509.13149 null
2025-09-16 Discovering Mathematical Equations with Diffusion Language Model Xiaoxu Han et.al. 2509.13136 null
2025-09-16 Quantifying CO2 Distribution at the Air-Water Interface – Spatiotemporally Resolved Measurements Using Tunable Diode Laser Spectroscopy Dongfang Zhao et.al. 2509.13113 null
2025-09-16 Quantitative 3D Morphology of Cellular H2/O2/N2 Flames on a Porous-Plug Burner: Spatially Resolved Measurements of Temperature and OH Radical Zeyu Yan et.al. 2509.13106 null
2025-09-16 MIA-EPT: Membership Inference Attack via Error Prediction for Tabular Data Eyal German et.al. 2509.13046 null
2025-09-16 ReTrack: Data Unlearning in Diffusion Models through Redirecting the Denoising Trajectory Qitan Shi et.al. 2509.13007 null
2025-09-16 Difference-Based Recovery for Modulo Sampling: Tightened Bounds and Robustness Guarantees Wenyi Yan et.al. 2509.12971 null
2025-09-16 Cosmic dust as a prerequisite for the formation of complex organic molecules in space? Alexey Potapov et.al. 2509.12967 null
2025-09-16 Mathematical Study of Reaction-Diffusion in Congested Crowd Motion Noureddine Igbida et.al. 2509.12935 null
2025-09-16 The Anatomy of Alignment: Decomposing Preference Optimization by Steering Sparse Features Jeremias Ferrao et.al. 2509.12934 null
2025-09-16 Non-parametric estimation of non-linear diffusion coefficient in parabolic SPDEs Martin Andersson et.al. 2509.12921 null
2025-09-16 Neural Network Localized Orthogonal Decomposition for Numerical Homogenization of Diffusion Operators with Random Coefficients Fabian Kröpfl et.al. 2509.12896 null
2025-09-16 Runge-Kutta Approximation and Decoupled Attention for Rectified Flow Inversion and Semantic Editing Weiming Chen et.al. 2509.12888 null
2025-09-16 Few to Big: Prototype Expansion Network via Diffusion Learner for Point Cloud Few-shot Semantic Segmentation Qianguang Zhao et.al. 2509.12878 null
2025-09-16 Bayesian Signal Separation via Plug-and-Play Diffusion-Within-Gibbs Sampling Yi Zhang et.al. 2509.12857 null
2025-09-16 Benchmarking thermostat algorithms in molecular dynamics simulations of a binary Lennard-Jones glass-former model Kumpei Shiraishi et.al. 2509.12837 null
2025-09-16 Pressure dependent structure of neat liquid methanol, CH3OH: molecular dynamics simulations with various united atom type potentials Imre Bakó et.al. 2509.12834 null
2025-09-16 A Lightweight Pipeline for Noisy Speech Voice Cloning and Accurate Lip Sync Synthesis Javeria Amir et.al. 2509.12831 null
2025-09-17 DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval Zechao Liu et.al. 2509.12824 null
2025-09-16 A Pressure-Based Diffusion Model for Influence Maximization on Social Networks Curt Stutsman et.al. 2509.12822 null
2025-09-16 A Statistical Benchmark for Diffusion Posterior Sampling Algorithms Martin Zach et.al. 2509.12821 null
2025-09-16 Double Helix Diffusion for Cross-Domain Anomaly Image Generation Linchun Wu et.al. 2509.12787 null
2025-09-18 A-TDOM: Active TDOM via On-the-Fly 3DGS Yiwei Xu et.al. 2509.12759 null
2025-09-16 What Makes a Good Generated Image? Investigating Human and Multimodal LLM Image Preference Alignment Rishab Parthasarathy et.al. 2509.12750 null
2025-09-16 $L^2$ -solutions to stochastic reaction-diffusion equations with superlinear drifts driven by space-time white noise^ Shijie Shang et.al. 2509.12744 null
2025-09-16 Generalizable Holographic Reconstruction via Amplitude-Only Diffusion Priors Jeongsol Kim et.al. 2509.12728 null
2025-09-16 SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation Jingdong Zhang et.al. 2509.12721 null
2025-09-16 Joint AoI and Handover Optimization in Space-Air-Ground Integrated Network Zifan Lang et.al. 2509.12716 null
2025-09-16 AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models Heng Zhang et.al. 2509.12715 null
2025-09-16 Morphological and Chemical Changes in Cd-free Colloidal QD-LEDs During Operation Ruiqi Zhang et.al. 2509.12597 null
2025-09-16 Anomalous statistics in the Langevin equation with fluctuating diffusivity: from Brownian yet non-Gaussian diffusion to anomalous diffusion and ergodicity breaking Takuma Akimoto et.al. 2509.12571 null
2025-09-16 Adaptive Sampling Scheduler Qi Wang et.al. 2509.12569 null
2025-09-16 Thermal Transport of GaN/Substrate Heterostructures under Non-Uniform Heat Source Ershuai Yin et.al. 2509.12548 null
2025-09-16 Topological Phononic Crystal on the Scale of Quasi-Ballistic Phonon Transport Keita Funayama et.al. 2509.12528 null
2025-09-15 Context-Aware Language Models for Forecasting Market Impact from Sequences of Financial News Ross Koval et.al. 2509.12519 null
2025-09-15 Image Tokenizer Needs Post-Training Kai Qiu et.al. 2509.12474 null
2025-09-15 Effects of temporal variations on wave speeds of bistable traveling waves for Lotka-Volterra competition systems Weiwei Ding et.al. 2509.12472 null
2025-09-15 PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization Dawei Xiang et.al. 2509.12446 null
2025-09-15 Diffusion-Based Generation and Imputation of Driving Scenarios from Limited Vehicle CAN Data Julian Ripper et.al. 2509.12375 null
2025-09-15 Brown Dwarf Formation Through Gravitational Collapse: Insights From 3D Numerical Simulations Adnan Ali Ahmad et.al. 2509.12336 null
2025-09-15 Radial Oscillations of Viscous Neutron Stars: Zero Diffusion Case Raissa F. P. Mendes et.al. 2509.12330 null
2025-09-15 LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence Zixin Yin et.al. 2509.12203 null
2025-09-15 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling Yang Zhou et.al. 2509.12201 null
2025-09-15 Homogeneous soil moisture fields suppress Sahelian MCS frequency Ben Maybee et.al. 2509.12118 null
2025-09-15 Predicting Structural Relaxation in Supercooled Small Molecules via Molecular Dynamics Simulations and Microscopic Theory Anh D. Phan et.al. 2509.12092 null
2025-09-15 Progressive Flow-inspired Unfolding for Spectral Compressive Imaging Xiaodong Wang et.al. 2509.12079 null
2025-09-15 AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective Yuchen Deng et.al. 2509.12052 null
2025-09-15 Layout-Conditioned Autoregressive Text-to-Image Generation via Structured Masking Zirui Zheng et.al. 2509.12046 null
2025-09-15 Robust Concept Erasure in Diffusion Models: A Theoretical Perspective on Security and Robustness Zixuan Fu et.al. 2509.12024 null
2025-09-15 A shortcut through the macroscopic fluctuation theory: a generalised Fick law Théotim Berlioz et.al. 2509.12017 null
2025-09-15 Data-driven Smile Design: Personalized Dental Aesthetics Outcomes Using Deep Learning Marcus Lin et.al. 2509.12001 null
2025-09-15 Optimization for Massive 3D-RIS Deployment: A Generative Diffusion Model-Based Approach Kaining Wang et.al. 2509.11969 null
2025-09-15 Learning to Generate 4D LiDAR Sequences Ao Liang et.al. 2509.11959 null
2025-09-15 Adaptive least-squares space-time finite element methods for convection-diffusion problems Christian Köthe et.al. 2509.11955 null
2025-09-15 Sphere-GAN: a GAN-based Approach for Saliency Estimation in 360° Videos Mahmoud Z. A. Wahba et.al. 2509.11948 null
2025-09-15 The Filter Echo: A General Tool for Filter Visualisation Daniel Gaa et.al. 2509.11932 null
2025-09-15 VH-Diffuser: Variable Horizon Diffusion Planner for Time-Aware Goal-Conditioned Trajectory Planning Ruijia Liu et.al. 2509.11930 null
2025-09-15 A thermodynamically consistent model for bulk-surface viscous fluid mixtures: Model derivation and mathematical analysis Patrik Knopf et.al. 2509.11925 null
2025-09-15 A nonlinear model for long-range segregation Howen Chuah et.al. 2509.11912 null
2025-09-15 Enhanced Cosmic-Ray Cooling in AGN from Dark Matter Deep Inelastic Scattering Linjie Li et.al. 2509.11906 null
2025-09-15 Bayesian recalibration of flux scale factors in diffuse radio maps using low-resolution absolute radiometers Ainulnabilah Nasirudin et.al. 2509.11894 null
2025-09-15 Numerical analysis of fluid estimation for source terms in neutral particles simulation Zhirui Tang et.al. 2509.11883 null
2025-09-15 Do It Yourself (DIY): Modifying Images for Poems in a Zero-Shot Setting Using Weighted Prompt Manipulation Sofia Jamil et.al. 2509.11878 null
2025-09-15 Wasserstein error estimates between telegraph processes and Brownian motion Gerardo Barrera et.al. 2509.11871 null
2025-09-15 Tenma: Robust Cross-Embodiment Robot Manipulation with Diffusion Transformer Travis Davies et.al. 2509.11865 null
2025-09-15 Understanding variations of galactic energetic particles in the heliosphere: modelling and radiation hazard assessment Miguel Orcinha et.al. 2509.11837 null
2025-09-15 Rough stochastic filtering Fabio Bugini et.al. 2509.11825 null
2025-09-15 Stochastic restarting with multiple restart conditions Johannes Aspman et.al. 2509.11809 null
2025-09-15 Modes of Mechanical Guidance of Adhesion-Independent Cell Migration Hanna Luise Gertack et.al. 2509.11801 null
2025-09-15 Dense gas properties and star formation in M 82 Fei Li et.al. 2509.11770 null
2025-09-15 Igniting VLMs toward the Embodied Space Andy Zhai et.al. 2509.11766 null
2025-09-17 Removal Attack and Defense on AI-generated Content Latent-based Watermarking De Zhang Lee et.al. 2509.11745 null
2025-09-15 DRAG: Data Reconstruction Attack using Guided Diffusion Wa-Kin Lei et.al. 2509.11724 null
2025-09-15 Controlled growth of polar altermagnets via chemical vapor transport Hiraka Haruhiro et.al. 2509.11716 null
2025-09-15 Lie symmetry analysis and similarity reductions for the tempered-fractional Keller Segel system Ghorbanali Haghighatdoost et.al. 2509.11690 null
2025-09-15 DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition Lifei Hao et.al. 2509.11661 null
2025-09-15 IS-Diff: Improving Diffusion-Based Inpainting with Better Initial Seed Yongzhe Lyu et.al. 2509.11638 null
2025-09-15 SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching Jiacheng Liu et.al. 2509.11628 null
2025-09-15 Inference-stage Adaptation-projection Strategy Adapts Diffusion Policy to Cross-manipulators Scenarios Xiangtong Yao et.al. 2509.11621 null
2025-09-15 A Phase Field Formulation of Frictional Sliding Contact for 3D Fully Eulerian Fluid Structure Interactions Biswajeet Rath et.al. 2509.11611 null
2025-09-15 Scaling to Multimodal and Multichannel Heart Sound Classification: Fine-Tuning Wav2Vec 2.0 with Synthetic and Augmented Biosignals Milan Marocchi et.al. 2509.11606 null
2025-09-15 MVQA-68K: A Multi-dimensional and Causally-annotated Dataset with Quality Interpretability for Video Assessment Yanyun Pu et.al. 2509.11589 null
2025-09-15 Reconstructing High-fidelity Plasma Turbulence with Data-driven Tuning of Diffusion in Low Resolution Grids Kunpeng Li et.al. 2509.11576 null
2025-09-15 The Dynamics of the Profit Rate in an Extended Okishio Framework Jihyuan Liuh et.al. 2509.11538 null
2025-09-15 Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification Suman Cha et.al. 2509.11511 null
2025-09-15 Collective Recourse for Generative Urban Visualizations Rashid Mushkani et.al. 2509.11487 null
2025-09-14 Improving LLMs’ Learning for Coreference Resolution Yujian Gan et.al. 2509.11466 null
2025-09-14 Diffusion of $^{210}\text{Pb}$ and $^{210}\text{Po}$ in Nylon P. Adhikari et.al. 2509.11464 null
2025-09-14 Fast Percolation Centrality Approximation with Importance Sampling Antonio Cruciani et.al. 2509.11454 null
2025-09-14 Mechanisms of isotope exchange between aqueous solutions and barite in low-temperature geochemical systems Chen Zhu et.al. 2509.11428 null
2025-09-14 IGA-LBM: Isogeometric lattice Boltzmann method Ye Ji et.al. 2509.11427 null
2025-09-14 Solving ill-conditioned polynomial equations using score-based priors with application to multi-target detection Rafi Beinhorn et.al. 2509.11397 null
2025-09-14 ActivePose: Active 6D Object Pose Estimation and Tracking for Robotic Manipulation Sheng Liu et.al. 2509.11364 null
2025-09-14 On the Escaping Efficiency of Distributed Adversarial Training Algorithms Ying Cao et.al. 2509.11337 null
2025-09-14 PINGS: Physics-Informed Neural Network for Fast Generative Sampling Achmad Ardani Prasha et.al. 2509.11284 null
2025-09-14 VideoAgent: Personalized Synthesis of Scientific Videos Xiao Liang et.al. 2509.11253 null
2025-09-14 Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation Chengze li et.al. 2509.11252 null
2025-09-14 Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation Yufei Tang et.al. 2509.11213 null
2025-09-14 StegOT: Trade-offs in Steganography via Optimal Transport Chengde Lin et.al. 2509.11178 null
2025-09-14 Cryptanalysis and design for a family of plaintext non-delayed chaotic ciphers Qianxue Wang et.al. 2509.11158 null
2025-09-14 Entropic active particle transport in pulsating 3D geometries Rahul Sinha et.al. 2509.11147 null
2025-09-14 Neural cellular automata: applications to biology and beyond classical AI Benedikt Hartl et.al. 2509.11131 null
2025-09-14 Filling the Gaps: A Multitask Hybrid Multiscale Generative Framework for Missing Modality in Remote Sensing Semantic Segmentation Nhi Kieu et.al. 2509.11102 null
2025-09-14 PanoLora: Bridging Perspective and Panoramic Video Generation with LoRA Adaptation Zeyu Dong et.al. 2509.11092 null
2025-09-14 An Advanced Convolutional Neural Network for Bearing Fault Diagnosis under Limited Data Shengke Sun et.al. 2509.11053 null
2025-09-14 Data-Efficient Ensemble Weather Forecasting with Diffusion Models Kevin Valencia et.al. 2509.11047 null
2025-09-13 General Decentralized Stochastic Optimal Control via Change of Measure: Applications to the Witsenhausen Counterexample Bhagyashri Telsang et.al. 2509.11013 null
2025-09-13 Approximation in an optimal design problem governed by the heat equation Kei Matsushima et.al. 2509.11011 null
2025-09-13 TrueSkin: Towards Fair and Accurate Skin Tone Recognition and Generation Haoming Lu et.al. 2509.10980 null
2025-09-13 Development and Analysis of Chien-Physics-Informed Neural Networks for Singular Perturbation Problems Gautam Singh et.al. 2509.10945 null
2025-09-13 ToMA: Token Merge with Attention for Image Generation with Diffusion Models Wenbo Lu et.al. 2509.10918 null
2025-09-13 Robustifying Diffusion-Denoised Smoothing Against Covariate Shift Ali Hedayatnia et.al. 2509.10913 null
2025-09-13 Real-Time Super-Resolution Imaging System Based on Zero-Shot Learning for Infrared Non-Destructive Testing Pengfei Zhu et.al. 2509.10902 null
2025-09-13 Thermal diffusivity characterization of impacted composites using evaporative cryocooling excitation and inverse physics-informed neural networks Pengfei Zhu et.al. 2509.10898 null
2025-09-13 A novel IR-SRGAN assisted super-resolution evaluation of photothermal coherence tomography for impact damage in toughened thermoplastic CFRP laminates under room temperature and low temperature Pengfei Zhu et.al. 2509.10894 null
2025-09-13 Text2Sign Diffusion: A Generative Approach for Gloss-Free Sign Language Production Liqian Feng et.al. 2509.10845 null
2025-09-13 Orbit-based structural decomposition and stellar population recovery for edge-on barred galaxies Yunpeng Jin et.al. 2509.10832 null
2025-09-13 Multi-Task Diffusion Approach For Prediction of Glioma Tumor Progression Aghiles Kebaili et.al. 2509.10824 null
2025-09-13 Hybrid Atomic Norm Sparse/Diffuse Channel Estimation Lei Lyu et.al. 2509.10770 null
2025-09-12 Using Drift Diffusion Model to Analyze Cars’ Lane Change Decisions behind Heavy Vehicles Nachuan Li et.al. 2509.10733 null
2025-09-12 The Rapid Arrival of Josiah Willard Gibbs’s Elementary Principles in Statistical Mechanics in European University Libraries Hector Giacomini et.al. 2509.10732 null
2025-09-12 Simultaneous determination of wave speed, diffusivity and nonlinearity in the Westervelt equation using complex time-periodic solutions Sebastian Acosta et.al. 2509.10718 null
2025-09-12 Maestro: Self-Improving Text-to-Image Generation via Agent Orchestration Xingchen Wan et.al. 2509.10704 null
2025-09-12 Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation Hao Zhang et.al. 2509.10687 null
2025-09-12 T2Bs: Text-to-Character Blendshapes via Video Generation Jiahao Luo et.al. 2509.10678 null
2025-09-12 Parallel and perpendicular diffusion of energetic particles in the near-Sun solar wind observed by Parker Solar Probe Nibuna Siranjeevi Madam Subashchandar et.al. 2509.10648 null
2025-09-12 Generalized Time-Reversal for Pulse Control in Diffusive Media Rohin E. McIntosh et.al. 2509.10646 null
2025-09-12 Radiation GRMHD Models of Accretion onto Stellar-Mass Black Holes: II. Super-Eddington Accretion Lizhong Zhang et.al. 2509.10638 null
2025-09-12 InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis Tao Han et.al. 2509.10441 null
2025-09-12 Inpainting-Guided Policy Optimization for Diffusion Large Language Models Siyan Zhao et.al. 2509.10396 null
2025-09-12 Immunizing Images from Text to Image Editing via Adversarial Cross-Attention Matteo Trippodo et.al. 2509.10359 null
2025-09-12 GARD: Gamma-based Anatomical Restoration and Denoising for Retinal OCT Botond Fazekas et.al. 2509.10341 null
2025-09-12 Compute Only 16 Tokens in One Timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching Zhixin Zheng et.al. 2509.10312 null
2025-09-12 Morphogenetic mechanical metamaterials: Emerging tensor properties from self-organized structures Thomas Fromentèze et.al. 2509.10277 null
2025-09-12 MagicMirror: A Large-Scale Dataset and Benchmark for Fine-Grained Artifacts Assessment in Text-to-Image Generation Jia Wang et.al. 2509.10260 null
2025-09-12 Mask Consistency Regularization in Object Removal Hua Yuan et.al. 2509.10259 null
2025-09-12 Computational modeling of diffusive dynamics in a bouncer system with an irregular surface Luiz Antonio Barreiro et.al. 2509.10253 null
2025-09-12 Phase Transitions for Elephant Random Walks with Two memory Channels Krishanu Maulik et.al. 2509.10225 null
2025-09-12 Ionospheric Electron Heat Flow Modulates Planetary Ambipolar Electric Fields Liangliang Yuan et.al. 2509.10218 null
2025-09-12 Subordinators and time-space fractional diffusion equations Mohamed Majdoub et.al. 2509.10203 null
2025-09-12 P3D: Scalable Neural Surrogates for High-Resolution 3D Physics Simulations with Global Context Benjamin Holzschuh et.al. 2509.10186 null
2025-09-12 Convergence to equilibrium for fully discretizations of nonlocal Cahn-Hilliard equation Danni Zhang et.al. 2509.10180 null
2025-09-12 The unified gas kinetic wave-particle method for the neutron transport equation Guangwei Liu et.al. 2509.10178 null
2025-09-12 Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization Yifan Chang et.al. 2509.10140 null
2025-09-12 Turing patterns on adaptive networks Marie Dorchain et.al. 2509.10124 null
2025-09-12 Realism Control One-step Diffusion for Real-World Image Super-Resolution Zongliang Wu et.al. 2509.10122 null
2025-09-12 Intrinsic disorder in the candidate quantum spin ice Pr $_2$Zr$_2$O$_7$ T. J. Hicken et.al. 2509.10101 null
2025-09-12 HHI-Assist: A Dataset and Benchmark of Human-Human Interaction in Physical Assistance Scenario Saeed Saadatnejad et.al. 2509.10096 null
2025-09-12 Color Me Correctly: Bridging Perceptual Color Spaces and Text Embeddings for Improved Diffusion Generation Sung-Lin Tsai et.al. 2509.10058 null
2025-09-12 Approximate Graph Propagation Revisited: Dynamic Parameterized Queries, Tighter Bounds and Dynamic Updates Zhuowei Zhao et.al. 2509.10036 null
2025-09-12 Effects of harmonic magnetic field boundary conditions in mean-field solar dynamo V. V. Pipin et.al. 2509.09985 null
2025-09-12 Normalized solutions to a Choquard equation involving mixed local and nonlocal operators J. Giacomoni et.al. 2509.09968 null
2025-09-12 Limited Reference, Reliable Generation: A Two-Component Framework for Tabular Data Generation in Low-Data Regimes Mingxuan Jiang et.al. 2509.09960 null
2025-09-12 Chord: Chain of Rendering Decomposition for PBR Material Estimation from Generated Texture Images Zhi Ying et.al. 2509.09952 null
2025-09-12 Acoustic Scene Classification Using CNN-GRU Model Without Knowledge Distillation Ee-Leng Tan et.al. 2509.09931 null
2025-09-12 A streamline upwind/Petrov-Galerkin method for the magnetic advection-diffusion problem Haochen Li et.al. 2509.09913 null
2025-09-11 Accelerating 3D Photoacoustic Computed Tomography with End-to-End Physics-Aware Neural Operators Jiayun Wang et.al. 2509.09894 null
2025-09-11 PeV particle acceleration and non-thermal emission in the `minimalist’ model of the extended jets in W50/SS433 A. M. Bykov et.al. 2509.09883 null
2025-09-11 Automated Tuning for Diffusion Inverse Problem Solvers without Generative Prior Retraining Yaşar Utku Alçalar et.al. 2509.09880 null
2025-09-11 Privacy-Preserving Automated Rosacea Detection Based on Medically Inspired Region of Interest Selection Chengyu Yang et.al. 2509.09844 null
2025-09-11 A risk-sensitive ergodic singular stochastic control problem Justin Gwee et.al. 2509.09835 null
2025-09-11 DiTReducio: A Training-Free Acceleration for DiT-Based TTS via Progressive Calibration Yanru Huo et.al. 2509.09748 null
2025-09-11 FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark Rongyao Fang et.al. 2509.09680 null
2025-09-11 Locality in Image Diffusion Models Emerges from Data Statistics Artem Lukoianov et.al. 2509.09672 null
2025-09-11 Geometric Neural Distance Fields for Learning Human Motion Priors Zhengdi Yu et.al. 2509.09667 null
2025-09-12 DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech Ngoc-Son Nguyen et.al. 2509.09631 null
2025-09-11 I Know Who Clones Your Code: Interpretable Smart Contract Similarity Detection Zhenguang Liu et.al. 2509.09630 null
2025-09-11 Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth Daria Laslo et.al. 2509.09610 null
2025-09-11 Constraints on Ultra-heavy DM from TeV-PeV gamma-ray diffuse measurements Manuel Rocamora et.al. 2509.09609 null
2025-09-11 Iterative energy reduction Galerkin methods and variational adaptivity Pascal Heid et.al. 2509.09600 null
2025-09-11 Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis Yikang Ding et.al. 2509.09595 null
2025-09-11 Exactly Solvable Model of Random Walks with Stochastic Exchange José Julian Díaz-Pérez et.al. 2509.09577 null
2025-09-11 Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders Dohun Lee et.al. 2509.09547 null
2025-09-11 Generative Diffusion Contrastive Network for Multi-View Clustering Jian Zhu et.al. 2509.09527 null
2025-09-11 Mapping of discrete range modulated proton radiograph to water-equivalent path length using machine learning Atiq Ur Rahman et.al. 2509.09514 null
2025-09-11 Explainable AI for Accelerated Microstructure Imaging: A SHAP-Guided Protocol on the Connectome 2.0 scanner Quentin Uhl et.al. 2509.09513 null
2025-09-11 Mixture of Semantics Transmission for Generative AI-Enabled Semantic Communication Systems Junjie Ni et.al. 2509.09499 null
2025-09-11 SEDM: Scalable Self-Evolving Distributed Memory for Agents Haoran Xu et.al. 2509.09498 null
2025-09-11 Prompt Pirates Need a Map: Stealing Seeds helps Stealing Prompts Felix Mächtle et.al. 2509.09488 null
2025-09-11 Vorticity Packing Effects on Turbulent Transport in Decaying 2D Incompressible Navier-Stokes Fluids Snehanshu Maiti et.al. 2509.09487 null
2025-09-11 Comprehensive Mapping of Tracer Diffusivities Across Composition Space in Ternary NiAlTi and Quinary NiCoFeAlTi High-Entropy Alloy Using Diffusion Couple Experiments and Physics Informed Neural Network Inversion Ismail Kamil Worke et.al. 2509.09486 null
2025-09-11 Bath-induced stabilization of classical non-linear response in two dimensional infrared spectroscopy Rajesh Dutta et.al. 2509.09476 null
2025-09-11 Axion-Photon Conversion in FLRW with Primordial Magnetic Fields: Explaining the Radio Excess Setabuddin et.al. 2509.09472 null
2025-09-11 FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model Yushen Xu et.al. 2509.09456 null
2025-09-11 Optimal Investment and Consumption in a Stochastic Factor Model Florian Gutekunst et.al. 2509.09452 null
2025-09-11 Composable Score-based Graph Diffusion Model for Multi-Conditional Molecular Generation Anjie Qiao et.al. 2509.09451 null
2025-09-11 Steady advection-diffusion in multiply-connected potential flows Kyle McKee et.al. 2509.09444 null
2025-09-11 Plug-and-play Diffusion Models for Image Compressive Sensing with Data Consistency Projection Xiaodong Wang et.al. 2509.09365 null
2025-09-11 Expressive Power of Deep Networks on Manifolds: Simultaneous Approximation Hanfei Zhou et.al. 2509.09362 null
2025-09-11 Turnpike properties for zero-sum stochastic linear quadratic differential games of Markovian regime switching system Xun Li et.al. 2509.09358 null
2025-09-11 Euler-type methods for Levy-driven McKean-Vlasov SDEs with super-linear coefficients: mean-square error analysis Jingtao Zhu et.al. 2509.09302 null
2025-09-11 A note on quantifying the contributions of incidence functions in spatio-temporal epidemic models Mohamed Mehdaoui et.al. 2509.09301 null
2025-09-11 Data Driven Discovery of Emergent Dynamics in Reaction Diffusion Systems from Sparse and Noisy Observations Saumitra Dwivedi et.al. 2509.09278 null
2025-09-11 Long time strong convergence analysis of one-step methods for McKean-Vlasov SDEs with superlinear growth coefficients Taiyuan Liu et.al. 2509.09274 null
2025-09-11 The role of communication delays in the optimal control of spatially invariant systems Luca Ballotta et.al. 2509.09269 null
2025-09-11 A novel method and dataset for depth-guided image deblurring from smartphone Lidar Antonio Montanaro et.al. 2509.09241 null
2025-09-11 MAPSS: Manifold-based Assessment of Perceptual Source Separation Amir Ivry et.al. 2509.09212 null
2025-09-11 ALL-PET: A Low-resource and Low-shot PET Foundation Model in the Projection Domain Bin Huang et.al. 2509.09130 null
2025-09-11 Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention Junhao Xing et.al. 2509.09116 null
2025-09-10 Integrating Anatomical Priors into a Causal Diffusion Model Binxu Li et.al. 2509.09054 null
2025-09-10 Noise-Activated Dopant Dynamics in Two-Dimensional Thermal Landscapes with Localized Cold Spots Mesfin Taye et.al. 2509.09046 null
2025-09-10 Cosmic Ray Spatial Distribution and the Galactic/Extragalactic Transition Paolo Lipari et.al. 2509.09028 null
2025-09-10 Complex dynamics and pattern formation in a diffusive epidemic model with an infection-dependent recovery rate Wael El Khateeb et.al. 2509.09000 null
2025-09-10 HARD: A Performance Portable Radiation Hydrodynamics Code based on FleCSI Framework Julien Loiseau et.al. 2509.08971 null
2025-09-10 Activity-driven clustering of jamming run-and-tumble particles: Exact three-body steady state by dynamical symmetry Leo Hahn et.al. 2509.08945 null
2025-09-10 Discovering Divergent Representations between Text-to-Image Models Lisa Dunlap et.al. 2509.08940 null
2025-09-10 Diffusion-Based Action Recognition Generalizes to Untrained Domains Rogerio Guimaraes et.al. 2509.08908 null
2025-09-10 Anomalously fast transport in non-integrable lattice gauge theories Devendra Singh Bhakuni et.al. 2509.08889 null
2025-09-10 RewardDance: Reward Scaling in Visual Generation Jie Wu et.al. 2509.08826 null
2025-09-10 GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts Jenna Kang et.al. 2509.08818 null
2025-09-10 Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles Eric Slyman et.al. 2509.08777 null
2025-09-11 Joint Model-based Model-free Diffusion for Planning with Constraints Wonsuhk Jung et.al. 2509.08775 null
2025-09-10 Sharp power concavity of two relevant free boundary problems of reaction-diffusion type Qingyou He et.al. 2509.08768 null
2025-09-10 Learning Turbulent Flows with Generative Models: Super-resolution, Forecasting, and Sparse Flow Reconstruction Vivek Oommen et.al. 2509.08752 null
2025-09-10 On the Lebesgue Constant of Extended-Domain Spectral Methods for Elliptic PDEs Po-Yi Wu et.al. 2509.08745 null
2025-09-10 Finite-temperature transport in the gapped spin-1/2 XXZ chain and one-dimensional lattice spinless fermion model J. M. P. Carmelo et.al. 2509.08741 null
2025-09-10 Data-driven generative simulation of SDEs using diffusion models Xuefeng Gao et.al. 2509.08731 null
2025-09-10 Accelerating Diffusion Transformer-Based Text-to-Speech with Transformer Layer Caching Siratish Sakpiboonchit et.al. 2509.08696 null
2025-09-10 The Small Magellanic Cloud through the lens of the James Webb Space Telescope : binaries and mass function within the galaxy outskirts M. V. Legnardi et.al. 2509.08687 null
2025-09-10 X-Part: high fidelity and structure coherent shape decomposition Xinhao Yan et.al. 2509.08643 null
2025-09-10 RoentMod: A Synthetic Chest X-Ray Modification Model to Identify and Correct Image Interpretation Model Shortcuts Lauren H. Cooke et.al. 2509.08640 null
2025-09-10 LADB: Latent Aligned Diffusion Bridges for Semi-Supervised Domain Translation Xuqin Wang et.al. 2509.08628 null
2025-09-10 Microstructural Control and Heat Transport Enhancement in Lanthanum Sulfate for Thermochemical Heat Storage Kunihiko Shizume et.al. 2509.08585 null
2025-09-10 EfficientIML: Efficient High-Resolution Image Manipulation Localization Jinhan Li et.al. 2509.08583 null
2025-09-10 Quenched and annealed heat kernel estimates for Brox’s diffusion Xin Chen et.al. 2509.08559 null
2025-09-10 PEHRT: A Common Pipeline for Harmonizing Electronic Health Record data for Translational Research Jessica Gronsbell et.al. 2509.08553 null
2025-09-10 System size and boundaries determine the patterning dynamics of attracting active particles Jan Rombouts et.al. 2509.08533 null
2025-09-10 RoboMatch: A Mobile-Manipulation Teleoperation Platform with Auto-Matching Network Architecture for Long-Horizon Manipulation Hanyu Liu et.al. 2509.08522 null
2025-09-10 HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning Liyang Chen et.al. 2509.08519 null
2025-09-10 Search for a photon peak from keV-scale dark matter annihilation with NuSTAR: Constraints on $\langle σv \rangle$ after 11 years of observations E. I. Zakharov et.al. 2509.08506 null
2025-09-10 Prompt-Driven Image Analysis with Multimodal Generative AI: Detection, Segmentation, Inpainting, and Interpretation Kaleem Ahmad et.al. 2509.08489 null
2025-09-10 Audio Deepfake Verification Li Wang et.al. 2509.08476 null
2025-09-10 Spherical Brownian Bridge Diffusion Models for Conditional Cortical Thickness Forecasting Ivan Stoyanov et.al. 2509.08442 null
2025-09-10 PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching Lei Ye et.al. 2509.08435 null
2025-09-10 One-dimensional particle clouds with elastic collisions Mikhail Menshikov et.al. 2509.08430 null
2025-09-10 LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations Payal Varshney et.al. 2509.08422 null
2025-09-10 The Critical 9365 Å Diffuse Interstellar Band and C $_{60}^{+}$ Association Daniel Majaess et.al. 2509.08414 null
2025-09-10 Protoplanetary disks around magnetized young stars with large-scale magnetic fields I: Steady-state solutions D. Steiner et.al. 2509.08393 null
2025-09-11 VRAE: Vertical Residual Autoencoder for License Plate Denoising and Deblurring Cuong Nguyen et.al. 2509.08392 null
2025-09-10 LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models Hirokazu Kameoka et.al. 2509.08379 null
2025-09-10 Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video Xiao Li et.al. 2509.08376 null
2025-09-10 Stop using root-mean-square error as a precipitation target! Kieran M. R. Hunt et.al. 2509.08369 null
2025-09-10 Physics-Guided Rectified Flow for Low-light RAW Image Enhancement Juntai Zeng et.al. 2509.08330 null
2025-09-10 Trans-scale spin Seebeck effect in nanostructured bulk composites based on magnetic insulator Sang J. Park et.al. 2509.08327 null
2025-09-10 Controlling GaN nucleation via O $_2$ -plasma-perforated graphene masks on c-plane sapphire Su Young An et.al. 2509.08275 null
2025-09-10 Generative Quasi-Continuum Modeling of Confined Fluids at the Nanoscale Bugra Yalcin et.al. 2509.08223 null
2025-09-10 Moiré excitons in generalized Wigner crystals Jing-Yang You et.al. 2509.08211 null
2025-09-09 ArtifactGen: Benchmarking WGAN-GP vs Diffusion for Label-Aware EEG Artifact Synthesis Hritik Arasu et.al. 2509.08188 null
2025-09-09 Modeling of convective cells, turbulence, and transport induced by a radio-frequency antenna in the tokamak boundary plasma M. V. Umansky et.al. 2509.08178 null
2025-09-09 A Linear Pricing Mechanism for Load Management in Day-Ahead Retail Energy Markets Phillippe K. Phanivong et.al. 2509.08166 null
2025-09-09 Diffusion-Guided Multi-Arm Motion Planning Viraj Parimi et.al. 2509.08160 null
2025-09-09 Electronic Fluctuations and Ionic Dynamics in Molten Silver Iodide Harender S. Dhattarwal et.al. 2509.08143 null
2025-09-09 Joint calibration of the volatility surface and variance term structure Jiwook Yoo et.al. 2509.08096 null
2025-09-09 DDNet: A Unified Physics-Informed Deep Learning Framework for Semiconductor Device Modeling Roberto Riganti et.al. 2509.08073 null
2025-09-09 Discovery of a $z \sim 0.8$ Ultra Steep Spectrum Radio Halo in the MeerKAT-South Pole Telescope Survey Isaac S. Magolego et.al. 2509.08062 null
2025-09-09 Acceleration of Heavy Ions at Non-Relativistic Collisionless Shocks Damiano Caprioli et.al. 2509.08061 null
2025-09-09 Breaking Dark: Hunting Heavy Decaying Dark Matter with Tibet AS $_γ$ and LHAASO-KM2A Abhishek Dubey et.al. 2509.08039 null
2025-09-09 PyPAS – Python package for Positron Annihilation Spectroscopy Doppler Broadening Analysis Achiya Yosef Amrusi et.al. 2509.08023 null
2025-09-08 CardioComposer: Flexible and Compositional Anatomical Structure Generation with Disentangled Geometric Guidance Karim Kadry et.al. 2509.08015 null
2025-09-08 Validation of a CT-brain analysis tool for measuring global cortical atrophy in older patient cohorts Sukhdeep Bal et.al. 2509.08012 null
2025-09-09 LHAASO Galactic Plane $γ$ -rays Strongly Constrain Heavy Dark Matter Celine Boehm et.al. 2509.07982 null
2025-09-09 Edwards-Wilkinson limit for a stochastic advection-diffusion PDE Sotirios Kotitsas et.al. 2509.07956 null
2025-09-09 Feature Space Analysis by Guided Diffusion Model Kimiaki Shirahama et.al. 2509.07936 null
2025-09-09 ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion Ao Li et.al. 2509.07920 null
2025-09-09 Measurement of ion acceleration and diffusion in a laser-driven magnetized plasma J. T. Y. Chu et.al. 2509.07880 null
2025-09-09 Duality estimates for subdiffusion problems including time-fractional porous medium type equations Arlúcio Viana et.al. 2509.07862 null
2025-09-09 Convergence analysis for the Barrett-Garcke-Nurnberg method of transport type Genming Bai et.al. 2509.07834 null
2025-09-09 A Note on the failure of temporal regularity for stochastic PDEs Antonio Agresti et.al. 2509.07803 null
2025-09-09 Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey Minghan Li et.al. 2509.07794 null
2025-09-09 SN 2022xlp: The second-known well-observed, intermediate-luminosity Iax supernova D. Bánhidi et.al. 2509.07717 null
2025-09-09 A Generalisable Generative Model for Multi-Detector Calorimeter Simulation Piyush Raikwar et.al. 2509.07700 null
2025-09-09 Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity Sung Ju Lee et.al. 2509.07647 null
2025-09-09 An all-sky 3D dust map Based on Gaia and LAMOST Tao Wang et.al. 2509.07640 null
2025-09-10 LSMTCR: A Scalable Multi-Architecture Model for Epitope-Specific T Cell Receptor de novo Design Ruihao Zhang et.al. 2509.07627 null
2025-09-09 AgentX: Towards Orchestrating Robust Agentic Workflow Patterns with FaaS-hosted MCP Services Shiva Sai Krishna Anand Tokal et.al. 2509.07595 null
2025-09-09 Sorting of binary active-passive mixtures in designed microchannels Horacio Serna et.al. 2509.07582 null
2025-09-09 Atomic Layer Etching of Aluminum Nitride: Mechanistic Insights from First-Principles Studies of Chlorine Chemistry Sanjay Nayak et.al. 2509.07554 null
2025-09-09 PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image Peng Li et.al. 2509.07552 null
2025-09-09 Two-dimensional fractional Brownian motion: Analysis in time and frequency domains Michał Balcerek et.al. 2509.07537 null
2025-09-09 Universal Few-Shot Spatial Control for Diffusion Models Kiet T. Nguyen et.al. 2509.07530 null
2025-09-09 Emergence of continuously varying critical exponents in coupled map lattice as an effect of quenched disorder Priyanka D. Bhoyar et.al. 2509.07529 null
2025-09-09 Target matching based generative model for speech enhancement Taihui Wang et.al. 2509.07521 null
2025-09-09 Magnetic Resonance Imaging Virtual Liver Biopsy Using Radiomics Analysis for the Assessment of Chronic Liver Disease Jiqing Huang et.al. 2509.07516 null
2025-09-09 LINR Bridge: Vector Graphic Animation via Neural Implicits and Video Diffusion Priors Wenshuo Gao et.al. 2509.07484 null
2025-09-09 Uncertainty in Hadronic Diffuse $γ$ -Ray Emission from the Temporal Stochasticity of Cosmic-Ray Sources Xing-Jian Lv et.al. 2509.07481 null
2025-09-09 ANYPORTAL: Zero-Shot Consistent Video Background Replacement Wenshuo Gao et.al. 2509.07472 null
2025-09-09 DepthVision: Robust Vision-Language Understanding through GAN-Based LiDAR-to-RGB Synthesis Sven Kirchner et.al. 2509.07463 null
2025-09-09 Unveiling Biological Models Through Turing Patterns Yuhan Li et.al. 2509.07458 null
2025-09-09 Node Position Estimation in Diffusion-Based Molecular Communications Using Multi-Layer Perceptron Sangjun Hwang et.al. 2509.07441 null
2025-09-09 GRASPion: an Open-Source, Programmable Brainbot for Active Matter Research F. Novkoski et.al. 2509.07437 null
2025-09-09 DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation Ze-Xin Yin et.al. 2509.07435 null
2025-09-11 Blow-up for a Nonlocal Diffusion Equation with Time Regularly Varying Nonlinearity and Forcing Rihab Ben Belgacem et.al. 2509.07405 null
2025-09-09 Time evolution of averaged limit shapes of random multiple Young diagrams Akihito Hora et.al. 2509.07393 null
2025-09-09 On the exponential convergence to equilibrium for ultrafast diffusion equations Yi C. Huang et.al. 2509.07382 null
2025-09-09 Knowledge Distillation Driven Semantic NOMA for Image Transmission with Diffusion Model Qifei Wang et.al. 2509.07363 null
2025-09-09 Distributed Frequency Control for Multi-Area Power Systems Considering Transient Frequency Safety Xiemin Mo et.al. 2509.07345 null
2025-09-09 SpecifyUI: Supporting Iterative UI Design Intent Expression through Structured Specifications and Generative AI Yunnong Chen et.al. 2509.07334 null
2025-09-09 Data-knowledge fusion driven frequency security assessment: A robust framework for renewable-dominated power grids Yurun Zhang et.al. 2509.07320 null
2025-09-08 Reconstruction Alignment Improves Unified Multimodal Models Ji Xie et.al. 2509.07295 null
2025-09-08 Breast Cancer Detection in Thermographic Images via Diffusion-Based Augmentation and Nonlinear Feature Fusion Sepehr Salem et.al. 2509.07277 null
2025-09-08 Hybrid Galam–Bass Model for Technology Innovation Giulia Rotundo et.al. 2509.07275 null
2025-09-08 Thermodynamic Irreversibility in Underdamped Brownian Motion with Spatial Temperature Gradients Mesfin Taye et.al. 2509.07272 null
2025-09-08 Extended Version: Market-Driven Equilibria for Distributed Solar Panel Investment Mehdi Davoudi et.al. 2509.07203 null
2025-09-08 Realism to Deception: Investigating Deepfake Detectors Against Face Enhancement Muhammad Saad Saeed et.al. 2509.07178 null
2025-09-08 Ultrathin oxide freestanding membranes with large-scale continuity and structural perfection Yuhao Hong et.al. 2509.07176 null
2025-09-08 Unveiling the Impact of Cosmic Rays on the Disc Sizes and Outflows from Dwarf Scales to Galaxy Groups Rebekka Bieri et.al. 2509.07124 null
2025-09-08 Indirect detection of boosted light scalar dark matter Arindam Basu et.al. 2509.07110 null
2025-09-08 Constraining Baryon Fractions in Galaxy Groups and Clusters with the First CHIME/FRB Outrigger Adam E. Lanman et.al. 2509.07097 null
2025-09-08 Automated Evaluation of Gender Bias Across 13 Large Multimodal Models Juan Manuel Contreras et.al. 2509.07050 null
2025-09-07 The Impact of Artificial Intelligence on Traditional Art Forms: A Disruption or Enhancement Viswa Chaitanya Marella et.al. 2509.07029 null
2025-09-10 Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models Jisung Hwang et.al. 2509.07027 null
2025-09-08 Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data Nithin Gopalakrishnan Nair et.al. 2509.06950 null
2025-09-08 Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models Yinjie Wang et.al. 2509.06949 null
2025-09-09 Interleaving Reasoning for Better Text-to-Image Generation Wenxuan Huang et.al. 2509.06945 null
2025-09-09 Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference Xiangwei Shen et.al. 2509.06942 null
2025-09-10 LLaDA-VLA: Vision Language Diffusion Action Models Yuqing Wen et.al. 2509.06932 null
2025-09-08 BIR-Adapter: A Low-Complexity Diffusion Model Adapter for Blind Image Restoration Cem Eteke et.al. 2509.06904 null
2025-09-08 Nanobot Algorithms for Treatment of Diffuse Cancer Noble Harasha et.al. 2509.06893 null
2025-09-08 Homogenisation of a Passive Scalar Transported by Locally Supported White Noise Federico Butori et.al. 2509.06878 null
2025-09-08 Infinite Interacting Brownian Motions and EVI Gradient Flows Kohei Suzuki et.al. 2509.06869 null
2025-09-08 A New Hybrid Model of Generative Adversarial Network and You Only Look Once Algorithm for Automatic License-Plate Recognition Behnoud Shafiezadeh et.al. 2509.06868 null
2025-09-08 floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL Bhavya Agrawalla et.al. 2509.06863 null
2025-09-08 Stochastic modelling of cosmic-ray sources for Galactic diffuse emissions Anton Stall et.al. 2509.06857 null
2025-09-08 CRISP – Compliant ROS2 Controllers for Learning-Based Manipulation Policies and Teleoperation Daniel San José Pro et.al. 2509.06819 null
2025-09-08 UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward Yufeng Cheng et.al. 2509.06818 null
2025-09-08 Large eddy simulations in astrophysics Wolfram Schmidt-Brückner et.al. 2509.06801 null
2025-09-08 Image Encryption Scheme Based on Hyper-Chaotic Map and Self-Adaptive Diffusion Yiqi Tang et.al. 2509.06754 null
2025-09-08 Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training Ruicheng Zhang et.al. 2509.06723 null
2025-09-08 STAGE: Segmentation-oriented Industrial Anomaly Synthesis via Graded Diffusion with Explicit Mask Alignment Xichen Xu et.al. 2509.06693 null
2025-09-08 A Parallel Solver with Multiphysics Finite Element Method for Poroelasticity Coupled with Elasticity Model Zhihao Ge et.al. 2509.06673 null
2025-09-08 The complementary of CTAO, direct detection and collider searches for dark matter in Effective Field Theories and Simplified models Igor Reis et.al. 2509.06628 null
2025-09-08 Fisher entropic Fokker-Planck model of monatomic rarefied gases Veronica Montanaro et.al. 2509.06610 null
2025-09-08 Contrastive Anatomy-Contrast Disentanglement: A Domain-General MRI Harmonization Method Daniel Scholz et.al. 2509.06592 null
2025-09-08 CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis Xin Kong et.al. 2509.06579 null
2025-09-08 From Rigging to Waving: 3D-Guided Diffusion for Natural Animation of Hand-Drawn Characters Jie Zhou et.al. 2509.06573 null
2025-09-08 Interlayer Coupling and Exciton Dynamics in 2D Hybrid Structures based on an InGaN Quantum Well coupled to a MoSe2 Monolayer D. Chen et.al. 2509.06547 null
2025-09-08 A multiscale theory for network advection-reaction-diffusion Hadrien Oliveri et.al. 2509.06546 null
2025-09-08 Thermalization dynamics of finite-size quantum critical systems Li Li et.al. 2509.06523 null
2025-09-08 On optimal solutions of classical and sliced Wasserstein GANs with non-Gaussian data Yu-Jui Huang et.al. 2509.06505 null
2025-09-08 TIDE: Achieving Balanced Subject-Driven Image Generation via Target-Instructed Diffusion Enhancement Jibai Lin et.al. 2509.06499 null
2025-09-08 Phyllotaxis in a Keller-Segel model Michael F. Staddon et.al. 2509.06498 null
2025-09-08 Discovery of giant bubbles in the hot gaseous halo of the massive disk galaxy NGC 6286 Lin He et.al. 2509.06470 null
2025-09-08 VQualA 2025 Challenge on Image Super-Resolution Generated Content Quality Assessment: Methods and Results Yixiao Li et.al. 2509.06413 null
2025-09-08 Diffusion-Shock PDEs for Deep Learning on Position-Orientation Space Finn M. Sherry et.al. 2509.06405 null
2025-09-08 Non-Destructive Rail Monitoring for Defect Identification Elissa Akiki et.al. 2509.06394 null
2025-09-08 Hydrogen-induced fast fracture in a 1.5 GPa dual-phase steel Rama Srinivas Varanasi et.al. 2509.06323 null
2025-09-08 McKean-Vlasov limits of scaling-critical reaction-diffusion equations with random initial data Bryan Castillo et.al. 2509.06260 null
2025-09-07 Multi-Scale Modeling and Predictive Control of Active Brownian Particles Sadra Saremi et.al. 2509.06217 null
2025-09-07 Grasp-MPC: Closed-Loop Visual Grasping via Value-Guided Model Predictive Control Jun Yamada et.al. 2509.06201 null
2025-09-07 Forward and inverse problems of a semilinear transport equation Kui Ren et.al. 2509.06183 null
2025-09-07 The role of the initial distribution in population survival within a bounded habitat Rafael de la Rosa et.al. 2509.06179 null
2025-09-07 UniVerse-1: Unified Audio-Video Generation via Stitching of Experts Duomin Wang et.al. 2509.06155 null
2025-09-07 If generative AI is the answer, what is the question? Ambuj Tewari et.al. 2509.06120 null
2025-09-10 The Thermodynamic Limit of Extreme First-Passage Times Talia Baravi et.al. 2509.06098 null
2025-09-07 Home-made Diffusion Model from Scratch to Hatch Shih-Ying Yeh et.al. 2509.06068 null
2025-09-10 BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models Yuming Li et.al. 2509.06040 null
2025-09-07 DreamAudio: Customized Text-to-Audio Generation with Diffusion Models Yi Yuan et.al. 2509.06027 null
2025-09-07 The Gross-Pitaewsky equation with time and space dependent coefficients Federico Lai et.al. 2509.06001 null
2025-09-07 Multi-Strategy Guided Diffusion via Sparse Masking Temporal Reweighting Distribution Correction Zekun Zhou et.al. 2509.05992 null
2025-09-07 Simulation of Solar Surface Flux Transport Constrained by Magnetic Power Spectra. I. Flux Transport Parameter Yukun Luo et.al. 2509.05989 null
2025-09-07 Imagining Alternatives: Towards High-Resolution 3D Counterfactual Medical Image Generation via Language Guidance Mohamed Mohamed et.al. 2509.05978 null
2025-09-09 Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching Feng Wang et.al. 2509.05952 null
2025-09-06 Transformer-based Topology Optimization Aaron Lutheran et.al. 2509.05800 null
2025-09-06 Hybrid Fourier Neural Operator-Plasma Fluid Model for Fast and Accurate Multiscale Simulations of High Power Microwave Breakdown Kalp Pandya et.al. 2509.05799 null
2025-09-06 Discrete-Time Quantum Random Walk for Epidemiological Modeling Sayan Manna et.al. 2509.05795 null
2025-09-06 Depth Profiling of Oxygen Migration in Ta/HfO2 Stacks During Ionic Liquid Gating Beatrice Bednarz et.al. 2509.05748 null
2025-09-06 InterAct: A Large-Scale Dataset of Dynamic, Expressive and Interactive Activities between Two People in Daily Scenarios Leo Ho et.al. 2509.05747 null
2025-09-06 High-friction limit for bipolar Euler-Riesz systems Nuno J. Alves et.al. 2509.05742 null
2025-09-06 Polarization memory effect in a multimode fiber Gauri Arora et.al. 2509.05665 null
2025-09-06 EditIDv2: Editable ID Customization with Data-Lubricated ID Feature Integration for Text-to-Image Generation Guandong Li et.al. 2509.05659 null
2025-09-06 Well-posedness and regularity theory for the fractional diffusion-wave equation in Lebesgue spaces Bruno de Andrade et.al. 2509.05654 null
2025-09-06 SuMa: A Subspace Mapping Approach for Robust and Effective Concept Erasure in Text-to-Image Diffusion Models Kien Nguyen et.al. 2509.05625 null
2025-09-06 Large and moderate deviation principles for stochastic partial differential equation on graph Jianbo Cui et.al. 2509.05622 null
2025-09-05 Perpendicular ion heating in turbulence and reconnection: magnetic moment breaking by coherent fluctuations Alfred Mallet et.al. 2509.05518 null
2025-09-05 Chemotaxis Models with Nonlinear/Porous Medium Diffusion, Consumption, and Logistic source on $\mathbb{R}^N$ : I. Global Solvability and Boundedness Zulaihat Hassan et.al. 2509.05494 null
2025-09-05 From Image Generation to Infrastructure Design: a Multi-agent Pipeline for Street Design Generation Chenguang Wang et.al. 2509.05469 null
2025-09-05 Newton to Einstein: Axiom-Based Discovery via Game Design Pingchuan Ma et.al. 2509.05448 null
2025-09-05 The MeerKAT Galaxy Cluster Legacy Survey – II. Catalogue of the diffuse radio emission in MeerKAT-GCLS clusters Konstantinos Kolokythas et.al. 2509.05442 null
2025-09-05 Diffusioosmosis of electrolyte solutions in uniformly charged channels Evgeny S. Asmolov et.al. 2509.05387 null
2025-09-05 Spin-transport characteristics in a Si-based spin metal-oxide-semiconductor field-effect transistor (spin MOSFET): Bias dependence of the spin polarization in Si and magnetoresistance in spin-valve signals Shoichi Sato et.al. 2509.05384 null
2025-09-05 Extreme Negative Polarisation of New Interstellar Comet 3I/ATLAS Zuri Gray et.al. 2509.05181 null
2025-09-05 Cheaper access to universal fluctuations in integrable spin chains from boundary effects Sylvain Prolhac et.al. 2509.05176 null
2025-09-05 Latest results from the searches for ultra-high-energy photons at the Pierre Auger Observatory Pierpaolo Savina et.al. 2509.05113 null
2025-09-05 Painting the market: generative diffusion models for financial limit order book simulation and forecasting Alfred Backhouse et.al. 2509.05107 null
2025-09-05 Physical interactions enable energy-efficient Turing patterns Cathelijne ter Burg et.al. 2509.05093 null
2025-09-05 MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading Yang Chen et.al. 2509.05080 null
2025-09-05 Masked Diffusion Language Models with Frequency-Informed Training Despoina Kosmopoulou et.al. 2509.05056 null
2025-09-05 Active thermodynamics of inertial chiral active gases: equation of state and edge currents Lorenzo Caprini et.al. 2509.05053 null
2025-09-05 QCA-MolGAN: Quantum Circuit Associative Molecular GAN with Multi-Agent Reinforcement Learning Aaron Mark Thomas et.al. 2509.05051 null
2025-09-05 LUIVITON: Learned Universal Interoperable VIrtual Try-ON Cong Cao et.al. 2509.05030 null
2025-09-05 Synthetic Acceleration Preconditioners for Parametric Radiative Transfer Equations based on Trajectory-Aware Reduced Order Models Ning Tang et.al. 2509.05001 null
2025-09-05 FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies Moritz Reuss et.al. 2509.04996 null
2025-09-05 Improving Spatial Resolution of Background Oriented Schlieren Based on Directional Rays Xiang Li et.al. 2509.04992 null
2025-09-05 Magnetorotational and convective instabilities in a thin layer of electrically conductive nanofluid under an external helical magnetic field M. I. Kopp et.al. 2509.04968 null
2025-09-05 Efficient estimation of jump parameters for stochastic differential equations driven by L{é}vy processes Elise Bayraktar et.al. 2509.04920 null
2025-09-05 Survey of Profile Parameters of the $6196 Å$ Diffuse Interstellar Band. From Uniform Profiles to Doppler Splitting and Blueshifts M. Piecka et.al. 2509.04915 null
2025-09-05 Off-lattice Microscopic Monte Carlo Modeling of Molecular Hydrogen Formation on Carbonaceous Dust Grains N. A. Satonkin et.al. 2509.04913 null
2025-09-05 Spectrum of slip dynamics, scaling & statistical laws emerge from simplified model of fault and damage zone architecture M. Almakari et.al. 2509.04909 null
2025-09-05 Plug-and-Play Latent Diffusion for Electromagnetic Inverse Scattering with Application to Brain Imaging Rui Guo et.al. 2509.04860 null
2025-09-05 A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing Chengkai Xu et.al. 2509.04853 null
2025-09-05 Stable and unstable spatially-periodic canards created in singular subcritical Turing bifurcations in the Brusselator system Robert Jencks et.al. 2509.04835 null
2025-09-05 SemSteDiff: Generative Diffusion Model-based Coverless Semantic Steganography Communication Song Gao et.al. 2509.04803 null
2025-09-05 Stability and Self-Organized Patterns in Coupled Ecohydrological–Fire Dynamics: A Model of Vegetation–Rainfall–Bushfire Interactions Serena Dipierro et.al. 2509.04766 null
2025-09-05 STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs Han Liang et.al. 2509.04719 null
2025-09-04 Transforming Fashion with AI: A Comparative Study of Large Language Models in Apparel Design Nusrat Jahan Lamia et.al. 2509.04705 null
2025-09-04 On convergence of upwinding Petrov-Galerkin methods for convection-diffusion Constantin Bacuta et.al. 2509.04703 null
2025-09-04 DarkStream: real-time speech anonymization with low latency Waris Quamer et.al. 2509.04667 null
2025-09-04 Mo Atom Rearrangement Drives Layer-Dependent Reactivity in Two-Dimensional MoS2 Zifan Wang et.al. 2509.04648 null
2025-09-04 Technical Developments of DA on $\mathbb{T}^3$ Hangyue Zhang et.al. 2509.04634 null
2025-09-04 $\mathcal{L}_1$ -DRAC: Distributionally Robust Adaptive Control Aditya Gahlawat et.al. 2509.04619 null
2025-09-04 DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models Jin Ma et.al. 2509.04597 null
2025-09-04 An S-matrix Formalism for the Nonclassical Optical Response of Plasmonic Sphere Aggregates Xin Zheng et.al. 2509.04589 null
2025-09-04 Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model Hongyang Wei et.al. 2509.04548 null
2025-09-04 Spatial Patterning and Selection: How the Environment Shapes Molecular Complexity Alexandre Champagne-Ruel et.al. 2509.04547 null
2025-09-04 PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting Linqing Wang et.al. 2509.04545 null
2025-09-04 In-Context Policy Adaptation via Cross-Domain Skill Diffusion Minjong Yoo et.al. 2509.04535 null
2025-09-04 Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image – Technical Preview Jun-Kun Chen et.al. 2509.04450 null
2025-09-04 Plot’n Polish: Zero-shot Story Visualization and Disentangled Editing with Text-to-Image Diffusion Models Kiymet Akdemir et.al. 2509.04446 null
2025-09-04 Durian: Dual Reference-guided Portrait Animation with Attribute Transfer Hyunsoo Cha et.al. 2509.04434 null
2025-09-04 Few-step Flow for 3D Generation via Marginal-Data Transport Distillation Zanwei Zhou et.al. 2509.04406 null
2025-09-04 Transition Models: Rethinking the Generative Learning Objective Zidong Wang et.al. 2509.04394 null
2025-09-04 SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer Jimin Xu et.al. 2509.04379 null
2025-09-04 Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology Yuchen Jiao et.al. 2509.04372 null
2025-09-04 Sensitivities of time-dependent temperature profile predictions for NSTX with the Multi-Mode Model J. B. Lestz et.al. 2509.04360 null
2025-09-04 From Editor to Dense Geometry Estimator JiYuan Wang et.al. 2509.04338 null
2025-09-04 The limiting law of the Discrete Gaussian level-lines Joseph Chen et.al. 2509.04333 null
2025-09-04 Noisy Label Refinement with Semantically Reliable Synthetic Images Yingxuan Li et.al. 2509.04298 null
2025-09-04 TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models Yuxin Gong et.al. 2509.04269 null
2025-09-04 Thermal diffusivity measurement based on evaporative cryocooling excitation: Theory and experiments Pengfei Zhu et.al. 2509.04263 null
2025-09-04 Error analysis for learning the time-stepping operator of evolutionary PDEs Ke Chen et.al. 2509.04256 null
2025-09-04 Synthetic Survival Data Generation for Heart Failure Prognosis Using Deep Generative Models Chanon Puttanawarut et.al. 2509.04245 null
2025-09-04 Axion-Photon Conversion In Magnetized Universe: Impact On The Global 21-cm Signal Pravin Kumar Natwariya et.al. 2509.04237 null
2025-09-04 Cosmic-Ray Boosted Diffuse Supernova Neutrinos Alexander Sandrock et.al. 2509.04229 null
2025-09-04 Making neural networks understand internal heat transfer using Fourier-transformed thermal diffusion wave fields Pengfei Zhu et.al. 2509.04223 null
2025-09-04 Two-dimensional magnetic tunnel p-n junctions for low-power electronics Wenkai Zhu et.al. 2509.04206 null
2025-09-04 Laplacian Flows in Complex-valued Directed Networks: Analysis, Design, and Consensus Aditi Saxena et.al. 2509.04196 null
2025-09-04 DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval Ruohong Yang et.al. 2509.04193 null
2025-09-04 Set Block Decoding is a Language Model Inference Accelerator Itai Gat et.al. 2509.04185 null
2025-09-04 On Riordan groups involving formal semi-Laurent series and their Lie group structure Dariusz Bugajewski et.al. 2509.04160 null
2025-09-04 Hyper Diffusion Avatars: Dynamic Human Avatar Generation using Network Weight Space Diffusion Dongliang Cao et.al. 2509.04145 null
2025-09-04 MEPG:Multi-Expert Planning and Generation for Compositionally-Rich Image Generation Yuan Zhao et.al. 2509.04126 null
2025-09-04 A unified stabilized virtual element method for the generalized Oseen equation: stability and robustness Sudheer Mishra et.al. 2509.04113 null
2025-09-04 Depletion-Induced Interactions Modulate Nanoscale Protein Diffusion in Polymeric Crowder Solutions Michelle Dargasz et.al. 2509.04087 null
2025-09-04 Keypoint-based Diffusion for Robotic Motion Planning on the NICOL Robot Lennart Clasmeier et.al. 2509.04076 null
2025-09-04 SMooGPT: Stylized Motion Generation using Large Language Models Lei Zhong et.al. 2509.04058 null
2025-09-04 CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning Zeyu Gan et.al. 2509.04027 null
2025-09-04 Electromechanical human heart modeling for predicting endocardial heart motion Milad Hasani et.al. 2509.04024 null
2025-09-04 Divergence-Kernel method for linear responses and diffusion models Angxiu Ni et.al. 2509.03992 null
2025-09-04 NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models Chuhan Zhang et.al. 2509.03985 null
2025-09-05 Improving Vessel Segmentation with Multi-Task Learning and Auxiliary Data Available Only During Model Training Daniel Sobotka et.al. 2509.03975 null
2025-09-04 ANTS: Shaping the Adaptive Negative Textual Space by MLLM for OOD Detection Zhu Wenjie et.al. 2509.03951 null
2025-09-04 Fluid boundary conditions in kinetic-diffusion Monte Carlo Thijs Steel et.al. 2509.03942 null
2025-09-04 Thickness-dependent magnon spin transport in antiferromagnetic insulators: Crossover from quasi-three-dimensional to quasi-two-dimensional regimes Mathias Åsan Myhre et.al. 2509.03941 null
2025-09-04 SwinSRGAN: Swin Transformer-based Generative Adversarial Network for High-Fidelity Speech Super-Resolution Jiajun Yuan et.al. 2509.03913 null
2025-09-04 A Generative Foundation Model for Chest Radiography Yuanfeng Ji et.al. 2509.03903 null
2025-09-04 Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series Zhengyi Guo et.al. 2509.03898 null
2025-09-04 Human Motion Video Generation: A Survey Haiwei Xue et.al. 2509.03883 null
2025-09-04 Demonstrating a family of X-ray dark-field retrieval approaches on a common set of samples Samantha J. Alloo et.al. 2509.03866 null
2025-09-04 A minimization principle behind the diffusion bridge of diurnal fish migration H. Yoshioka et.al. 2509.03824 null
2025-09-04 Machine Learning for LiDAR-Based Indoor Surface Classification in Intelligent Wireless Environments Parth Ashokbhai Shiroya et.al. 2509.03813 null
2025-09-04 Causality-guided Prompt Learning for Vision-language Models via Visual Granulation Mengyu Gao et.al. 2509.03803 null
2025-09-04 Universal Structure of Turbulent Radiative Mixing Layers Prateek Sharma et.al. 2509.03802 null
2025-09-04 A high-lying isomer in ^{92}Zr with lifetime modulated by the atomic charge states: a proposed approach for a nuclear gamma-ray laser C. X. Jia et.al. 2509.03797 null
2025-09-04 Fitting Image Diffusion Models on Video Datasets Juhun Lee et.al. 2509.03794 null
2025-09-03 Learning functions through Diffusion Maps Alvaro Almeida Gomez et.al. 2509.03758 null
2025-09-03 Effects of Bethe-Heitler pair production in ultraluminous X-ray sources Gustavo Esteban Romero et.al. 2509.03735 null
2025-09-03 LuxDiT: Lighting Estimation with Video Diffusion Transformer Ruofan Liang et.al. 2509.03680 null
2025-09-03 Applying a Gaussian networking theory to model motor-driven transport along cytoskeletal filaments Nadine du Toit et.al. 2509.03671 null
2025-09-06 Efficient Virtuoso: A Latent Diffusion Transformer Model for Goal-Conditioned Trajectory Planning Antonio Guillen-Perez et.al. 2509.03658 null
2025-09-05 Noise is All You Need: rethinking the value of noise on seismic denoising via diffusion models Donglin Zhu et.al. 2509.03629 null
2025-09-03 Statistical Analysis of PAHs as a Tracer of Anomalous Microwave Emission Using DIRBE Data Danielle Sponseller et.al. 2509.03611 null
2025-09-03 Breaking Down the $\textsf{CosmoGEMS}$ : Toward Modeling and Understanding Globular Cluster Stellar Streams in a Fully Cosmological Context Nondh Panithanpaisal et.al. 2509.03599 null
2025-09-02 Diffusion-RL Based Air Traffic Conflict Detection and Resolution Method Tonghe Li et.al. 2509.03550 null
2025-09-03 Dynamically Controlled Transport of GeV Cosmic Rays in Diverse Galactic Environments Ronan Hix et.al. 2509.03519 null
2025-09-03 Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play? Ouxiang Li et.al. 2509.03516 null
2025-09-03 OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation Han Li et.al. 2509.03498 null
2025-09-03 From Image Denoisers to Regularizing Imaging Inverse Problems: An Overview Hong Ye Tan et.al. 2509.03475 null
2025-09-03 Joint Training of Image Generator and Detector for Road Defect Detection Kuan-Chuan Peng et.al. 2509.03465 null
2025-09-03 Nitrogen chemistry of hycean worlds on the example of K2-18b Maja W. Radecka et.al. 2509.03455 null
2025-09-03 ANNIE: Be Careful of Your Robots Yiyang Huang et.al. 2509.03383 null
2025-09-03 Dynamics of Infection Spread and Hotspot Growth in Bi-Pathogen Networks Alyssa Yu et.al. 2509.03374 null
2025-09-03 Generative Auto-Bidding in Large-Scale Competitive Auctions via Diffusion Completer-Aligner Yewen Li et.al. 2509.03348 null
2025-09-03 On the MIA Vulnerability Gap Between Private GANs and Diffusion Models Ilana Sebag et.al. 2509.03341 null
2025-09-03 Dynamical interface above a hard wall and reflected SPDE on the half-line Pierre Faugère et.al. 2509.03328 null
2025-09-03 Numerical Modeling of Galactic Cosmic Ray Modulation in the Heliosphere D. A. Shestakov et.al. 2509.03326 null
2025-09-03 InfraDiffusion: zero-shot depth map restoration with diffusion models and prompted segmentation from sparse infrastructure point clouds Yixiong Jing et.al. 2509.03324 null
2025-09-03 Noise resilience of two-dimensional Floquet topological phases Balaganchi A. Bhargava et.al. 2509.03296 null
2025-09-03 SynBT: High-quality Tumor Synthesis for Breast Tumor Segmentation by 3D Diffusion Model Hongxu Yang et.al. 2509.03267 null
2025-09-03 Estudio de la eficiencia en la escalabilidad de GPUs para el entrenamiento de Inteligencia Artificial David Cortes et.al. 2509.03263 null
2025-09-03 Evaluation of Stress Detection as Time Series Events – A Novel Window-Based F1-Metric Harald Vilhelm Skat-Rørdam et.al. 2509.03240 null
2025-09-03 Deep Learning for High Speed Optical Coherence Elastography with a Fiber Scanning Endoscope Maximilian Neidhardt et.al. 2509.03193 null
2025-09-03 Dissecting the Diffuse Emission of the Galaxy with the HAWC Observatory Georg Schwefer et.al. 2509.03189 null
2025-09-03 The slow evolution of dark matter halos from cusp to core naturally produces extended stellar core-like distributions Jorge Sanchez Almeida et.al. 2509.03167 null
2025-09-03 Temporally-Aware Diffusion Model for Brain Progression Modelling with Bidirectional Temporal Regularisation Mattia Litrico et.al. 2509.03141 null
2025-09-03 RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation Sashuai Zhou et.al. 2509.03131 null
2025-09-03 On the Smart Coordination of Flexibility Scheduling in Multi-carrier Integrated Energy Systems Christian Doh Dinga et.al. 2509.03126 null
2025-09-03 Towards Realistic Hand-Object Interaction with Gravity-Field Based Diffusion Bridge Miao Xu et.al. 2509.03114 null
2025-09-03 Bounded imaginary powers of generalized diffusion operators Alexandre Thorel et.al. 2509.03105 null
2025-09-03 Collision operator for electron runaway in cold weakly-ionized plasmas Yeongsun Lee et.al. 2509.03092 null
2025-09-03 Diffusive shock acceleration: non-classical model of cosmic ray transport A. A. Lagutin et.al. 2509.03091 null
2025-09-03 High Cursive Complex Character Recognition using GAN External Classifier S M Rafiuddin et.al. 2509.03062 null
2025-09-03 DCDB: Dynamic Conditional Dual Diffusion Bridge for Ill-posed Multi-Tasks Chengjie Huang et.al. 2509.03044 null
2025-09-03 Boundary layer effects induced by the fluid in a chemotaxis-Navier-Stokes system Qianqian Hou et.al. 2509.03028 null
2025-09-03 Enhancing Robustness in Post-Processing Watermarking: An Ensemble Attack Network Using CNNs and Transformers Tzuhsuan Huang et.al. 2509.03006 null
2025-09-03 DUViN: Diffusion-Based Underwater Visual Navigation via Knowledge-Transferred Depth Features Jinghe Yang et.al. 2509.02983 null
2025-09-03 InstaDA: Augmenting Instance Segmentation Data with Dual-Agent System Xianbao Hou et.al. 2509.02973 null
2025-09-03 Non-Linear and Meta-Stable Dynamics in Financial Markets: Evidence from High Frequency Crypto Currency Market Makers Igor Halperin et.al. 2509.02941 null
2025-09-03 The Role of Far-side Magnetic Structures in Modeling 2024 Solar Eclipse Guanglu Shi et.al. 2509.02911 null
2025-09-02 The Space Coronagraph Optical Bench (SCoOB): 8. end-to-end numerical modeling of the testbed to estimate the contrast limits Ramya M Anche et.al. 2509.02887 null
2025-09-02 Fluid Model of Schrodinger equation and derivation of the quantum potential Lachezar Simeonov et.al. 2509.02868 null
2025-09-02 Predicting Movie Success with Multi-Task Learning: A Hybrid Framework Combining GPT-Based Sentiment Analysis and SIR Propagation Wenlan Xie et.al. 2509.02809 null
2025-09-02 DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off Jusheng Zhang et.al. 2509.02785 null
2025-09-02 Synthetic generation of online social networks through homophily Alejandro Buitrago López et.al. 2509.02762 null
2025-09-02 Spacetime Wavelet Method for Linear Boundary-Value Problems in Sylvester Matrix Equation Form Cody D. Cochran et.al. 2509.02720 null
2025-09-02 Ultrafast anisotropic exciton transport in phosphorene Kai-Wei Chang et.al. 2509.02682 null
2025-09-02 Explosive Dispersal Outflows as a New Class of Fermi Gamma-Ray Sources: The Case of DR21 Paarmita Pandey et.al. 2509.02679 null
2025-09-02 Double-faced white dwarfs and the magnetic inhibition of convection Sivan Ginzburg et.al. 2509.02671 null
2025-09-02 Is RL fine-tuning harder than regression? A PDE learning approach for diffusion models Wenlong Mou et.al. 2509.02528 null
2025-09-02 Unifi3D: A Study on 3D Representations for Generation and Reconstruction in a Common Framework Nina Wiedemann et.al. 2509.02474 null
2025-09-02 TeRA: Rethinking Text-guided Realistic 3D Avatar Generation Yanwen Wang et.al. 2509.02466 null
2025-09-02 Fractional differential equations: non-constant coefficients, simulation and model reduction Ruben Aylwin et.al. 2509.02465 null
2025-09-02 GenCompositor: Generative Video Compositing with Diffusion Transformer Shuzhou Yang et.al. 2509.02460 null
2025-09-02 Quantitative positivity of transition densities for random perturbations of Hamiltonian systems Shimaa Elesaely et.al. 2509.02448 null
2025-09-02 Kelvin-Helmholtz instability in binary fluids with miscibility gap Anubhav Dubey et.al. 2509.02400 null
2025-09-02 Revisiting the diffusion equation derivation in Persson’s theory of contact Yang Xu et.al. 2509.02397 null
2025-09-02 Widely non-degenerate nonlinear frequency conversion in cryogenic titanium in-diffused lithium niobate waveguides Nina Amelie Lange et.al. 2509.02392 null
2025-09-02 Category-Aware 3D Object Composition with Disentangled Texture and Shape Multi-view Diffusion Zeren Xiong et.al. 2509.02357 null
2025-09-02 A recursive formula for the $n^\text{th}$ survival function and the $n^\text{th}$ first passage time distribution for jump and diffusion processes. Applications to the pricing of $n^\text{th}$ -to-default CDS Alessio Lapolla et.al. 2509.02347 null
2025-09-02 Multi-stage PDE-based image processing techniques for noisy MRI scans Ksenia Slepova et.al. 2509.02342 null
2025-09-02 RDIT: Residual-based Diffusion Implicit Models for Probabilistic Time Series Forecasting Chih-Yu Lai et.al. 2509.02341 null
2025-09-02 Distribution estimation via Flow Matching with Lipschitz guarantees Lea Kunkel et.al. 2509.02337 null
2025-09-02 Exploring Diffusion Models for Generative Forecasting of Financial Charts Taegyeong Lee et.al. 2509.02308 null
2025-09-02 Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image Generation Sapir Esther Yiflach et.al. 2509.02295 null
2025-09-03 Sem-RaDiff: Diffusion-Based 3D Radar Semantic Perception in Cluttered Agricultural Environments Ruibin Zhang et.al. 2509.02283 null
2025-09-02 Think2Sing: Orchestrating Structured Motion Subtitles for Singing-Driven 3D Head Animation Zikai Huang et.al. 2509.02278 null
2025-09-02 Ergodicity of conditional McKean-Vlasov jump diffusions Jianhai Bao et.al. 2509.02249 null
2025-09-02 Spectrogram Patch Codec: A 2D Block-Quantized VQ-VAE and HiFi-GAN for Neural Speech Coding Luis Felipe Chary et.al. 2509.02244 null
2025-09-02 Improving atomic force microscopy structure discovery via style-translation Jie Huang et.al. 2509.02240 null
2025-09-02 Mechanical performance of hybrid polymer-lipid vesicles with leaflet asymmetry engineered using microfluidics Yuting Huang et.al. 2509.02194 null
2025-09-02 Enhancing Zero-Shot Pedestrian Attribute Recognition with Synthetic Data Generation: A Comparative Study with Image-To-Image Diffusion Models Pablo Ayuso-Albizu et.al. 2509.02161 null
2025-09-02 Nuclear fusion plasma fuelling with ice pellets using a neuromorphic controller L. L. T. C. Jansen et.al. 2509.02147 null
2025-09-02 Differentiable Expectation-Maximisation and Applications to Gaussian Mixture Model Optimal Transport Samuel Boïté et.al. 2509.02109 null
2025-09-02 GeoLayer: Towards Low-Latency and Cost-Efficient Geo-Distributed Graph Stores with Layered Graph Feng Yao et.al. 2509.02106 null
2025-09-02 A Data-Centric Approach to Pedestrian Attribute Recognition: Synthetic Augmentation via Prompt-driven Diffusion Models Alejandro Alonso et.al. 2509.02099 null
2025-09-02 Environment-Aware Channel Measurement and Modeling for Terahertz Monostatic Sensing Yejian Lyu et.al. 2509.02088 null
2025-09-02 Superexponential dissipation enhancement on $\mathbb{T}^d$ Keefer Rowan et.al. 2509.02081 null
2025-09-02 Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling Srinivas Anumasa et.al. 2509.02069 null
2025-09-02 Measuring metal sulfides in interstellar dust with PRIMA Izaskun Jiménez-Serra et.al. 2509.02067 null
2025-09-02 Enhanced Raman scattering by fast GaN phonon-polaritons Mayssoune Mina et.al. 2509.02057 null
2025-09-02 Palette Aligned Image Diffusion Elad Aharoni et.al. 2509.02000 null
2025-09-02 Draw-In-Mind: Learning Precise Image Editing via Chain-of-Thought Imagination Ziyun Zeng et.al. 2509.01986 null
2025-09-03 Discrete Noise Inversion for Next-scale Autoregressive Text-based Image Editing Quan Dao et.al. 2509.01984 null
2025-09-02 Nonmonotonic change with energy of the mean logarithmic mass of cosmic rays in the knee region: the mechanism of formation of this feature and sources of particles A. A. Lagutin et.al. 2509.01974 null
2025-09-02 Efficient Bayesian Sampling with Langevin Birth-Death Dynamics Alex Leviyev et.al. 2509.01942 null
2025-09-02 A Diffusion-Based Framework for Configurable and Realistic Multi-Storage Trace Generation Seohyun Kim et.al. 2509.01919 null
2025-09-02 DroneSR: Rethinking Few-shot Thermal Image Super-Resolution from Drone-based Perspective Zhipeng Weng et.al. 2509.01898 null
2025-09-02 Far-infrared probing with PRIMA into particle acceleration associated with relativistic jets from active galactic nuclei Naoki Isobe et.al. 2509.01876 null
2025-09-04 RadioDiff-Loc: Diffusion Model Enhanced Scattering Congnition for NLoS Localization with Sparse Radio Map Estimation Xiucheng Wang et.al. 2509.01875 null
2025-09-02 Latent Gene Diffusion for Spatial Transcriptomics Completion Paula Cárdenas et.al. 2509.01864 null
2025-09-02 Does the high-energy AMS-02 positron flux originate from the dark matter density spikes around nearby black holes? Man Ho Chan et.al. 2509.01860 null
2025-09-01 PractiLight: Practical Light Control Using Foundational Diffusion Models Yotam Erel et.al. 2509.01837 null
2025-09-01 ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training Ge Yan et.al. 2509.01819 null
2025-09-03 Intermittent localization and fast spatial learning by non-Markov random walks with decaying memory Paulina R. Martín-Cornejo et.al. 2509.01806 null
2025-09-01 Mapping Magnetic Fields from Clouds to Cores with PRIMAger Kate Pattle et.al. 2509.01796 null
2025-09-01 High-Performance Trajectory Tracking MPC for Quadcopters with Coupled Time-Varying Constraints and Stability Proofs Maedeh Izadi et.al. 2509.01767 null
2025-09-01 Clinical Metadata Guided Limited-Angle CT Image Reconstruction Yu Shi et.al. 2509.01752 null
2025-09-01 Controllable Generation of Implied Volatility Surfaces with Variational Autoencoders Jing Wang et.al. 2509.01743 null
2025-09-01 Quadratic Growth Model with Discontinuity: A Link between Monostable and Bistable Traveling Waves Wonhyung Choi et.al. 2509.01715 null
2025-09-01 The PRIMA promise of deciphering interstellar dust evolution with observations of the nearby Universe Frédéric Galliano et.al. 2509.01692 null
2025-09-01 The Impact of Baryonic Effects on the Dynamical Masses Inferred Using Satellite Kinematics Josephine F. W. Baggen et.al. 2509.01690 null
2025-09-01 Preconditioned Regularized Wasserstein Proximal Sampling Hong Ye Tan et.al. 2509.01685 null
2025-09-01 Efficient Transformer-Inspired Variants of Physics-Informed Deep Operator Networks Zhi-Feng Wei et.al. 2509.01679 null
2025-09-01 Investigating the role of magnetic fields in the formation and evolution of striations in interstellar clouds with PRIMA Raphael Skalidis et.al. 2509.01678 null
2025-08-29 Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair and Private Data Generation Tobias Hyrup et.al. 2508.21815 null
2025-08-29 Tree-Guided Diffusion Planner Hyeonseong Jeon et.al. 2508.21800 null
2025-08-29 OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization Jiazheng Xing et.al. 2508.21727 null
2025-08-29 FLORA: Efficient Synthetic Data Generation for Object Detection in Low-Data Regimes via finetuning Flux LoRA Alvaro Patricio et.al. 2508.21712 null
2025-09-01 Infinite-Dimensional Stochastic Differential Equations and Diffusion Dynamics of Coulomb Random Point Fields Hirofumi Osada et.al. 2508.21658 null
2025-08-29 Deciphering the gamma-ray emission in the Cygnus region L. Haerer et.al. 2508.21644 null
2025-08-29 Conforming and discontinuous discretizations of non-isothermal Darcy-Forchheimer flows Stefano Bonetti et.al. 2508.21630 null
2025-09-02 Approximate calculation of multidimensional first passage times James F. Lutsko et.al. 2508.21607 null
2025-08-29 Condense to Conduct and Conduct to Condense Tomasz Kazana et.al. 2508.21602 null
2025-08-29 Fluid dynamics of charm quarks from heavy to light-ion collisions Federica Capellino et.al. 2508.21600 null
2025-08-29 OASIS: Harnessing Diffusion Adversarial Network for Ocean Salinity Imputation using Sparse Drifter Trajectories Bo Li et.al. 2508.21570 null
2025-08-29 ECHO: Ego-Centric modeling of Human-Object interactions Ilya A. Petrov et.al. 2508.21556 null
2025-08-29 Complete Gaussian Splats from a Single Image with Denoising Diffusion Models Ziwei Liao et.al. 2508.21542 null
2025-08-29 Molecular Beam Epitaxy of 2H-TaS $_2$ few-layers on GaN(0001) Constantin Hilbrunner et.al. 2508.21537 null
2025-08-29 Adaptive generative moment matching networks for improved learning of dependence structures Marius Hofert et.al. 2508.21531 null
2025-08-29 Few-Shot Neuro-Symbolic Imitation Learning for Long-Horizon Planning and Acting Pierrick Lorang et.al. 2508.21501 null
2025-08-29 Controllable 3D Molecular Generation for Structure-Based Drug Design Through Bayesian Flow Networks and Gradient Integration Seungyeon Choi et.al. 2508.21468 null
2025-08-29 Diffusion-based Multi-modal Synergy Interest Network for Click-through Rate Prediction Xiaoxi Cui et.al. 2508.21460 null
2025-09-01 Contrarian Motives in Social Learning: Information Cascades with Nonconformist Preferences Georgy Lukyanov et.al. 2508.21446 null
2025-08-29 Quantum enhanced ensemble GANs for anomaly detection in continuous biomanufacturing Rajiv Kailasanathan et.al. 2508.21438 null
2025-08-29 MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation Francisco Caetano et.al. 2508.21435 null
2025-08-29 Global Hot Gas Excess in (U)LIRGs: Replicating Galactic Nuclei Scaling Relations between Diffuse X-ray Emission and Star Formation on Galaxy-Wide Scales Chunyi Zhang et.al. 2508.21401 null
2025-08-29 Dynamics-Compliant Trajectory Diffusion for Super-Nominal Payload Manipulation Anuj Pasricha et.al. 2508.21375 null
2025-08-29 Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image Qingran Miao et.al. 2508.21371 null
2025-08-29 Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning Yuquan Bi et.al. 2508.21363 null
2025-08-29 QUAV: Quantum-Assisted Path Planning and Optimization for UAV Navigation with Obstacle Avoidance Nouhaila Innan et.al. 2508.21361 null
2025-08-29 DLGAN : Time Series Synthesis Based on Dual-Layer Generative Adversarial Networks Xuan Hou et.al. 2508.21340 null
2025-08-29 Quantum Monte Carlo Benchmarking of Molecular Adsorption on Graphene-Supported Single Pt Atom Jeonghwan Ahn et.al. 2508.21339 null
2025-08-29 Stage-Diff: Stage-wise Long-Term Time Series Generation Based on Diffusion Models Xuan Hou et.al. 2508.21330 null
2025-08-28 PHD: Personalized 3D Human Body Fitting with Point Diffusion Hsuan-I Ho et.al. 2508.21257 null
2025-08-28 Weighted Support Points from Random Measures: An Interpretable Alternative for Generative Modeling Peiqi Zhao et.al. 2508.21255 null
2025-08-28 Reverse Imaging for Wide-spectrum Generalization of Cardiac MRI Segmentation Yidong Zhao et.al. 2508.21254 null
2025-08-28 Mutual Information Rate – Linear Noise Approximation and Exact Computation Manuel Reinhardt et.al. 2508.21220 null
2025-08-28 WaveLLDM: Design and Development of a Lightweight Latent Diffusion Model for Speech Enhancement and Restoration Kevin Putra Santoso et.al. 2508.21153 null
2025-08-28 Propagation in the Fisher-KPP equation with Mixed Operator Begoña Barrios et.al. 2508.21151 null
2025-08-28 The COLIBRE project: cosmological hydrodynamical simulations of galaxy formation and evolution Joop Schaye et.al. 2508.21126 null
2025-08-28 Safe-Control: A Safety Patch for Mitigating Unsafe Content in Text-to-Image Generation Models Xiangtao Meng et.al. 2508.21099 null
2025-08-28 TrInk: Ink Generation with Transformer Network Zezhong Jin et.al. 2508.21098 null
2025-08-28 First-Place Solution to NeurIPS 2024 Invisible Watermark Removal Challenge Fahad Shamshad et.al. 2508.21072 null
2025-08-28 Dress&Dance: Dress up and Dance as You Like It - Technical Preview Jun-Kun Chen et.al. 2508.21070 null
2025-08-28 OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning Yuan Gong et.al. 2508.21066 null
2025-08-28 Mixture of Contexts for Long Video Generation Shengqu Cai et.al. 2508.21058 null
2025-08-28 HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning Zhi Su et.al. 2508.21043 null
2025-08-28 FW-GAN: Frequency-Driven Handwriting Synthesis with Wave-Modulated MLP Generator Huynh Tong Dang Khoa et.al. 2508.21040 null
2025-08-28 Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets Dale Decatur et.al. 2508.21032 null
2025-08-28 System size and event shape dependence of particle-identified balance functions in proton-proton collisions at $\sqrt{s}=13$ TeV Subash Chandra Behera et.al. 2508.21030 null
2025-08-28 POSE: Phased One-Step Adversarial Equilibrium for Video Diffusion Models Jiaxiang Cheng et.al. 2508.21019 null
2025-08-28 Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance Luozhijie Jin et.al. 2508.21016 null
2025-08-28 Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees Yaniv Hassidof et.al. 2508.21001 null
2025-08-28 RANGAN: GAN-empowered Anomaly Detection in 5G Cloud RAN Douglas Liao et.al. 2508.20985 null
2025-08-28 Random attractors and nonergodic attractors for diffusions with degeneracies Yuri Bakhtin et.al. 2508.20968 null
2025-08-28 Very high-energy gamma-ray and neutrino emission from hadronic interaction in compact binary millisecond pulsars Vittoria Vecchiotti et.al. 2508.20952 null
2025-08-28 Lattice Random Walk Discretisations of Stochastic Differential Equations Samuel Duffield et.al. 2508.20883 null
2025-08-28 Understanding and evaluating computer vision models through the lens of counterfactuals Pushkar Shukla et.al. 2508.20881 null
2025-08-28 Leveraging Discriminative Latent Representations for Conditioning GAN-Based Speech Enhancement Shrishti Saha Shetu et.al. 2508.20859 null
2025-08-28 Uniform error analysis of a rectangular Morley finite element method on a Shishkin mesh for a 4th-order singularly perturbed boundary value problem Xiangyun Meng et.al. 2508.20857 null
2025-08-28 Learning Primitive Embodied World Models: Towards Scalable Robotic Learning Qiao Sun et.al. 2508.20840 null
2025-08-28 High-Resolution Atomic Magnetometer-Based Imaging of Integrated Circuits and Batteries Dominic Hunter et.al. 2508.20834 null
2025-08-28 Distinct Spatiotemporal Dynamics of Thermoelectric Transport Across Superconducting Transition Rajae Malek et.al. 2508.20792 null
2025-08-28 Prediction of sulphate hazes in the lower Venus atmosphere Peter Woitke et.al. 2508.20790 null
2025-08-28 Evaluating Compositional Generalisation in VLMs and Diffusion Models Beth Pearson et.al. 2508.20783 null
2025-08-28 Unleashing Uncertainty: Efficient Machine Unlearning for Generative AI Christoforos N. Spartalis et.al. 2508.20773 null
2025-08-28 Anomalous diffusion and run-and-tumble motion of a chemotactic particle in low dimensions Jacopo Romano et.al. 2508.20756 null
2025-08-28 Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning Yibin Wang et.al. 2508.20751 null
2025-08-29 A two-state generalisation of the strong collision model Ola Kenji Forslund et.al. 2508.20727 null
2025-08-28 EEGDM: Learning EEG Representation with Latent Diffusion Model Shaocong Wang et.al. 2508.20705 null
2025-08-28 Agent-based model of information diffusion in the limit order book trading Mateusz Wilinski et.al. 2508.20672 null
2025-08-28 “Humor, Art, or Misinformation?”: A Multimodal Dataset for Intent-Aware Synthetic Image Detection Anastasios Skoularikis et.al. 2508.20670 null
2025-08-28 Amadeus: Autoregressive Model with Bidirectional Attribute Modelling for Symbolic Music Hongju Su et.al. 2508.20665 null
2025-08-28 VarDiU: A Variational Diffusive Upper Bound for One-Step Diffusion Distillation Leyang Wang et.al. 2508.20646 null
2025-08-28 CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models Ayan Banerjee et.al. 2508.20640 null
2025-08-28 EmoCAST: Emotional Talking Portrait via Emotive Text Description Yiguo Jiang et.al. 2508.20615 null
2025-08-28 Revisiting the Privacy Risks of Split Inference: A GAN-Based Data Reconstruction Attack via Progressive Feature Optimization Yixiang Qiu et.al. 2508.20613 null
2025-08-28 Physics Informed Generative Models for Magnetic Field Images Aye Phyu Phyu Aung et.al. 2508.20612 null
2025-08-28 GENRE-CMR: Generalizable Deep Learning for Diverse Multi-Domain Cardiac MRI Reconstruction Kian Anvari Hamedani et.al. 2508.20600 null
2025-08-28 Disruptive Attacks on Face Swapping via Low-Frequency Perceptual Perturbations Mengxiao Huang et.al. 2508.20595 null
2025-08-28 FastFit: Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models Zheng Chong et.al. 2508.20586 null
2025-08-28 Persode: Personalized Visual Journaling with Episodic Memory-Aware AI Agent Seokho Jin et.al. 2508.20585 null
2025-08-28 SimShear: Sim-to-Real Shear-based Tactile Servoing Kipp McAdam Freud et.al. 2508.20561 null
2025-08-28 Equilibria of aggregation-diffusion models with nonlinear potentials Francesco Bozzola et.al. 2508.20523 null
2025-08-28 Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent En Ci et.al. 2508.20505 null
2025-08-28 Run-and-tumble particle with diffusion: boundary local times and the zero-diffusion limit Paul C Bressloff et.al. 2508.20473 null
2025-08-28 Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation Jiusi Li et.al. 2508.20471 null
2025-08-28 Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models Desen Sun et.al. 2508.20424 null
2025-09-01 AWorld: Orchestrating the Training Recipe for Agentic AI Chengyue Yu et.al. 2508.20404 null
2025-08-28 Mean Field Game with Reflected Jump Diffusion Dynamics: A Linear Programming Approach Zongxia Liang et.al. 2508.20388 null
2025-08-28 Do triangles matter? Replicating hypergraph disease dynamics with lower-order interactions Eugene Tan et.al. 2508.20380 null
2025-08-28 Audio-Guided Visual Editing with Complex Multi-Modal Prompts Hyeonyu Kim et.al. 2508.20379 null
2025-08-28 Numerical Method for Space-Time Fractional Diffusion: A Stochastic Approach Tengteng Cui et.al. 2508.20361 null
2025-08-28 Artificial neural network solver for Fokker-Planck and Koopman eigenfunctions Max Kreider et.al. 2508.20339 null
2025-08-27 Score-Based Diffusion Models in Infinite Dimensions: A Malliavin Calculus Perspective Ehsan Mirafzali et.al. 2508.20316 null
2025-08-27 Efficient ion re-acceleration in laboratory-produced interpenetrating collisionless shocks W. Yao et.al. 2508.20303 null
2025-08-27 Out-of-time-order correlators bridge classical transport and quantum dynamics Sophia N. Fricke et.al. 2508.20235 null
2025-08-27 Velocity Spectrum Imaging using velocity encoding preparation pulses Luis Hernandez-Garcia et.al. 2508.20218 null
2025-08-27 InfinityHuman: Towards Long-Term Audio-Driven Human Xiaodi Li et.al. 2508.20210 null
2025-08-27 The structure of the giant radio fossil in the Ophiuchus galaxy cluster Simona Giacintucci et.al. 2508.20190 null
2025-08-27 SDiFL: Stable Diffusion-Driven Framework for Image Forgery Localization Yang Su et.al. 2508.20182 null
2025-08-27 Nonlinear diffusion in relativistic kinetic theory Simone Calogero et.al. 2508.20147 null
2025-08-27 MicroLad: 2D-to-3D Microstructure Reconstruction and Generation via Latent Diffusion and Score Distillation Kang-Hyun Lee et.al. 2508.20138 null
2025-08-27 Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning Jinhao Liang et.al. 2508.20095 null
2025-08-27 AudioStory: Generating Long-Form Narrative Audio with Large Language Models Yuxin Guo et.al. 2508.20088 null
2025-08-27 Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies Zhixuan Liang et.al. 2508.20072 null
2025-08-27 A unique solution to overcome the barriers to planetesimal formation at low dust-to-gas ratio H. Meheut et.al. 2508.20070 null
2025-08-27 Neural Conditional Simulation for Complex Spatial Processes Julia Walchessen et.al. 2508.20067 null
2025-08-27 Joint Analysis of HI Absorption Zeeman Measurements and the Morphology of Filamentary HI Emission Marta Nowotka et.al. 2508.20065 null
2025-08-27 Wave coarsening drives time crystallization in active solids Jonas Veenstra et.al. 2508.20052 null
2025-08-27 GS: Generative Segmentation via Label Diffusion Yuhao Chen et.al. 2508.20020 null
2025-08-27 Diffusion Language Models Know the Answer Before Decoding Pengxiang Li et.al. 2508.19982 null
2025-08-27 The Information Dynamics of Generative Diffusion Luca Ambrogioni et.al. 2508.19897 null
2025-08-27 Quantum latent distributions in deep generative models Omar Bacarreza et.al. 2508.19857 null
2025-08-28 Ego-centric Predictive Model Conditioned on Hand Trajectories Binjie Zhang et.al. 2508.19852 null
2025-08-27 Physics-Informed DeepONet Coupled with FEM for Convective Transport in Porous Media with Sharp Gaussian Sources Erdi Kara et.al. 2508.19847 null
2025-08-27 Exotic rheology of materials with active rearrangements Aondoyima Ioratim-Uba et.al. 2508.19844 null
2025-08-27 Not Every Gift Comes in Gold Paper or with a Red Ribbon: Exploring Color Perception in Text-to-Image Models Shay Shomer Chai et.al. 2508.19791 null
2025-08-27 StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation Xiuchao Wu et.al. 2508.19789 null
2025-08-27 Fast 3D Diffusion for Scalable Granular Media Synthesis Muhammad Moeeze Hassan et.al. 2508.19752 null
2025-08-27 Fractal Flow: Hierarchical and Interpretable Normalizing Flow via Topic Modeling and Recursive Strategy Binhui Zhang et.al. 2508.19750 null
2025-08-27 MC for Gastroretentive Drug Delivery Sebastian Lotter et.al. 2508.19739 null
2025-08-27 Synthetic Image Detection via Spectral Gaps of QC-RBIM Nishimori Bethe-Hessian Operators V. S. Usatyuk et.al. 2508.19698 null
2025-08-27 MnBr $_2$ on the graphene on Ir(110) substrate: growth, structure, and super-moiré Affan Safeer et.al. 2508.19694 null
2025-08-27 Atomistic insights into hydrogen migration in IGZO from machine-learning interatomic potential: linking atomic diffusion to device performance Hyunsung Cho et.al. 2508.19674 null
2025-08-27 Multi-value Probabilistic Computing with current-controlled Skyrmion Diffusion Thomas B. Winkler et.al. 2508.19623 null
2025-08-27 IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation Qizhe Fan et.al. 2508.19604 null
2025-08-27 Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction Dat Nguyen Cong et.al. 2508.19581 null
2025-08-28 Interact-Custom: Customized Human Object Interaction Image Generation Zhu Xu et.al. 2508.19575 null
2025-08-27 Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era Dawei Li et.al. 2508.19570 null
2025-08-27 MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery Yu-Wei Zhang et.al. 2508.19555 null
2025-08-27 Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding Bowen Sun et.al. 2508.19529 null
2025-08-27 MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment Zhiting Gao et.al. 2508.19527 null
2025-08-27 Functionally-graded drug delivery systems with binding reactions: analytical and stochastic approaches for the fraction of drug released Obi A. Carwood et.al. 2508.19510 null
2025-08-27 DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View Tian Qiu et.al. 2508.19508 null
2025-08-27 Sat2Flow: A Structure-Aware Diffusion Framework for Human Flow Generation from Satellite Imagery Xiangxu Wang et.al. 2508.19499 null
2025-08-27 Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks Muhammad Ahmed Mohsin et.al. 2508.19495 null
2025-08-26 MRExtrap: Longitudinal Aging of Brain MRIs using Linear Modeling in Latent Space Jaivardhan Kapoor et.al. 2508.19482 null
2025-08-26 Shining light on degeneracies and uncertainties in quantifying both exchange and restriction with time-dependent diffusion MRI using Bayesian inference Maëliss Jallais et.al. 2508.19478 null
2025-08-26 Hydrodynamic Limit of the Symmetric Zero-Range Process with Slow Boundary Oslenne Araújo et.al. 2508.19447 null
2025-08-26 On Surjectivity of Neural Networks: Can you elicit any behavior from your model? Haozhe Jiang et.al. 2508.19445 null
2025-08-26 Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization Paimon Goulart et.al. 2508.19443 null
2025-08-26 Quantification of mobile ions in perovskite solar cells with thermally activated ion current measurements Moritz C. Schmidt et.al. 2508.19403 null
2025-08-26 DETNO: A Diffusion-Enhanced Transformer Neural Operator for Long-Term Traffic Forecasting Owais Ahmad et.al. 2508.19389 null
2025-08-26 Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs Supratik Sarkar et.al. 2508.19366 null
2025-08-28 MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation Ming Chen et.al. 2508.19320 null
2025-08-26 Disorder-induced proximate quantum spin ice phase in Pr $_2$Sn$_2$O$_7$ Yi Luo et.al. 2508.19248 null
2025-08-26 Articulate3D: Zero-Shot Text-Driven 3D Object Posing Oishi Deb et.al. 2508.19244 null
2025-08-26 MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation Hao Shi et.al. 2508.19236 null
2025-08-26 VibeVoice Technical Report Zhiliang Peng et.al. 2508.19205 null
2025-08-26 LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding Julian Ost et.al. 2508.19204 null
2025-08-26 Planning-Query-Guided Model Generation for Model-Based Deformable Object Manipulation Alex LaGrassa et.al. 2508.19199 null
2025-08-26 All-in-One Slider for Attribute Manipulation in Diffusion Models Weixin Ye et.al. 2508.19195 null
2025-08-26 MDD: a Mask Diffusion Detector to Protect Speaker Verification Systems from Adversarial Perturbations Yibo Bai et.al. 2508.19180 null
2025-08-26 Stoch-IDENT: New Method and Mathematical Analysis for Identifying SPDEs from Data Jianbo Cui et.al. 2508.19177 null
2025-08-26 RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration Yan Chen et.al. 2508.19154 null
2025-08-26 Saddle Hierarchy in Dense Associative Memory Robin Thériault et.al. 2508.19151 null
2025-08-26 Alloyed cementite (Fe-Ni-Cr) $_3$ C: structure and hyperfine field from DFT calculations and experimental comparison Lyudmila V. Dobysheva et.al. 2508.19148 null
2025-08-26 Lattice vacancy migration barriers in Fe-Ni alloys, and why Ni atoms diffuse slowly: An ab initio study Adam M. Fisher et.al. 2508.19124 null
2025-08-26 Composition and Alignment of Diffusion Models using Constrained Learning Shervin Khalafi et.al. 2508.19104 null
2025-08-26 Evaluation of in vitro antibacterial activity and phytochemical profile of aqueous leaf extract of Asystasia variabilis R Wijerathna et.al. 2508.19049 null
2025-08-26 In-vitro Anti-bacterial Activity of Methanol and Aqueous Crude Extracts of Horsfieldia iryaghedhi RMHKK Rajapaksha et.al. 2508.19025 null
2025-08-28 STDiff: A State Transition Diffusion Framework for Time Series Imputation in Industrial Systems Gary Simethy et.al. 2508.19011 null
2025-08-26 Detection of Diffuse Radio Emission inside the Supernova Remnant G338.3-0.0 associated with the Gamma-ray Source HESS J1640-465 Moaz Abdelmaguid et.al. 2508.18999 null
2025-08-26 Krylov-Veretennikov desomposition for measure-valued processes induced by SDEs with interaction on Riemannian manifolds Andrey Dorogovtsev et.al. 2508.18995 null
2025-08-26 Junctional-Fluctuation-Mediated Fluidisation of Multi-Phase Field Epithelial Monolayers James N. Graham et.al. 2508.18987 null
2025-08-26 Vanishing Angular Viscosity Limit For Micropolar Fluid Model In $\mathbb{R}_+^2$ : Boundary Layer And Optimal Convergence Rate Yinghui Wang et.al. 2508.18980 null
2025-08-26 Linear approximations of large deviations: Cubic diffusion test Pelerine Tsobgni Nyawo et.al. 2508.18977 null
2025-08-26 Generative AI in Map-Making: A Technical Exploration and Its Implications for Cartographers Claudio Affolter et.al. 2508.18959 null
2025-08-26 Energy-Based Flow Matching for Generating 3D Molecular Structure Wenyin Zhou et.al. 2508.18949 null
2025-08-26 Stochastic Forces Enhance Tracer Diffusion in Non-motile Active Matter Henry Alston et.al. 2508.18882 null
2025-08-26 Experimental investigation of turbulence and turbulent thermal diffusion in strongly inhomogeneous and anisotropic forced convection E. Zarbib et.al. 2508.18865 null
2025-08-26 Super and Weak Poincaré Inequalities for Sticky-Reflected Diffusion Processes Feng-Yu Wang et.al. 2508.18846 null
2025-08-26 Single-Photon Detection in Few-Layer NbSe $_2$ Superconducting Nanowires Lucio Zugliani et.al. 2508.18843 null
2025-08-26 Quantum-Circuit-Based Visual Fractal Image Generation in Qiskit and Analytics Hillol Biswas et.al. 2508.18835 null
2025-08-26 On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation Adrian Meise et.al. 2508.18833 null
2025-08-26 Asymptotic limit of a vector-valued Allen-Cahn equation for phase transition dynamics Huan Dong et.al. 2508.18754 null
2025-08-26 Joint Time-Position Statistics and Fisher Information in Drift-Diffusion Molecular Channels Yun-Feng Lo et.al. 2508.18680 null
2025-08-26 ROSE: Remove Objects with Side Effects in Videos Chenxuan Miao et.al. 2508.18633 null
2025-08-26 Wan-S2V: Audio-Driven Cinematic Video Generation Xin Gao et.al. 2508.18621 null
2025-08-26 SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis Xiaohao Sun et.al. 2508.18597 null
2025-08-26 Search for the radiative decay of the cosmic neutrino background through spectral measurements of the cosmic infrared background using PRIMA Yuji Takeuchi et.al. 2508.18590 null
2025-08-25 Controllable Single-shot Animation Blending with Temporal Conditioning Eleni Tselepi et.al. 2508.18525 null
2025-08-25 VQualA 2025 Challenge on Face Image Quality Assessment: Methods and Results Sizhuo Ma et.al. 2508.18445 null
2025-08-25 Phase-Field Model of Freeze Casting Kaihua Ji et.al. 2508.18416 null
2025-08-25 Hillas meets Eddington: the case for blazars as ultra-high-energy neutrino sources Xavier Rodrigues et.al. 2508.18345 null
2025-08-25 ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models Haitang Feng et.al. 2508.18271 null
2025-08-25 SafeBimanual: Diffusion-based Trajectory Optimization for Safe Bimanual Manipulation Haoyuan Deng et.al. 2508.18268 null
2025-08-25 Diffusiophoretic corner flows Dobromir Nowak et.al. 2508.18233 null
2025-08-25 Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance Ayce Idil Aytekin et.al. 2508.18213 null
2025-08-25 New shell-model calculations of the $δ_C$ correction to superallowed $0^+\rightarrow0^+$ nuclear $β$ decay and standard-model implications L. Xayavong et.al. 2508.18189 null
2025-08-25 SpotEdit: Evaluating Visually-Guided Image Editing Methods Sara Ghazanfari et.al. 2508.18159 null
2025-08-25 Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation Haijian Ma et.al. 2508.18148 null
2025-08-25 Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem Zhicong Tang et.al. 2508.18095 null
2025-08-26 Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation Yaqi Li et.al. 2508.18032 null
2025-08-25 HD 28471: a near-resonant compact multiplanet system with a possible cold giant planet A. T. Stevenson et.al. 2508.18000 null
2025-08-26 Solute dispersion in axially strained tube flows: Large-time asymptotics and Ornstein-Uhlenbeck Gaussian profiles Prabakaran Rajamanickam et.al. 2508.17982 null
2025-08-25 Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech Dimme de Groot et.al. 2508.17980 null
2025-08-26 Generative Feature Imputing – A Technique for Error-resilient Semantic Communication Jianhao Huang et.al. 2508.17957 null
2025-08-25 Nodal error behind discrepancies between coupled cluster and diffusion Monte Carlo: AcOH dimer case study S. Lambie et.al. 2508.17937 null
2025-08-25 Parallel Nodal Interior-Penalty Discontinuous Galerkin Methods for the Subsonic Compressible Navier-Stokes Equations: Applications to Vortical Flows and VIV Problems Spiros Zafeiris et.al. 2508.17917 null
2025-08-25 Quasi-likelihood inference for SDE with mixed-effects observed at high frequency Maud Delattre et.al. 2508.17910 null
2025-08-25 Local Well-Posedness of the Cahn-Hilliard-Biot System Helmut Abels et.al. 2508.17893 null
2025-08-27 Vocoder-Projected Feature Discriminator Takuhiro Kaneko et.al. 2508.17874 null
2025-08-25 FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation Takuhiro Kaneko et.al. 2508.17868 null
2025-08-25 Diffusion-Based Data Augmentation for Medical Image Segmentation Maham Nazir et.al. 2508.17844 null
2025-08-25 Threshold Diffusions Lina Ji et.al. 2508.17812 null
2025-08-25 CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation Mingyue Yang et.al. 2508.17760 null
2025-08-25 SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling Fanjiang Ye et.al. 2508.17756 null
2025-08-25 DiffusionGS: Generative Search with Query Conditioned Diffusion in Kuaishou Qinyao Li et.al. 2508.17754 null
2025-08-25 Few-shot Human Action Anomaly Detection via a Unified Contrastive Learning Framework Koichiro Kamide et.al. 2508.17726 null
2025-08-25 Instant Preference Alignment for Text-to-Image Diffusion Models Yang Li et.al. 2508.17718 null
2025-08-25 CATformer: Contrastive Adversarial Transformer for Image Super-Resolution Qinyi Tian et.al. 2508.17708 null
2025-08-25 On the Edge of Memorization in Diffusion Models Sam Buchanan et.al. 2508.17689 null
2025-08-25 Calculating the power spectrum in stochastic inflation by Monte Carlo simulation and least squares curve fitting Koichi Miyamoto et.al. 2508.17654 null
2025-08-27 ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion Nima Kondori et.al. 2508.17631 null
2025-08-25 Effects of Near-Field Hydrodynamic Interactions on Bacterial Dynamics Near a Solid Surface Baopi Liu et.al. 2508.17626 null
2025-08-25 Steering When Necessary: Flexible Steering Large Language Models with Backtracking Jinwei Gan et.al. 2508.17621 null
2025-08-25 Preference Trajectory Modeling via Flow Matching for Sequential Recommendation Li Li et.al. 2508.17618 null
2025-08-25 JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on Aowen Wang et.al. 2508.17614 null
2025-08-25 HotSpotter - Patterned Species Instance Recognition Jonathan P. Crall et.al. 2508.17605 null
2025-08-25 GWM: Towards Scalable Gaussian World Models for Robotic Manipulation Guanxing Lu et.al. 2508.17600 null
2025-08-25 HERO: Hierarchical Extrapolation and Refresh for Efficient World Models Quanjian Song et.al. 2508.17588 null
2025-08-24 Controllability of a system of non-autonomous degenerate coupled parabolic equations Alfredo S. Gamboa et.al. 2508.17546 null
2025-08-24 Universal scaling of higher-order cumulants in quantum isotropic spin chains Shixian Jiang et.al. 2508.17535 null
2025-08-24 Learning Reaction-Diffusion Kinetics from Mechanical Information Royal C. Ihuaenyi et.al. 2508.17523 null
2025-08-24 Variational Shape Inference for Grasp Diffusion on SE(3) S. Talha Bukhari et.al. 2508.17482 null
2025-08-24 T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation Kaiyue Sun et.al. 2508.17472 null
2025-08-24 A Synthetic Dataset for Manometry Recognition in Robotic Applications Pedro Antonio Rabelo Saraiva et.al. 2508.17468 null
2025-08-24 Bias Amplification in Stable Diffusion’s Representation of Stigma Through Skin Tones and Their Homogeneity Kyra Wilson et.al. 2508.17465 null
2025-08-24 Disentangled Geometry and Appearance for Efficient Multi-View Surface Reconstruction and Rendering Qitong Zhang et.al. 2508.17436 null
2025-08-24 An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing Zihan Liang et.al. 2508.17435 null
2025-08-24 TinySR: Pruning Diffusion for Real-World Image Super-Resolution Linwei Dong et.al. 2508.17434 null
2025-08-24 Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling Haochen You et.al. 2508.17426 null
2025-08-24 Asteroid Rotation Periods: Statistical Analysis in the Diameter-Spin Distribution Maryam Nastaran et.al. 2508.17415 null
2025-08-24 MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling Haoyu Wang et.al. 2508.17404 null
2025-08-24 Stability and uniqueness of bounded weak solutions to triangular degenerate cross-diffusion systems Xiuqing Chen et.al. 2508.17379 null
2025-08-24 ShaLa: Multimodal Shared Latent Space Modelling Jiali Cui et.al. 2508.17376 null
2025-08-24 Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation Guoqing Zhang et.al. 2508.17364 null
2025-08-24 DiCache: Let Diffusion Model Determine Its Own Cache Jiazi Bu et.al. 2508.17356 null
2025-08-24 ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation Yuxuan Song et.al. 2508.17345 null
2025-08-24 Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing Tristan S. W. Stevens et.al. 2508.17326 null
2025-08-24 An improved nonlocal electron heat transport model for magnetized plasmas Z. H. Chen et.al. 2508.17309 null
2025-08-24 PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing Peilin Xiong et.al. 2508.17302 null
2025-08-24 FoundDiff: Foundational Diffusion Model for Generalizable Low-Dose CT Denoising Zhihao Chen et.al. 2508.17299 null
2025-08-24 4D Visual Pre-training for Robot Learning Chengkai Hou et.al. 2508.17230 null
2025-08-24 Multi-Metric Preference Alignment for Generative Speech Restoration Junan Zhang et.al. 2508.17229 null
2025-08-24 Effects of Geometric configuration in relativistic isobaric collisions at $\sqrt{s_{NN}}=200$ GeV Akash Das et.al. 2508.17227 null
2025-08-24 MMCIG: Multimodal Cover Image Generation for Text-only Documents and Its Dataset Construction via Pseudo-labeling Hyeyeon Kim et.al. 2508.17199 null
2025-08-23 Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities Yili Jin et.al. 2508.17163 null
2025-08-23 SyncGuard: Robust Audio Watermarking Capable of Countering Desynchronization Attacks Zhenliang Gan et.al. 2508.17121 null
2025-08-23 CP4SBI: Local Conformal Calibration of Credible Sets in Simulation-Based Inference Luben M. C. Cabezas et.al. 2508.17077 null
2025-08-23 LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening Halid Abdulrahim Kadi et.al. 2508.17070 null
2025-08-23 SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation Peng Hu et.al. 2508.17062 null
2025-08-23 PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models Xianjing Cheng et.al. 2508.17050 null
2025-08-23 Styleclone: Face Stylization with Diffusion Based Data Augmentation Neeraj Matiyali et.al. 2508.17045 null
2025-08-23 A Novel Local Focusing Mechanism for Deepfake Detection Generalization Mingliang Li et.al. 2508.17029 null
2025-08-23 Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation Konstantina Nikolaidou et.al. 2508.17017 null
2025-08-23 An improved lattice Boltzmann method with a novel conservative boundary scheme for viscoelastic fluid flows Yuan Yu et.al. 2508.16997 null
2025-08-23 Score Matching on Large Geometric Graphs for Cosmology Generation Diana-Alexandra Onutu et.al. 2508.16990 null
2025-08-23 HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching Liang Feng et.al. 2508.16984 null
2025-08-23 Shape optimization problems with random coefficients via the penalty method Xiaowei Pang et.al. 2508.16961 null
2025-08-23 RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze Ruicheng Zhang et.al. 2508.16956 null
2025-08-23 Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model Fan Ding et.al. 2508.16947 null
2025-08-23 Sig-DEG for Distillation: Making Diffusion Models Faster and Lighter Lei Jiang et.al. 2508.16939 null
2025-08-23 HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation Sizhe Shan et.al. 2508.16930 null
2025-08-23 Structural Energy-Guided Sampling for View-Consistent Text-to-3D Qing Zhang et.al. 2508.16917 null
2025-08-23 Remarks on the three-dimensional Navier-Stokes equations with Lions’ exponent forced by space-time white noise Kazuo Yamazaki et.al. 2508.16906 null
2025-08-23 Enhanced shape recovery in advection–diffusion problems via a novel ADMM-based CCBM optimization Elmehdi Cherrat et.al. 2508.16898 null
2025-08-23 Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network Pouya Shiri et.al. 2508.16897 null
2025-08-23 Delta-SVD: Efficient Compression for Personalized Text-to-Image Models Tangyuan Zhang et.al. 2508.16863 null
2025-08-23 Subtleties of UV-crosslinking in microfluidic particle fabrication: UV dosage and intensity matter Sabrina Marnoto et.al. 2508.16862 null
2025-08-23 Intelligent Shanghai Typhoon Model (ISTM): A generative probabilistic emulator for typhoon hybrid modeling Zeyi Niu et.al. 2508.16851 null
2025-08-23 NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows Denis Tarasov et.al. 2508.16845 null
2025-08-22 A Fluctuating Hydrodynamics Model for Nanoscale Surfactant-laden Interfaces John B. Bell et.al. 2508.16820 null
2025-08-22 Two-Step Bose-Einstein Condensation of an ideal Magnetized Charged Bosonic gas under neutron star-like conditions Amanda Castillo Ayon et.al. 2508.16799 null
2025-08-22 TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling Yuancheng Wang et.al. 2508.16790 null
2025-08-22 Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data Stefania L. Moroianu et.al. 2508.16783 null
2025-08-26 Characterising the short-orbital period X-ray transient Swift J1910.2-0546 J. M. Corral-Santana et.al. 2508.16775 null
2025-08-22 Spontaneous spiral patterns etched on Germanium Yilin Wong et.al. 2508.16764 null
2025-08-22 A Framework for Benchmarking Fairness-Utility Trade-offs in Text-to-Image Models via Pareto Frontiers Marco N. Bochernitsan et.al. 2508.16752 null
2025-08-22 Hamiltonian Simulation for Advection-Diffusion Equation with arbitrary transport field Niladri Gomes et.al. 2508.16728 null
2025-08-22 MV-RAG: Retrieval Augmented Multiview Diffusion Yosef Dayani et.al. 2508.16577 null
2025-08-22 Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution Tainyi Zhang et.al. 2508.16557 null
2025-08-22 Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning Xuan Zhang et.al. 2508.16524 null
2025-08-22 Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation Zhijian Zhou et.al. 2508.16521 null
2025-08-22 ARSP: Automated Repair of Verilog Designs via Semantic Partitioning Bingkun Yao et.al. 2508.16517 null
2025-08-22 Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation Chun-Peng Chang et.al. 2508.16512 null
2025-08-22 Underdamped Langevin MCMC with third order convergence Maximilian Scott et.al. 2508.16485 null
2025-08-22 Large-scale concentration and relaxation for mean-field Langevin particle systems Songbo Wang et.al. 2508.16428 null
2025-08-22 Multiscale Growth Kinetics of Model Biomolecular Condensates Under Passive and Active Conditions Tamizhmalar Sundararajan et.al. 2508.16398 null
2025-08-22 Parrondo paradox in quantum image encryption Łukasz Pawela et.al. 2508.16382 null
2025-08-22 Observation of negative orbital torque from Vanadium Nikhil Vijayan et.al. 2508.16339 null
2025-08-22 A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions Nishant Jain et.al. 2508.16306 null
2025-08-22 Towards Diagnostic Quality Flat-Panel Detector CT Imaging Using Diffusion Models Hélène Corbaz et.al. 2508.16252 null
2025-08-22 Numerical solution of the time fractional nonlinear Fisher-KPP diffusion-reaction equation using the local domain boundary element method Theodore V. Gortsas et.al. 2508.16241 null
2025-08-22 UniEM-3M: A Universal Electron Micrograph Dataset for Microstructural Segmentation and Generation Nan wang et.al. 2508.16239 null
2025-08-22 PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting Hohyun Na et.al. 2508.16217 null
2025-08-22 OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models Huanpeng Chu et.al. 2508.16212 null
2025-08-22 Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers Shikang Zheng et.al. 2508.16211 null
2025-08-22 Competition and Attraction Improve Model Fusion João Abrantes et.al. 2508.16204 null
2025-08-22 FuXi-TC: A generative framework integrating deep learning and physics-based models for improved tropical cyclone forecasts Shan Guo et.al. 2508.16168 null
2025-08-22 Transport Properties of QGP within a Bayesian Holographic QCD Model Bing Chen et.al. 2508.16167 null
2025-08-22 RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution Haodong He et.al. 2508.16158 null
2025-08-22 On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models Yi Zhang et.al. 2508.16154 null
2025-08-22 Machine Learning for Medicine Must Be Interpretable, Shareable, Reproducible and Accountable by Design Ayyüce Begüm Bektaş et.al. 2508.16097 null
2025-08-22 Two-flow Feedback Multi-scale Progressive Generative Adversarial Network Sun Weikai et.al. 2508.16089 null
2025-08-22 A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection Qifeng Liu et.al. 2508.16069 null
2025-08-21 Clinically-Informed Preprocessing Improves Stroke Segmentation in Low-Resource Settings Juampablo E. Heras Rivera et.al. 2508.16004 null
2025-08-21 Multiscale Analysis of a Kinetic Model of Confined Suspensions of Self-Propelled Rods Leonid Berlyand et.al. 2508.16003 null
2025-08-21 Universal Fluctuations in the Tail Probability for d=2 Random Walks in Space-Time Random Environments Franscesca Ark et.al. 2508.15999 null
2025-08-21 Diverse Signer Avatars with Manual and Non-Manual Feature Modelling for Sign Language Production Mohamed Ilyes Lakhal et.al. 2508.15988 null
2025-08-21 UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation Zhaodong Jiang et.al. 2508.15972 null
2025-08-21 Physical blowups via buffered time change in a mean-field neural network Nikolaos Papadopoulos et.al. 2508.15961 null
2025-08-21 Structure-Preserving Medical Image Generation from a Latent Graph Representation Kevin Arias et.al. 2508.15920 null
2025-08-21 Text-Driven 3D Hand Motion Generation from Sign Language Data Léore Bensabath et.al. 2508.15902 null
2025-08-21 Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning Yijun Liu et.al. 2508.15874 null
2025-08-21 CineScale: Free Lunch in High-Resolution Cinematic Visual Generation Haonan Qiu et.al. 2508.15774 null
2025-08-21 Scaling Group Inference for Diverse and High-Quality Generation Gaurav Parmar et.al. 2508.15773 null
2025-08-21 Visual Autoregressive Modeling for Instruction-Guided Image Editing Qingyang Mao et.al. 2508.15772 null
2025-08-21 Waver: Wave Your Way to Lifelike Video Generation Yifu Zhang et.al. 2508.15761 null
2025-08-21 Skyrmion Lattice Order Controlled by Confinement Geometry Raphael Gruber et.al. 2508.15758 null
2025-08-21 Spatial Super-Infection and Co-Infection Dynamics in Networks Alyssa Yu et.al. 2508.15740 null
2025-08-21 Probability Density from Latent Diffusion Models for Out-of-Distribution Detection Joonas Järve et.al. 2508.15737 null
2025-08-21 The Status of the Astrophysical Parameters of Upper Main Sequence Stars Lukas Kueß et.al. 2508.15722 null
2025-08-21 WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception Zhiheng Liu et.al. 2508.15720 null
2025-08-21 Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation Nikita Kachaev et.al. 2508.15663 null
2025-08-21 When and What: Diffusion-Grounded VideoLLM with Entity Aware Segmentation for Long Video Understanding Pengcheng Fang et.al. 2508.15641 null
2025-08-21 Are Virtual DES Images a Valid Alternative to the Real Ones? Ana C. Perre et.al. 2508.15594 null
2025-08-21 Lattice distortions and non-sluggish diffusion in BCC refractory high entropy alloys Jingfeng Zhang et.al. 2508.15558 null
2025-08-21 Dream 7B: Diffusion Large Language Models Jiacheng Ye et.al. 2508.15487 null
2025-08-21 Reevaluating Anomalous Electric Fields at the Air-Water Interface: A Surface-Specific Spectroscopic Survey Joseph C. Shirley et.al. 2508.15422 null
2025-08-21 Speckle suppression in digital in-line holographic microscopy through liquid crystal dynamic scattering Emilia Wdowiak et.al. 2508.15419 null
2025-08-21 Numerical Analysis of Unsupervised Learning Approaches for Parameter Identification in PDEs Siyu Cen et.al. 2508.15381 null
2025-08-21 Diffusion-driven pattern formation in an opinion dynamical network model Tim Mauch et.al. 2508.15377 null
2025-08-21 Performance Analysis of RIS-Aided High-Mobility Wireless Systems Hanwen Hu et.al. 2508.15375 null
2025-08-22 Analytical Theory of Chiral Active Particle Transport in a Fluctuating Density Field Jayam Joshi et.al. 2508.15366 null
2025-08-21 The effect of multi-occupancy traps on the diffusion and retention of multiple hydrogen isotopes in irradiated tungsten and vanadium Sanjeet Kaur et.al. 2508.15341 null
2025-08-21 Discovering correlations between metal foam thermal characteristics and non-Fourier behavior Anna Fehér et.al. 2508.15340 null
2025-08-21 Interface fluctuations for $1$ D stochastic Allen-Cahn equation – singular regime Weijun Xu et.al. 2508.15319 null
2025-08-21 VideoEraser: Concept Erasure in Text-to-Video Diffusion Models Naen Xu et.al. 2508.15314 null
2025-08-21 HIP: Model-Agnostic Hypergraph Influence Prediction via Distance-Centrality Fusion and Neural ODEs Su-Su Zhang et.al. 2508.15312 null
2025-08-21 Modeling Long-term User Behaviors with Diffusion-driven Multi-interest Network for CTR Prediction Weijiang Lai et.al. 2508.15311 null
2025-08-21 Contribution of Globular Clusters to Diffuse Gamma-ray Emission from Galactic Plane Jiayin He et.al. 2508.15295 null
2025-08-21 Optimizing Compilation for Distributed Quantum Computing via Clustering and Annealing Ruilin Zhou et.al. 2508.15267 null
2025-08-21 Pathology-Informed Latent Diffusion Model for Anomaly Detection in Lymph Node Metastasis Jiamu Wang et.al. 2508.15236 null
2025-08-21 Pretrained Diffusion Models Are Inherently Skipped-Step Samplers Wenju Xu et.al. 2508.15233 null
2025-08-21 Collaborative Multi-Modal Coding for High-Quality 3D Generation Ziang Cao et.al. 2508.15228 null
2025-08-21 GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design Wen-Fan Wang et.al. 2508.15227 null
2025-08-21 A rutile-based homologous series Na(PtO $2$)${2\it{n}+1}$ discovered by computationally assisted high-pressure synthesis Yasuhito Kobayashi et.al. 2508.15223 null
2025-08-21 See it. Say it. Sorted: Agentic System for Compositional Diagram Generation Hantao Zhang et.al. 2508.15222 null
2025-08-21 Obstacle-tuned transition from chaotic to coherent vortex flows and odd diffusion in chiral active fluids Joscha Mecke et.al. 2508.15210 null
2025-08-21 Quantum Differential Equation Solvers with Low State Preparation Cost: Eliminating the Time Dependence in Dissipative Equations Gengzhi Yang et.al. 2508.15170 null
2025-08-21 MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion Xuyang Chen et.al. 2508.15169 null
2025-08-21 Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors Jeonghyun Noh et.al. 2508.15151 null
2025-08-21 Electron-Ion Equilibration in the Merging Galaxy Cluster Abell 665 Christian Norseth et.al. 2508.15138 null
2025-08-24 Side Effects of Erasing Concepts from Diffusion Models Shaswati Saha et.al. 2508.15124 null
2025-08-20 Microstructural and preliminary optical and microwave characterization of erbium doped CaMoO $_4$ thin films Ignas Masiulionis et.al. 2508.15122 null
2025-08-24 CurveFlow: Curvature-Guided Flow Matching for Image Generation Yan Luo et.al. 2508.15093 null
2025-08-20 Sampling by averaging: A multiscale approach to score estimation Paula Cordero-Encinar et.al. 2508.15069 null
2025-08-20 Asymptotic analysis on narrow tubes: narrow escape problems and diffusion processes Wen-Tai Hsu et.al. 2508.15060 null
2025-08-20 Correlating Particle Acceleration Rates with Plasma Conditions in Colliding Wind Binaries Gislaine B Cordeiro et.al. 2508.15059 null
2025-08-20 An MRI Atlas of the Human Fetal Brain: Reference and Segmentation Tools for Fetal Brain MRI Analysis Mahdi Bagheri et.al. 2508.15034 null
2025-08-20 Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement Chunming He et.al. 2508.15027 null
2025-08-20 TAIGen: Training-Free Adversarial Image Generation via Diffusion Models Susim Roy et.al. 2508.15020 null
2025-08-20 Probing Magnetic Properties of RuO $_{2}$ Heterostructures Through the Ferromagnetic Layer Frank M. Abel et.al. 2508.15004 null
2025-08-20 LyLA-Therm: Lyapunov-based Langevin Adaptive Thermodynamic Neural Network Controller Saiedeh Akbari et.al. 2508.14989 null
2025-08-20 Aura-CAPTCHA: A Reinforcement Learning and GAN-Enhanced Multi-Modal CAPTCHA System Joydeep Chandra et.al. 2508.14976 null
2025-08-20 Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI Oliver Welin Odeback et.al. 2508.14950 null
2025-08-19 Inference Time Debiasing Concepts in Diffusion Models Lucas S. Kupssinskü et.al. 2508.14933 null
2025-08-19 TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation Jiacheng Xie et.al. 2508.14932 null
2025-08-20 Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs Haokun Lin et.al. 2508.14896 null
2025-08-20 Virtual Community: An Open World for Humans, Robots, and Society Qinhong Zhou et.al. 2508.14893 null
2025-08-20 Squeezed Diffusion Models Jyotirmai Singh et.al. 2508.14871 null
2025-08-20 Critical trajectories in kinetic geometry Helge Dietert et.al. 2508.14868 null
2025-08-20 Universal winding properties of chiral active motion Ion Santra et.al. 2508.14862 null
2025-08-20 Physics-Informed ML Exploration of Structure-Transport Relationships in Hard Carbon Nikhil Rampal et.al. 2508.14849 null
2025-08-20 TransLight: Image-Guided Customized Lighting Control with Generative Decoupling Zongming Li et.al. 2508.14814 null
2025-08-20 Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization Canyu Zhao et.al. 2508.14811 null
2025-08-20 Cross-Modality Controlled Molecule Generation with Diffusion Language Model Yunzhe Zhang et.al. 2508.14748 null
2025-08-20 Modeling the impact of temperature and bird migration on the spread of West Nile virus Pride Duve et.al. 2508.14740 null
2025-08-20 GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting Jiaxin Wei et.al. 2508.14717 null
2025-08-20 The heating and cooling of 2D electrons at low temperatures A. K. Jain et.al. 2508.14694 null
2025-08-20 Virtual Multiplex Staining for Histological Images using a Marker-wise Conditioned Diffusion Model Hyun-Jic Oh et.al. 2508.14681 null
2025-08-21 Phase space transport, quasilinear diffusion and locality in phase velocity Didier Bénisti et.al. 2508.14657 null
2025-08-20 AnchorSync: Global Consistency Optimization for Long Video Editing Zichi Liu et.al. 2508.14609 null
2025-08-20 Call Option Price using Pearson Diffusion Processes Tapan Kar et.al. 2508.14577 null
2025-08-20 Minimizing Task-Oriented Age of Information for Remote Monitoring with Pre-Identification Shuying Gan et.al. 2508.14575 null
2025-08-20 EffiFusion-GAN: Efficient Fusion Generative Adversarial Network for Speech Enhancement Bin Wen et.al. 2508.14525 null
2025-08-20 SATURN: Autoregressive Image Generation Guided by Scene Graphs Thanh-Nhan Vo et.al. 2508.14502 null
2025-08-20 Multimode Fiber Imaging Based on Hydrogel Fiber Lele He et.al. 2508.14501 null
2025-08-20 DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion Moyu Zhang et.al. 2508.14500 null
2025-08-20 Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration Haoran Bai et.al. 2508.14483 null
2025-08-20 DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing Weitao Wang et.al. 2508.14465 null
2025-08-20 Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering Shanlin Sun et.al. 2508.14461 null
2025-08-20 Early Evolution of the Cavity and Core of a Coronal Mass Ejection in the Inner Corona Shuting Li et.al. 2508.14455 null
2025-08-20 FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy Yijin Chen et.al. 2508.14441 null
2025-08-20 MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion Fei Peng et.al. 2508.14440 null
2025-08-20 Weakly-Convex Regularization for Magnetic Resonance Image Denoising Akash Prabakar et.al. 2508.14438 null
2025-08-20 FOCUS: Frequency-Optimized Conditioning of DiffUSion Models for mitigating catastrophic forgetting during Test-Time Adaptation Gabriel Tjio et.al. 2508.14437 null
2025-08-20 HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation Bing Han et.al. 2508.14431 null
2025-08-20 Disentanglement in T-space for Faster and Distributed Training of Diffusion Models with Fewer Latent-states Samarth Gupta et.al. 2508.14413 null
2025-08-20 A Real-world Display Inverse Rendering Dataset Seokjun Choi et.al. 2508.14411 null
2025-08-20 CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities Yue Gong et.al. 2508.14405 null
2025-08-20 Img2ST-Net: Efficient High-Resolution Spatial Omics Prediction from Whole Slide Histology Images via Fully Convolutional Image-to-Image Learning Junchao Zhu et.al. 2508.14393 null
2025-08-20 Physics-Constrained Diffusion Reconstruction with Posterior Correction for Quantitative and Fast PET Imaging Yucun Hou et.al. 2508.14364 null
2025-08-20 Organ-Agents: Virtual Human Physiology Simulator via LLMs Rihao Chang et.al. 2508.14357 null
2025-08-20 SBGD: Improving Graph Diffusion Generative Model via Stochastic Block Diffusion Junwei Su et.al. 2508.14352 null
2025-08-20 A Non-Asymptotic Convergent Analysis for Scored-Based Graph Generative Model via a System of Stochastic Differential Equations Junwei Su et.al. 2508.14351 null
2025-08-20 Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation Lingkai Kong et.al. 2508.14342 null
2025-08-20 Modeling oxygen-void interactions in uranium nitride Mohamed AbdulHameed et.al. 2508.14329 null
2025-08-20 MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation Guile Wu et.al. 2508.14327 null
2025-08-20 Modeling of silver transport in cubic SiC: Integrating molecular dynamics, bounds averaging, and uncertainty quantification Mohamed AbdulHameed et.al. 2508.14325 null
2025-08-19 Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning Said Djafar Said et.al. 2508.14276 null
2025-08-19 Mean field social optimization: feedback person-by-person optimality and the dynamic programming equation Minyi Huang et.al. 2508.14236 null
2025-08-19 CO Adsorption Sites on Interstellar Water Ices Explored with Machine Learning Potentials. Binding energy distributions and snowline Giulia M. Bovolenta et.al. 2508.14219 null
2025-08-19 A well-balanced gas-kinetic scheme with adaptive mesh refinement for shallow water equations Gaocheng Liu et.al. 2508.14216 null
2025-08-19 Nonadiabatic force matching for alchemical free-energy estimation Jorge L. Rosa-Raíces et.al. 2508.14179 null
2025-08-19 DPad: Efficient Diffusion Language Models with Suffix Dropout Xinhua Chen et.al. 2508.14148 null
2025-08-18 3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models Jolanta Mozyrska et.al. 2508.14122 null
2025-08-19 InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing Shaoshu Yang et.al. 2508.14033 null
2025-08-19 Electrochemical response of biological membranes to localized currents and external electric fields Joshua B. Fernandes et.al. 2508.14001 null
2025-08-19 Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment Samuel Seligardi et.al. 2508.13989 null
2025-08-20 Towards a general diffusion-based information quality assessment model Anthony Lopes Temporao et.al. 2508.13927 null
2025-08-19 Learning to See Through Flare Xiaopeng Peng et.al. 2508.13907 null
2025-08-19 Revisiting Diffusion Q-Learning: From Iterative Denoising to One-Step Action Generation Thanh Nguyen et.al. 2508.13904 null
2025-08-19 Diffusion-Driven High-Dimensional Variable Selection Minjie Wang et.al. 2508.13890 null
2025-08-19 Toward Deployable Multi-Robot Collaboration via a Symbolically-Guided Decision Transformer Rathnam Vidushika Rasanji et.al. 2508.13877 null
2025-08-19 SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation Paul Grimal et.al. 2508.13866 null
2025-08-19 Stochastic synaptic dynamics under learning Jakob Stubenrauch et.al. 2508.13846 null
2025-08-19 UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion Zihan Liang et.al. 2508.13843 null
2025-08-20 Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction Niklas Bubeck et.al. 2508.13826 null
2025-08-19 COCO: Cognitive Operating System with Continuous Oversight for Multi-Agent Workflow Reliability Churong Liang et.al. 2508.13815 null
2025-08-19 Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs Juncheng Xie et.al. 2508.13805 null
2025-08-19 Elementary Monte Carlo model of the anisotropic recrystallization and antiripening under intensive stirring and high supersaturations Serhii Abakumov et.al. 2508.13799 null
2025-08-19 Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing Feng-Lin Liu et.al. 2508.13797 null
2025-08-19 DegDiT: Controllable Audio Generation with Dynamic Event Graph Guided Diffusion Transformer Yisu Liu et.al. 2508.13786 null
2025-08-19 Comparing Conditional Diffusion Models for Synthesizing Contrast-Enhanced Breast MRI from Pre-Contrast Images Sebastian Ibarra et.al. 2508.13776 null
2025-08-19 Eliminating Rasterization: Direct Vector Floor Plan Generation with DiffPlanner Shidong Wang et.al. 2508.13738 null
2025-08-19 Simulation of Impact-induced seismic shaking on asteroid (25143) Itokawa to address its resurfacing process Sunho Jin et.al. 2508.13727 null
2025-08-19 Unravelling disorder in kagome Yb $_{0.5}$Co$_3$Ge$_3$ A. Korshunov et.al. 2508.13719 null
2025-08-19 Diffuse-Layer Capacitance at the Potential of Zero Charge in Binary Mixtures Yuki Uematsu et.al. 2508.13691 null
2025-08-19 PHECT: A lightweight computation tool for pulsar halo emission Kun Fang et.al. 2508.13667 null
2025-08-19 Calibrated Semantic Diffusion: A p-Laplacian Synthesis with Learnable Dissipation, Quantified Constants, and Graph-Aware Calibration Faruk Alpay et.al. 2508.13658 null
2025-08-19 Personalized Subgraph Federated Learning with Sheaf Collaboration Wenfei Liang et.al. 2508.13642 null
2025-08-19 V2P: From Background Suppression to Center Peaking for Robust GUI Grounding Task Jikai Chen et.al. 2508.13634 null
2025-08-19 Text2Weight: Bridging Natural Language and Neural Network Weight Spaces Bowen Tian et.al. 2508.13633 null
2025-08-20 DiffIER: Optimizing Diffusion Models with Iterative Error Reduction Ao Chen et.al. 2508.13628 null
2025-08-19 Bridging Clear and Adverse Driving Conditions Yoel Shapiro et.al. 2508.13592 null
2025-08-19 Temporal-Conditional Referring Video Object Segmentation with Noise-Free Text-to-Video Diffusion Model Ruixin Zhang et.al. 2508.13584 null
2025-08-19 Overcoming Quantum Resistivity Scaling in Nanoscale Interconnects Using Delafossite PdCoO2 Seoung-Hun Kang et.al. 2508.13573 null
2025-08-19 A stability-enhanced nonstandard finite difference framework for solving one and two-dimensional nonlocal differential equations Shweta Kumari et.al. 2508.13542 null
2025-08-20 2D Gaussians Meet Visual Tokenizer Yiang Shi et.al. 2508.13515 null
2025-08-19 A Monte Carlo simulation on the scattering coefficients of solar radio wave propagation Jiazhen Gan et.al. 2508.13494 null
2025-08-19 The Lévy flight foraging hypothesis: comparison between stationary distributions and anomalous diffusion Serena Dipierro et.al. 2508.13487 null
2025-08-19 EventTSF: Event-Aware Non-Stationary Time Series Forecasting Yunfeng Ge et.al. 2508.13434 null
2025-08-19 Hyperactive Magnetar Eruptions: Giant Flares, Baryon Ejections, and FRBs Ashley Bransgrove et.al. 2508.13419 null
2025-08-18 Counterfactual Probabilistic Diffusion with Expert Models Wenhao Mu et.al. 2508.13355 null
2025-08-18 Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction Sedigheh Dargahi et.al. 2508.13340 null
2025-08-18 Resistive diffusion and radiative cooling effects in magnetized oblique shocks R. Datta et.al. 2508.13310 null
2025-08-18 GaitCrafter: Diffusion Model for Biometric Preserving Gait Synthesis Sirshapan Mitra et.al. 2508.13300 null
2025-08-18 Field-level Reconstruction from Foreground-Contaminated 21-cm Maps Shu-Fan Chen et.al. 2508.13265 null
2025-08-18 4DNeX: Feed-Forward 4D Generative Modeling Made Easy Zhaoxi Chen et.al. 2508.13154 null
2025-08-18 MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models Haoyu He et.al. 2508.13148 null
2025-08-18 Some semi-decoupled algorithms with optimal convergence for a four-field linear thermo-poroelastic model Ziliang Li et.al. 2508.13109 null
2025-08-18 Precise Action-to-Video Generation Through Visual Action Prompts Yuang Wang et.al. 2508.13104 null
2025-08-18 Denoising diffusion models for inverse design of inflatable structures with programmable deformations Sara Karimi et.al. 2508.13097 null
2025-08-18 DMS:Diffusion-Based Multi-Baseline Stereo Generation for Improving Self-Supervised Depth Estimation Zihua Liu et.al. 2508.13091 null
2025-08-18 ID-Card Synthetic Generation: Toward a Simulated Bona fide Dataset Qingwen Zeng et.al. 2508.13078 null
2025-08-18 From Transthoracic to Transesophageal: Cross-Modality Generation using LoRA Diffusion Emmanuel Oladokun et.al. 2508.13077 null
2025-08-18 Reinforced Context Order Recovery for Adaptive Reasoning and Planning Long Ma et.al. 2508.13070 null
2025-08-18 Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping Siddharth Khandelwal et.al. 2508.13065 null
2025-08-19 PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models Pengcheng Huang et.al. 2508.13021 null
2025-08-18 EgoTwin: Dreaming Body and View in First Person Jingqiao Xiu et.al. 2508.13013 null
2025-08-18 Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model Xianglong He et.al. 2508.13009 null
2025-08-18 Transfer Learning for Neutrino Scattering: Domain Adaptation with GANs Jose L. Bonilla et.al. 2508.12987 null
2025-08-18 The Leibenson process Viorel Barbu et.al. 2508.12979 null
2025-08-18 Compact Attention: Exploiting Structured Spatio-Temporal Sparsity for Fast Video Generation Qirui Li et.al. 2508.12969 null
2025-08-18 Self-Consistent Heating of the Magnetically Closed Solar Corona: Generation of Nanoflares, Thermodynamic Response of the Plasma and Observational Signatures Craig D. Johnston et.al. 2508.12952 null
2025-08-18 Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models Jianshu Zeng et.al. 2508.12945 null
2025-08-19 Fully Automated Segmentation of Fiber Bundles in Anatomic Tracing Data Kyriaki-Margarita Bintsi et.al. 2508.12942 null
2025-08-18 7Bench: a Comprehensive Benchmark for Layout-guided Text-to-image Models Elena Izzo et.al. 2508.12919 null
2025-08-18 FoleySpace: Vision-Aligned Binaural Spatial Audio Generation Lei Zhao et.al. 2508.12918 null
2025-08-18 S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models Chubin Chen et.al. 2508.12880 null
2025-08-18 E3RG: Building Explicit Emotion-driven Empathetic Response Generation System with Multimodal Large Language Model Ronghao Lin et.al. 2508.12854 null
2025-08-18 Strongly correlated stochastic systems Marco Biroli et.al. 2508.12818 null
2025-08-18 Next Visual Granularity Generation Yikai Wang et.al. 2508.12811 null
2025-08-18 Wavy Transformer Satoshi Noguchi et.al. 2508.12787 null
2025-08-18 Right and Wrong Ansätze for Nonlinear Waves in Stochastic PDEs C. H. S. Hamster et.al. 2508.12786 null
2025-08-18 Leveraging Diffusion Models for Stylization using Multiple Style Images Dan Ruta et.al. 2508.12784 null
2025-08-18 TURB-Scalar. A large database of passive scalar fields advected by 2D Navier-Stokes in the turbulent inverse cascade regime Chiara Calascibetta et.al. 2508.12762 null
2025-08-18 Effects of Defects on Thermal Transport across Solid/Solid Heterogeneous Interfaces Ershuai Yin et.al. 2508.12744 null
2025-08-18 Single-Reference Text-to-Image Manipulation with Dual Contrastive Denoising Score Syed Muhmmad Israr et.al. 2508.12718 null
2025-08-18 Hyperparameter Optimization in the Estimation of PDE and Delay-PDE models from data Oliver Mai et.al. 2508.12715 null
2025-08-18 Asymmetric Diffusion Recommendation Model Yongchun Zhu et.al. 2508.12706 null
2025-08-18 Deadline-Aware Bandwidth Allocation for Semantic Generative Communication with Diffusion Models Jinhyuk Choi et.al. 2508.12701 null
2025-08-18 MixCache: Mixture-of-Cache for Video Diffusion Transformer Acceleration Yuanxin Wei et.al. 2508.12691 null
2025-08-18 WP-CLIP: Leveraging CLIP to Predict Wölfflin’s Principles in Visual Art Abhijay Ghildyal et.al. 2508.12668 null
2025-08-18 Stable Diffusion-Based Approach for Human De-Occlusion Seung Young Noh et.al. 2508.12663 null
2025-08-18 Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery Jiyeon Kang et.al. 2508.12650 null
2025-08-18 Cognitive Structure Generation: From Educational Priors to Policy Optimization Hengnian Gu et.al. 2508.12647 null
2025-08-18 ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving Can Cui et.al. 2508.12603 null
2025-08-19 A Tale of Two Sightlines: Comparison of Hydrocarbon Dust Absorption Bands toward Cygnus OB2-12 and the Galactic Center Yvonne J. Pendleton et.al. 2508.12601 null
2025-08-17 Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference Denis Blessing et.al. 2508.12511 null
2025-08-17 Say It, See It: A Systematic Evaluation on Speech-Based 3D Content Generation Methods in Augmented Reality Yanming Xiu et.al. 2508.12498 null
2025-08-19 Portable Laser-Pumped Rb Atomic Clock with Digital Circuits Qiang Hao et.al. 2508.12437 null
2025-08-17 Spin decoherence dynamics of Er $^{3+}$ in CeO$_2$ film Sagar Kumar Seth et.al. 2508.12429 null
2025-08-17 TiP4GEN: Text to Immersive Panorama 4D Scene Generation Ke Xing et.al. 2508.12415 null
2025-08-17 Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position Zhixin Xie et.al. 2508.12398 null
2025-08-17 DeCoT: Decomposing Complex Instructions for Enhanced Text-to-Image Generation with Large Language Models Xiaochuan Lin et.al. 2508.12396 null
2025-08-17 Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models Xun Su et.al. 2508.12361 null
2025-08-17 Topological Dissipation as the Missing Link in Multiscale Polymer Dynamics Xu-Ze Zhang et.al. 2508.12359 null
2025-08-17 Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data Ahmet H. Güzel et.al. 2508.12356 null
2025-08-17 Semantic Discrepancy-aware Detector for Image Forgery Identification Ziye Wang et.al. 2508.12341 null
2025-08-17 Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR Fatemeh Ghorbani Lohesara et.al. 2508.12336 null
2025-08-17 Sketchar: Supporting Character Design and Illustration Prototyping Using Generative AI Long Ling et.al. 2508.12333 null
2025-08-17 Steering chiral active Brownian motion via stochastic position-orientation resetting Amir Shee et.al. 2508.12223 null
2025-08-17 Distribution Matching via Generalized Consistency Models Sagar Shrestha et.al. 2508.12222 null
2025-08-17 Self-Guided Action Diffusion Rhea Malhotra et.al. 2508.12189 null
2025-08-16 Critical Importance of Grain Boundaries to the Conductivity of Polycrystalline Molecular Crystals Shujit Chandra Paul et.al. 2508.12172 null
2025-08-16 Belief-Conditioned One-Step Diffusion: Real-Time Trajectory Planning with Just-Enough Sensing Gokul Puthumanaillam et.al. 2508.12166 null
2025-08-16 A Systematic Particle Filter for Estimating Time-Varying Parameters in Advection-Diffusion Equations with Source Terms Andrea Arnold et.al. 2508.12155 null
2025-08-16 Demystifying Foreground-Background Memorization in Diffusion Models Jimmy Z. Di et.al. 2508.12148 null
2025-08-16 Relativistic quintuple-zeta basis sets for the s block Marten L. Reitsma et.al. 2508.12144 null
2025-08-16 DualFit: A Two-Stage Virtual Try-On via Warping and Synthesis Minh Tran et.al. 2508.12131 null
2025-08-16 Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion Songwei Liu et.al. 2508.12094 null
2025-08-16 Strong overlap of deterministic and stochastic dynamics in a super-diffusive regime Muhammad Tayyab et.al. 2508.12091 null
2025-08-16 Generic Event Boundary Detection via Denoising Diffusion Jaejun Hwang et.al. 2508.12084 null
2025-08-16 Content Accuracy and Quality Aware Resource Allocation Based on LP-Guided DRL for ISAC-Driven AIGC Networks Ningzhe Shi et.al. 2508.12079 null
2025-08-16 Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization Kousuke Nakano et.al. 2508.12033 null
2025-08-16 Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems Szymon Pawlonka et.al. 2508.12026 null
2025-08-16 Virtual Trading in Multi-Settlement Electricity Markets Agostino Capponi et.al. 2508.11979 null
2025-08-16 UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding Yueming Xu et.al. 2508.11952 null
2025-08-19 Assessment of Using Synthetic Data in Brain Tumor Segmentation Aditi Jahagirdar et.al. 2508.11922 null
2025-08-16 SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress Lingyun Zhang et.al. 2508.11904 null
2025-08-16 OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation Jilei Mao et.al. 2508.11898 null
2025-08-16 Simulation of heavy quarkonium equilibration in the quark-gluon plasma Shouxing Zhao et.al. 2508.11897 null
2025-08-16 SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System Truong Thanh Hung Nguyen et.al. 2508.11873 null
2025-08-15 Serendipitous discovery of a young cluster of galaxies at $z \sim 0.5$ projected next to the nearby tadpole galaxy KUG 1138 + 327 Q. Daniel Wang et.al. 2508.11819 null
2025-08-15 FairTabGen: Unifying Counterfactual and Causal Fairness in Synthetic Tabular Data Generation Nitish Nagesh et.al. 2508.11810 null
2025-08-15 LoRAtorio: An intrinsic approach to LoRA Skill Composition Niki Foteinopoulou et.al. 2508.11624 null
2025-08-15 Dataset Creation for Visual Entailment using Generative AI Rob Reijtenbach et.al. 2508.11605 null
2025-08-15 CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion Zhe Zhu et.al. 2508.11603 null
2025-08-15 Low barrier ZrO $_x$ -based Josephson junctions Jaehong Choi et.al. 2508.11593 null
2025-08-15 Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model Zuo Zuo et.al. 2508.11550 null
2025-08-15 Physics-Informed Diffusion Models for Unsupervised Anomaly Detection in Multivariate Time Series Juhi Soni et.al. 2508.11528 null
2025-08-15 CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models Xiaoxue Wu et.al. 2508.11484 null
2025-08-15 SPG: Style-Prompting Guidance for Style-Specific Content Creation Qian Liang et.al. 2508.11476 null
2025-08-15 DPI-SPR: A Differentiable Physical Inversion for Shadow Profile Reconstruction Framework in Forward Scatter Radar ShuQi Lei et.al. 2508.11470 null
2025-08-15 Simulation-based inference using splitting schemes for partially observed diffusions in chemical reaction networks Petar Jovanovski et.al. 2508.11438 null
2025-08-15 MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation Qian Liang et.al. 2508.11433 null
2025-08-15 Wavelength dependence of laser pulse filamentation around atomic resonances Gabor Demeter et.al. 2508.11417 null
2025-08-15 The Effect of Flow Parameters and Wall Models on Gas-Surface Interactions: A Numerical Investigation of dsmcFoam M. B. Agir et.al. 2508.11403 null
2025-08-15 Pairwise correlations of global times in one-dimensional Brownian motion under stochastic resetting Yihao Wang et.al. 2508.11387 null
2025-08-15 AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis Zonglin Wu et.al. 2508.11375 null
2025-08-15 GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition Md Asgor Hossain Reaj et.al. 2508.11334 null
2025-08-15 Noise Matters: Optimizing Matching Noise for Diffusion Classifiers Yanghao Wang et.al. 2508.11330 null
2025-08-18 TimeMachine: Fine-Grained Facial Age Editing with Identity Preservation Yilin Mi et.al. 2508.11284 null
2025-08-15 Probing the Representational Power of Sparse Autoencoders in Vision Models Matthew Lyle Olson et.al. 2508.11277 null
2025-08-15 Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception Junjie Wang et.al. 2508.11256 null
2025-08-15 FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation MengChao Wang et.al. 2508.11255 null
2025-08-15 Graph Neural Diffusion via Generalized Opinion Dynamics Asela Hevapathige et.al. 2508.11249 null
2025-08-15 Cross-Granularity Hypergraph Retrieval-Augmented Generation for Multi-hop Question Answering Changjian Wang et.al. 2508.11247 null
2025-08-15 Efficient Image-to-Image Schrödinger Bridge for CT Field of View Extension Zhenhao Li et.al. 2508.11211 null
2025-08-15 StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation Seungmi Lee et.al. 2508.11203 null
2025-08-15 NGC 2392 and NGC 4361: Spectroscopic Diagnostics of Planetary Nebula Evolution Atul Kumar Singh et.al. 2508.11202 null
2025-08-15 Statistical Properties of Current Noise Induced by Electron-Phonon Scattering in Metallic Carbon Nanotubes Aina Sumiyoshi et.al. 2508.11201 null
2025-08-15 Representation Quantization for Collaborative Filtering Augmentation Yunze Luo et.al. 2508.11194 null
2025-08-15 Semi-supervised Image Dehazing via Expectation-Maximization and Bidirectional Brownian Bridge Diffusion Models Bing Liu et.al. 2508.11165 null
2025-08-15 LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction Maoquan Zhang et.al. 2508.11153 null
2025-08-15 Residual-based Efficient Bidirectional Diffusion Model for Image Dehazing and Haze Generation Bing Liu et.al. 2508.11134 null
2025-08-15 SQ-A: A Collision Triggered Starburst in Intra-Group Medium of Stephan’s Quintet C. K. Xu et.al. 2508.11124 null
2025-08-14 Diffusion is a code repair operator and generator Mukul Singh et.al. 2508.11110 null
2025-08-14 HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing Xinjie Gao et.al. 2508.11106 null
2025-08-14 GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning Kelin Yu et.al. 2508.11049 null
2025-08-14 A porous medium equation with spatially inhomogeneous absorption. Part II: Large time behavior Razvan Gabriel Iagar et.al. 2508.11046 null
2025-08-14 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation Nikolaos Gkanatsios et.al. 2508.11002 null
2025-08-14 Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling Tejomay Kishor Padole et.al. 2508.10995 null
2025-08-14 Match & Choose: Model Selection Framework for Fine-tuning Text-to-Image Diffusion Models Basile Lewandowski et.al. 2508.10993 null
2025-08-14 The extended molecular gas of the Circinus galaxy and NGC 1097 as seen by APEX Akhil Lasrado et.al. 2508.10982 null
2025-08-14 EVCtrl: Efficient Control Adapter for Visual Generation Zixiang Yang et.al. 2508.10963 null
2025-08-13 From Promise to Practical Reality: Transforming Diffusion MRI Analysis with Fast Deep Learning Enhancement Xinyi Wang et.al. 2508.10950 null
2025-08-14 Exchange-driven self-diffusion of nanoscale crystalline parahydrogen clusters on graphite K. M. Kolevski et.al. 2508.10883 null
2025-08-14 A Survey on Diffusion Language Models Tianyi Li et.al. 2508.10875 null
2025-08-14 Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation Harold Haodong Chen et.al. 2508.10858 null
2025-08-16 Object Fidelity Diffusion for Remote Sensing Image Generation Ziqi Ye et.al. 2508.10801 null
2025-08-14 Ultra-High-Definition Reference-Based Landmark Image Super-Resolution with Generative Diffusion Prior Zhenning Shi et.al. 2508.10779 null
2025-08-14 Video-BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation Youping Gu et.al. 2508.10774 null
2025-08-14 AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences Jieyu Li et.al. 2508.10771 null
2025-08-14 Formation and protection of an Eu-Ir surface compound below hexagonal boron nitride Alaa Mohammed Idris Bakhit et.al. 2508.10746 null
2025-08-14 A Kinetic Theory Approach to Ordered Fluids José A. Carrillo et.al. 2508.10744 null
2025-08-14 Thinking Inside the Mask: In-Place Prompting in Diffusion LLMs Xiangqi Jin et.al. 2508.10736 null
2025-08-14 Exploiting Discriminative Codebook Prior for Autoregressive Image Generation Longxiang Tang et.al. 2508.10719 null
2025-08-14 NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale NextStep Team et.al. 2508.10711 null
2025-08-14 CountCluster: Training-Free Object Quantity Guidance with Cross-Attention Map Clustering for Text-to-Image Generation Joohyeon Lee et.al. 2508.10710 null
2025-08-14 Probabilistic Forecasting Method for Offshore Wind Farm Cluster under Typhoon Conditions: a Score-Based Conditional Diffusion Model Jinhua He et.al. 2508.10705 null
2025-08-14 Effective permeability conditions for diffusive transport through impermeable membranes with gaps Molly Brennan et.al. 2508.10694 null
2025-08-14 Novel View Synthesis using DDIM Inversion Sehajdeep SIngh et.al. 2508.10688 null
2025-08-14 MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control Yuchen Zhu et.al. 2508.10684 null
2025-08-14 Hybrid Generative Fusion for Efficient and Privacy-Preserving Face Recognition Dataset Generation Feiran Li et.al. 2508.10672 null
2025-08-14 Geospatial Diffusion for Land Cover Imperviousness Change Forecasting Debvrat Varshney et.al. 2508.10649 null
2025-08-14 Increasing the Utility of Synthetic Images through Chamfer Guidance Nicola Dall’Asen et.al. 2508.10631 null
2025-08-14 A Unified Framework from Boltzmann Transport to Proton Treatment Planning Andreas E. Kyprianou et.al. 2508.10596 null
2025-08-14 HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis Shiyu Liu et.al. 2508.10566 null
2025-08-14 Projected Coupled Diffusion for Test-Time Constrained Joint Generation Hao Luan et.al. 2508.10531 null
2025-08-14 EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba Quang Nguyen et.al. 2508.10522 null
2025-08-15 KDPE: A Kernel Density Estimation Strategy for Diffusion Policy Trajectory Selection Andrea Rosasco et.al. 2508.10511 null
2025-08-14 A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection Yangjie Xiao et.al. 2508.10509 null
2025-08-14 TweezeEdit: Consistent and Efficient Image Editing with Path Regularization Jianda Mao et.al. 2508.10498 null
2025-08-14 A Unified Multi-Agent Framework for Universal Multimodal Understanding and Generation Jiulin Li et.al. 2508.10494 null
2025-08-14 Jamming of active particles in narrow pores: Implications for ratchet effect and diffusion coefficient Šimon Pajger et.al. 2508.10483 null
2025-08-14 NanoControl: A Lightweight Framework for Precise and Efficient Control in Diffusion Transformer Shanyuan Liu et.al. 2508.10424 null
2025-08-14 Extracting a stochastic model for predator-prey dynamic of turbulence and zonal flows with limited data J. C. Huang et.al. 2508.10408 null
2025-08-14 Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models Eunseo Koh et.al. 2508.10407 null
2025-08-14 PQ-DAF: Pose-driven Quality-controlled Data Augmentation for Data-scarce Driver Distraction Detection Haibin Sun et.al. 2508.10397 null
2025-08-14 EDIS: A Simulation Software for Dynamic Ion Intercalation/Deintercalation Processes in Electrode Materials Liqi Wang et.al. 2508.10384 null
2025-08-14 Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models Hyundo Lee et.al. 2508.10382 null
2025-08-14 A Semantic-Aware Framework for Safe and Intent-Integrative Assistance in Upper-Limb Exoskeletons Yu Chen et.al. 2508.10378 null
2025-08-14 Scalable Modeling of Nonlinear Network Dynamics in Neurodegenerative Disease Daniel Semchin et.al. 2508.10343 null
2025-08-14 ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver Wenxuan Song et.al. 2508.10333 null
2025-08-14 Cross-view Generalized Diffusion Model for Sparse-view CT Reconstruction Jixiang Chen et.al. 2508.10313 null
2025-08-14 DiffAxE: Diffusion-driven Hardware Accelerator Generation and Design Space Exploration Arkapravo Ghosh et.al. 2508.10303 null
2025-08-14 Influence Maximization in Multi-layer Social Networks Based on Differentiated Graph Embeddings Ronghua Lin et.al. 2508.10289 null
2025-08-14 High Fidelity Text to Image Generation with Contrastive Alignment and Structural Guidance Danyi Gao et.al. 2508.10280 null
2025-08-14 A Spectral Solver to Capture Unsteady Dynamics in the Aerospike Nozzle Wake Zachary Pyle et.al. 2508.10275 null
2025-08-14 Non-Decaying Solutions to the 2D Dissipative Quasi-Geostrophic Equations David M. Ambrose et.al. 2508.10254 null
2025-08-13 Run-and-tumble dynamics with non-reciprocal transitions between three velocity states Julio C. R. Romo-Cruz et.al. 2508.10213 null
2025-08-13 Diffusive Braking of Penetrative Convection in Stably-Stratified Fluids Bradley W. Hindman et.al. 2508.10174 null
2025-08-13 Predicting First-Passage Dynamics in Disordered Systems Exactly: Application to Sparse Networks Daniel Marris et.al. 2508.10140 null
2025-08-13 The Perturbation Theory Approach to Stability in the Scattered Disk Matthew Belyakov et.al. 2508.10119 null
2025-08-13 Constrained Decoding of Diffusion LLMs with Context-Free Grammars Niels Mündler et.al. 2508.10111 null
2025-08-13 Quantum circuit simulation with a local time-dependent variational principle Aaron Sander et.al. 2508.10096 null
2025-08-13 Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design Yuhao Sun et.al. 2508.10065 null
2025-08-13 Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation Junyan Ye et.al. 2508.09987 null
2025-08-13 Story2Board: A Training-Free Approach for Expressive Storyboard Generation David Dinkevich et.al. 2508.09983 null
2025-08-13 Masquerade: Learning from In-the-wild Human Videos using Data-Editing Marion Lepert et.al. 2508.09976 null
2025-08-13 PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image Geonhee Sim et.al. 2508.09973 null
2025-08-13 Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models Luca Eyring et.al. 2508.09968 null
2025-08-13 Stable Diffusion Models are Secretly Good at Visual In-Context Learning Trevine Oorloff et.al. 2508.09949 null
2025-08-13 AST-n: A Fast Sampling Approach for Low-Dose CT Reconstruction using Diffusion Models Tomás de la Sotta et.al. 2508.09943 null
2025-08-13 Quo Vadis Handwritten Text Generation for Handwritten Text Recognition? Vittorio Pippi et.al. 2508.09936 null
2025-08-13 Active Particle Diffusion in Convection Roll Arrays Pulak Kumar Ghosh et.al. 2508.09924 null
2025-08-14 Prototype-Guided Diffusion: Visual Conditioning without External Memory Bilal Faye et.al. 2508.09922 null
2025-08-13 Hybrid Quantum-Classical Latent Diffusion Models for Medical Image Generation Kübra Yeter-Aydeniz et.al. 2508.09903 null
2025-08-13 Binary Mixtures in Linear Convection Arrays Pulak Kumar Ghosh et.al. 2508.09902 null
2025-08-13 Exploring the Physics of the Plasma Liner Experiment: A Multi-dimensional Study with FLASH, OSIRIS, and HELIOS E. C. Hansen et.al. 2508.09895 null
2025-08-13 Marketron Through the Looking Glass: From Equity Dynamics to Option Pricing in Incomplete Markets Igor Halperin et.al. 2508.09863 null
2025-08-13 HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics Weiqi Li et.al. 2508.09858 null
2025-08-13 Enhancing Diffusion Face Generation with Contrastive Embeddings and SegFormer Guidance Dhruvraj Singh Rawat et.al. 2508.09847 null
2025-08-13 On the Generalization Limits of Quantum Generative Adversarial Networks with Pure State Generators Jasmin Frkatovic et.al. 2508.09844 null
2025-08-13 Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Weigao Sun et.al. 2508.09834 null
2025-08-13 Physical Autoregressive Model for Robotic Manipulation without Action Pretraining Zijian Song et.al. 2508.09822 null
2025-08-13 Feature Impact Analysis on Top Long-Jump Performances with Quantile Random Forest and Explainable AI Techniques Qi Gan et.al. 2508.09810 null
2025-08-13 Condition number for finite element discretisation of nonlocal PDE systems with applications to biology Olusegun E. Adebayo et.al. 2508.09781 null
2025-08-13 Impacts of the duration and intensity of grazing cycle on vegetation population dynamics in semi-arid ecosystems with seasonal succession Junhong Gan et.al. 2508.09760 null
2025-08-13 Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection Zhiqiu Zhang et.al. 2508.09746 null
2025-08-13 MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers Qianru Qiu et.al. 2508.09709 null
2025-08-13 Hydrodynamic approximations for driven dense colloidal mixtures in narrow pores Frantisek Slanina et.al. 2508.09686 null
2025-08-13 Anomalous Transport of Elongated Particles in Oscillatory Vortical Flows Shiyuan Hu et.al. 2508.09677 null
2025-08-13 GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors Xingyilang Yin et.al. 2508.09667 null
2025-08-13 NegFaceDiff: The Power of Negative Context in Identity-Conditioned Diffusion for Synthetic Face Generation Eduarda Caldeira et.al. 2508.09661 null
2025-08-13 Asymptotic-analysis-inspired boundary conditions aiming at eliminating polymer diffusive instability Ming Dong et.al. 2508.09635 null
2025-08-15 Preacher: Paper-to-Video Agentic System Jingwei Liu et.al. 2508.09632 null
2025-08-13 MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography Daniel Barco et.al. 2508.09616 null
2025-08-13 Global uniform regularity for the 3D incompressible MHD equations with slip boundary condition near a background magnetic field Jincheng Gao et.al. 2508.09609 null
2025-08-13 Images Speak Louder Than Scores: Failure Mode Escape for Enhancing Generative Quality Jie Shao et.al. 2508.09598 null
2025-08-13 Dual Recursive Feedback on Generation and Appearance Latents for Pose-Robust Text-to-Image Diffusion Jiwon Kim et.al. 2508.09575 null
2025-08-13 Zeolitic imidazolate framework glasses emit white light Zhencai Li et.al. 2508.09552 null
2025-08-13 Exploring the Equivalence of Closed-Set Generative and Real Data Augmentation in Image Classification Haowen Wang et.al. 2508.09550 null
2025-08-13 Boron Clusters for Metal-Free Water Splitting Masaya Fujioka et.al. 2508.09538 null
2025-08-13 Ehrenfest Dynamics with Spontaneous Localization Anderson A. Tomaz et.al. 2508.09526 null
2025-08-13 Generation of Indian Sign Language Letters, Numbers, and Words Ajeet Kumar Yadav et.al. 2508.09522 null
2025-08-13 A hyperbolic finite difference scheme for anisotropic diffusion equations: preserving the discrete maximum principle Tokuhiro Eto et.al. 2508.09509 null
2025-08-13 Stingrays in the radio sky: Two unusual diffuse radio relic sources in the direction of the Magellanic Stream Zachary J Smeaton et.al. 2508.09495 null
2025-08-13 SARE: Semantic-Aware Reconstruction Error for Generalizable Diffusion-Generated Image Detection Ju Yeon Kang et.al. 2508.09487 null
2025-08-13 CLIP-Flow: A Universal Discriminator for AI-Generated Images Inspired by Anomaly Detection Zhipeng Yuan et.al. 2508.09477 null
2025-08-14 From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts Yuji Wang et.al. 2508.09476 null
2025-08-13 Leveraging Failed Samples: A Few-Shot and Training-Free Framework for Generalized Deepfake Detection Shibo Yao et.al. 2508.09475 null
2025-08-13 Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy Hao Yu et.al. 2508.09461 null
2025-08-13 RASR: Retrieval-Augmented Super Resolution for Practical Reference-based Image Restoration Jiaqi Yan et.al. 2508.09449 null
2025-08-13 DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation Haoxiang Shi et.al. 2508.09444 null
2025-08-13 Scaling behaviour of rotating convection in a spherical shell with different Prandtl numbers Wei Fan et.al. 2508.09416 null
2025-08-13 Dynamos driven by top-heavy double-diffusive convection in the strong-field regime Wei Fan et.al. 2508.09410 null
2025-08-12 Understanding Dementia Speech Alignment with Diffusion-Based Image Generation Mansi et.al. 2508.09385 null
2025-08-12 X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents Guoxian Song et.al. 2508.09383 null
2025-08-12 UltraLight Med-Vision Mamba for Classification of Neoplastic Progression in Tubular Adenomas Aqsa Sultana et.al. 2508.09339 null
2025-08-12 Lung-DDPM+: Efficient Thoracic CT Image Synthesis using Diffusion Probabilistic Model Yifan Jiang et.al. 2508.09327 null
2025-08-12 Quantum correction to the Langevin cross section in resonant-exchange processes I. Simbotin et.al. 2508.09302 null
2025-08-12 Evolution of a Long-Lived Deep-Seated Main-Sequence Magnetic Field During White Dwarf Cooling Matias Castro-Tapia et.al. 2508.09268 null
2025-08-12 TFZ: Topology-Preserving Compression of 2D Symmetric and Asymmetric Second-Order Tensor Fields Nathaniel Gorski et.al. 2508.09235 null
2025-08-12 GSMT: Graph Fusion and Spatiotemporal TaskCorrection for Multi-Bus Trajectory Prediction Fan Ding et.al. 2508.09227 null
2025-08-12 Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models Wen Wang et.al. 2508.09138 null
2025-08-12 Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices Ya Zou et.al. 2508.09136 null
2025-08-13 Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer Zixin Yin et.al. 2508.09131 null
2025-08-13 Robust quantum computational advantage with programmable 3050-photon Gaussian boson sampling Hua-Liang Liu et.al. 2508.09092 null
2025-08-13 Direct Measurement of Electron Heating in Electron-Only Reconnection in a Laboratory Mini-Magnetosphere Lucas Rovige et.al. 2508.09086 null
2025-08-12 Rankin-Selberg integrals for $\mathrm{GSpin}$ groups with application to the global Gan-Gross-Prasad conjecture Pan Yan et.al. 2508.09066 null
2025-08-12 Per-Query Visual Concept Learning Ori Malca et.al. 2508.09045 null
2025-08-12 Stochastic Decentralized Optimization of Non-Smooth Convex and Convex-Concave Problems over Time-Varying Networks Maxim Divilkovskiy et.al. 2508.09029 null
2025-08-12 Envisioning Generative Artificial Intelligence in Cartography and Mapmaking Yuhao Kang et.al. 2508.09028 null
2025-08-12 TaoCache: Structure-Maintained Video Generation Acceleration Zhentao Fan et.al. 2508.08978 null
2025-08-12 Urban-STA4CLC: Urban Theory-Informed Spatio-Temporal Attention Model for Predicting Post-Disaster Commercial Land Use Change Ziyi Guo et.al. 2508.08976 null
2025-08-12 Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation Soo-Whan Chung et.al. 2508.08953 null
2025-08-12 Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation Ao Ma et.al. 2508.08949 null
2025-08-12 EGGCodec: A Robust Neural Encodec Framework for EGG Reconstruction and F0 Extraction Rui Feng et.al. 2508.08924 null
2025-08-12 When and How Ultrasound Enhances Nanoparticle Diffusion in Hydrogels: A Stick-and-Release Mechanism Pablo M. Blanco et.al. 2508.08918 null
2025-08-12 Sound Signal Synthesis with Auxiliary Classifier GAN, COVID-19 cough as an example Yahya Sherif Solayman Mohamed Saleh et.al. 2508.08892 null
2025-08-12 Transient Noise Removal via Diffusion-based Speech Inpainting Mordehay Moradi et.al. 2508.08890 null
2025-08-12 DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI Bo-Hsun Chen et.al. 2508.08831 null
2025-08-12 Geometry-Aware Global Feature Aggregation for Real-Time Indirect Illumination Meng Gai et.al. 2508.08826 null
2025-08-12 TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models Yuqi Peng et.al. 2508.08812 null
2025-08-12 Identity-Preserving Aging and De-Aging of Faces in the StyleGAN Latent Space Luis S. Luevano et.al. 2508.08808 null
2025-08-12 Anomalous Sodium Insertion in Highly Oriented Graphite: Thermodynamics, Kinetics and Evidence for Two-Sided Intercalation Chuanhai Gan et.al. 2508.08806 null
2025-08-14 Measurement-Based Quantum Diffusion Models Xinyu Liu et.al. 2508.08799 null
2025-08-12 DiffPose-Animal: A Language-Conditioned Diffusion Framework for Animal Pose Estimation Tianyu Xiong et.al. 2508.08783 null
2025-08-12 Patient-Adaptive Focused Transmit Beamforming using Cognitive Ultrasound Wessel L. van Nierop et.al. 2508.08782 null
2025-08-12 Exploring Palette based Color Guidance in Diffusion Models Qianru Qiu et.al. 2508.08754 null
2025-08-12 Elucidating Rectified Flow with Deterministic Sampler: Polynomial Discretization Complexity for Multi and One-step Models Ruofeng Yang et.al. 2508.08735 null
2025-08-13 A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models Lingzhe Zhang et.al. 2508.08712 null
2025-08-12 Towards Safe Imitation Learning via Potential Field-Guided Flow Matching Haoran Ding et.al. 2508.08707 null
2025-08-12 SafeFix: Targeted Model Repair via Controlled Image Generation Ouyang Xu et.al. 2508.08701 null
2025-08-12 Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos Qi Zheng et.al. 2508.08700 null
2025-08-12 DiffVolume: Diffusion Models for Volume Generation in Limit Order Books Zhuohan Wang et.al. 2508.08698 null
2025-08-12 Detecting Sterile Neutrino Dark Matter at MeV Gamma-Ray Observatories Subaru Fujisawa et.al. 2508.08695 null
2025-08-12 Expert-Guided Diffusion Planner for Auto-bidding Yunshan Peng et.al. 2508.08687 null
2025-08-12 In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality Chenrui Liu et.al. 2508.08673 null
2025-08-12 Nonlinear dynamics of reaction-diffusion wave trains under large and fully nonlocalized modulations Joannis Alexopoulos et.al. 2508.08637 null
2025-08-14 Yan: Foundational Interactive Video Generation Deheng Ye et.al. 2508.08601 null
2025-08-12 RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space Jingyun Liang et.al. 2508.08588 null
2025-08-12 Unlocking the Potential of Diffusion Priors in Blind Face Restoration Yunqi Miao et.al. 2508.08556 null
2025-08-12 UQGNN: Uncertainty Quantification of Graph Neural Networks for Multivariate Spatiotemporal Prediction Dahai Yu et.al. 2508.08551 null
2025-08-12 Fluorescence time profile measurement of LAB based liquid scintillator in response to medium relativistic ion particles Xiaojie Luo et.al. 2508.08546 null
2025-08-12 Transition to Petschek Reconnection in Subrelativistic Pair Plasmas: Implications for Particle Acceleration Adam Robbins et.al. 2508.08533 null
2025-08-11 SynLLM: A Comparative Analysis of Large Language Models for Medical Tabular Synthetic Data Generation via Prompt Engineering Arshia Ilaty et.al. 2508.08529 null
2025-08-11 Control-affine Schrödinger Bridge and Generalized Bohm Potential Alexis M. H. Teter et.al. 2508.08511 null
2025-08-11 CObL: Toward Zero-Shot Ordinal Layering without User Prompting Aneel Damaraju et.al. 2508.08498 null
2025-08-11 MuGa-VTON: Multi-Garment Virtual Try-On via Diffusion Transformers with Prompt Customization Ankan Deria et.al. 2508.08488 null
2025-08-11 MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling Qian Wang et.al. 2508.08487 null
2025-08-11 Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features Pallabee Das et.al. 2508.08458 null
2025-08-11 Hot Jupiter formation in dense stellar clusters: A Monte Carlo model applied to 47 Tucanae J. A. Wirth et.al. 2508.08406 null
2025-08-11 Wave Propagation Dynamics via Lattice Difference Equations Eddy Kwessi et.al. 2508.08387 null
2025-08-11 Spatiotemporally Consistent Indoor Lighting Estimation with Diffusion Priors Mutian Tong et.al. 2508.08384 null
2025-08-11 Exponentially Improved Constant in Quantum Solution Extraction Gumaro Rendon et.al. 2508.08375 null
2025-08-11 StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation Shuyuan Tu et.al. 2508.08248 null
2025-08-12 Cut2Next: Generating Next Shot via In-Context Tuning Jingwen He et.al. 2508.08244 null
2025-08-13 BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion Qiayuan Liao et.al. 2508.08241 null
2025-08-11 OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution Zhiqiang Wu et.al. 2508.08227 null
2025-08-11 Learning User Preferences for Image Generation Model Wenyi Mo et.al. 2508.08220 null
2025-08-11 Reinforcement Learning in Vision: A Survey Weijia Wu et.al. 2508.08189 null
2025-08-13 CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data Chongke Bi et.al. 2508.08173 null
2025-08-11 ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction Chaojun Ni et.al. 2508.08170 null
2025-08-11 An effective potential for generative modelling with active matter Adrian Baule et.al. 2508.08146 null
2025-08-11 Reproducing and Extending Brownian Motion in Optical Trap: A Computational Reimplementation of Volpe and Volpe (2013) Eyad I. B Hamid et.al. 2508.08138 null
2025-08-11 FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting Yitong Yang et.al. 2508.08136 null
2025-08-11 Optimal Dividend, Reinsurance, and Capital Injection Strategies for an Insurer with Two Collaborating Business Lines Tim J. Boonen et.al. 2508.08130 null
2025-08-11 Learned Regularization for Microwave Tomography Bowen Tong et.al. 2508.08114 null
2025-08-11 TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning Junzhe Xu et.al. 2508.08098 null
2025-08-11 Fast and Generalizable parameter-embedded Neural Operators for Lithium-Ion Battery Simulation Amir Ali Panahi et.al. 2508.08087 null
2025-08-11 Matrix-3D: Omnidirectional Explorable 3D World Generation Zhongqi Yang et.al. 2508.08086 null
2025-08-12 Why Bohmian velocity might not be the only quantum velocity and the role of quantum diffusion flux is super-luminal wave packets Charalampos Antonakos et.al. 2508.08065 null
2025-08-11 S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix Peng Dai et.al. 2508.08048 null
2025-08-12 Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation Fangyuan Mao et.al. 2508.07981 null
2025-08-11 Well-posedness for a fourth-order nonisothermal tumor growth model of Caginalp type Giulia Cavalleri et.al. 2508.07979 null
2025-08-12 Adaptive Multiple Access and Service Placement for Generative Diffusion Models Hamidreza Mazandarani et.al. 2508.07978 null
2025-08-11 Deep imaging of the galaxy Malin 2 shows new faint structures and a candidate satellite dwarf galaxy Junais et.al. 2508.07930 null
2025-08-11 Score Augmentation for Diffusion Models Liang Hou et.al. 2508.07926 null
2025-08-11 Generative Video Matting Yongtao Ge et.al. 2508.07905 null
2025-08-11 Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models Johanna P. Müller et.al. 2508.07903 null
2025-08-12 Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation Bowen Xue et.al. 2508.07901 null
2025-08-11 NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction Tianle Zeng et.al. 2508.07897 null
2025-08-11 Deep Learning-Based Desikan-Killiany Parcellation of the Brain Using Diffusion MRI Yousef Sadegheih et.al. 2508.07815 null
2025-08-11 DiTVR: Zero-Shot Diffusion Transformer for Video Restoration Sicheng Gao et.al. 2508.07811 null
2025-08-11 MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks Yushen Xu et.al. 2508.07803 null
2025-08-11 Generative Inversion for Property-Targeted Materials Design: Application to Shape Memory Alloys Cheng Li et.al. 2508.07798 null
2025-08-11 Feynman-Kac formula gor general time dependent stochastic parabolic equation on a bounded domain and applications Yaozhong Hu et.al. 2508.07793 null
2025-08-13 AgentWorld: An Interactive Simulation Platform for Scene Construction and Mobile Robotic Manipulation Yizheng Zhang et.al. 2508.07770 null
2025-08-11 Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation Xiaoyan Liu et.al. 2508.07769 null
2025-08-11 Sea-Undistort: A Dataset for Through-Water Image Restoration in High Resolution Airborne Bathymetric Mapping Maximilian Kromer et.al. 2508.07760 null
2025-08-11 Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild Haoran Wang et.al. 2508.07759 null
2025-08-11 Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion Minseo Kim et.al. 2508.07755 null
2025-08-11 Grouped Speculative Decoding for Autoregressive Image Generation Junhyuk So et.al. 2508.07747 null
2025-08-11 Is GAN Necessary for Mel-Spectrogram-based Neural Vocoder? Hui-Peng Du et.al. 2508.07711 null
2025-08-11 Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing Weitao Wang et.al. 2508.07700 null
2025-08-11 DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework Wenzhuo Ma et.al. 2508.07682 null
2025-08-11 LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering Xiaohang Zhan et.al. 2508.07647 null
2025-08-11 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning Jian Ma et.al. 2508.07607 null
2025-08-11 LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation Wenhui Song et.al. 2508.07603 null
2025-08-11 ShoulderShot: Generating Over-the-Shoulder Dialogue Videos Yuang Zhang et.al. 2508.07597 null
2025-08-11 Procedural Mixture Sets Hendrik Rommeswinkel et.al. 2508.07588 null
2025-08-12 From Platform Migration to Cultural Integration: the Ingress and Diffusion of #wlw from TikTok to RedNote in Queer Women Communities Ziqi Pan et.al. 2508.07579 null
2025-08-11 UniFlow: Unifying Speech Front-End Tasks via Continuous Generative Modeling Ziqian Wang et.al. 2508.07558 null
2025-08-11 Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation Minghao Yin et.al. 2508.07557 null
2025-08-11 Physics-informed Multiresolution Wavelet Neural Network Method for Solving Partial Differential Equations Feng Han et.al. 2508.07546 null
2025-08-11 Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing Joonghyuk Shin et.al. 2508.07519 null
2025-08-10 Forecasting solar power output in Ibadan: A machine learning approach leveraging weather data and system specifications Obarotu Peter Urhuerhi et.al. 2508.07462 null
2025-08-10 Unified Semiclassical Theory of Nonlinear Hall Effect:Bridging Ballistic and Diffusive Transport Regime Xinyu Liu et.al. 2508.07445 null
2025-08-10 Robust, fast, and adaptive splitting schemes for nonlinear doubly-degenerate diffusion equations Ayesha Javed et.al. 2508.07420 null
2025-08-10 CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization Youqi Wang et.al. 2508.07413 null
2025-08-10 Conditional splitting probabilities for hidden-state inference in drift-diffusive processes Emir Sezik et.al. 2508.07386 null
2025-08-10 Supercritical fluids as a distinct state of matter characterized by sub-short-range structural order Sha Jin et.al. 2508.07385 null
2025-08-10 SODiff: Semantic-Oriented Diffusion Model for JPEG Compression Artifacts Removal Tingyu Yang et.al. 2508.07346 null
2025-08-10 CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation Fangtai Wu et.al. 2508.07341 null
2025-08-10 Linear-Quadratic Mean Field Games with Common Noise: A Direct Approach Wenyu Cong et.al. 2508.07271 null
2025-08-10 Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers Xin Ma et.al. 2508.07246 null
2025-08-10 Causal Negative Sampling via Diffusion Model for Out-of-Distribution Recommendation Chu Zhao et.al. 2508.07243 null
2025-08-10 HaDM-ST: Histology-Assisted Differential Modeling for Spatial Transcriptomics Generation Xuepeng Liu et.al. 2508.07225 null
2025-08-10 Neural Bridge Processes Jian Xu et.al. 2508.07220 null
2025-08-10 Explainability-in-Action: Enabling Expressive Manipulation and Tacit Understanding by Bending Diffusion Models in ComfyUI Ahmed M. Abuzuraiq et.al. 2508.07183 null
2025-08-10 CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion Xiaotong Lin et.al. 2508.07162 null
2025-08-10 SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models Ruolin Yang et.al. 2508.07149 null
2025-08-10 Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction Yu Liu et.al. 2508.07146 null
2025-08-10 SketchConcept: Sketching-based Concept Recomposition for Product Design using Generative AI Runlin Duan et.al. 2508.07141 null
2025-08-10 Canvas3D: Empowering Precise Spatial Control for Image Generation with Constraints from a 3D Virtual Canvas Runlin Duan et.al. 2508.07135 null
2025-08-10 On the geometric Brownian motion with state-dependent variable exponent diffusion term Mustafa Avci et.al. 2508.07130 null
2025-08-10 Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays Gregory Schuit et.al. 2508.07128 null
2025-08-10 Modelling Human Skin Morphology and Simulating Transdermal Transport of 50 Chemicals Milana Tesfamarian et.al. 2508.07123 null
2025-08-09 DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit Aiden Swann et.al. 2508.07118 null
2025-08-09 Whisfusion: Parallel ASR Decoding via a Diffusion Transformer Taeyoun Kwon et.al. 2508.07048 null
2025-08-09 A Stage-Aware Mixture of Experts Framework for Neurodegenerative Disease Progression Modelling Tiantian He et.al. 2508.07032 null
2025-08-09 Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities Anindya Bijoy Das et.al. 2508.07031 null
2025-08-09 Vec2Summ: Text Summarization via Probabilistic Sentence Embeddings Mao Li et.al. 2508.07017 null
2025-08-12 HiMat: DiT-based Ultra-High Resolution SVBRDF Generation Zixiong Wang et.al. 2508.07011 null
2025-08-09 Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments Gian Mario Favero et.al. 2508.07006 null
2025-08-09 Mechanism of Anisotropic Crystallization and Phase Transitions under Van der Waals Squeezing Yuxiang Gao et.al. 2508.06992 null
2025-08-09 WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering Yixin Zhu et.al. 2508.06982 null
2025-08-09 Structure-Preserving Digital Twins via Conditional Neural Whitney Forms Brooks Kinch et.al. 2508.06981 null
2025-08-09 CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing Weiyan Xie et.al. 2508.06937 null
2025-08-09 Unveiling the Puzzle of Brittleness in Single Crystal Iridium Qing Cheng et.al. 2508.06929 null
2025-08-09 AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning Shihao Yuan et.al. 2508.06924 null
2025-08-09 Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing Shichao Ma et.al. 2508.06916 null
2025-08-09 MultiRef: Controllable Image Generation with Multiple Visual References Ruoxi Chen et.al. 2508.06905 null
2025-08-09 Text to Speech System for Meitei Mayek Script Gangular Singh Irengbam et.al. 2508.06870 null
2025-08-09 Speech Enhancement based on cascaded two flow Seonggyu Lee et.al. 2508.06842 null
2025-08-09 FlowSE: Flow Matching-based Speech Enhancement Seonggyu Lee et.al. 2508.06840 null
2025-08-09 Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models Shiqian Zhao et.al. 2508.06837 null
2025-08-09 A Score-based Diffusion Model Approach for Adaptive Learning of Stochastic Partial Differential Equation Solutions Toan Huynh et.al. 2508.06834 null
2025-08-09 Efficient data-driven regression for reduced-order modeling of spatial pattern formation Alessandro Alla et.al. 2508.06833 null
2025-08-09 Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation Xiao Huang et.al. 2508.06806 null
2025-08-09 D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning Shu-Ang Yu et.al. 2508.06804 null
2025-08-09 GaN/InN HEMT based UV photodetector on SiC with hexagonal boron nitride passivation Mustafa Kilin et.al. 2508.06782 null
2025-08-08 Topology Generation of UAV Covert Communication Networks: A Graph Diffusion Approach with Incentive Mechanism Xin Tang et.al. 2508.06746 null
2025-08-08 Design of high-mobility p-type GaN via the piezomobility tensor Jie-Cheng Chen et.al. 2508.06723 null
2025-08-08 Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video Jixuan He et.al. 2508.06715 null
2025-08-08 LightSwitch: Multi-view Relighting with Material-guided Diffusion Yehonathan Litman et.al. 2508.06494 null
2025-08-08 Weak approximation of stochastic differential equations with sticky boundary conditions Akash Sharma et.al. 2508.06487 null
2025-08-08 SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning Lingkun Long et.al. 2508.06447 null
2025-08-08 SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation Guido Manni et.al. 2508.06429 null
2025-08-08 4D operando X-ray nano-holo-tomography reveals multiscale chemomechanics in Silicon-Graphite anode Victor Vanpeene et.al. 2508.06413 null
2025-08-08 FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation Wenbin Teng et.al. 2508.06392 null
2025-08-08 Diffuse measures and nonlinear parabolic equations Francesco Petitta et.al. 2508.06384 null
2025-08-08 ActivityDiff: A diffusion model with Positive and Negative Activity Guidance for De Novo Drug Design Renyi Zhou et.al. 2508.06364 null
2025-08-08 Quantum Algorithm for Estimating Intrinsic Geometry Nhat A. Nghiem et.al. 2508.06355 null
2025-08-08 Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging? Xin Ci Wong et.al. 2508.06327 null
2025-08-08 OM2P: Offline Multi-Agent Mean-Flow Policy Zhuoran Li et.al. 2508.06269 null
2025-08-08 ADPro: a Test-time Adaptive Diffusion Policy for Robot Manipulation via Manifold and Initial Noise Constraints Zezeng Li et.al. 2508.06266 null
2025-08-08 Tanaka formula for SDEs driven by fractional Brownian motion Tommi Sottinen et.al. 2508.06261 null
2025-08-08 Low dimensional dynamics of a sparse balanced synaptic network of quadratic integrate-and-fire neurons Maria V. Ageeva et.al. 2508.06253 null
2025-08-08 Light-Addressable Smart Nanostructures via Resonant Nanoheating Victor Tabouillot et.al. 2508.06215 null
2025-08-08 Inverse Source Problems for the Time-Fractional Evolution Equation Rahmonov Askar Ahmadovich et.al. 2508.06209 null
2025-08-08 Clinically-guided Data Synthesis for Laryngeal Lesion Detection Chiara Baldini et.al. 2508.06182 null
2025-08-08 Synthetic Data-Driven Multi-Architecture Framework for Automated Polyp Segmentation Through Integrated Detection and Mask Generation Ojonugwa Oluwafemi Ejiga Peter et.al. 2508.06170 null
2025-08-08 Sharp non-existence threshold for a parabolic Hardy-H{é}non equation with quasilinear diffusion Razvan Gabriel Iagar et.al. 2508.06164 null
2025-08-08 Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment Zhenbang Du et.al. 2508.06160 null
2025-08-08 Revealing the Staging Structural Evolution and Li (De)Intercalation Kinetics in Graphite Anodes via Machine Learning Potential Liqi Wang et.al. 2508.06156 null
2025-08-08 VISTAR:A User-Centric and Role-Driven Benchmark for Text-to-Image Evaluation Kaiyuan Jiang et.al. 2508.06152 null
2025-08-08 Improving Diagnostic Accuracy for Oral Cancer with inpainting Synthesis Lesions Generated Using Diffusion Models Yong Oh Lee et.al. 2508.06151 null
2025-08-08 DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera Shaohua Pan et.al. 2508.06139 null
2025-08-08 GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving Jian Wang et.al. 2508.06113 null
2025-08-08 MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment Gui Zou et.al. 2508.06104 null
2025-08-08 UGD-IML: A Unified Generative Diffusion-based Framework for Constrained and Unconstrained Image Manipulation Localization Yachun Mi et.al. 2508.06101 null
2025-08-08 MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows Xiquan Li et.al. 2508.06098 null
2025-08-08 E-React: Towards Emotionally Controlled Synthesis of Human Reactions Chen Zhu et.al. 2508.06093 null
2025-08-08 SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment Yanxiao Sun et.al. 2508.06082 null
2025-08-08 DreamVE: Unified Instruction-based Image and Video Editing Bin Xia et.al. 2508.06080 null
2025-08-08 Towards MR-Based Trochleoplasty Planning Michael Wehrli et.al. 2508.06076 null
2025-08-08 Radio continuum and \HI 21-cm line observations of a nearby luminous infrared galaxy IRAS 17526+3253 Jianfeng Wu et.al. 2508.06075 null
2025-08-08 Real-time physics-informed reconstruction of transient fields using sensor guidance and higher-order time differentiation Hong-Kyun Noh et.al. 2508.06070 null
2025-08-08 ThematicPlane: Bridging Tacit User Intent and Latent Spaces for Image Generation Daniel Lee et.al. 2508.06065 null
2025-08-08 NEP: Autoregressive Image Editing via Next Editing Token Prediction Huimin Wu et.al. 2508.06044 null
2025-08-08 Bayesian Radio Map Estimation: Fundamentals and Implementation via Diffusion Models Tien Ngoc Ha et.al. 2508.06037 null
2025-08-08 InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow Yiming Gong et.al. 2508.06033 null
2025-08-08 Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts Kiran Chhatre et.al. 2508.06032 null
2025-08-08 Improved Sub-Visible Particle Classification in Flow Imaging Microscopy via Generative AI-Based Image Synthesis Utku Ozbulak et.al. 2508.06021 null
2025-08-08 Vacuum Dealloyed Brass as Li-Metal Battery Current Collector: Effect of Zinc and Porosity Eric V Woods et.al. 2508.06015 null
2025-08-08 ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors Minsu Kim et.al. 2508.06014 null
2025-08-08 KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training Kai Zhang et.al. 2508.06001 null
2025-08-08 Global solutions in $L^{p}{v}L^{\infty}{x}$ for the Boltzmann equation in bounded domains Dingqun Deng et.al. 2508.05985 null
2025-08-08 Revisiting $μ$ SR Studies of Ion Dynamics in the Light of Extended Kubo-Toyabe Model Takashi U. Ito et.al. 2508.05968 null
2025-08-08 Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents Han Lin et.al. 2508.05954 null
2025-08-08 A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image Yanxing Liang et.al. 2508.05950 null
2025-08-08 Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution Zhanyi Sun et.al. 2508.05941 null
2025-08-08 Reverse Diffusion Sequential Monte Carlo Samplers Luhuan Wu et.al. 2508.05926 null
2025-08-08 Fast, Convex and Conditioned Network for Multi-Fidelity Vectors and Stiff Univariate Differential Equations Siddharth Rout et.al. 2508.05921 null
2025-08-07 Measurement of All Flavor PeV Neutrino Flux using Combined Datasets from IceCube Emre Yildizci et.al. 2508.05886 null
2025-08-07 Emerging ultra-wide band gap semiconductors for future high-frequency electronics Emily M. Garrity et.al. 2508.05823 null
2025-08-07 FineDialFact: A benchmark for Fine-grained Dialogue Fact Verification Xiangyan Chen et.al. 2508.05782 null
2025-08-07 MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss Can Zhao et.al. 2508.05772 null
2025-08-07 UnGuide: Learning to Forget with LoRA-Guided Diffusion Models Agnieszka Polowczyk et.al. 2508.05755 null
2025-08-07 Quantum Reservoir GAN Hikaru Wakaura et.al. 2508.05716 null
2025-08-07 High multiplicity and global structure of coexistence states in a predator-prey model with saturation Kousuke Kuto et.al. 2508.05714 null
2025-08-07 Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation Yue Liao et.al. 2508.05635 null
2025-08-07 GAP: Gaussianize Any Point Clouds with Text Guidance Weiqi Zhang et.al. 2508.05631 null
2025-08-07 Latent Space Diffusion for Topology Optimization Aaron Lutheran et.al. 2508.05624 null
2025-08-07 Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision Luozheng Qin et.al. 2508.05606 null
2025-08-07 Unveiling the Lithium-Ion Transport Mechanism in Li2ZrCl6 Solid-State Electrolyte via Deep Learning-Accelerated Molecular Dynamics Simulations Hanzeng Guo et.al. 2508.05598 null
2025-08-07 Discrepancy-Aware Contrastive Adaptation in Medical Time Series Analysis Yifan Wang et.al. 2508.05572 null
2025-08-07 MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips Shibo Wang et.al. 2508.05506 null
2025-08-07 Heat and super-diffusive melting fronts in unsaturated porous media Eirik G. Flekkøy et.al. 2508.05451 null
2025-08-07 Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI Krzysztof Janowicz et.al. 2508.05432 null
2025-08-07 MolSnap: Snap-Fast Molecular Generation with Latent Variational Mean Flow Md Atik Ahamed et.al. 2508.05411 null
2025-08-07 UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation Wonjun Kang et.al. 2508.05399 null
2025-08-07 Real-Time Iteration Scheme for Diffusion Policy Yufei Duan et.al. 2508.05396 null
2025-08-09 Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms Jie Xiao et.al. 2508.05387 null
2025-08-07 Multi-Modal Multi-Behavior Sequential Recommendation with Conditional Diffusion-Based Feature Denoising Xiaoxi Cui et.al. 2508.05352 null
2025-08-07 Stranski-Krastanov Growth of Disordered ScNx Thin Films on MgO(100): Influence of Defect Densities on Electronic Structure and Transport Properties Susmita Chowdhury et.al. 2508.05330 null
2025-08-07 Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting Frank Ruis et.al. 2508.05323 null
2025-08-07 Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces Mathias Rose Bjare et.al. 2508.05306 null
2025-08-07 SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens Nikita Dragunov et.al. 2508.05305 null
2025-08-07 An Investigation into the Distribution of Ratios of Particle Solver-based Likelihoods Emil Løvbak et.al. 2508.05303 null
2025-08-07 Wavelet-Guided Dual-Frequency Encoding for Remote Sensing Change Detection Xiaoyang Zhang et.al. 2508.05271 null
2025-08-07 B4DL: A Benchmark for 4D LiDAR LLM in Spatio-Temporal Understanding Changho Choi et.al. 2508.05269 null
2025-08-07 SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion Xiaoyang Zhang et.al. 2508.05264 null
2025-08-07 ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models Yatong Lan et.al. 2508.05236 null
2025-08-07 Parabolic abstract evolution equations in cylindrical domains and uniformly local Sobolev spaces Joly Romain et.al. 2508.05220 null
2025-08-07 An asymptotic-preserving active flux scheme for the hyperbolic heat equation in the diffusive scaling Junming Duan et.al. 2508.05166 null
2025-08-07 RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer Fangyu Du et.al. 2508.05115 null
2025-08-07 PoseGen: In-Context LoRA Finetuning for Pose-Controllable Long Human Video Generation Jingxuan He et.al. 2508.05091 null
2025-08-07 MetaDiT: Enabling Fine-grained Constraints in High-degree-of Freedom Metasurface Design Hao Li et.al. 2508.05076 null
2025-08-07 Align-for-Fusion: Harmonizing Triple Preferences via Dual-oriented Diffusion for Cross-domain Sequential Recommendation Yongfu Zha et.al. 2508.05074 null
2025-08-07 FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer Jian Zhu et.al. 2508.05069 null
2025-08-07 DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion Yifeng Huang et.al. 2508.05060 null
2025-08-07 Observation of Super-ballistic Brownian Motion in Liquid Jason Boynewicz et.al. 2508.05031 null
2025-08-07 Coupled 1D Chemical Kinetic-Transport and 2D Hydrodynamic Modeling Supports a modest 1-1.5x Supersolar Oxygen Abundance in Jupiter’s Atmosphere Jeehyun Yang et.al. 2508.05007 null
2025-08-07 Switching Diffusion Systems with Past-Dependent Switching and Countable State Space: Successful Couplings and Strong Ergodicity Fubao Xi et.al. 2508.04997 null
2025-08-08 REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers Yuepeng Jiang et.al. 2508.04996 null
2025-08-07 Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression Zheng Chen et.al. 2508.04979 null
2025-08-06 Simulation of Non-Premixed, Supersonic Combustion using the Discontinuous Galerkin Method on Fully Unstructured Grids Cal J. Rising et.al. 2508.04930 null
2025-08-06 Taxonomy of Faults in Attention-Based Neural Networks Sigma Jahan et.al. 2508.04925 null
2025-08-08 Learning AI Auditing: A Case Study of Teenagers Auditing a Generative AI Model Luis Morales-Navarro et.al. 2508.04902 null
2025-08-06 The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models Leo Zhang et.al. 2508.04884 null
2025-08-06 Unified Flow Matching for Long Horizon Event Forecasting Xiao Shou et.al. 2508.04843 null
2025-08-06 Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off Seungyong Lee et.al. 2508.04825 null
2025-08-06 Delay-constrained re-entry governs large-scale brain seizures and other network pathologies Paul Triebkorn et.al. 2508.04824 null
2025-08-06 Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models Mehrdad Moradi et.al. 2508.04818 null
2025-08-06 Stochastic Optimal Control with Control-Dependent Diffusion and State Constraints: A Degenerate Elliptic Approach Anderson O. Calixto et.al. 2508.04809 null
2025-08-06 Electrodeless Magnetohydrodynamic Local Force Generator for Aerocapture Bernard Parent et.al. 2508.04806 null
2025-08-06 ACM Multimedia Grand Challenge on ENT Endoscopy Analysis Trong-Thuan Nguyen et.al. 2508.04801 null
2025-08-08 Quantum-impurity sensing of altermagnetic order V. A. S. V. Bittencourt et.al. 2508.04788 null
2025-08-06 Edge-Assisted Collaborative Fine-Tuning for Multi-User Personalized Artificial Intelligence Generated Content (AIGC) Nan Li et.al. 2508.04745 null
2025-08-06 A colossal dielectric response of HfxZr1-xO2 nanoparticles Oleksandr S. Pylypchuk et.al. 2508.04697 null
2025-08-06 Diffusion in a $d$ -dimensional rough potential Jacob Jeffries et.al. 2508.04674 null
2025-08-06 HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models Young D. Kwon et.al. 2508.04663 null
2025-08-06 Stochastic Calculus for Pathwise Observables of Markov-Jump Processes: Unification of Diffusion and Jump Dynamics Lars Torbjørn Stutzer et.al. 2508.04647 null
2025-08-06 A unified model for linear responses of physical networks José M. Ortiz-Tavárez et.al. 2508.04616 null
2025-08-06 Multitask Learning with Stochastic Interpolants Hugo Negrel et.al. 2508.04605 null
2025-08-07 A Comprehensive Framework for Uncertainty Quantification of Voxel-wise Supervised Models in IVIM MRI Nicola Casali et.al. 2508.04588 null
2025-08-06 Joint Communication and Indoor Positioning Based on Visible Light in the Presence of Dimming A. Tarik Leblebici et.al. 2508.04570 null
2025-08-06 DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling Yijie Li et.al. 2508.04568 null
2025-08-06 TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning Yunbi Liu et.al. 2508.04565 null
2025-08-06 Drone Detection with Event Cameras Gabriele Magrini et.al. 2508.04564 null
2025-08-06 One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose Jinxi Liu et.al. 2508.04559 null
2025-08-06 Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis Angang Zhang et.al. 2508.04551 null
2025-08-06 MSC: A Marine Wildlife Video Dataset with Grounded Segmentation and Clip-Level Captioning Quang-Trung Truong et.al. 2508.04549 null
2025-08-06 X-ray thermal diffuse scattering as a texture-robust temperature diagnostic for dynamically compressed solids P. G. Heighway et.al. 2508.04525 null
2025-08-06 $β$ -Irida-Graphene: A New 2D Carbon Allotrope for Sodium-Ion Battery Anodes José A. S. Laranjeira et.al. 2508.04506 null
2025-08-06 QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution Bowen Chai et.al. 2508.04485 null
2025-08-06 Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model Hongxu Chen et.al. 2508.04472 null
2025-08-06 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation Shuzhou Yang et.al. 2508.04467 null
2025-08-06 Case Studies of Generative Machine Learning Models for Dynamical Systems Nachiket U. Bapat et.al. 2508.04459 null
2025-08-06 Cognitive Effort in the Two-Step Task: An Active Inference Drift-Diffusion Model Approach Alvaro Garrido Perez et.al. 2508.04435 null
2025-08-06 Unmasking Interstitial Lung Diseases: Leveraging Masked Autoencoders for Diagnosis Ethan Dack et.al. 2508.04429 null
2025-08-06 Hydrodynamic Effects in Cryogenic Buffer Gas Cells: Design Insights from Hybrid Simulations Nick Vogeley et.al. 2508.04364 null
2025-08-06 Derivation and Numerical Simulation of a Thermodynamically Consistent Magneto Two-Phase Flow Model for Magnetic Drug Targeting Eberhard Bänsch et.al. 2508.04360 null
2025-08-06 From Split to Share: Private Inference with Distributed Feature Sharing Zihan Liu et.al. 2508.04346 null
2025-08-06 Performative Market Making Charalampos Kleitsikas et.al. 2508.04344 null
2025-08-06 TempFlow-GRPO: When Timing Matters for GRPO in Flow Models Xiaoxuan He et.al. 2508.04324 null
2025-08-06 Wave coupling in partially ionized plasmas with shear flows I. Fast-to-Alfvén transformation Miquel Cantallops et.al. 2508.04319 null
2025-08-06 Turbulent Injection assisted by Diffusion Models for Scale Resolving Simulations Margaux Boxho et.al. 2508.04318 null
2025-08-06 Parameter Estimation for Weakly Interacting Hypoelliptic Diffusions Yuga Iguchi et.al. 2508.04287 null
2025-08-06 S2M3: Split-and-Share Multi-Modal Models for Distributed Multi-Task Inference on the Edge JinYi Yoon et.al. 2508.04271 null
2025-08-06 Sparse Narrow-Band Topology Optimization for Large-Scale Thermal-Fluid Applications Vladislav Pimanov et.al. 2508.04261 null
2025-08-06 High-Dimensional Matrix-Variate Diffusion Index Models for Time Series Forecasting Zhiren Ma et.al. 2508.04259 null
2025-08-06 Suspensions of small ultra-soft colloids remain liquids in overcrowded conditions Nikolaos A. Burger et.al. 2508.04244 null
2025-08-06 PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction Muhua Zhu et.al. 2508.04236 null
2025-08-06 DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification Saifullah Saifullah et.al. 2508.04233 null
2025-08-06 Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction Yu Liu et.al. 2508.04229 null
2025-08-06 LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation Kangrui Cen et.al. 2508.04228 null
2025-08-06 DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models Saifullah Saifullah et.al. 2508.04208 null
2025-08-06 A background-free signal of jet-induced diffusion wake in quark-gluon plasma Zhong Yang et.al. 2508.04194 null
2025-08-06 Deeper Inside Deep ViT Sungrae Hong et.al. 2508.04181 null
2025-08-06 Quasi-Clique Discovery via Energy Diffusion Yu Zhang et.al. 2508.04174 null
2025-08-06 Non-Equilibrium Dynamics and First-Passage Properties of Stochastic Processes: From Brownian Motion to Active Particles Mathis Guéneau et.al. 2508.04154 null
2025-08-06 IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control Lijuan Liu et.al. 2508.04147 null
2025-08-06 Polynomial-time sampling despite disorder chaos Eric Ma et.al. 2508.04133 null
2025-08-06 Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation Maximilian Ulmer et.al. 2508.04122 null
2025-08-06 Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework Yi-Ting Chen et.al. 2508.04090 null
2025-08-06 Long time behavior and Yaglom limit for real trait-structured Birth and Death Processes Pierre Collet et.al. 2508.04089 null
2025-08-06 Convolutional autoencoders for the reconstruction of three-dimensional interfacial multiphase flows Murray Cutforth et.al. 2508.04084 null
2025-08-06 POD-based reduced order modeling of global-in-time iterative decoupled algorithms for Biot’s consolidation model Huipeng Gu et.al. 2508.04082 null
2025-08-06 Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion Fangmin Zhao et.al. 2508.04055 null
2025-08-06 Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation Jiayi He et.al. 2508.04049 null
2025-08-06 Nonlinear stability of two-dimensional periodic waves in parabolic systems with conservation laws L. Miguel Rodrigues et.al. 2508.04023 null
2025-08-07 S $^2$ Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation Weilun Feng et.al. 2508.04016 null
2025-08-06 Constructing Generalized Sample Transition Probabilities with Biased Simulations Yanbin Wang et.al. 2508.03977 null
2025-08-05 Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm Lin Zhang et.al. 2508.03955 null
2025-08-05 Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model Shen Zhu et.al. 2508.03925 null
2025-08-05 Coefficient Identification Problem with Integral Overdetermination Condition for Diffusion Equations R. R. Ashurov et.al. 2508.03859 null
2025-08-05 VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations Yifei Zong et.al. 2508.03839 null
2025-08-05 HPSv3: Towards Wide-Spectrum Human Preference Score Yuhang Ma et.al. 2508.03789 null
2025-08-05 LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Jianxiong Gao et.al. 2508.03694 null
2025-08-05 LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences Ao Liang et.al. 2508.03692 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-05 OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World Katherine Liu et.al. 2508.03669 null
2025-08-05 Rigidity for graph product von Neumann algebras Camille Horbez et.al. 2508.03662 null
2025-08-05 DiWA: Diffusion Policy Adaptation with World Models Akshay L Chandra et.al. 2508.03645 null
2025-08-05 Likelihood Matching for Diffusion Models Lei Qian et.al. 2508.03636 null
2025-08-05 Radiative Nonideal MHD Simulations of Inner Protoplanetary Disks: Temperature Structures, Asymmetric Winds, and Episodic Surface Accretion Shoji Mori et.al. 2508.03624 null
2025-08-05 Expanding the Standard Diffusion Process to Specified Non-Gaussian Marginal Distributions Robert Richardson et.al. 2508.03617 null
2025-08-05 CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models Ana Lawry Aguila et.al. 2508.03594 null
2025-08-05 Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection Long Qian et.al. 2508.03539 null
2025-08-05 X-ray Halos of Early-Type Galaxies with AGN Feedback and Accretion from a Circumgalactic Medium: models and observations Silvia Pellegrini et.al. 2508.03536 null
2025-08-05 CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation Kaishen Yuan et.al. 2508.03535 null
2025-08-05 LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Text-to-Image Generation Lianwei Yang et.al. 2508.03485 null
2025-08-05 When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models Dasol Choi Jihwan Lee et.al. 2508.03483 null
2025-08-05 Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models Hyungjin Kim et.al. 2508.03481 null
2025-08-05 VideoGuard: Protecting Video Content from Unauthorized Editing Junjie Cao et.al. 2508.03480 null
2025-08-05 Learning to Incentivize: LLM-Empowered Contract for AIGC Offloading in Teleoperation Zijun Zhan et.al. 2508.03464 null
2025-08-06 READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation Haotian Wang et.al. 2508.03457 null
2025-08-05 Error Estimates of Semi-Lagrangian Schemes for Diffusive Conservation Laws Haruki Takemura et.al. 2508.03455 null
2025-08-05 RAAG: Ratio Aware Adaptive Guidance Shangwen Zhu et.al. 2508.03442 null
2025-08-05 Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN Shivangi Nigam et.al. 2508.03415 null
2025-08-05 SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models Pingchuan Ma et.al. 2508.03402 null
2025-08-05 Delay-facilitated self-assembly in compartmentalized systems Severin Angerpointner et.al. 2508.03383 null
2025-08-05 Diffusion Once and Done: Degradation-Aware LoRA for Efficient All-in-One Image Restoration Ni Tang et.al. 2508.03373 null
2025-08-05 A Closed-Loop Multi-Agent Framework for Aerodynamics-Aware Automotive Styling Design Xinyu Jin et.al. 2508.03370 null
2025-08-05 GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images Yifei Sun et.al. 2508.03357 null
2025-08-05 Quenching time and probability estimates for a stochastic reaction-diffusion system with coupled inner singular absorption terms driven by mixed noises Nikos I. Kavallaris et.al. 2508.03354 null
2025-08-06 Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation Xunzhi Xiang et.al. 2508.03334 null
2025-08-05 Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation Peiyu Wang et.al. 2508.03320 null
2025-08-05 Thermal Metamaterials for Enhanced Non-Fourier Heat Transport Harry Mclean et.al. 2508.03316 null
2025-08-05 The non-isothermal Maxwell-Stefan asymptotics of the multi-species Boltzmann equations Xinqiu Chen et.al. 2508.03311 null
2025-08-05 Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation Jun Luo et.al. 2508.03300 null
2025-08-05 Investigation on deep learning-based galaxy image translation models Hengxin Ruan et.al. 2508.03291 null
2025-08-07 Well-Posedness of the Cauchy Problem for One-Dimensional Nonlinear Diffusion Equations with Dynamic and Fourth-Type Boundary Conditions in the Lp Lq Maximal Regularity Setting Ken Furukawa et.al. 2508.03288 null
2025-08-07 Global solvability for doubly degenerate nutrient taxis system with a wide range of bacterial responses in physical dimension Bao-Ngoc Tran et.al. 2508.03268 null
2025-08-05 Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation Gang Dai et.al. 2508.03256 null
2025-08-05 V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models Jisoo Kim et.al. 2508.03254 null
2025-08-05 Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion Wentao Qu et.al. 2508.03252 null
2025-08-06 FFHQ-Makeup: Paired Synthetic Makeup Dataset with Facial Consistency Across Multiple Styles Xingchao Yang et.al. 2508.03241 null
2025-08-05 BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models Yu Pan et.al. 2508.03221 null
2025-08-05 Timing is everything: How subtle timing changes in MRI echo planar imaging can significantly alter mechanical vibrations and sound level Amir Seginer et.al. 2508.03220 null
2025-08-05 Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance Eliot Beyler et.al. 2508.03210 null
2025-08-05 Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models Muhammed Saeed et.al. 2508.03199 null
2025-08-05 An Analytic Model to Determine the Interstitial-Solute Energetics and Underlying Mechanism in Refractory High-Entropy Alloys Qianxi Zhu et.al. 2508.03163 null
2025-08-05 SARD: Segmentation-Aware Anomaly Synthesis via Region-Constrained Diffusion with Discriminative Mask Guidance Yanshu Wang et.al. 2508.03143 null
2025-08-05 UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying Chengyu Bai et.al. 2508.03142 null
2025-08-05 Filtering and 1/3 Power Law for Optimal Time Discretisation in Numerical Integration of Stochastic Differential Equations Igor G. Vladimirov et.al. 2508.03135 null
2025-08-05 Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback Jingyi Chen et.al. 2508.03123 null
2025-08-05 Power System Voltage Stability Boundary: Computational Results and Applications Zhenyao Li et.al. 2508.03119 null
2025-08-05 T2UE: Generating Unlearnable Examples from Text Descriptions Xingjun Ma et.al. 2508.03091 null
2025-08-05 MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation Youran Zhou et.al. 2508.03083 null
2025-08-05 Multi-human Interactive Talking Dataset Zeyu Zhu et.al. 2508.03050 null
2025-08-05 Urban In-Context Learning: Bridging Pretraining and Inference through Masked Diffusion for Urban Profiling Ruixing Zhang et.al. 2508.03042 null
2025-08-05 Sparse Identification of Nonlinear Dynamics for Stochastic Delay Differential Equations Dimitri Breda et.al. 2508.03040 null
2025-08-05 MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention Qi Xie et.al. 2508.03034 null
2025-08-05 LiGen: GAN-Augmented Spectral Fingerprinting for Indoor Positioning Jie Lin et.al. 2508.03024 null
2025-08-05 Generating Light-based Fingerprints for Indoor Localization Hsun-Yu Lee et.al. 2508.03011 null
2025-08-05 Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models Fan Yang et.al. 2508.03006 null
2025-08-05 Diffusion Models with Adaptive Negative Sampling Without External Resources Alakh Desai et.al. 2508.02973 null
2025-08-05 Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver Jonathan Patsenker et.al. 2508.02964 null
2025-08-04 X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio Chenxu Zhang et.al. 2508.02944 null
2025-08-04 Documenting Patterns of Exoticism of Marginalized Populations within Text-to-Image Generators Sourojit Ghosh et.al. 2508.02937 null
2025-08-06 A nonstandard finite difference scheme for an SEIQR epidemiological PDE model Achraf Zinihi et.al. 2508.02928 null
2025-08-04 Goal-Oriented Adaptive Finite Element Multilevel Quasi-{M}onte {C}arlo Joakim Beck et.al. 2508.02925 null
2025-08-04 How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution Minh-Hai Nguyen et.al. 2508.02923 null
2025-08-04 RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation Mehrdad Moradi et.al. 2508.02903 null
2025-08-04 REFLECT: Rectified Flows for Efficient Brain Anomaly Correction Transport Farzad Beizaee et.al. 2508.02889 null
2025-08-04 Memoirs of mass accretion: probing the edges of intracluster light in simulated galaxy clusters Tara Dacunha et.al. 2508.02837 null
2025-08-04 DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework Tongchun Zuo et.al. 2508.02807 null
2025-08-04 NASIM: Revealing the low surface brightness Universe from legacy VISTA data Elham Saremi et.al. 2508.02780 null
2025-08-04 D2PPO: Diffusion Policy Policy Optimization with Dispersive Loss Guowei Zou et.al. 2508.02644 null
2025-08-04 CAK: Emergent Audio Effects from Minimal Deep Learning Austin Rockman et.al. 2508.02643 null
2025-08-04 Anticipating Decoherence: a Predictive Framework for Enhancing Coherence in Quantum Emitters Pranshu Maan et.al. 2508.02638 null
2025-08-04 ReMoMask: Retrieval-Augmented Masked Motion Generation Zhengdao Li et.al. 2508.02605 null
2025-08-04 Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction Yuerong Song et.al. 2508.02558 null
2025-08-04 From Pixels to Pathology: Restoration Diffusion for Diagnostic-Consistent Virtual IHC Jingsong Liu et.al. 2508.02528 null
2025-08-06 xDeepServe: Model-as-a-Service on Huawei CloudMatrix384 Ao Xiao et.al. 2508.02520 null
2025-08-04 QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots Sheng Wu et.al. 2508.02512 null
2025-08-04 Quantitative and Predictive Folding Models from Limited Single-Molecule Data Using Simulation-Based Inference Lars Dingeldein et.al. 2508.02509 null
2025-08-04 Toward Using Machine Learning as a Shape Quality Metric for Liver Point Cloud Generation Khoa Tuan Nguyen et.al. 2508.02482 null
2025-08-04 PoseGuard: Pose-Guided Generation with Safety Guardrails Kongxin Wang et.al. 2508.02476 null
2025-08-04 Efficient spin-pumping and spin-to-charge conversion in epitaxial Mn $_3$ Sn(0001) noncollinear antiferromagnetic films Surya N. Panda et.al. 2508.02415 null
2025-08-04 Hydra: Accurate Multi-Modal Leaf Wetness Sensing with mm-Wave and Camera Fusion Yimeng Liu et.al. 2508.02409 null
2025-08-04 Inference-time Scaling for Diffusion-based Audio Super-resolution Yizhu Jin et.al. 2508.02391 null
2025-08-04 Talking Surveys: How Photorealistic Embodied Conversational Agents Shape Response Quality, Engagement, and Satisfaction Matus Krajcovic et.al. 2508.02376 null
2025-08-04 Transport-Guided Rectified Flow Inversion: Improved Image Editing Using Optimal Transport Theory Marian Lupascu et.al. 2508.02363 null
2025-08-04 Qwen-Image Technical Report Chenfei Wu et.al. 2508.02324 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-05 LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training Sikui Zhang et.al. 2508.02308 null
2025-08-05 Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor Xiaoliu Guan et.al. 2508.02240 null
2025-08-04 Abstract Formulation of Mean-Field Models and Propagation of Chaos Tau Shean Lim et.al. 2508.02224 null
2025-08-04 A theory of strange metals Simone Fratini et.al. 2508.02221 null
2025-08-04 Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Yuxuan Song et.al. 2508.02193 null
2025-08-04 DreamPainter: Image Background Inpainting for E-commerce Scenarios Sijie Zhao et.al. 2508.02155 null
2025-08-04 AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models Die Chen et.al. 2508.02151 null
2025-08-04 VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling Yuru Xiao et.al. 2508.02129 null
2025-08-04 AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation Zhiwen Li et.al. 2508.02107 null
2025-08-04 Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis Kaiyang Ji et.al. 2508.02106 null
2025-08-04 “Stack It Up!”: 3D Stable Structure Generation from 2D Hand-drawn Sketch Yiqing Xu et.al. 2508.02093 null
2025-08-04 Unsupervised Multi-channel Speech Dereverberation via Diffusion Yulun Wu et.al. 2508.02071 null
2025-08-04 “Set It Up”: Functional Object Arrangement with Compositional Generative Models Yiqing Xu et.al. 2508.02068 null
2025-08-04 StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion Haoxin Yang et.al. 2508.02056 null
2025-08-04 Why Generate When You Can Transform? Unleashing Generative Attention for Dynamic Recommendation Yuli Liu et.al. 2508.02050 null
2025-08-04 Conditional Diffusion Model with Anatomical-Dose Dual Constraints for End-to-End Multi-Tumor Dose Prediction Hui Xie et.al. 2508.02043 null
2025-08-04 Frequency-Domain Denoising-Based in Vivo Fluorescence Imaging XuHao Yu et.al. 2508.02025 null
2025-08-04 Significant Mobility Enhancement in Coupled AlGaN/GaN Quantum Wells considering Inter-Well Distance and Asymmetric Widths Le Tri Dat et.al. 2508.02024 null
2025-08-05 Asymptotic analysis of the Allen-Cahn equation with dynamic boundary conditions of Cahn-Hilliard type Pierluigi Colli et.al. 2508.02021 null
2025-08-04 Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention Kyungmin Jo et.al. 2508.02004 null
2025-08-04 Generative Large-Scale Pre-trained Models for Automated Ad Bidding Optimization Yu Lei et.al. 2508.02002 null
2025-08-04 Path-Integral Formulation of Bosonic Markovian Open Quantum Dynamics with Monte Carlo stochastic trajectories using the Glauber-Sudarshan P, Wigner, and Husimi Q Functions and Hybrids Toma Yoneya et.al. 2508.01991 null
2025-08-04 Controllable and Stealthy Shilling Attacks via Dispersive Latent Diffusion Shutong Qiao et.al. 2508.01987 null
2025-08-04 Diffusion models for inverse problems Hyungjin Chung et.al. 2508.01975 null
2025-08-03 Distributed games with jumps: An $α$ -potential game approach Xin Guo et.al. 2508.01929 null
2025-08-03 On the Non-Markovian Navier-Stokes Framework for Turbulence Modeling – A Preliminary Analysis Siamak Kazemzadeh Hannani et.al. 2508.01890 null
2025-08-03 DiffusionFF: Face Forgery Detection via Diffusion-based Artifact Localization Siran Peng et.al. 2508.01873 null
2025-08-05 Moment Estimate and Variational Approach for Learning Generalized Diffusion with Non-gradient Structures Fanze Kong et.al. 2508.01854 null
2025-08-03 Diffusion-based 3D Hand Motion Recovery with Intuitive Physics Yufei Zhang et.al. 2508.01835 null
2025-08-03 Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder Runxuan Yang et.al. 2508.01796 null
2025-08-03 Exponential mixing for the stochastic Kuramoto-Sivashinsky equation on the 1D torus Peng Gao et.al. 2508.01794 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 Semantically-Guided Inference for Conditional Diffusion Models: Enhancing Covariate Consistency in Time Series Forecasting Rui Ding et.al. 2508.01761 null
2025-08-03 Dynamic Coupling of Infiltration-Soil Moisture Feedback:Emergent Vegetation Patterns in a Water-Vegetation Model Juan Yan et.al. 2508.01755 null
2025-08-03 Energy-Efficient Federated Learning for Edge Real-Time Vision via Joint Data, Computation, and Communication Design Xiangwang Hou et.al. 2508.01745 null
2025-08-05 Imbalance-Robust and Sampling-Efficient Continuous Conditional GANs via Adaptive Vicinity and Auxiliary Regularization Xin Ding et.al. 2508.01725 null
2025-08-03 ModFus-DM: Explore the Representation in Modulated Signal Diffusion Generated Models Haoyue Tan et.al. 2508.01719 null
2025-08-03 Versatile Transition Generation with Image-to-Video Diffusion Zuhao Yang et.al. 2508.01698 null
2025-08-03 DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing Yufeng Chi et.al. 2508.01684 null
2025-08-03 DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding Hanqing Wang et.al. 2508.01651 null
2025-08-03 StrandDesigner: Towards Practical Strand Generation with Sketch Guidance Na Zhang et.al. 2508.01650 null
2025-08-03 Hamiltonian simulation for nonlinear partial differential equation by Schrödingerization Shoya Sasaki et.al. 2508.01640 null
2025-08-03 VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation Xuanran Zhai et.al. 2508.01622 null
2025-08-03 LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding Xuanzhao Dong et.al. 2508.01617 null
2025-08-03 TCDiff: Triplex Cascaded Diffusion for High-fidelity Multimodal EHRs Generation with Incomplete Clinical Data Yandong Yan et.al. 2508.01615 null
2025-08-03 Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models Haoran Dai et.al. 2508.01605 null
2025-08-03 Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment Lubin Gan et.al. 2508.01602 null
2025-08-03 CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation Sung-Wook Lee et.al. 2508.01600 null
2025-08-03 Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching Juyan Zhang et.al. 2508.01597 null
2025-08-03 A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation Hua Yu et.al. 2508.01590 null
2025-08-03 Censored Sampling for Topology Design: Guiding Diffusion with Human Preferences Euihyun Kim et.al. 2508.01589 null
2025-08-03 Diffusion Models for Future Networks and Communications: A Comprehensive Survey Nguyen Cong Luong et.al. 2508.01586 null
2025-08-03 Tractography-Guided Dual-Label Collaborative Learning for Multi-Modal Cranial Nerves Parcellation Lei Xie et.al. 2508.01577 null
2025-08-03 Sub 10 nm Nanochannels Enable Directional Quasi Ballistic Exciton Transport over 5 μm at Room Temperature Xiao-Jie Wang et.al. 2508.01567 null
2025-08-03 MGCR-Net:Multimodal Graph-Conditioned Vision-Language Reconstruction Network for Remote Sensing Change Detection Chengming Wang et.al. 2508.01555 null
2025-08-02 A Reward-Directed Diffusion Framework for Generative Design Optimization Hadi Keramati et.al. 2508.01509 null
2025-08-02 Instruction-based Time Series Editing Jiaxing Qiu et.al. 2508.01504 null
2025-08-02 The role of zealots in the spread of linguistic traits Vivian Dornelas et.al. 2508.01500 null
2025-08-02 TreeDiff: AST-Guided Code Generation with Diffusion LLMs Yiming Zeng et.al. 2508.01473 null
2025-08-02 Regression Augmentation With Data-Driven Segmentation Shayan Alahyari et.al. 2508.01455 null
2025-08-02 Physically-based Lighting Augmentation for Robotic Manipulation Shutong Jin et.al. 2508.01442 null
2025-08-02 Viscosity Stabilized Plug-and-Play Reconstruction Arghya Sinha et.al. 2508.01441 null
2025-08-02 Parabolic-elliptic and indirect-direct simplifications in chemotaxis systems driven by indirect signalling Le Trong Thanh Bui et.al. 2508.01436 null
2025-08-02 Artificial Intelligence and Misinformation in Art: Can Vision Language Models Judge the Hand or the Machine Behind the Canvas? Tarian Fu et.al. 2508.01408 null
2025-08-02 StyleSentinel: Reliable Artistic Copyright Verification via Stylistic Fingerprints Lingxiao Chen et.al. 2508.01335 null
2025-08-05 Zero-shot Segmentation of Skin Conditions: Erythema with Edit-Friendly Inversion Konstantinos Moutselos et.al. 2508.01334 null
2025-08-02 LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points Xuemiao Zhang et.al. 2508.01317 null
2025-08-02 CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis Alec Sargood et.al. 2508.01292 null
2025-08-02 PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation Zonglei Jing et.al. 2508.01272 null
2025-08-02 Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling Lexiao Zou et.al. 2508.01264 null
2025-08-02 NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection Jiazhen Yan et.al. 2508.01248 null
2025-08-02 Effect of protection zone on the dynamics of a diffusion-advection population-toxicant model Jing Gao et.al. 2508.01246 null
2025-08-02 Sliding two-dimensional superconductivity and charge-density-wave state in a bulk crystal Xiangqi Liu et.al. 2508.01241 null
2025-08-02 SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches Cheng Tan et.al. 2508.01237 null
2025-08-02 Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system Jiyong Kim et.al. 2508.01230 null
2025-08-02 StyDeco: Unsupervised Style Transfer with Distilling Priors and Semantic Decoupling Yuanlin Yang et.al. 2508.01215 null
2025-08-02 Energy-dependent anisotropy of cosmic-ray muons: A twelve-year study with IceCube Neutrino Observatory Nabin Upadhya Dhakal et.al. 2508.01194 null
2025-08-02 DELTAv2: Accelerating Dense 3D Tracking Tuan Duc Ngo et.al. 2508.01170 null
2025-08-02 RoboLinker: A Diffusion-model-based Matching Clothing Generator Between Humans and Companion Robots Jing Tang et.al. 2508.01165 null
2025-08-02 LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation Xinyu Yan et.al. 2508.01152 null
2025-08-02 Personalized Safety Alignment for Text-to-Image Diffusion Models Yu Lei et.al. 2508.01151 null
2025-08-02 Dataset Condensation with Color Compensation Huyu Wu et.al. 2508.01139 null
2025-08-01 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models Jinsong Li et.al. 2508.00819 null
2025-08-01 Multibeam High Throughput Satellite: Hardware Foundation, Resource Allocation, and Precoding Rui Chen et.al. 2508.00800 null
2025-08-01 Video Generators are Robot Policies Junbang Liang et.al. 2508.00795 null
2025-08-01 SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation Kien T. Pham et.al. 2508.00782 null
2025-08-01 Diffusion-Scheduled Denoising Autoencoders for Anomaly Detection in Tabular Data Timur Sattarov et.al. 2508.00758 null
2025-08-01 LeakyCLIP: Extracting Training Data from CLIP Yunhao Chen et.al. 2508.00756 null
2025-08-01 SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation Prerana Ramkumar et.al. 2508.00750 null
2025-08-01 AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation Le Wang et.al. 2508.00733 null
2025-08-01 YOLO-Count: Differentiable Object Counting for Text-to-Image Generation Guanning Zeng et.al. 2508.00728 null
2025-08-01 Controllability of diffusive Lotka-Volterra strongly competitive systems under boundary constrained controls Elisa Affili et.al. 2508.00713 null
2025-08-01 D3: Training-Free AI-Generated Video Detection Using Second-Order Features Chende Zheng et.al. 2508.00701 null
2025-08-01 On-Device Diffusion Transformer Policy for Efficient Robot Manipulation Yiming Wu et.al. 2508.00697 null
2025-08-01 Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network Young-ho Cho et.al. 2508.00692 null
2025-08-01 Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators Albert Matveev et.al. 2508.00643 null
2025-08-01 Minimum Data, Maximum Impact: 20 annotated samples for explainable lung nodule classification Luisa Gallée et.al. 2508.00639 null
2025-08-01 DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior Junzhe Lu et.al. 2508.00599 null
2025-08-01 Wukong Framework for Not Safe For Work Detection in Text-to-Image systems Mingrui Liu et.al. 2508.00591 null
2025-08-01 Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints Jens U. Kreber et.al. 2508.00558 null
2025-08-01 DBLP: Noise Bridge Consistency Distillation For Efficient And Reliable Adversarial Purification Chihan Huang et.al. 2508.00552 null
2025-08-01 Video Color Grading via Look-Up Table Generation Seunghyun Shin et.al. 2508.00548 null
2025-08-01 HannesImitation: Grasping with the Hannes Prosthetic Hand via Imitation Learning Carlo Alessi et.al. 2508.00491 null
2025-08-01 LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer Yuzhuo Chen et.al. 2508.00477 null
2025-08-01 A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces Leonidas Akritidis et.al. 2508.00472 null
2025-08-01 Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution Yiwen Wang et.al. 2508.00471 null
2025-08-01 AutoDebias: Automated Framework for Debiasing Text-to-Image Models Hongyi Cai et.al. 2508.00445 null
2025-08-01 SDMatte: Grafting Diffusion Models for Interactive Matting Longfei Huang et.al. 2508.00443 null
2025-08-01 Diffusion-Based User-Guided Data Augmentation for Coronary Stenosis Detection Sumin Seo et.al. 2508.00438 null
2025-08-01 Accurate Latent Inversion for Generative Image Steganography via Rectified Flow Yuqi Qian et.al. 2508.00434 null
2025-08-01 Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation Nan Xiang et.al. 2508.00428 null
2025-08-01 Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting Seunggeun Chi et.al. 2508.00427 null
2025-08-01 Collimated QED Cascades with Curved Plasma Mirror Xuesong Geng et.al. 2508.00417 null
2025-08-01 DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space Junyu Chen et.al. 2508.00413 null
2025-08-01 Sortblock: Similarity-Aware Feature Reuse for Diffusion Model Hanqi Chen et.al. 2508.00412 null
2025-08-01 Predictive information criterion for jump diffusion processes Yuma Uehara et.al. 2508.00411 null
2025-08-01 Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency Xi Xue et.al. 2508.00397 null
2025-08-01 Sheaf Graph Neural Networks via PAC-Bayes Spectral Optimization Yoonhyuk Choi et.al. 2508.00357 null
2025-08-01 BOOD: Boundary-based Out-Of-Distribution Data Generation Qilin Liao et.al. 2508.00350 null
2025-08-01 Favorable modifications of Scrape-Off Layer (SOL) heat flux width through pulsed fuelling in ADITYA-U Tokamak SK Injamul Hoque et.al. 2508.00339 null
2025-08-01 Radially Locked Sun-Ray Patterns in Autocatalytic Reaction-Diffusion-Advection Systems Surya Narayan Maharana et.al. 2508.00329 null
2025-08-01 Steering Guidance for Personalized Text-to-Image Diffusion Models Sunghyun Park et.al. 2508.00319 null
2025-08-01 GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection Suhang Cai et.al. 2508.00312 null
2025-08-01 TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps Zehui Xu et.al. 2508.00303 null
2025-08-01 Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence Danzhen Fu et.al. 2508.00299 null
2025-08-01 AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer Jin Lyu et.al. 2508.00298 null
2025-08-01 TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models Christian Simon et.al. 2508.00289 null
2025-08-01 UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents Jianqiang Xiao et.al. 2508.00288 null
2025-08-01 Towards Robust Semantic Correspondence: A Benchmark and Insights Wenyue Chong et.al. 2508.00272 null
2025-08-01 Jet Image Generation in High Energy Physics Using Diffusion Models Victor D. Martinez et.al. 2508.00250 null
2025-07-31 Reliability of 1D radiative-convective photochemical-equilibrium retrievals on transit spectra of WASP-107b Thomas Konings et.al. 2508.00177 null
2025-07-31 DiSC-Med: Diffusion-based Semantic Communications for Robust Medical Image Transmission Fupei Guo et.al. 2508.00172 null
2025-07-31 World Consistency Score: A Unified Metric for Video Generation Quality Akshat Rakheja et.al. 2508.00144 null
2025-07-31 Entanglement spreading and emergent locality in Brownian SYK chains Onkar Parrikar et.al. 2508.00060 null
2025-07-31 Predicting Large-scale Urban Network Dynamics with Energy-informed Graph Neural Diffusion Tong Nie et.al. 2508.00037 null
2025-07-31 Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Bowen Zhang et.al. 2507.23785 null
2025-07-31 SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions Jessica Bader et.al. 2507.23784 null
2025-07-31 General diffusions on metric graphs as limits of time-space Markov Chains Alexis Anagnostakis et.al. 2507.23724 null
2025-07-31 DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching Emery Pierson et.al. 2507.23715 null
2025-07-31 CFDagent: A Language-Guided, Zero-Shot Multi-Agent System for Complex Flow Simulation Zhaoyue Xu et.al. 2507.23693 null
2025-07-31 UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration Zihan Cheng et.al. 2507.23685 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 Analysis of a Cross-Nonlinear Porous-Medium System Modeling Pressure-Driven Cell Population Dynamics Alexis Béjar-López et.al. 2507.23680 null
2025-07-31 DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data Rabeya Tus Sadia et.al. 2507.23676 null
2025-07-31 One-Step Flow Policy Mirror Descent Tianyi Chen et.al. 2507.23675 null
2025-07-31 Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image Synthesis Kunpeng Qiu et.al. 2507.23652 null
2025-07-31 A stochastic heat equation with non-locally Lipschitz coefficients Le Chen et.al. 2507.23637 null
2025-07-31 DivControl: Knowledge Diversion for Controllable Image Generation Yucheng Xie et.al. 2507.23620 null
2025-08-02 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 Theory of ultrafast conductance modulation in electrochemical protonic synapses by multiphase polarization Michael L. Li et.al. 2507.23576 null
2025-08-01 H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation Hongzhe Bi et.al. 2507.23523 null
2025-07-31 Conical diffraction of the synchrotron beam to probe the efficiency and morphology of blazed gratings K. V. Nikolaev et.al. 2507.23513 null
2025-07-31 Emergence of long-range non-equilibrium correlations in free liquid diffusion Marco Bussoletti et.al. 2507.23507 null
2025-07-31 Digital literacy interventions can boost humans in discerning deepfakes Dominique Geissler et.al. 2507.23492 null
2025-07-31 Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion Mutian Xu et.al. 2507.23483 null
2025-07-31 Adjoint-Based Aerodynamic Shape Optimization with a Manifold Constraint Learned by Diffusion Models Long Chen et.al. 2507.23443 null
2025-07-31 Out-of-Distribution Detection in Medical Imaging via Diffusion Trajectories Lemar Abdi et.al. 2507.23411 null
2025-07-31 An optimal preconditioner for high-order scheme arising from multi-dimensional Riesz space fractional diffusion equations with variable coefficients Yuan-Yuan Huang et.al. 2507.23408 null
2025-07-31 UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries Yijie Zhu et.al. 2507.23372 null
2025-07-31 IN45023 Neural Network Design Patterns in Computer Vision Seminar Report, Summer 2025 Radu-Andrei Bourceanu et.al. 2507.23357 null
2025-07-31 Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads Yingjie Zhou et.al. 2507.23343 null
2025-07-31 EMU and the DRAGNs I: A Catalogue of DRAGNs Ray P. Norris et.al. 2507.23337 null
2025-07-31 Classifying Compact Radio Emission in Nearby Galaxies: a 10GHz Study of Active Galactic Nuclei, Supernovae, Anomalous Microwave Emission and Star Forming Regions Kristen C. Dage et.al. 2507.23332 null
2025-07-31 The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models Alfio Ferrara et.al. 2507.23313 null
2025-07-31 PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving Xuewei Tang et.al. 2507.23309 null
2025-08-01 Training-free Geometric Image Editing on Diffusion Models Hanshen Zhu et.al. 2507.23300 null
2025-07-31 UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing Hao Tang et.al. 2507.23278 null
2025-07-31 PixNerd: Pixel Neural Field Diffusion Shuai Wang et.al. 2507.23268 null
2025-07-31 Automated Mapping the Pathways of Cranial Nerve II, III, V, and VII/VIII: A Multi-Parametric Multi-Stage Diffusion Tractography Atlas Lei Xie et.al. 2507.23245 null
2025-07-31 BS-1-to-N: Diffusion-Based Environment-Aware Cross-BS Channel Knowledge Map Generation for Cell-Free Networks Zhuoyin Dai et.al. 2507.23236 null
2025-07-31 Adversarial-Guided Diffusion for Multimodal LLM Attacks Chengwei Xia et.al. 2507.23202 null
2025-07-30 X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention Xiaochen Zhao et.al. 2507.23143 null
2025-07-30 Nonzero $\mathfrak{n}$ cohomology of Totally Degenerate Limit of Discrete Series representations Jin Kunwoo Lee et.al. 2507.23102 null
2025-07-30 Diffusion model for gradient preconditioning in hyperspectral imaging inverse problems Jonathan Monsalve et.al. 2507.23065 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-07-30 Search for Neutrinos from the Galactic 4FGL Sources with the Pion-bump Signature with IceCube Alejandra Granados et.al. 2507.23040 null
2025-07-30 Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction Giuseppe Cartella et.al. 2507.23021 null
2025-07-30 Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods Siwoo Park et.al. 2507.23010 null
2025-07-30 LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis Jamil Fayyad et.al. 2507.23001 null
2025-07-29 Neural Autoregressive Modeling of Brain Aging Ridvan Yesiloglu et.al. 2507.22954 null
2025-07-30 AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS Hai Ling et.al. 2507.22880 null
2025-07-30 Robust Contract with Career Concerns Tan Gan et.al. 2507.22852 null
2025-07-30 Morph: ChirpTransformer-based Encoder-decoder Co-design for Reliable LoRa Communication Yidong Ren et.al. 2507.22851 null
2025-07-30 DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion Qingcheng Zhao et.al. 2507.22825 null
2025-07-30 Design and Analysis of Plasmonic-Nanorod-Enhanced Lead-Free Inorganic Perovskite/Silicon Heterojunction Tandem Solar Cell Exceeding the Shockley-Queisser Limit Md. Sad Abdullah Sami et.al. 2507.22803 null
2025-07-31 G-Core: A Simple, Scalable and Balanced RLHF Trainer Junyu Wu et.al. 2507.22789 null
2025-07-30 DO-EM: Density Operator Expectation Maximization Adit Vishnu et.al. 2507.22786 null
2025-08-01 Next Tokens Denoising for Speech Synthesis Yanqing Liu et.al. 2507.22746 null
2025-07-30 Zero-Shot Image Anomaly Detection Using Generative Foundation Models Lemar Abdi et.al. 2507.22692 null
2025-07-30 LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing Federico Girella et.al. 2507.22627 null
2025-07-30 Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions Yiting Qu et.al. 2507.22617 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning Xiefan Guo et.al. 2507.22604 null
2025-07-30 Diffusion Models for Influence Maximization on Temporal Networks: A Guide to Make the Best Choice Aaqib Zahoor et.al. 2507.22589 null
2025-07-30 DACA-Net: A Degradation-Aware Conditional Diffusion Network for Underwater Image Enhancement Chang Huang et.al. 2507.22501 null
2025-07-30 LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning Xiang Li et.al. 2507.22499 null
2025-07-30 Visual Language Models as Zero-Shot Deepfake Detectors Viacheslav Pirogov et.al. 2507.22469 null
2025-07-30 TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation Jiuming Liu et.al. 2507.22454 null
2025-07-30 GVD: Guiding Video Diffusion Model for Scalable Video Distillation Kunyang Li et.al. 2507.22360 null
2025-07-29 Trade-offs in Image Generation: How Do Different Dimensions Interact? Sicheng Zhang et.al. 2507.22100 null
2025-07-29 X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again Zigang Geng et.al. 2507.22058 null
2025-07-30 See Different, Think Better: Visual Variations Mitigating Hallucinations in LVLMs Ziyun Dai et.al. 2507.22003 null
2025-07-29 Enhancing Generalization in Data-free Quantization via Mixup-class Prompting Jiwoong Park et.al. 2507.21947 null
2025-07-29 Anyone Can Jailbreak: Prompt-Based Attacks on LLMs and T2Is Ahmed B Mustafa et.al. 2507.21820 null
2025-07-29 Control Copy-Paste: Controllable Diffusion-Based Augmentation Method for Remote Sensing Few-Shot Object Detection Yanxing Liu et.al. 2507.21816 null
2025-07-29 MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE Junzhe Li et.al. 2507.21802 null
2025-07-29 APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing Sangmin Han et.al. 2507.21690 null
2025-07-29 GuidPaint: Class-Guided Image Inpainting with Diffusion Models Qimin Wang et.al. 2507.21627 null
2025-07-29 Locally Controlled Face Aging with Latent Diffusion Models Lais Isabelle Alves dos Santos et.al. 2507.21600 null
2025-07-29 Neural network enabled wide field-of-view imaging with hyperbolic metalenses Joel Yeo et.al. 2507.21562 null
2025-07-29 Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance Mengling Xu et.al. 2507.21529 null
2025-07-29 BANG: Dividing 3D Assets via Generative Exploded Dynamics Longwen Zhang et.al. 2507.21493 null
2025-07-29 Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training Sodtavilan Odonchimed et.al. 2507.21452 null
2025-07-30 Multimodal LLMs as Customized Reward Models for Text-to-Image Generation Shijie Zhou et.al. 2507.21391 null
2025-07-28 Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation I-Hsiang Chen et.al. 2507.21367 null
2025-07-28 A Contrastive Diffusion-based Network (CDNet) for Time Series Classification Yaoyu Zhang et.al. 2507.21357 null
2025-07-28 HDR Environment Map Estimation with Latent Diffusion Models Jack Hilliard et.al. 2507.21261 null
2025-07-28 Adaptive Multimodal Protein Plug-and-Play with Diffusion-Based Priors Amartya Banerjee et.al. 2507.21260 null
2025-07-28 Learning from Limited and Imperfect Data Harsh Rangwani et.al. 2507.21205 null
2025-08-01 Flow Matching Policy Gradients David McAllister et.al. 2507.21053 null
2025-07-29 JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1 Xinhan Di et.al. 2507.20987 null
2025-07-28 Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision Xiao Fang et.al. 2507.20976 null

Industry

Publish Date Title Authors PDF Code
2025-10-07 MadNCL: A GPU Implementation of Algorithm NCL for Large-Scale, Degenerate Nonlinear Programs Alexis Montoison et.al. 2510.05885 null
2025-10-07 TensorBLEU: Vectorized GPU-based BLEU Score Implementation for Per-Sentence In-Training Evaluation Adam Filipek et.al. 2510.05485 null
2025-10-06 Mixed-precision ab initio tensor network state methods adapted for NVIDIA Blackwell technology via emulated FP64 arithmetic Cole Brower et.al. 2510.04795 null
2025-10-06 Bio-Inspired Robotic Houbara: From Development to Field Deployment for Behavioral Studies Lyes Saad Saoud et.al. 2510.04692 null
2025-10-06 Fast Witness Persistence for MRI Volumes via Hybrid Landmarking Jorge Leonardo Ruiz Williams et.al. 2510.04553 null
2025-10-05 RAP: 3D Rasterization Augmented End-to-End Planning Lan Feng et.al. 2510.04333 null
2025-10-05 ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation Jay Zhangjie Wu et.al. 2510.04290 null
2025-10-05 Replacing Softmax Similarity with a Sharpened Angular Similarity: Theory and Practice of Scaling To Billion-Context Attention Sahil Joshi et.al. 2510.04008 null
2025-10-04 Datacenter Energy Optimized Power Profiles Sreedhar Narayanaswamy et.al. 2510.03872 null
2025-09-29 Convolutional Neural Nets vs Vision Transformers: A SpaceNet Case Study with Balanced vs Imbalanced Regimes Akshar Gothi et.al. 2510.03297 null
2025-09-28 MACE: A Hybrid LLM Serving System with Colocated SLO-aware Continuous Retraining Alignment Yufei Li et.al. 2510.03283 null
2025-10-03 On the energy efficiency of sparse matrix computations on multi-GPU clusters Massimo Bernaschi et.al. 2510.02878 null
2025-10-03 Accelerating cosmological simulations on GPUs: a portable approach using OpenMP M. D. Lepinzan et.al. 2510.02873 null
2025-10-03 Sequence-Preserving Dual-FoV Defense for Traffic Sign and Light Recognition in Autonomous Vehicles Abhishek Joshi et.al. 2510.02642 null
2025-10-03 microJAX: A Differentiable Framework for Microlensing Modeling with GPU-Accelerated Image-Centered Ray Shooting Shota Miyazaki et.al. 2510.02639 null
2025-10-02 SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting Sung-Yeon Park et.al. 2510.02469 null
2025-10-02 Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities Mario Medrano-Paredes et.al. 2510.02264 null
2025-10-02 Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving Haibo Hu et.al. 2510.01795 null
2025-10-02 Edge GPU Aware Multiple AI Model Pipeline for Accelerated MRI Reconstruction and Analysis Ashiyana Abdul Majeed et.al. 2510.01730 null
2025-10-02 MMGaP: Multi-User MIMO Detection and Precoding using GPU-assisted Physics-inspired Computation Abhishek Kumar Singh et.al. 2510.01579 null
2025-10-02 NVIDIA AI Aerial: AI-Native Wireless Communications Kobi Cohen-Arazi et.al. 2510.01533 null
2025-10-01 ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models Akshat Ramachandran et.al. 2510.01290 null
2025-10-01 Sentry: Authenticating Machine Learning Artifacts on the Fly Andrew Gan et.al. 2510.00554 null
2025-10-01 A Deep Learning Pipeline for Epilepsy Genomic Analysis Using GPT-2 XL and NVIDIA H100 Muhammad Omer Latif et.al. 2510.00392 null
2025-09-30 TASP: Topology-aware Sequence Parallelism Yida Wang et.al. 2509.26541 null
2025-09-30 Benchmarking Deep Learning Convolutions on Energy-constrained CPUs Enrique Galvez et.al. 2509.26217 null
2025-09-30 NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving Yuan Gao et.al. 2509.25944 null
2025-09-30 SAIL: SRAM-Accelerated LLM Inference System with Lookup-Table-based GEMV Jingyao Zhang et.al. 2509.25853 null
2025-09-24 AMLA: MUL by ADD in FlashAttention Rescaling Qichen Liao et.al. 2509.25224 null
2025-09-29 DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder Junyu Chen et.al. 2509.25182 null
2025-10-01 DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space Wenkun He et.al. 2509.25180 null
2025-09-30 YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection Ranjan Sapkota et.al. 2509.25164 null
2025-09-29 Pretraining Large Language Models with NVFP4 NVIDIA et.al. 2509.25149 null
2025-09-29 ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation Jiuhong Xiao et.al. 2509.24878 null
2025-09-29 SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Junsong Chen et.al. 2509.24695 null
2025-09-28 Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning Muleilan Pei et.al. 2509.23993 null
2025-09-28 VFSI: Validity First Spatial Intelligence for Constraint-Guided Traffic Diffusion Kargi Chauhan et.al. 2509.23971 null
2025-09-28 Learning Adaptive Pseudo-Label Selection for Semi-Supervised 3D Object Detection Taehun Kong et.al. 2509.23880 null
2025-09-28 FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention Hangtian Zhao et.al. 2509.23733 null
2025-09-28 Performance and Numerical Aspects of Decompositional Factorizations with FP64 Floating-Point Emulation in INT8 Piotr Luszczek et.al. 2509.23565 null
2025-09-27 Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization Vage Egiazarian et.al. 2509.23202 null
2025-09-26 Tiny-QMoE Jack Cashman et.al. 2509.22951 null
2025-09-26 Self-driving cars: Are we there yet? Merve Atasever et.al. 2509.22754 null
2025-09-18 VIRTUS-FPP: Virtual Sensor Modeling for Fringe Projection Profilometry in NVIDIA Isaac Sim Adam Haroon et.al. 2509.22685 null
2025-09-17 FLAME: A Serving System Optimized for Large-Scale Generative Recommendation with Efficiency Xianwen Guo et.al. 2509.22681 null
2025-09-26 LongLive: Real-time Interactive Long Video Generation Shuai Yang et.al. 2509.22622 null
2025-09-26 Lightweight error mitigation strategies for post-training N:M activation sparsity in LLMs Shirin Alanova et.al. 2509.22166 null
2025-09-25 XenoFlow: How Fast Can a SmartNIC-Based DNS Load Balancer Run? Max Schrötter et.al. 2509.21656 null
2025-09-25 SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchips Xinyu Lian et.al. 2509.21271 null
2025-09-25 Mojo: MLIR-Based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem William F. Godoy et.al. 2509.21039 null
2025-09-24 FlyTrap: Physical Distance-Pulling Attack Towards Camera-based Autonomous Target Tracking Systems Shaoyuan Xie et.al. 2509.20362 null
2025-09-24 A Comprehensive Evaluation of YOLO-based Deer Detection Performance on Edge Devices Bishal Adhikari et.al. 2509.20318 null
2025-09-24 Fulcrum: Optimizing Concurrent DNN Training and Inferencing on Edge Accelerators Prashanthi S. K. et.al. 2509.20205 null
2025-09-24 Pagoda: An Energy and Time Roofline Study for DNN Workloads on Edge Accelerators Prashanthi S. K. et.al. 2509.20189 null
2025-09-24 Characterizing the Performance of Accelerated Jetson Edge Devices for Training Deep Learning Models Prashanthi S. K. et.al. 2509.20160 null
2025-09-24 Games Are Not Equal: Classifying Cloud Gaming Contexts for Effective User Experience Measurement Yifan Wang et.al. 2509.19669 null
2025-09-23 Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Sherwin Bahmani et.al. 2509.19296 null
2025-09-23 Scheduler-Driven Job Atomization Michal Konopa et.al. 2509.19086 null
2025-09-23 Beyond Backpropagation: Exploring Innovative Algorithms for Energy-Efficient Deep Neural Network Training Przemysław Spyra et.al. 2509.19063 null
2025-09-23 3D Blocking for Matrix-free Smoothers in 2D Variable-Viscosity Stokes Equations with Applications to Geodynamics Marcel Ferrari et.al. 2509.19061 null
2025-09-23 Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs Marcin Chrapek et.al. 2509.18886 null
2025-09-26 APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation Yuzhen Zhou et.al. 2509.18521 null
2025-09-22 RL-augmented Adaptive Model Predictive Control for Bipedal Locomotion over Challenging Terrain Junnosuke Kamohara et.al. 2509.18466 null
2025-09-22 Robotic Skill Diversification via Active Mutation of Reward Functions in Reinforcement Learning During a Liquid Pouring Task Jannick van Buuren et.al. 2509.18463 null
2025-09-19 TinyEcoWeedNet: Edge Efficient Real-Time Aerial Agricultural Weed Detection Omar H. Khater et.al. 2509.18193 null
2025-09-22 AERO-MPPI: Anchor-Guided Ensemble Trajectory Optimization for Agile Mapless Drone Navigation Xin Chen et.al. 2509.17340 null
2025-09-21 PMRT: A Training Recipe for Fast, 3D High-Resolution Aerodynamic Prediction Sam Jacob Jacob et.al. 2509.17182 null
2025-09-19 WarpSpeed: A High-Performance Library for Concurrent GPU Hash Tables Hunter McCoy et.al. 2509.16407 null
2025-09-19 Neural Atlas Graphs for Dynamic Scene Decomposition and Editing Jan Philipp Schneider et.al. 2509.16336 null
2025-09-24 The Role of High-Performance GPU Resources in Large Language Model Based Radiology Imaging Diagnosis Jyun-Ping Kao et.al. 2509.16328 null
2025-09-17 GraphMend: Code Transformations for Fixing Graph Breaks in PyTorch 2 Savini Kashmira et.al. 2509.16248 null
2025-09-19 A Memory Efficient Adjoint Method to Enable Billion Parameter Optimization on a Single GPU in Dynamic Problems Leon Herrmann et.al. 2509.15744 null
2025-09-19 KoopCast: Trajectory Forecasting via Koopman Operators Jungjin Lee et.al. 2509.15513 null
2025-09-18 Accelerating Garfield++ with CUDA T. Neep et.al. 2509.15377 null
2025-09-18 Efficient 3D Perception on Embedded Systems via Interpolation-Free Tri-Plane Lifting and Volume Fusion Sibaek Lee et.al. 2509.14641 null
2025-09-17 An RDMA-First Object Storage System with SmartNIC Offload Yu Zhu et.al. 2509.13997 null
2025-09-17 SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation Jiayi Pan et.al. 2509.13848 null
2025-09-16 Testing and benchmarking emerging supercomputers via the MFC flow solver Benjamin Wilfong et.al. 2509.13575 null
2025-09-16 Real-Time Detection and Tracking of Foreign Object Intrusions in Power Systems via Feature-Based Edge Intelligence Xinan Wang et.al. 2509.13396 null
2025-09-16 HPIM: Heterogeneous Processing-In-Memory-based Accelerator for Large Language Models Inference Cenlin Duan et.al. 2509.12993 null
2025-09-07 Profiling LoRA/QLoRA Fine-Tuning Efficiency on Consumer GPUs: An RTX 4060 Case Study MSR Avinash et.al. 2509.12229 null
2025-09-15 Advanced Layout Analysis Models for Docling Nikolaos Livathinos et.al. 2509.11720 null
2025-09-15 HeLoFusion: An Efficient and Scalable Encoder for Modeling Heterogeneous and Multi-Scale Interactions in Trajectory Prediction Bingqing Wei et.al. 2509.11719 null
2025-09-13 PHLoRA: data-free Post-hoc Low-Rank Adapter extraction from full-rank checkpoint Bhoomit Vasani et.al. 2509.10971 null
2025-09-19 Understanding AI Evaluation Patterns: How Different GPT Models Assess Vision-Language Descriptions Sajjad Abdoli et.al. 2509.10707 null
2025-09-12 MCBP: A Memory-Compute Efficient LLM Inference Accelerator Leveraging Bit-Slice-enabled Sparsity and Repetitiveness Huizheng Wang et.al. 2509.10372 null
2025-09-19 Characterizing the Efficiency of Distributed Training: A Power, Performance, and Thermal Perspective Seokjin Go et.al. 2509.10371 null
2025-09-12 Ruggedized Ultrasound Sensing in Harsh Conditions: eRTIS in the wild Dennis Laurijssen et.al. 2509.10029 null
2025-09-10 Rapid Manufacturing of Lightweight Drone Frames Using Single-Tow Architected Composites Md Habib Ullah Khan et.al. 2509.09024 null
2025-09-03 Silent Until Sparse: Backdoor Attacks on Semi-Structured Sparsity Wei Guo et.al. 2509.08747 null
2025-09-10 Compressing CNN models for resource-constrained systems by channel and layer pruning Ahmed Sadaqa et.al. 2509.08714 null
2025-09-09 Attribute-based Object Grounding and Robot Grasp Detection with Spatial Reasoning Houjian Yu et.al. 2509.08126 null
2025-09-09 MVAT: Multi-View Aware Teacher for Weakly Supervised 3D Object Detection Saad Lahlali et.al. 2509.07507 null
2025-09-09 Network-accelerated Active Messages Md Ashfaqur Rahaman et.al. 2509.07431 null
2025-09-06 3DPillars: Pillar-based two-stage 3D object detection Jongyoun Noh et.al. 2509.05780 null
2025-09-06 SpecPrune-VLA: Accelerating Vision-Language-Action Models via Action-Aware Self-Speculative Pruning Hanzhen Wang et.al. 2509.05614 null
2025-09-05 Characterizing and Optimizing Realistic Workloads on a Commercial Compute-in-SRAM Device Niansong Zhang et.al. 2509.05451 null
2025-09-05 SpikingBrain Technical Report: Spiking Brain-inspired Large Models Yuqi Pan et.al. 2509.05276 null
2025-09-04 Guideline-Consistent Segmentation via Multi-Agent Refinement Vanshika Vats et.al. 2509.04687 null
2025-09-04 A Highly Scalable TDMA for GPUs and Its Application to Flow Solver Optimization Seungchan Kim et.al. 2509.03933 null
2025-09-04 Real-Time Buoyancy Estimation for AUV Simulations Using Convex Hull-Based Submerged Volume Calculation Ad-Deen Mahbub et.al. 2509.03804 null
2025-09-03 LuxDiT: Lighting Estimation with Video Diffusion Transformer Ruofan Liang et.al. 2509.03680 null
2025-09-06 Efficient Virtuoso: A Latent Diffusion Transformer Model for Goal-Conditioned Trajectory Planning Antonio Guillen-Perez et.al. 2509.03658 null
2025-09-03 Combining Performance and Productivity: Accelerating the Network Sensing Graph Challenge with GPUs and Commodity Data Science Software Siddharth Samsi et.al. 2509.03653 null
2025-09-03 Can the Waymo Open Motion Dataset Support Realistic Behavioral Modeling? A Validation Study with Naturalistic Trajectories Yanlin Zhang et.al. 2509.03515 null
2025-09-03 Harnessing Batched BLAS/LAPACK Kernels on GPUs for Parallel Solutions of Block Tridiagonal Systems David Jin et.al. 2509.03015 null
2025-09-02 Do LLM Modules Generalize? A Study on Motion Generation for Autonomous Driving Mingyi Wang et.al. 2509.02754 null
2025-09-02 LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference Krishna Teja Chitty-Venkata et.al. 2509.02753 null
2025-09-02 HydroGAT: Distributed Heterogeneous Graph Attention Transformer for Spatiotemporal Flood Prediction Aishwarya Sarkar et.al. 2509.02481 null
2025-09-02 AutoDrive-R $^2$ : Incentivizing Reasoning and Self-Reflection Capacity for VLA Model in Autonomous Driving Zhenlong Yuan et.al. 2509.01944 null
2025-09-01 PointSlice: Accurate and Efficient Slice-Based Representation for 3D Object Detection from Point Clouds Liu Qifeng et.al. 2509.01487 null
2025-09-01 LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving Huanqi Hu et.al. 2509.01229 null
2025-09-30 Metis: Training LLMs with FP4 Quantization Hengjie Cao et.al. 2509.00404 null
2025-08-27 More than Carbon: Cradle-to-Grave environmental impacts of GenAI training on the Nvidia A100 GPU Sophia Falk et.al. 2509.00093 null
2025-08-29 FLORA: Efficient Synthetic Data Generation for Object Detection in Low-Data Regimes via finetuning Flux LoRA Alvaro Patricio et.al. 2508.21712 null
2025-09-01 $Δ$ -Motif: Subgraph Isomorphism at Scale via Data-Centric Parallelism Yulun Wang et.al. 2508.21287 null
2025-09-21 GPU-acceleration of the Discontinuous Galerkin Shallow Water Equations Model (DG-SWEM) with OpenACC Chayanon Wichitrnithed et.al. 2508.21208 null
2025-08-28 Leveraging Generative Models for Real-Time Query-Driven Text Summarization in Large-Scale Web Search Zeyu Xiong et.al. 2508.20559 null
2025-08-28 Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation Jiusi Li et.al. 2508.20471 null
2025-08-28 MedFoundationHub: A Lightweight and Secure Toolkit for Deploying Medical Vision Language Foundation Models Xiao Li et.al. 2508.20345 null
2025-08-26 APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration Shaobo Ma et.al. 2508.19087 null
2025-08-26 TaiBai: A fully programmable brain-inspired processor with topology-aware efficiency Qianpeng Li et.al. 2508.18961 null
2025-08-26 ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive Xinhao Luo et.al. 2508.18850 null
2025-08-26 Strata: Hierarchical Context Caching for Long Context Language Model Serving Zhiqiang Xie et.al. 2508.18572 null
2025-08-25 Practical GPU Choices for Earth Observation: ResNet-50 Training Throughput on Integrated, Laptop, and Cloud Accelerators Ritvik Chaturvedi et.al. 2508.18206 null
2025-08-24 A Synthetic Dataset for Manometry Recognition in Robotic Applications Pedro Antonio Rabelo Saraiva et.al. 2508.17468 null
2025-08-24 MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models Krishna Teja Chitty-Venkata et.al. 2508.17467 null
2025-08-23 DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method Qingwen Zhang et.al. 2508.17054 null
2025-08-23 A Novel Local Focusing Mechanism for Deepfake Detection Generalization Mingliang Li et.al. 2508.17029 null
2025-08-31 GPT-OSS-20B: A Comprehensive Deployment-Centric Analysis of OpenAI’s Open-Weight Mixture of Experts Model Deepak Kumar et.al. 2508.16700 null
2025-08-17 GPU Acceleration for Faster Evolutionary Spatial Cyclic Game Systems Louie Sinadjan et.al. 2508.16639 null
2025-08-22 GreenLLM: SLO-Aware Dynamic Frequency Scaling for Energy-Efficient LLM Serving Qunyou Liu et.al. 2508.16449 null
2025-08-22 Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars NVIDIA et.al. 2508.16401 null
2025-08-27 Hybrid Classical-Quantum Supercomputing: A demonstration of a multi-user, multi-QPU and multi-GPU environment Mateusz Slysz et.al. 2508.16297 null
2025-08-22 Bare-Metal RISC-V + NVDLA SoC for Efficient Deep Learning Inference Vineet Kumar et.al. 2508.16095 null
2025-08-22 A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection Qifeng Liu et.al. 2508.16069 null
2025-08-21 graph framework: A Domain Specific Compiler for Building Physics Applications M. Cianciosa et.al. 2508.15967 null
2025-08-17 Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base Stations Mauro Belgiovine et.al. 2508.15816 null
2025-09-21 DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians Cong Wang et.al. 2508.15376 null
2025-08-20 Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds Jia Lu et.al. 2508.14892 null
2025-08-20 Leveraging Hardware-Aware Computation in Mixed-Precision Matrix Multiply: A Tile-Centric Approach Qiao Zhang et.al. 2508.14848 null
2025-09-10 Memory-Anchored Multimodal Reasoning for Explainable Video Forensics Chen Chen et.al. 2508.14581 null
2025-09-02 NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model NVIDIA et.al. 2508.14444 null
2025-08-19 The 9th AI City Challenge Zheng Tang et.al. 2508.13564 null
2025-08-18 Optimizing Allreduce Operations for Heterogeneous Architectures with Multiple Processes per GPU Michael Adams et.al. 2508.13397 null
2025-08-18 X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms Yueming Yuan et.al. 2508.13337 null
2025-07-28 Sustainable AI Training via Hardware-Software Co-Design on NVIDIA, AMD, and Emerging GPU Architectures Yashasvi Makin et.al. 2508.13163 null
2025-08-18 CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction Zhiwei Ning et.al. 2508.12917 null
2025-08-17 CarelessWhisper: Turning Whisper into a Causal Streaming Model Tomer Krichli et.al. 2508.12301 null
2025-08-17 TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform Jun Liu et.al. 2508.12279 null
2025-08-17 ATLAS: AI-Native Receiver Test-and-Measurement by Leveraging AI-Guided Search Mauro Belgiovine et.al. 2508.12204 null
2025-08-16 Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization Kousuke Nakano et.al. 2508.12033 null
2025-08-18 Visual Perception Engine: Fast and Flexible Multi-Head Inference for Robotic Vision Tasks Jakub Łucki et.al. 2508.11584 null
2025-08-15 Efficient GPU-Centered Singular Value Decomposition Using the Divide-and-Conquer Method Shifang Liu et.al. 2508.11467 null
2025-08-15 Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking Haonan Zhang et.al. 2508.11323 null
2025-08-14 EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI Training Hasibul Jamil et.al. 2508.11035 null
2025-08-12 ViPE: Video Pose Engine for 3D Geometric Perception Jiahui Huang et.al. 2508.10934 null
2025-08-13 GPU accelerated MHD in the DISPATCH framework using directive-based programming Michael Haahr et.al. 2508.09568 null
2025-08-13 UWBa at SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval Ladislav Lenc et.al. 2508.09517 null
2025-08-13 Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving Guangxun Zhu et.al. 2508.09404 null
2025-08-07 Camel: Energy-Aware LLM Inference on Resource-Constrained Devices Hao Xu et.al. 2508.09173 null
2025-08-12 Profiling Large Language Model Inference on Apple Silicon: A Quantization Perspective Afsara Benazir et.al. 2508.08531 null
2025-08-11 Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson – Extended Abhinaba Chakraborty et.al. 2508.08430 null
2025-08-10 Weather-Driven Agricultural Decision-Making Using Digital Twins Under Imperfect Conditions Tamim Ahmed et.al. 2508.08326 null
2025-08-11 Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions Bangsheng Tang et.al. 2508.08192 null
2025-08-11 TLV-HGNN: Thinking Like a Vertex for Memory-efficient HGNN Inference Dengke Han et.al. 2508.07796 null
2025-08-10 An Experimental Exploration of In-Memory Computing for Multi-Layer Perceptrons Pedro Carrinho et.al. 2508.07317 null
2025-09-06 The Fused Kernel Library: A C++ API to Develop Highly-Efficient GPU Libraries Oscar Amoros et.al. 2508.07071 null
2025-08-27 From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving Antonio Guillen-Perez et.al. 2508.07029 null
2025-08-09 A Portable Multi-GPU Solver for Collisional Plasmas with Coulombic Interactions James Almgren-Bell et.al. 2508.06771 null
2025-08-02 PiKV: KV Cache Management System for Mixture of Experts Dong Liu et.al. 2508.06526 null
2025-08-08 MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows Xiquan Li et.al. 2508.06098 null
2025-08-07 CleanUpBench: Embodied Sweeping and Grasping Benchmark Wenbo Li et.al. 2508.05543 null
2025-08-07 MedMambaLite: Hardware-Aware Mamba for Medical Image Classification Romina Aalishah et.al. 2508.05049 null
2025-08-07 CSRAP: Enhanced Canvas Attention Scheduling for Real-Time Mission Critical Perception Md Iftekharul Islam Sakib et.al. 2508.04976 null
2025-08-07 Real-Time Doppler and Ionospheric Dispersion Correction Techniques for Arbitrary Waveforms Utilizing GPU Compute Daniel J. Vickers et.al. 2508.04951 null
2025-08-05 AIC CTU@FEVER 8: On-premise fact checking through long context RAG Herbert Ullrich et.al. 2508.04390 null
2025-08-06 A Foundation Model for DAS Signal Recognition and Visual Prompt Tuning of the Pre-trained Model for Downstream Tasks Kun Gui et.al. 2508.04316 null
2025-08-11 Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems Luai Abuelsamen et.al. 2508.04146 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-09-04 Understanding the Landscape of Ampere GPU Memory Errors Zhu Zhu et.al. 2508.03513 null
2025-08-05 Full-History Graphs with Edge-Type Decoupled Networks for Temporal Reasoning Osama Mohammed et.al. 2508.03251 null
2025-08-04 MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models Wenyuan Liu et.al. 2508.02343 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis Yuzhuang Xu et.al. 2508.02322 null
2025-08-04 GPU in the Blind Spot: Overlooked Security Risks in Transportation Sefatun-Noor Puspa et.al. 2508.01995 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-02 A Parallel Algorithm for Finding Robust Spanners in Large Social Networks Arindam Khanda et.al. 2508.01485 null
2025-08-01 Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection Cheng-You Lu et.al. 2508.01014 null
2025-08-01 Optimal Scheduling Algorithms for LLM Inference: Theory and Practice Agrim Bari et.al. 2508.01002 null
2025-07-29 Forecasting LLM Inference Performance via Hardware-Agnostic Analytical Modeling Rajeev Patwari et.al. 2508.00904 null
2025-08-12 Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving Stefan Englmeier et.al. 2508.00589 null
2025-08-09 DGEMM without FP64 Arithmetic – Using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme Daichi Mukunoki et.al. 2508.00441 null
2025-08-01 On Learning Closed-Loop Probabilistic Multi-Agent Simulator Juanwu Lu et.al. 2508.00384 null
2025-08-01 Beamformed 360° Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization Belman Jahir Rodriguez et.al. 2508.00307 null
2025-07-31 FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction Donghyun Lee et.al. 2507.23480 null
2025-07-31 InterfO-RAN: Real-Time In-band Cellular Uplink Interference Detection with GPU-Accelerated dApps Neagin Neasamoni Santhi et.al. 2507.23177 null
2025-07-30 On the Sustainability of AI Inferences in the Edge Ghazal Sobhani et.al. 2507.23093 null
2025-07-30 Early Goal-Guided Multi-Scale Fusion for Real-Time Vision-Language Driving Santosh Patapati et.al. 2507.23042 null
2025-07-28 Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery Deepak Joshi et.al. 2507.20680 null
2025-07-27 SWIFT: A General Sensitive Weight Identification Framework for Fast Sensor-Transfer Pansharpening Zeyu Xia et.al. 2507.20311 null
2025-07-26 Racing to Idle: Energy Efficiency of Matrix Multiplication on Heterogeneous CPU and GPU Architectures Mufakir Qamar Ansari et.al. 2507.20063 null
2025-07-26 A Fast Parallel Median Filtering Algorithm Using Hierarchical Tiling Louis Sugy et.al. 2507.19926 null
2025-08-02 GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting Baijun Ye et.al. 2507.19451 null
2025-07-25 TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability Mohammad Aflah Khan et.al. 2507.19419 null
2025-07-25 LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences Yusuke Hirota et.al. 2507.19362 null
2025-07-25 SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding Large Language Models Zhen Wan et.al. 2507.19361 null
2025-07-25 High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins Lorenzo Cazzella et.al. 2507.19173 null
2025-07-24 SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time Yun Chen et.al. 2507.18713 null
2025-07-24 Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping Chong Cheng et.al. 2507.18541 null
2025-07-24 Building an Accelerated OpenFOAM Proof-of-Concept Application using Modern C++ Giulio Malenza et.al. 2507.18268 null
2025-07-26 MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation Zhongzhen Wen et.al. 2507.17773 null
2025-07-23 BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems Malsha Ashani Mahawatta Dona et.al. 2507.17722 null
2025-07-24 Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners Kostas Karakontis et.al. 2507.17519 null
2025-07-25 HuNavSim 2.0: An Enhanced Human Navigation Simulator for Human-Aware Robot Navigation Miguel Escudero-Jiménez et.al. 2507.17317 null
2025-07-23 GPU Benchmark through QPE Emulator with cuQuantum for Practical Quantum Applications Takaki Akiba et.al. 2507.17175 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 Model Compression Engine for Wearable Devices Skin Cancer Diagnosis Jacob M. Delgado-López et.al. 2507.17125 null
2025-07-23 Computer Vision for Real-Time Monkeypox Diagnosis on Embedded Systems Jacob M. Delgado-López et.al. 2507.17123 null
2025-07-22 Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems Imran Latif et.al. 2507.16781 null
2025-07-22 AcceleratedKernels.jl: Cross-Architecture Parallel Algorithms from a Unified, Transpiled Codebase Andrei-Leonard Nicusan et.al. 2507.16710 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-21 MSGM: A Multi-Scale Spatiotemporal Graph Mamba for EEG Emotion Recognition Hanwen Liu et.al. 2507.15914 null
2025-07-30 GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data Analysis Guoxi Liu et.al. 2507.15230 null
2025-07-19 Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall Shayan Rokhva et.al. 2507.14662 null
2025-07-16 GPU-Accelerated Interpretable Generalization for Rapid Cyberattack Detection and Forensics Shu-Ting Huang et.al. 2507.14222 null
2025-08-12 CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning Xiaoya Li et.al. 2507.14111 null
2025-07-23 Photonic Fabric Platform for AI Accelerators Jing Ding et.al. 2507.14000 null
2025-07-18 Leveraging Multi-Instance GPUs through moldable task scheduling Jorge Villarrubia et.al. 2507.13601 null
2025-07-17 Performance Portable Gradient Computations Using Source Transformation Kim Liegeois et.al. 2507.13204 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 HyDRA: A Hybrid Dual-Mode Network for Closed- and Open-Set RFFI with Optimized VMD Hanwen Liu et.al. 2507.12133 null
2025-07-16 PoTPTQ: A Two-step Power-of-Two Post-training for LLMs Xinyu Wang et.al. 2507.11959 null
2025-07-15 MIRAGE: KV Cache Optimization through Parameter Remapping for Multi-tenant LLM Serving Ruihao Li et.al. 2507.11507 null
2025-07-15 MMStencil: Optimizing High-order Stencils on Multicore CPU using Matrix Unit Yinuo Wang et.al. 2507.11067 null
2025-07-15 Standards-Compliant DM-RS Allocation via Temporal Channel Prediction for Massive MIMO Systems Sehyun Ryu et.al. 2507.11064 null
2025-07-15 Modernizing CNN-based Weather Forecast Model towards Higher Computational Efficiency Minjong Cheon et.al. 2507.10893 null
2025-07-21 Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks Aaron Jarmusch et.al. 2507.10789 null
2025-07-14 A Benchmarking Framework for AI models in Automotive Aerodynamics Kaustubh Tangsali et.al. 2507.10747 null
2025-07-14 Quantize-then-Rectify: Efficient VQ-VAE Training Borui Zhang et.al. 2507.10547 null
2025-07-30 Designing quantum chemistry algorithms with just-in-time compilation Xiaojie Wu et.al. 2507.09772 null
2025-07-13 GeoWarp: An automatically differentiable and GPU-accelerated implicit MPM framework for geomechanics based on NVIDIA Warp Yidong Zhao et.al. 2507.09435 null
2025-07-12 Factorization-free Orthogonal Projection onto the Positive Semidefinite Cone with Composite Polynomial Filtering Shucheng Kang et.al. 2507.09165 null
2025-07-10 Vidyut3d: a GPU accelerated fluid solver for non-equilibrium plasmas on adaptive grids Hariswaran Sitaraman et.al. 2507.08200 null
2025-07-10 GPUHammer: Rowhammer Attacks on GPU Memories are Practical Chris S. Lin et.al. 2507.08166 null
2025-07-03 Collective Communication Profiling of Modern-day Machine Learning Workloads Jit Gupta et.al. 2507.07117 null
2025-07-09 StixelNExT++: Lightweight Monocular Scene Segmentation and Representation for Collective Perception Marcel Vosshans et.al. 2507.06687 null
2025-07-09 EA: An Event Autoencoder for High-Speed Vision Sensing Riadul Islam et.al. 2507.06459 null
2025-07-08 CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation Kushal Gajjar et.al. 2507.06013 null
2025-07-07 Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model Mengyao Xu et.al. 2507.05513 null
2025-07-07 Robotic System with AI for Real Time Weed Detection, Canopy Aware Spraying, and Droplet Pattern Evaluation Inayat Rasool et.al. 2507.05432 null
2025-07-23 Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms Zhiyi Hu et.al. 2507.04786 null
2025-07-05 ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments Guile Wu et.al. 2507.03886 null
2025-07-24 Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps Chong Cheng et.al. 2507.03737 null
2025-07-03 NVIDIA GPU Confidential Computing Demystified Zhongshu Gu et.al. 2507.02770 null
2025-07-03 Flotilla: A scalable, modular and resilient federated learning framework for heterogeneous resources Roopkatha Banerjee et.al. 2507.02295 null
2025-07-02 SAKURAONE: Empowering Transparent and Open AI Platforms through Private-Sector HPC Investment in Japan Fumikazu Konishi et.al. 2507.02124 null
2025-07-02 Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization Giuseppe Ruggeri et.al. 2507.01676 null
2025-06-20 PyTorch-based Geometric Learning with Non-CUDA Processing Units: Experiences from Intel Gaudi-v2 HPUs Fanchen Bu et.al. 2507.01031 null
2025-07-01 Anatomy of High-Performance Column-Pivoted QR Decomposition Maksim Melnichenko et.al. 2507.00976 null
2025-07-01 Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms Zain Taufique et.al. 2507.00491 null
2025-07-01 Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs Mohammad Firas Sada et.al. 2507.00418 null
2025-07-01 Question Decomposition for Retrieval-Augmented Generation Paul J. L. Ammann et.al. 2507.00355 null
2025-06-24 AdaDeDup: Adaptive Hybrid Data Pruning for Efficient Large-Scale Object Detection Training Feiyang Kang et.al. 2507.00049 null
2025-06-30 Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model Mu-Chi Chen et.al. 2506.23635 null
2025-06-30 Risk-Based Filtering of Valuable Driving Situations in the Waymo Open Motion Dataset Tim Puphal et.al. 2506.23433 null
2025-06-29 CMOS+X: Stacking Persistent Embedded Memories based on Oxide Transistors upon GPGPU Platforms Faaiq Waqar et.al. 2506.23405 null
2025-06-28 FF-INT8: Efficient Forward-Forward DNN Training on Edge Devices with INT8 Precision Jingxiao Ma et.al. 2506.22771 null
2025-06-27 Quantum-Classical Auxiliary Field Quantum Monte Carlo with Matchgate Shadows on Trapped Ion Quantum Computers Luning Zhao et.al. 2506.22408 null
2025-06-27 MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism Zheng Zhang et.al. 2506.22175 null
2025-06-27 MCFuser: High-Performance and Rapid Fusion of Memory-Bound Compute-Intensive Operators Zheng Zhang et.al. 2506.22169 null
2025-07-08 BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting Zipei Ma et.al. 2506.22099 null
2025-06-27 SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model Shuhan Tan et.al. 2506.21976 null
2025-06-23 TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge Zhiyuan Zhang et.al. 2506.21618 null
2025-06-26 SAM4D: Segment Anything in Camera and LiDAR Streams Jianyun Xu et.al. 2506.21547 null
2025-06-26 Portable High-Performance Kernel Generation for a Computational Fluid Dynamics Code with DaCe Måns I. Andersson et.al. 2506.20994 null
2025-06-25 Characterization and Mitigation of Training Instabilities in Microscaling Formats Huangyuan Su et.al. 2506.20752 null
2025-06-24 MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models Hoa La et.al. 2506.20686 null
2025-06-25 SuperSONIC: Cloud-Native Infrastructure for ML Inferencing Dmitry Kondratyev et.al. 2506.20657 null
2025-06-25 Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking Ben Kang et.al. 2506.20381 null
2025-06-24 Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification Minghao Qin et.al. 2506.19225 null
2025-06-23 Let Your Video Listen to Your Music! Xinyu Zhang et.al. 2506.18881 null
2025-06-23 Cross-Architecture Knowledge Distillation (KD) for Retinal Fundus Image Anomaly Detection on NVIDIA Jetson Nano Berk Yilmaz et.al. 2506.18220 null
2025-06-22 AMD Versal Implementations of FAM and SSCA Estimators Carol Jingyi Li et.al. 2506.18003 null
2025-06-20 Code Generation for Near-Roofline Finite Element Actions on GPUs from Symbolic Variational Forms Kaushik Kulkarni et.al. 2506.17471 null
2025-06-19 VideoGAN-based Trajectory Proposal for Automated Vehicles Annajoyce Mariani et.al. 2506.16209 null
2025-06-19 Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs Xun Wang et.al. 2506.16196 null
2025-06-19 HetGPU: The pursuit of making binary compatibility towards GPUs Yiwei Yang et.al. 2506.15993 null
2025-06-18 Early Attentive Sparsification Accelerates Neural Speech Transcription Zifei Xu et.al. 2506.15912 null
2025-06-18 UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting Kai He et.al. 2506.15673 null
2025-06-18 Engineering Supercomputing Platforms for Biomolecular Applications Robert Welch et.al. 2506.15585 null
2025-07-30 Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention Syed Haider Ali et.al. 2506.15562 null
2025-06-17 Align Your Flow: Scaling Continuous-Time Flow Map Distillation Amirmojtaba Sabour et.al. 2506.14603 null
2025-06-18 Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models Xuanchi Ren et.al. 2506.09042 null
2025-06-10 Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions David Acuna et.al. 2506.08927 null
2025-07-18 Controllable Weather Synthesis and Removal with Video Diffusion Models Chih-Hao Lin et.al. 2505.00704 null
2025-04-21 LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception Yuan-Hong Liao et.al. 2504.15362 null
2025-04-15 PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond Minghua Liu et.al. 2504.11451 null
2025-04-17 VideoPanda: Video Panoramic Diffusion with Multi-view Attention Kevin Xie et.al. 2504.11389 null
2025-04-01 Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control NVIDIA et.al. 2503.14492 null
2025-03-05 GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Xuanchi Ren et.al. 2503.03751 null
2025-03-03 Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Jay Zhangjie Wu et.al. 2503.01774 null
2025-03-22 DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models Ruofan Liang et.al. 2501.18590 null
2025-07-09 Cosmos World Foundation Model Platform for Physical AI NVIDIA et.al. 2501.03575 null
2025-06-26 InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models Yifan Lu et.al. 2412.03934 null
2025-04-01 Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos Hanxue Liang et.al. 2412.03526 null
2024-11-14 LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Zhengyi Wang et.al. 2411.09595 null
2025-02-28 ReMatching Dynamic Reconstruction Flow Sara Oblak et.al. 2411.00705 null
2024-10-26 SCube: Instant Large-Scale Scene Reconstruction using VoxSplats Xuanchi Ren et.al. 2410.20030 null
2025-02-11 SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes Tianchang Shen et.al. 2409.20562 null
2024-09-28 G3R: Gradient Guided Generalizable Reconstruction Yun Chen et.al. 2409.19405 null
2024-09-27 UniCal: Unified Neural Sensor Calibration Ze Yang et.al. 2409.18953 null
2024-09-26 Learning to Drive via Asymmetric Self-Play Chris Zhang et.al. 2409.18218 null
2024-09-15 Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models Yuan-Hong Liao et.al. 2409.09788 null
2025-04-19 OmniRe: Omni Urban Scene Reconstruction Ziyu Chen et.al. 2408.16760 null
2024-08-19 Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering Ruofan Liang et.al. 2408.09702 null
2025-03-20 Wolf: Dense Video Captioning with a World Summarization Framework Boyi Li et.al. 2407.18908 null
2024-07-15 SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation Jordan Juravsky et.al. 2407.10481 null
2024-10-10 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes Nicolas Moenne-Loccoz et.al. 2407.07090 null
2024-07-01 fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence Francis Williams et.al. 2407.01781 null
2024-10-31 DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Letian Wang et.al. 2406.12095 null
2024-06-14 L4GM: Large 4D Gaussian Reconstruction Model Jiawei Ren et.al. 2406.10324 null
2024-06-12 UnO: Unsupervised Occupancy Fields for Perception and Forecasting Ben Agro et.al. 2406.08691 null
2024-06-12 Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata Dongsu Zhang et.al. 2406.08292 null
2024-06-13 DeTra: A Unified Model for Object Detection and Trajectory Forecasting Sergio Casas et.al. 2406.04426 null
2024-04-24 NeRF-XL: Scaling NeRFs with Multiple GPUs Ruilong Li et.al. 2404.16221 null
2024-04-22 Align Your Steps: Optimizing Sampling Schedules in Diffusion Models Amirmojtaba Sabour et.al. 2404.14507 null
2024-04-16 RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting Ashkan Mirzaei et.al. 2404.10765 null
2025-05-26 Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves? Yuan-Hong Liao et.al. 2404.06510 null
2024-04-01 QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving Sourav Biswas et.al. 2404.01486 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385 null
2024-03-22 Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks Aqeel Anwar et.al. 2403.15370 null
2024-01-22 EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models Koichi Namekata et.al. 2401.11739 null
2023-12-28 Compact Neural Graphics Primitives with Learned Hash Probing Towaki Takikawa et.al. 2312.17241 null
2024-01-03 Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models Huan Ling et.al. 2312.13763 null
2023-12-11 LightSim: Neural Lighting Simulation for Urban Scenes Ava Pun et.al. 2312.06654 null
2024-04-14 Trajeglish: Traffic Modeling as Next-Token Prediction Jonah Philion et.al. 2312.04535 null
2024-06-25 XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies Xuanchi Ren et.al. 2312.03806 null
2024-04-12 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570 null
2023-11-16 Adaptive Shells for Efficient Neural Radiance Field Rendering Zian Wang et.al. 2311.10091 null
2023-11-09 Real-Time Neural Rasterization for Large Scenes Jeffrey Yunfan Liu et.al. 2311.05607 null
2023-11-09 Reconstructing Objects in-the-wild for Realistic Sensor Simulation Ze Yang et.al. 2311.05602 null
2023-11-07 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Chenfeng Xu et.al. 2311.04391 null
2023-11-03 EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Jiawei Yang et.al. 2311.02077 null
2023-11-03 Towards Unsupervised Object Detection From LiDAR Point Clouds Lunjun Zhang et.al. 2311.02007 null
2023-11-02 MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory Enxu Li et.al. 2311.01556 null
2023-11-17 4D-Former: Multimodal 4D Panoptic Segmentation Ali Athar et.al. 2311.01520 null
2023-11-02 UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation Yuwen Xiong et.al. 2311.01448 null
2023-11-02 CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation Jingkang Wang et.al. 2311.01447 null
2023-11-02 Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation Jay Sarva et.al. 2311.01446 null
2023-11-02 LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds Anqi Joyce Yang et.al. 2311.01444 null
2023-11-02 Learning Realistic Traffic Agents in Closed-loop Chris Zhang et.al. 2311.01394 null
2024-04-01 Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion Lunjun Zhang et.al. 2311.01017 null
2024-01-26 ViR: Towards Efficient Vision Retention Backbones Ali Hatamizadeh et.al. 2310.19731 null
2023-10-20 TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models Tianshi Cao et.al. 2310.13772 null
2023-09-11 Towards Viewpoint Robustness in Bird’s Eye View Segmentation Tzofi Klinghoffer et.al. 2309.05192 null
2023-08-10 Flexible Isosurface Extraction for Gradient-Based Mesh Optimization Tianchang Shen et.al. 2308.05371 null
2023-08-03 UniSim: A Neural Closed-Loop Sensor Simulator Ze Yang et.al. 2308.01898 null
2023-08-02 Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving Ben Agro et.al. 2308.01471 null
2023-07-14 DreamTeacher: Pretraining Image Backbones with Deep Generative Models Daiqing Li et.al. 2307.07487 null
2023-06-27 Rethinking Closed-loop Training for Autonomous Driving Chris Zhang et.al. 2306.15713 null
2023-06-06 ATT3D: Amortized Text-to-3D Object Synthesis Jonathan Lorraine et.al. 2306.07349 null
2023-06-09 Neural Kernel Surface Reconstruction Jiahui Huang et.al. 2305.19590 null
2023-08-13 Neural LiDAR Fields for Novel View Synthesis Shengyu Huang et.al. 2305.01643 null
2023-04-19 NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models Seung Wook Kim et.al. 2304.09787 null
2023-12-28 Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models Andreas Blattmann et.al. 2304.08818 null
2023-04-06 Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes Zian Wang et.al. 2304.03266 null
2023-04-04 Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion Davis Rempe et.al. 2304.01893 null
2023-03-25 VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion Yiming Li et.al. 2302.12251 null
2023-02-09 Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting Viraj Prabhu et.al. 2302.04832 null
2023-02-02 Synthesizing Physical Character-Scene Interactions Mohamed Hassan et.al. 2302.00883 null
2023-01-31 PADL: Language-Directed Physics-Based Character Control Jordan Juravsky et.al. 2301.13868 null
2023-03-25 Magic3D: High-Resolution Text-to-3D Content Creation Chen-Hsuan Lin et.al. 2211.10440 null
2022-11-08 GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting Alexander Cui et.al. 2211.02545 null
2022-10-12 LION: Latent Point Diffusion Models for 3D Shape Generation Xiaohui Zeng et.al. 2210.06978 null
2022-10-06 XDGAN: Multi-Modal 3D Shape Generation in 2D Space Hassan Abu Alhaija et.al. 2210.03007 null
2022-10-03 Optimizing Data Collection for Machine Learning Rafid Mahmood et.al. 2210.01234 null
2022-09-26 EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations Ahmad Darkhalil et.al. 2209.13064 null
2022-09-22 GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images Jun Gao et.al. 2209.11163 null
2022-08-19 Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion Zian Wang et.al. 2208.09480 null
2022-08-18 MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation Gopal Sharma et.al. 2208.08580 null
2022-07-05 Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention Gary Leung et.al. 2207.02126 null
2022-07-13 How Much More Data Do I Need? Estimating Requirements for Downstream Tasks Rafid Mahmood et.al. 2207.01725 null
2022-06-19 Scalable Neural Data Server: A Data Recommender for Transfer Learning Tianshi Cao et.al. 2206.09386 null
2022-06-16 Virtual Correspondence: Humans as a Cue for Extreme-View Geometry Wei-Chiu Ma et.al. 2206.08365 null
2022-06-15 Variable Bitrate Neural Fields Towaki Takikawa et.al. 2206.07707 null
2022-06-06 Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps Seung Wook Kim et.al. 2206.02903 null
2022-05-05 ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters Xue Bin Peng et.al. 2205.01906 null
2022-04-19 M $^2$ BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation Enze Xie et.al. 2204.05088 null
2022-04-06 AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis Zhiqin Chen et.al. 2204.03105 null

Autonomous Driving

Publish Date Title Authors PDF Code
2025-10-07 Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models Jiahao Wang et.al. 2510.06209 null
2025-10-07 The Safety Challenge of World Models for Embodied AI Agents: A Review Lorenzo Baraldi et.al. 2510.05865 null
2025-10-07 ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving Yongxuan Lyu et.al. 2510.05752 null
2025-10-07 Precise and Efficient Collision Prediction under Uncertainty in Autonomous Driving Marc Kaufeld et.al. 2510.05729 null
2025-10-07 HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video Hongchi Xia et.al. 2510.05560 null
2025-10-06 Comparative Analysis of YOLOv5, Faster R-CNN, SSD, and RetinaNet for Motorbike Detection in Kigali Autonomous Driving Context Ngeyen Yinkfu et.al. 2510.04912 null
2025-10-06 Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction Chi Yan et.al. 2510.04759 null
2025-10-05 Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction Yuhao Luo et.al. 2510.04365 null
2025-10-04 From Filters to VLMs: Benchmarking Defogging Methods through Object Detection and Segmentation Performance Ardalan Aryashad et.al. 2510.03906 null
2025-10-04 Referring Expression Comprehension for Small Objects Kanoko Goto et.al. 2510.03701 null
2025-10-04 Safety-Oriented Dynamic Path Planning for Automated Vehicles Mostafa Emam et.al. 2510.03640 null
2025-10-03 Agile Tradespace Exploration for Space Rendezvous Mission Design via Transformers Yuji Takubo et.al. 2510.03544 null
2025-10-03 Training-Free Out-Of-Distribution Segmentation With Foundation Models Laith Nayal et.al. 2510.02909 null
2025-10-03 GS-Share: Enabling High-fidelity Map Sharing with Incremental Gaussian Splatting Xinran Zhang et.al. 2510.02884 null
2025-10-03 Action Deviation-Aware Inference for Low-Latency Wireless Robots Jeyoung Park et.al. 2510.02851 null
2025-10-03 Work Zones challenge VLM Trajectory Planning: Toward Mitigation and Robust Autonomous Driving Yifan Liao et.al. 2510.02803 null
2025-10-03 A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios Ruining Yang et.al. 2510.02627 null
2025-10-02 Calibrating the Full Predictive Class Distribution of 3D Object Detectors for Autonomous Driving Cornelius Schröder et.al. 2510.01829 null
2025-10-02 Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving Haibo Hu et.al. 2510.01795 null
2025-10-02 Predictive Preference Learning from Human Interventions Haoyuan Cai et.al. 2510.01545 null
2025-10-01 Strategic Fusion of Vision Language Models: Shapley-Credited Context-Aware Dawid-Skene for Multi-Label Tasks in Autonomous Driving Yuxiang Feng et.al. 2510.01126 null
2025-10-03 Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving Sheng Yang et.al. 2510.00060 null
2025-09-30 TTT3R: 3D Reconstruction as Test-Time Training Xingyu Chen et.al. 2509.26645 null
2025-09-30 PRISM: Progressive Rain removal with Integrated State-space Modeling Pengze Xue et.al. 2509.26413 null
2025-09-30 Beyond Pixels: Efficient Dataset Distillation via Sparse Gaussian Representation Chenyang Jiang et.al. 2509.26219 null
2025-09-30 Beyond Overall Accuracy: Pose- and Occlusion-driven Fairness Analysis in Pedestrian Detection for Autonomous Driving Mohammad Khoshkdahan et.al. 2509.26166 null
2025-09-30 Preemptive Spatiotemporal Trajectory Adjustment for Heterogeneous Vehicles in Highway Merging Zones Yuan Li et.al. 2509.25929 null
2025-09-30 MuSLR: Multimodal Symbolic Logical Reasoning Jundong Xu et.al. 2509.25851 null
2025-09-29 Online Mapping for Autonomous Driving: Addressing Sensor Generalization and Dynamic Map Updates in Campus Environments Zihan Zhang et.al. 2509.25542 null
2025-09-27 BEV-VLM: Trajectory Planning via Unified BEV Abstraction Guancheng Chen et.al. 2509.25249 null
2025-09-29 StreamForest: Efficient Online Video Understanding with Persistent Event Memory Xiangyu Zeng et.al. 2509.24871 null
2025-09-29 TACO-Net: Topological Signatures Triumph in 3D Object Classification Anirban Ghosh et.al. 2509.24802 null
2025-09-29 Learning to Sample: Reinforcement Learning-Guided Sampling for Autonomous Vehicle Motion Planning Korbinian Moller et.al. 2509.24313 null
2025-09-29 Skeleton-based Robust Registration Framework for Corrupted 3D Point Clouds Yongqiang Wang et.al. 2509.24273 null
2025-09-28 Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning Muleilan Pei et.al. 2509.23993 null
2025-10-05 AutoPrune: Each Complexity Deserves a Pruning Policy Hanshi Wang et.al. 2509.23931 null
2025-09-28 DriveE2E: Closed-Loop Benchmark for End-to-End Autonomous Driving through Real-to-Simulation Haibao Yu et.al. 2509.23922 null
2025-09-28 Preserving Cross-Modal Stability for Visual Unlearning in Multimodal Scenarios Jinghan Xu Yuyang Zhang Qixuan Cai Jiancheng Chen Keqiu Li et.al. 2509.23895 null
2025-09-28 From Static to Dynamic: a Survey of Topology-Aware Perception in Autonomous Driving Yixiao Chen et.al. 2509.23641 null
2025-09-28 BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous Driving Shu Liu et.al. 2509.23589 null
2025-09-28 OVSeg3R: Learn Open-vocabulary Instance Segmentation from 2D via 3D Reconstruction Hongyang Li et.al. 2509.23541 null
2025-09-27 WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving Ziyue Zhu et.al. 2509.23402 null
2025-09-27 Preventing Robotic Jailbreaking via Multimodal Domain Adaptation Francesco Marchiori et.al. 2509.23281 null
2025-09-26 Persistent Autoregressive Mapping with Traffic Rules for Autonomous Driving Shiyi Liang et.al. 2509.22756 null
2025-09-26 Self-driving cars: Are we there yet? Merve Atasever et.al. 2509.22754 null
2025-10-07 Robust Object Detection for Autonomous Driving via Curriculum-Guided Group Relative Policy Optimization Xu Jia et.al. 2509.22688 null
2025-10-03 An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment Xiaoyun Qiu et.al. 2509.22550 null
2025-09-26 EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model Andrii Litvynchuk et.al. 2509.22527 null
2025-09-29 A Multi-Modality Evaluation of the Reality Gap in Autonomous Driving Systems Stefano Carlo Lambertenghi et.al. 2509.22379 null
2025-09-26 UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal Data Yujian Yuan et.al. 2509.22262 null
2025-09-26 An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose Qifeng Wang et.al. 2509.22058 null
2025-09-25 PL-VIWO2: A Lightweight, Fast and Robust Visual-Inertial-Wheel Odometry Using Points and Lines Zhixin Zhang et.al. 2509.21563 null
2025-09-25 Human-like Navigation in a World Built for Humans Bhargav Chandaka et.al. 2509.21189 null
2025-09-25 Autoregressive End-to-End Planning with Time-Invariant Spatial Alignment and Multi-Objective Policy Refinement Jianbo Zhao et.al. 2509.20938 null
2025-09-25 MTRDrive: Memory-Tool Synergistic Reasoning for Robust Autonomous Driving in Corner Cases Ziang Luo et.al. 2509.20843 null
2025-09-25 DAC-LoRA: Dynamic Adversarial Curriculum for Efficient and Robust Few-Shot Adaptation Ved Umrajkar et.al. 2509.20792 null
2025-09-29 MASt3R-Fusion: Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM Yuxuan Zhou et.al. 2509.20757 null
2025-10-04 Cyber Racing Coach: A Haptic Shared Control Framework for Teaching Advanced Driving Skills Congkai Shen et.al. 2509.20653 null
2025-09-26 AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving Jinhao Chai et.al. 2509.20253 null
2025-09-24 Universal Camouflage Attack on Vision-Language Models for Autonomous Driving Dehong Kong et.al. 2509.20196 null
2025-09-24 Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving Pengxiang Li et.al. 2509.20109 null
2025-09-25 Hyperspectral Adapter for Semantic Segmentation with Vision Foundation Models Juana Valeria Hurtado et.al. 2509.20107 null
2025-09-25 OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving Pei Liu et.al. 2509.19973 null
2025-09-24 BiTAA: A Bi-Task Adversarial Attack for Object Detection and Depth Estimation via 3D Gaussian Splatting Yixun Zhang et.al. 2509.19793 null
2025-09-24 RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving Carlo Bosio et.al. 2509.19789 null
2025-09-24 EfficienT-HDR: An Efficient Transformer-Based Framework via Multi-Exposure Fusion for HDR Reconstruction Yu-Shen Huang et.al. 2509.19779 null
2025-09-23 The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar William L. Muckelroy III et.al. 2509.19644 null
2025-09-20 Vision-Based Perception for Autonomous Vehicles in Off-Road Environment Using Deep Learning Nelson Alves Ferreira Neto et.al. 2509.19378 null
2025-09-23 Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Sherwin Bahmani et.al. 2509.19296 null
2025-09-23 TriFusion-AE: Language-Guided Depth and LiDAR Fusion for Robust Point Cloud Processing Susmit Neogi et.al. 2509.18743 null
2025-09-23 The Case for Negative Data: From Crash Reports to Counterfactuals for Reasonable Driving Jay Patrikar et.al. 2509.18626 null
2025-09-23 MLF-4DRCNet: Multi-Level Fusion with 4D Radar and Camera for 3D Object Detection in Autonomous Driving Yuzhi Wu et.al. 2509.18613 null
2025-09-23 PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving Chengran Yuan et.al. 2509.18609 null
2025-09-23 Spatial Envelope MPC: High Performance Driving without a Reference Siyuan Yu et.al. 2509.18506 null
2025-09-22 AD-VF: LLM-Automatic Differentiation Enables Fine-Tuning-Free Robot Planning from Formal Methods Feedback Yunhao Yang et.al. 2509.18384 null
2025-09-19 MMCD: Multi-Modal Collaborative Decision-Making for Connected Autonomy with Knowledge Distillation Rui Liu et.al. 2509.18198 null
2025-09-25 V2V-GoT: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models and Graph-of-Thoughts Hsu-kuang Chiu et.al. 2509.18053 null
2025-09-22 Towards Seeing Bones at Radio Frequency Yiwen Song et.al. 2509.17979 null
2025-09-22 DriveDPO: Policy Learning via Safety DPO For End-to-End Autonomous Driving Shuyao Shang et.al. 2509.17940 null
2025-09-22 SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model Xiao Zhou et.al. 2509.17850 null
2025-09-22 Predicting Depth Maps from Single RGB Images and Addressing Missing Information in Depth Estimation Mohamad Mofeed Chaar et.al. 2509.17686 null
2025-09-22 Tensor-Based Self-Calibration of Cameras via the TrifocalCalib Method Gregory Schroeder et.al. 2509.17620 null
2025-09-22 Vision-Based Driver Drowsiness Monitoring: Comparative Analysis of YOLOv5-v11 Models Dilshara Herath et.al. 2509.17498 null
2025-09-22 FGGS-LiDAR: Ultra-Fast, GPU-Accelerated Simulation from General 3DGS Models to LiDAR Junzhe Wu et.al. 2509.17390 null
2025-09-21 CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving Ruiguo Zhong et.al. 2509.17080 null
2025-09-21 Orchestrate, Generate, Reflect: A VLM-Based Multi-Agent Collaboration Framework for Automated Driving Policy Learning Zengqi Peng et.al. 2509.17042 null
2025-09-21 SLAM-Former: Putting SLAM into One Transformer Yijun Yuan et.al. 2509.16909 null
2025-09-21 End2Race: Efficient End-to-End Imitation Learning for Real-Time F1Tenth Racing Zhijie Qiao et.al. 2509.16894 null
2025-09-20 Improve bounding box in Carla Simulator Mohamad Mofeed Chaar et.al. 2509.16773 null
2025-09-28 Are VLMs Ready for Lane Topology Awareness in Autonomous Driving? Xin Chen et.al. 2509.16654 null
2025-09-20 ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents Yichen Wang et.al. 2509.16645 null
2025-09-20 SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving Haiming Zhang et.al. 2509.16588 null
2025-09-20 ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting Xiaoyang Yan et.al. 2509.16552 null
2025-09-20 RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation Tianyi Yan et.al. 2509.16500 null
2025-09-19 Neural Atlas Graphs for Dynamic Scene Decomposition and Editing Jan Philipp Schneider et.al. 2509.16336 null
2025-09-18 RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving Shuocheng Yang et.al. 2509.16261 null
2025-09-19 RadarGaussianDet3D: An Efficient and Effective Gaussian-based 3D Detector with 4D Automotive Radars Weiyi Xiong et.al. 2509.16119 null
2025-09-19 SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features Jinyuan Qu et.al. 2509.16098 null
2025-09-19 CoPAD : Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-oriented Decoder in V2X Scenarios Kangyu Wu et.al. 2509.15984 null
2025-09-19 CoReVLA: A Dual-Stage End-to-End Autonomous Driving Framework for Long-Tail Scenarios via Collect-and-Refine Shiyu Fang et.al. 2509.15968 null
2025-09-19 RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation Paul Julius Kühn et.al. 2509.15886 null
2025-09-19 CBPNet: A Continual Backpropagation Prompt Network for Alleviating Plasticity Loss on Edge Devices Runjie Shao et.al. 2509.15785 null
2025-09-19 Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution Chang Soo Lim et.al. 2509.15781 null
2025-09-22 Online Slip Detection and Friction Coefficient Estimation for Autonomous Racing Christopher Oeltjen et.al. 2509.15423 null
2025-09-18 Out-of-Sight Trajectories: Tracking, Fusion, and Prediction Haichao Zhang et.al. 2509.15219 null
2025-09-18 Feature-aligned Motion Transformation for Efficient Dynamic Point Cloud Compression Xuan Deng et.al. 2509.14591 null
2025-09-18 DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising Li Gao et.al. 2509.14565 null
2025-09-17 FlowDrive: Energy Flow Field for End-to-End Autonomous Driving Hao Jiang et.al. 2509.14303 null
2025-10-03 MCGS-SLAM: A Multi-Camera SLAM Framework Using Gaussian Splatting for High-Fidelity Mapping Zhihao Cao et.al. 2509.14191 null
2025-09-17 BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection Rongyu Zhang et.al. 2509.14151 null
2025-09-17 SEG-Parking: Towards Safe, Efficient, and Generalizable Autonomous Parking via End-to-End Offline Reinforcement Learning Zewei Yang et.al. 2509.13956 null
2025-09-17 MAP: End-to-End Autonomous Driving with Map-Assisted Planning Huilin Yin et.al. 2509.13926 null
2025-09-17 Data-Efficient Spectral Classification of Hyperspectral Data Using MiniROCKET and HDC-MiniROCKET Nick Theisen et.al. 2509.13809 null
2025-09-17 AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving Yuechen Luo et.al. 2509.13769 null
2025-09-17 UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry Tae-Wook Um et.al. 2509.13713 null
2025-09-17 FishBEV: Distortion-Resilient Bird’s Eye View Segmentation with Surround-View Fisheye Cameras Hang Li et.al. 2509.13681 null
2025-09-28 TreeIRL: Safe Urban Driving with Tree Search and Inverse Reinforcement Learning Momchil S. Tomov et.al. 2509.13579 null
2025-09-16 Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving Artem Savkin et.al. 2509.13507 null
2025-09-16 Road Obstacle Video Segmentation Shyam Nandan Rai et.al. 2509.13181 null
2025-09-17 TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving Jiawei Wang et.al. 2509.13164 null
2025-09-16 An Uncertainty-Weighted Decision Transformer for Navigation in Dense, Complex Driving Scenarios Zhihao Zhang et.al. 2509.13132 null
2025-09-16 Weakly and Self-Supervised Class-Agnostic Motion Prediction for Autonomous Driving Ruibo Li et.al. 2509.13116 null
2025-09-16 4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar Xiao Tang et.al. 2509.12931 null
2025-09-16 StereoCarla: A High-Fidelity Driving Dataset for Generalizable Stereo Xianda Guo et.al. 2509.12683 null
2025-09-16 Maps for Autonomous Driving: Full-process Survey and Frontiers Pengxin Chen et.al. 2509.12632 null
2025-09-16 DisorientLiDAR: Physical Attacks on LiDAR-based Localization Yizhen Lao et.al. 2509.12595 null
2025-08-26 UrgenGo: Urgency-Aware Transparent GPU Kernel Launching for Autonomous Driving Hanqi Zhu et.al. 2509.12207 null
2025-09-16 Embodied Navigation Foundation Model Jiazhao Zhang et.al. 2509.12129 null
2025-09-15 Probabilistic Robustness Analysis in High Dimensional Space: Application to Semantic Segmentation Network Navid Hashemi et.al. 2509.11838 null
2025-09-14 SAMP: Spatial Anchor-based Motion Policy for Collision-Aware Robotic Manipulators Kai Chen et.al. 2509.11185 null
2025-09-14 SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion Zhiwen Yang et.al. 2509.11171 null
2025-09-13 Point-Plane Projections for Accurate LiDAR Semantic Segmentation in Small Data Scenarios Simone Mosco et.al. 2509.10841 null
2025-09-11 Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey Wei Dai et.al. 2509.10570 null
2025-09-17 DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training Jianxin Shi et.al. 2509.10426 null
2025-09-12 Multimodal SAM-adapter for Semantic Segmentation Iacopo Curti et.al. 2509.10408 null
2025-09-12 CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion Santiago Montiel-Marín et.al. 2509.10139 null
2025-09-12 BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird’s-Eye View with Deformable Attention and Sparse Goal Proposals Minsang Kong et.al. 2509.10080 null
2025-09-11 MGTraj: Multi-Granularity Goal-Guided Human Trajectory Prediction with Recursive Refinement Network Ge Sun et.al. 2509.09200 null
2025-09-23 LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations Payal Varshney et.al. 2509.08422 null
2025-09-10 Sparse BEV Fusion with Self-View Consistency for Multi-View Detection and Tracking Keisuke Toida et.al. 2509.08421 null
2025-09-10 InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection Zhongyu Xia et.al. 2509.08374 null
2025-09-10 Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities Rajendramayavan Sathyam et.al. 2509.08302 null
2025-09-10 A Comprehensive Review of Reinforcement Learning for Autonomous Driving in the CARLA Simulator Elahe Delavari et.al. 2509.08221 null
2025-09-09 Mean Field Game-Based Interactive Trajectory Planning Using Physics-Inspired Unified Potential Fields Zhen Tian et.al. 2509.08147 null
2025-09-09 TA-VLA: Elucidating the Design Space of Torque-aware Vision-Language-Action Models Zongzheng Zhang et.al. 2509.07962 null
2025-09-09 Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation Yusuke Hirota et.al. 2509.07596 null
2025-09-09 Bias-Aware Machine Unlearning: Towards Fairer Vision Models via Controllable Forgetting Sai Siddhartha Chary Aylapuram et.al. 2509.07456 null
2025-09-09 Attention and Risk-Aware Decision Framework for Safe Autonomous Driving Zhen Tian et.al. 2509.07412 null
2025-09-08 SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis Zhengqing Chen et.al. 2509.06798 null
2025-09-08 Adaptive Evolution Factor Risk Ellipse Framework for Reliable and Safe Autonomous Driving Fujiang Yuan et.al. 2509.06375 null
2025-09-06 Scenario-based Decision-making Using Game Theory for Interactive Autonomous Driving: A Survey Zhihao Lin et.al. 2509.05777 null
2025-09-06 Evaluating YOLO Architectures: Implications for Real-Time Vehicle Detection in Urban Environments of Bangladesh Ha Meem Hossain et.al. 2509.05652 null
2025-09-06 OccVLA: Vision-Language-Action Model with Implicit 3D Occupancy Supervision Ruixun Liu et.al. 2509.05578 null
2025-09-03 Unsupervised Instance Segmentation with Superpixels Cuong Manh Hoang et.al. 2509.05352 null
2025-09-08 LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation Yinglin Duan et.al. 2509.05263 null
2025-09-05 Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet Mohammad Saeid et.al. 2509.05198 null
2025-09-05 A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing Chengkai Xu et.al. 2509.04853 null
2025-09-05 Enhancing Self-Driving Segmentation in Adverse Weather Conditions: A Dual Uncertainty-Aware Training Approach to SAM Optimization Dharsan Ravindran et.al. 2509.04735 null
2025-09-04 Bootstrapping Reinforcement Learning with Sub-optimal Policies for Autonomous Driving Zhihao Zhang et.al. 2509.04712 null
2025-09-04 Domain Adaptation for Different Sensor Configurations in 3D Object Detection Satoshi Tanaka et.al. 2509.04711 null
2025-09-04 In-Context Policy Adaptation via Cross-Domain Skill Diffusion Minjong Yoo et.al. 2509.04535 null
2025-09-09 One Flight Over the Gap: A Survey from Perspective to Panoramic Vision Xin Lin et.al. 2509.04444 null
2025-09-04 TriLiteNet: Lightweight Model for Multi-Task Visual Perception Quang-Huy Che et.al. 2509.04092 null
2025-09-04 SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation Han Huang et.al. 2509.03999 null
2025-09-03 sam-llm: interpretable lane change trajectoryprediction via parametric finetuning Zhuo Cao et.al. 2509.03462 null
2025-09-03 KEPT: Knowledge-Enhanced Prediction of Trajectories from Consecutive Driving Frames with Vision-Language Models Yujin Wang et.al. 2509.02966 null
2025-09-02 2nd Place Solution for CVPR2024 E2E Challenge: End-to-End Autonomous Driving Using Vision Language Model Zilong Guo et.al. 2509.02659 null
2025-09-02 Omnidirectional Spatial Modeling from Correlated Panoramas Xinshen Zhang et.al. 2509.02164 null
2025-09-02 Generalizing Unsupervised Lidar Odometry Model from Normal to Snowy Weather Conditions Beibei Zhou et.al. 2509.02011 null
2025-09-02 Explaining What Machines See: XAI Strategies in Deep Object Detection Models FatemehSadat Seyedmomeni et.al. 2509.01991 null
2025-09-02 AutoDrive-R $^2$ : Incentivizing Reasoning and Self-Reflection Capacity for VLA Model in Autonomous Driving Zhenlong Yuan et.al. 2509.01944 null
2025-09-01 PointSlice: Accurate and Efficient Slice-Based Representation for 3D Object Detection from Point Clouds Liu Qifeng et.al. 2509.01487 null
2025-09-01 Guided Model-based LiDAR Super-Resolution for Resource-Efficient Automotive scene Segmentation Alexandros Gkillas et.al. 2509.01317 null
2025-09-01 Toward a Holistic Multi-Criteria Trajectory Evaluation Framework for Autonomous Driving in Mixed Traffic Environment Nouhed Naidja et.al. 2509.01291 null
2025-09-04 Enhanced Mean Field Game for Interactive Decision-Making with Varied Stylish Multi-Vehicles Liancheng Zheng et.al. 2509.00981 null
2025-08-31 OmniReason: A Temporal-Guided Vision-Language-Action Framework for Autonomous Driving Pei Liu et.al. 2509.00789 null
2025-08-30 Vehicle-in-Virtual-Environment (VVE) Method for Developing and Evaluating VRU Safety of Connected and Autonomous Driving with Focus on Bicyclist Safety Haochong Chen et.al. 2509.00624 null
2025-08-30 Safe and Efficient Lane-Changing for Autonomous Vehicles: An Improved Double Quintic Polynomial Approach with Time-to-Collision Evaluation Rui Bai et.al. 2509.00582 null
2025-08-30 Galaxea Open-World Dataset and G0 Dual-System VLA Model Tao Jiang et.al. 2509.00576 null
2025-08-30 FLUID: A Fine-Grained Lightweight Urban Signalized-Intersection Dataset of Dense Conflict Trajectories Yiyang Chen et.al. 2509.00497 null
2025-08-30 Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation Jialiang Kang et.al. 2509.00379 null
2025-08-29 3D-LATTE: Latent Space 3D Editing from Textual Instructions Maria Parelli et.al. 2509.00269 null
2025-08-29 DriveQA: Passing the Driving Knowledge Test Maolin Wei et.al. 2508.21824 null
2025-08-29 Mini Autonomous Car Driving based on 3D Convolutional Neural Networks Pablo Moraes et.al. 2508.21271 null
2025-08-18 2COOOL: 2nd Workshop on the Challenge Of Out-Of-Label Hazards in Autonomous Driving Ali K. AlShami et.al. 2508.21080 null
2025-08-28 DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes Yajiao Xiong et.al. 2508.20965 null
2025-08-28 Surfel-based 3D Registration with Equivariant SE(3) Features Xueyang Kang et.al. 2508.20789 null
2025-08-28 SKGE-SWIN: End-To-End Autonomous Vehicle Waypoint Prediction and Navigation Using Skip Stage Swin Transformer Fachri Najm Noer Kartiman et.al. 2508.20762 null
2025-08-28 UTA-Sign: Unsupervised Thermal Video Augmentation via Event-Assisted Traffic Signage Sketching Yuqi Han et.al. 2508.20594 null
2025-08-28 Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts Zixuan Hu et.al. 2508.20488 null
2025-08-28 Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation Jiusi Li et.al. 2508.20471 null
2025-08-27 Streamlining the Development of Active Learning Methods in Real-World Object Detection Moussa Kassem Sbeyti et.al. 2508.19906 null
2025-08-27 Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities Imad Ali Shah et.al. 2508.19905 null
2025-08-27 Generalizing Monocular 3D Object Detection Abhinav Kumar et.al. 2508.19593 null
2025-08-25 Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation Alexandros Gkillas et.al. 2508.19290 null
2025-08-26 Interpretable Decision-Making for End-to-End Autonomous Driving Mona Mirzaie et.al. 2508.18898 null
2025-08-26 EMind: A Foundation Model for Multi-task Electromagnetic Signals Understanding Luqing Luo et.al. 2508.18785 null
2025-08-20 GM-Skip: Metric-Guided Transformer Block Skipping for Efficient Vision-Language Models Lianming Huang et.al. 2508.18227 null
2025-09-02 EventTracer: Fast Path Tracing-based Event Stream Rendering Zhenyang Li et.al. 2508.18071 null
2025-09-02 Integration of Computer Vision with Adaptive Control for Autonomous Driving Using ADORE Abu Shad Ahammed et.al. 2508.17985 null
2025-08-25 Enhanced Drift-Aware Computer Vision Architecture for Autonomous Driving Md Shahi Amran Hossain et.al. 2508.17975 null
2025-08-25 Adaptive Output Steps: FlexiSteps Network for Dynamic Trajectory Prediction Yunxiang Liu et.al. 2508.17797 null
2025-08-23 A Rapid Iterative Trajectory Planning Method for Automated Parking through Differential Flatness Zhouheng Li et.al. 2508.17038 null
2025-08-23 A Survey of Deep Learning-based Point Cloud Denoising Jinxi Wang et.al. 2508.17011 null
2025-08-23 Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model Fan Ding et.al. 2508.16947 null
2025-08-22 Closer to Reality: Practical Semi-Supervised Federated Learning for Foundation Model Adaptation Guangyu Sun et.al. 2508.16568 null
2025-08-22 Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation Chun-Peng Chang et.al. 2508.16512 null
2025-08-22 SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather Edoardo Palladin et.al. 2508.16408 null
2025-08-22 MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction Ziyang Yan et.al. 2508.15653 null
2025-08-23 ExtraGS: Geometric-Aware Trajectory Extrapolation with Uncertainty-Guided Generative Priors Kaiyuan Tan et.al. 2508.15529 null
2025-08-21 RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features Olga Matykina et.al. 2508.15353 null
2025-08-21 RATopo: Improving Lane Topology Reasoning via Redundancy Assignment Han Li et.al. 2508.15272 null
2025-08-21 Adversarial Agent Behavior Learning in Autonomous Driving Using Deep Reinforcement Learning Arjun Srinivasan et.al. 2508.15207 null
2025-08-25 MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion Xuyang Chen et.al. 2508.15169 null
2025-08-28 Learning to Drive Ethically: Embedding Moral Reasoning into Autonomous Driving Dianzhao Li et.al. 2508.14926 null
2025-08-20 Multiscale Video Transformers for Class Agnostic Segmentation in Autonomous Driving Leila Cheshmi et.al. 2508.14729 null
2025-08-20 MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation Guile Wu et.al. 2508.14327 null
2025-09-16 ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving Xianda Guo et.al. 2508.13977 null
2025-08-19 Unleashing Semantic and Geometric Priors for 3D Scene Completion Shiyuan Chen et.al. 2508.13601 null
2025-08-25 Bridging Clear and Adverse Driving Conditions Yoel Shapiro et.al. 2508.13592 null
2025-08-19 Generative Model-Based Feature Attention Module for Video Action Analysis Guiqin Wang et.al. 2508.13565 null
2025-08-19 CORENet: Cross-Modal 4D Radar Denoising Network with LiDAR Supervision for Autonomous Driving Fuyang Liu et.al. 2508.13485 null
2025-08-19 Structured Prompting and Multi-Agent Knowledge Distillation for Traffic Video Interpretation and Risk Inference Yunxiang Yang et.al. 2508.13439 null
2025-08-18 Incremental Generalized Hybrid A* Sidharth Talia et.al. 2508.13392 null
2025-08-18 Prune2Drive: A Plug-and-Play Framework for Accelerating Vision-Language Models in Autonomous Driving Minhao Xiong et.al. 2508.13305 null
2025-08-18 SpotVLM: Cloud-edge Collaborative Real-time VLM based on Context Transfer Chen Qian et.al. 2508.12638 null
2025-08-18 ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving Can Cui et.al. 2508.12603 null
2025-08-17 An Initial Study of Bird’s-Eye View Generation for Autonomous Vehicles using Cross-View Transformers Felipe Carlos dos Santos et.al. 2508.12520 null
2025-08-17 LMAD: Integrated End-to-End Vision-Language Model for Explainable Autonomous Driving Nan Song et.al. 2508.12404 null
2025-08-17 DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection Yuval Haitman et.al. 2508.12330 null
2025-08-17 TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform Jun Liu et.al. 2508.12279 null
2025-08-16 InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes Hongyuan Liu et.al. 2508.12015 null
2025-08-16 Saliency-Based Attention Shifting: A Framework for Improving Driver Situational Awareness of Out-of-Label Hazards Yousra Shleibik et.al. 2508.11887 null
2025-08-16 Data Shift of Object Detection in Autonomous Driving Lida Xu et.al. 2508.11868 null
2025-08-15 Relative Position Matters: Trajectory Prediction and Planning with Polar Representation Bozhou Zhang et.al. 2508.11492 null
2025-08-15 Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous Driving Bozhou Zhang et.al. 2508.11488 null
2025-08-15 EvoPSF: Online Evolution of Autonomous Driving Models via Planning-State Feedback Jiayue Jin et.al. 2508.11453 null
2025-08-15 ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving Jingyu Li et.al. 2508.11428 null
2025-08-15 Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking Haonan Zhang et.al. 2508.11323 null
2025-08-15 A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving Jialin Li et.al. 2508.11218 null
2025-08-14 CSNR and JMIM Based Spectral Band Selection for Reducing Metamerism in Urban Driving Jiarong Li et.al. 2508.10962 null
2025-08-18 HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model Qi Liu et.al. 2508.10935 null
2025-08-14 Towards Powerful and Practical Patch Attacks for 2D Object Detection in Autonomous Driving Yuxin Cao et.al. 2508.10600 null
2025-08-14 SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving Philipp Wolters et.al. 2508.10567 null
2025-08-14 Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies Ayushman Sarkar et.al. 2508.10523 null
2025-08-14 STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes Keishi Ishihara et.al. 2508.10427 null
2025-08-14 From Pixel to Mask: A Survey of Out-of-Distribution Segmentation Wenjie Zhao et.al. 2508.10309 null
2025-08-13 BridgeTA: Bridging the Representation Gap in Knowledge Distillation via Teacher Assistant for Bird’s Eye View Map Segmentation Beomjun Kim et.al. 2508.09599 null
2025-08-13 Offline Auto Labeling: BAAS Stefan Haag et.al. 2508.09585 null
2025-08-13 Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving Guangxun Zhu et.al. 2508.09404 null
2025-08-12 VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception Fuhao Chang et.al. 2508.09061 null
2025-08-12 A Pseudo Global Fusion Paradigm-Based Cross-View Network for LiDAR-Based Place Recognition Jintao Cheng et.al. 2508.08917 null
2025-08-21 ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction Chaojun Ni et.al. 2508.08170 null
2025-08-18 TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation Huawei Sun et.al. 2508.08038 null
2025-08-11 CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving Qi Xiang et.al. 2508.07838 null
2025-08-11 Risk Map As Middleware: Towards Interpretable Cooperative End-to-end Autonomous Driving for Risk-Aware Planning Mingyue Lei et.al. 2508.07686 null
2025-08-11 Progressive Bird’s Eye View Perception for Safety-Critical Autonomous Driving: A Comprehensive Survey Yan Gong et.al. 2508.07560 null
2025-08-12 Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring Ludan Zhang et.al. 2508.07552 null
2025-08-10 Noise-Aware Generative Microscopic Traffic Simulation Vindula Jayawardana et.al. 2508.07453 null
2025-08-09 An Evolutionary Game-Theoretic Merging Decision-Making Considering Social Acceptance for Autonomous Driving Haolin Liu et.al. 2508.07080 null
2025-08-27 From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving Antonio Guillen-Perez et.al. 2508.07029 null
2025-08-09 WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering Yixin Zhu et.al. 2508.06982 null
2025-08-08 Robust-Sub-Gaussian Model Predictive Control for Safe Ultrasound-Image-Guided Robotic Spinal Surgery Yunke Ao et.al. 2508.06744 null
2025-08-15 IRL-VLA: Training an Vision-Language-Action Policy via Reward World Model Anqing Jiang et.al. 2508.06571 null
2025-08-20 MetAdv: A Unified and Interactive Adversarial Testing Platform for Autonomous Driving Aishan Liu et.al. 2508.06534 null
2025-08-02 RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving Jiayuan Wang et.al. 2508.06529 null
2025-08-12 GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving Jian Wang et.al. 2508.06113 null
2025-08-08 ME $^3$ -BEV: Mamba-Enhanced Deep Reinforcement Learning for End-to-End Autonomous Driving with BEV-Perception Siyi Lu et.al. 2508.06074 null
2025-08-07 VISTA: Vision-Language Imitation of Situational Thinking and Attention for Human-Like Driver Focus in Dynamic Environments Kaiser Hamid et.al. 2508.05852 null
2025-08-07 SMOL-MapSeg: Show Me One Label Yunshuang Yuan et.al. 2508.05501 null
2025-08-07 Physical Adversarial Camouflage through Gradient Calibration and Regularization Jiawei Liang et.al. 2508.05414 null
2025-08-07 DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model Rui Yu et.al. 2508.05402 null
2025-08-07 ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models Yatong Lan et.al. 2508.05236 null
2025-08-07 PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems Qi Guo et.al. 2508.05167 null
2025-08-07 AdvDINO: Domain-Adversarial Self-Supervised Representation Learning for Spatial Proteomics Stella Su et.al. 2508.04955 null
2025-08-06 Occupancy Learning with Spatiotemporal Memory Ziyang Leng et.al. 2508.04705 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case Baihui Xiao et.al. 2508.04642 null
2025-08-06 Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark Xiao Wang et.al. 2508.04260 null
2025-08-06 DRIVE: Dynamic Rule Inference and Verified Evaluation for Constraint-Aware Autonomous Driving Longling Geng et.al. 2508.04066 null
2025-08-05 LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences Ao Liang et.al. 2508.03692 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-13 MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention Qi Xie et.al. 2508.03034 null
2025-08-04 Context-aware Risk Assessment and Its Application in Autonomous Driving Boyang Tian et.al. 2508.02919 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera Byeonggyu Park et.al. 2508.02348 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 Test-Time Model Adaptation for Quantized Neural Networks Zeshuai Deng et.al. 2508.02180 null
2025-08-04 Beyond RGB and Events: Enhancing Object Detection under Adverse Lighting with Monocular Normal Maps Mingjie Liu et.al. 2508.02127 null
2025-08-04 Mapillary Vistas Validation for Fine-Grained Traffic Signs: A Benchmark Revealing Vision-Language Model Limitations Sparsh Garg et.al. 2508.02047 null
2025-08-20 Bench2ADVLM: A Closed-Loop Benchmark for Vision-language Models in Autonomous Driving Tianyuan Zhang et.al. 2508.02028 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-03 StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding Haolin Yang et.al. 2508.01875 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving Luqi Cheng et.al. 2508.01704 null
2025-08-03 Adverse Weather-Independent Framework Towards Autonomous Driving Perception through Temporal Correlation and Unfolded Regularization Wei-Bin Kou et.al. 2508.01583 null
2025-08-02 A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Zhan Shi et.al. 2508.01197 null
2025-08-01 CP-FREEZER: Latency Attacks against Vehicular Cooperative Perception Chenyi Wang et.al. 2508.01062 null
2025-08-12 Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance Fengze Yang et.al. 2508.01057 null
2025-07-31 Service Discovery-Based Hybrid Network Middleware for Efficient Communication in Distributed Robotic Systems Shiyao Sang et.al. 2508.00947 null
2025-08-01 Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR Adwait Chandorkar et.al. 2508.00744 null
2025-08-12 Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving Stefan Englmeier et.al. 2508.00589 null
2025-08-01 Uncertainty-Aware Likelihood Ratio Estimation for Pixel-Wise Out-of-Distribution Detection Marc Hölle et.al. 2508.00587 null
2025-08-01 Pro2Guard: Proactive Runtime Enforcement of LLM Agent Safety via Probabilistic Model Checking Haoyu Wang et.al. 2508.00500 null
2025-08-01 Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence Danzhen Fu et.al. 2508.00299 null
2025-07-21 AoI-Aware Resource Allocation with Deep Reinforcement Learning for HAPS-V2X Networks Ahmet Melih Ince et.al. 2508.00011 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation Yuchen Zhou et.al. 2507.23599 null
2025-08-09 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 A Unified Perception-Language-Action Framework for Adaptive Autonomous Driving Yi Zhang et.al. 2507.23540 null
2025-07-31 MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting Xingyue Peng et.al. 2507.23340 null
2025-07-31 Contrastive Learning-Driven Traffic Sign Perception: Multi-Modal Fusion of Text and Vision Qiang Lu et.al. 2507.23331 null
2025-07-31 FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models Yiming Yang et.al. 2507.23325 null
2025-08-02 FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning Jiajun Cao et.al. 2507.23318 null
2025-08-04 PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving Xuewei Tang et.al. 2507.23309 null
2025-07-30 Causal-Inspired Multi-Agent Decision-Making via Graph Reinforcement Learning Jing Wang et.al. 2507.23080 null
2025-08-05 Vision-Language Fusion for Real-Time Autonomous Driving: Goal-Centered Cross-Attention of Camera, HD-Map, & Waypoints Santosh Patapati et.al. 2507.23064 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-08-07 Bayesian Optimization applied for accelerated Virtual Validation of the Autonomous Driving Function Satyesh Shanker Awasthi et.al. 2507.22769 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation Jiuming Liu et.al. 2507.22454 null
2025-07-30 Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators Kaustav Chakraborty et.al. 2507.22389 null
2025-07-29 Hierarchical Game-Based Multi-Agent Decision-Making for Autonomous Vehicles Mushuang Liu et.al. 2507.21941 null
2025-07-31 MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors Shouyi Lu et.al. 2507.21872 null
2025-07-29 SAMITE: Position Prompted SAM2 with Calibrated Memory for Visual Object Tracking Qianxiong Xu et.al. 2507.21732 null
2025-08-16 Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition Ruiyang Hao et.al. 2507.21610 null
2025-07-29 SafeDriveRAG: Towards Safe Autonomous Driving with Knowledge Graph-based Retrieval-Augmented Generation Hao Ye et.al. 2507.21585 null
2025-07-30 No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering Linye Wei et.al. 2507.21572 null
2025-07-29 RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors Tianhui Cai et.al. 2507.21567 null
2025-07-29 SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity Xingyang Li et.al. 2507.21499 null
2025-07-29 MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving Thomas Monninger et.al. 2507.21423 null
2025-08-03 Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy Jicheng Yuan et.al. 2507.21358 null
2025-07-25 Seeing Beyond Frames: Zero-Shot Pedestrian Intention Prediction with Raw Temporal Video and Multimodal Cues Pallavi Zambare et.al. 2507.21161 null
2025-07-28 GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction Tianhao Li et.al. 2507.20963 null
2025-07-25 Event-Based De-Snowing for Autonomous Driving Manasi Muglikar et.al. 2507.20901 null
2025-07-28 DriveAgent-R1: Advancing VLM-based Autonomous Driving with Hybrid Thinking and Active Perception Weicheng Zheng et.al. 2507.20879 null
2025-07-27 Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars Mattia Piccinini et.al. 2507.20427 null
2025-07-27 VESPA: Towards un(Human)supervised Open-World Pointcloud Labeling for Autonomous Driving Levente Tempfli et.al. 2507.20397 null
2025-07-27 Solving Scene Understanding for Autonomous Navigation in Unstructured Environments Naveen Mathews Renji et.al. 2507.20389 null
2025-07-27 VLMPlanner: Integrating Visual Language Models with Motion Planning Zhipeng Tang et.al. 2507.20342 null
2025-07-27 MambaMap: Online Vectorized HD Map Construction using State Space Model Ruizi Yang et.al. 2507.20224 null
2025-07-27 LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks Fei Kong et.al. 2507.20174 null
2025-07-27 Meta Fusion: A Unified Framework For Multimodality Fusion with Mutual Learning Ziyi Liang et.al. 2507.20089 null
2025-07-26 Digital Twin Channel-Enabled Online Resource Allocation for 6G: Principle, Architecture and Application Tongjie Li et.al. 2507.19974 null
2025-08-12 DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes Rishav Kumar et.al. 2507.19912 null
2025-07-26 Bridging Simulation and Usability: A User-Friendly Framework for Scenario Generation in CARLA Ahmed Abouelazm et.al. 2507.19883 null
2025-07-26 FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving Tao Lian et.al. 2507.19881 null
2025-07-30 RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection Xiaokai Bai et.al. 2507.19856 null
2025-07-26 A 4D Radar Camera Extrinsic Calibration Tool Based on 3D Uncertainty Perspective N Points Chuan Cao et.al. 2507.19829 null
2025-07-25 PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction Haichuan Li et.al. 2507.19701 null
2025-07-25 Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing Haichuan Li et.al. 2507.19691 null
2025-08-02 GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting Baijun Ye et.al. 2507.19451 null
2025-07-25 An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles Matthias Weiß et.al. 2507.19446 null
2025-07-25 SDVDiag: A Modular Platform for the Diagnosis of Connected Vehicle Functions Matthias Weiß et.al. 2507.19403 null
2025-07-25 BEV-LLM: Leveraging Multimodal BEV Maps for Scene Captioning in Autonomous Driving Felix Brandstaetter et.al. 2507.19370 null
2025-07-25 LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences Yusuke Hirota et.al. 2507.19362 null
2025-07-25 SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence Viktar Dubovik et.al. 2507.19321 null
2025-07-25 CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception Jiaru Zhong et.al. 2507.19239 null
2025-07-25 VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions Haoang Lu et.al. 2507.19188 null
2025-07-25 Continual Learning-Based Unified Model for Unpaired Image Restoration Tasks Kotha Kartheek et.al. 2507.19184 null
2025-07-25 Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL Ahmed Abouelazm et.al. 2507.19146 null
2025-07-31 PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction Yanghong Liu et.al. 2507.19119 null
2025-07-25 Fine-Grained Traffic Inference from Road to Lane via Spatio-Temporal Graph Node Generation Shuhao Li et.al. 2507.19089 null
2025-07-25 HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback Elham Soltani Kazemi et.al. 2507.18921 null
2025-07-24 Diffusion-FS: Multimodal Free-Space Prediction via Diffusion for Autonomous Driving Keshav Gupta et.al. 2507.18763 null
2025-07-24 Linear Memory SE(2) Invariant Attention Ethan Pronovost et.al. 2507.18597 null
2025-07-24 GaussianFusionOcc: A Seamless Sensor Fusion Approach for 3D Occupancy Prediction Using 3D Gaussians Tomislav Pavković et.al. 2507.18522 null
2025-07-24 Delving into Mapping Uncertainty for Mapless Trajectory Prediction Zongzheng Zhang et.al. 2507.18498 null
2025-07-24 Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments Xiao Yang et.al. 2507.18484 null
2025-07-24 CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting Haoran Xu et.al. 2507.18473 null
2025-07-24 LONG3R: Long Sequence Streaming 3D Reconstruction Zhuoguang Chen et.al. 2507.18255 null
2025-07-24 GenAI for Automotive Software Development: From Requirements to Wheels Nenad Petrovic et.al. 2507.18223 null
2025-07-24 Goal-based Trajectory Prediction for improved Cross-Dataset Generalization Daniel Grimm et.al. 2507.18196 null
2025-07-24 Policy Disruption in Reinforcement Learning:Adversarial Attack with Large Language Models and Critical State Identification Junyong Jiang et.al. 2507.18113 null
2025-07-23 BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems Malsha Ashani Mahawatta Dona et.al. 2507.17722 null
2025-07-23 Reusing Attention for One-stage Lane Topology Understanding Yang Li et.al. 2507.17617 null
2025-07-23 InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling Xiaoxue Chen et.al. 2507.17613 null
2025-07-24 PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving Maciej K. Wozniak et.al. 2507.17596 null
2025-07-23 SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving Chuang Chen et.al. 2507.17479 null
2025-07-23 VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization Sania Waheed et.al. 2507.17455 null
2025-07-23 Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning Joobin Jin et.al. 2507.17418 null
2025-08-06 DeMo++: Motion Decoupling for Autonomous Driving Bozhou Zhang et.al. 2507.17342 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study Mandar Pitale et.al. 2507.17118 null
2025-07-22 SDGOCC: Semantic and Depth-Guided Bird’s-Eye View Transformation for 3D Multimodal Occupancy Prediction Zaipeng Duan et.al. 2507.17083 null
2025-07-22 Few-Shot Learning in Video and 3D Object Detection: A Survey Md Meftahul Ferdaus et.al. 2507.17079 null
2025-07-22 Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach Adithya Mohan et.al. 2507.17070 null
2025-07-22 Denoising-While-Completing Network (DWCNet): Robust Point Cloud Completion Under Corruption Keneni W. Tesema et.al. 2507.16743 null
2025-07-22 Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control Zongzheng Zhang et.al. 2507.16645 null
2025-07-22 A Target-based Multi-LiDAR Multi-Camera Extrinsic Calibration System Lorenzo Gentilini et.al. 2507.16621 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-22 A Sparsity-Aware Autonomous Path Planning Accelerator with HW/SW Co-Design and Multi-Level Dataflow Optimization Yifan Zhang et.al. 2507.16177 null
2025-07-21 Optimal Batch-Size Control for Low-Latency Federated Learning with Device Heterogeneity Huiling Yang et.al. 2507.15601 null
2025-07-21 Robots for Kiwifruit Harvesting and Pollination Jamie Bell et.al. 2507.15484 null
2025-07-21 VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving Haichao Liu et.al. 2507.15266 null
2025-07-20 CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning Pan Hu et.al. 2507.14903 null
2025-07-23 GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving Chi Wan et.al. 2507.14456 null
2025-07-18 Preference-based Multi-Objective Reinforcement Learning Ni Mu et.al. 2507.14066 null
2025-07-18 Architecting Human-AI Cocreation for Technical Services – Interaction Modes and Contingency Factors Jochen Wulf et.al. 2507.14034 null
2025-07-18 Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection Yujian Mo et.al. 2507.13899 null
2025-07-18 Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation Max van den Hoven et.al. 2507.13857 null
2025-07-18 One Step Closer: Creating the Future to Boost Monocular Semantic Scene Completion Haoang Lu et.al. 2507.13801 null
2025-07-18 AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework Yu Yao et.al. 2507.13729 null
2025-07-17 CaSTFormer: Causal Spatio-Temporal Transformer for Driving Intention Prediction Sirui Wang et.al. 2507.13425 null
2025-07-16 From Binary to Semantic: Utilizing Large-Scale Binary Occupancy Data for 3D Semantic Occupancy Prediction Chihiro Noguchi et.al. 2507.13387 null
2025-07-17 Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models Arian Mousakhan et.al. 2507.13162 null
2025-07-17 Channel-wise Motion Features for Efficient Motion Segmentation Riku Inoue et.al. 2507.13082 null
2025-07-23 LaViPlan : Language-Guided Visual Path Planning with RLVR Hayeon Oh et.al. 2507.12911 null
2025-07-17 World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving Yanchen Guan et.al. 2507.12762 null
2025-07-17 Domain-Enhanced Dual-Branch Model for Efficient and Interpretable Accident Anticipation Yanchen Guan et.al. 2507.12755 null
2025-07-16 ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving Yuhang Lu et.al. 2507.12499 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models Santosh Vasa et.al. 2507.12414 null
2025-07-21 AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving Jiawei Xu et.al. 2507.12137 null
2025-07-16 LidarPainter: One-Step Away From Any Lidar View To Novel Guidance Yuzhou Ji et.al. 2507.12114 null
2025-07-16 Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics Muleilan Pei et.al. 2507.12083 null
2025-07-16 IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving Kanghyun Ryu et.al. 2507.11940 null
2025-07-16 Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers Mohammed Hassanin et.al. 2507.11852 null
2025-07-15 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Zhen Xu et.al. 2507.11540 null
2025-07-15 A Survey on Interpretability in Visual Recognition Qiyang Wan et.al. 2507.11099 null
2025-07-14 RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding Benjamin Stoler et.al. 2507.10749 null
2025-07-14 Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance Kyungtae Han et.al. 2507.10500 null

Traffic Simulation

Publish Date Title Authors PDF Code
2025-10-07 Learning to Crawl: Latent Model-Based Reinforcement Learning for Soft Robotic Adaptive Locomotion Vaughn Gzenda et.al. 2510.05957 null
2025-10-07 Stable Robot Motions on Manifolds: Learning Lyapunov-Constrained Neural Manifold ODEs David Boetius et.al. 2510.05707 null
2025-10-06 Efficient Probabilistic Planning with Maximum-Coverage Distributionally Robust Backward Reachable Trees Alex Rose et.al. 2510.04807 null
2025-10-06 Hands-Free Heritage: Automated 3D Scanning for Cultural Heritage Digitization Javed Ahmad et.al. 2510.04781 null
2025-10-06 Building Gradient by Gradient: Decentralised Energy Functions for Bimanual Robot Assembly Alexander L. Mitchell et.al. 2510.04696 null
2025-10-06 MobRT: A Digital Twin-Based Framework for Scalable Learning in Mobile Manipulation Yilin Mei et.al. 2510.04592 null
2025-10-05 Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction Yuhao Luo et.al. 2510.04365 null
2025-10-05 Integrated Planning and Control on Manifolds: Factor Graph Representation and Toolkit Peiwen Yang et.al. 2510.04278 null
2025-10-04 COVER:COverage-VErified Roadmaps for Fixed-time Motion Planning in Continuous Semi-Static Environments Niranjan Kumar Ilampooranan et.al. 2510.03875 null
2025-10-04 Trajectory prediction for heterogeneous agents: A performance analysis on small and imbalanced datasets Tiago Rodrigues de Almeida et.al. 2510.03776 null
2025-10-03 Shape-Space Graphs: Fast and Collision-Free Path Planning for Soft Robots Carina Veil et.al. 2510.03547 null
2025-10-03 Distributed Connectivity Maintenance and Recovery for Quadrotor Motion Planning Yutong Wang et.al. 2510.03504 null
2025-10-03 Warm-Starting Optimization-Based Motion Planning for Robotic Manipulators via Point Cloud-Conditioned Flow Matching Sibo Tian et.al. 2510.03460 null
2025-09-30 A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety Shucheng Zhang et.al. 2510.03314 null
2025-10-03 Long-Term Human Motion Prediction Using Spatio-Temporal Maps of Dynamics Yufei Zhu et.al. 2510.03031 null
2025-10-03 Point Cloud-Based Control Barrier Functions for Model Predictive Control in Safety-Critical Navigation of Autonomous Mobile Robots Faduo Liang et.al. 2510.02885 null
2025-10-03 A Trajectory Generator for High-Density Traffic and Diverse Agent-Interaction Scenarios Ruining Yang et.al. 2510.02627 null
2025-10-02 SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian Splatting Sung-Yeon Park et.al. 2510.02469 null
2025-10-02 ERUPT: An Open Toolkit for Interfacing with Robot Motion Planners in Extended Reality Isaac Ngui et.al. 2510.02464 null
2025-10-02 Symskill: Symbol and Skill Co-Invention for Data-Efficient and Real-Time Long-Horizon Manipulation Yifei Simon Shao et.al. 2510.01661 null
2025-10-01 Safe Motion Planning and Control Using Predictive and Adaptive Barrier Methods for Autonomous Surface Vessels Alejandro Gonzalez-Garcia et.al. 2510.01357 null
2025-10-01 From Seeing to Predicting: A Vision-Language Framework for Trajectory Forecasting and Controlled Video Generation Fan Yang et.al. 2510.00806 null
2025-10-01 From Human Hands to Robot Arms: Manipulation Skills Transfer via Trajectory Alignment Han Zhou et.al. 2510.00491 null
2025-10-01 Conflict-Based Search as a Protocol: A Multi-Agent Motion Planning Protocol for Heterogeneous Agents, Solvers, and Independent Tasks Rishi Veerapaneni et.al. 2510.00425 null
2025-10-01 EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy Observations Jiayi Liu et.al. 2510.00405 null
2025-09-30 A Systematic Study of Large Language Models for Task and Motion Planning With PDDLStream Jorge Mendez-Mendez et.al. 2510.00182 null
2025-10-03 Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving Sheng Yang et.al. 2510.00060 null
2025-09-30 The Trajectory Bundle Method: Unifying Sequential-Convex Programming and Sampling-Based Trajectory Optimization Kevin Tracy et.al. 2509.26575 null
2025-09-30 Learning from Hallucinating Critical Points for Navigation in Dynamic Environments Saad Abdul Ghani et.al. 2509.26513 null
2025-09-30 Kinodynamic Motion Planning for Mobile Robot Navigation across Inconsistent World Models Eric R. Damm et.al. 2509.26339 null
2025-09-30 Hierarchical Diffusion Motion Planning with Task-Conditioned Uncertainty-Aware Priors Amelie Minji Kim et.al. 2509.25685 null
2025-09-29 Parallel Heuristic Search as Inference for Actor-Critic Reinforcement Learning Models Hanlan Yang et.al. 2509.25402 null
2025-09-29 SRMP: Search-Based Robot Motion Planning Library Itamar Mishani et.al. 2509.25352 null
2025-09-29 Path Diffuser: Diffusion Model for Data-Driven Traffic Simulator Da Saem Lee et.al. 2509.24995 null
2025-09-29 Trajectory Prediction via Bayesian Intention Inference under Unknown Goals and Kinematics Shunan Yin et.al. 2509.24928 null
2025-09-29 Learning to Sample: Reinforcement Learning-Guided Sampling for Autonomous Vehicle Motion Planning Korbinian Moller et.al. 2509.24313 null
2025-09-29 Towards Tighter Convex Relaxation of Mixed-integer Programs: Leveraging Logic Network Flow for Task and Motion Planning Xuan Lin et.al. 2509.24235 null
2025-09-29 ViReSkill: Vision-Grounded Replanning with Skill Memory for LLM-Based Planning in Lifelong Robot Learning Tomoyuki Kagaya et.al. 2509.24219 null
2025-09-29 Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse-view Videos Yingdong Hu et.al. 2509.24209 null
2025-09-29 A Novel Model for 3D Motion Planning for a Generalized Dubins Vehicle with Pitch and Yaw Rate Constraints Deepak Prakash Kumar et.al. 2509.24143 null
2025-09-28 Hazy Pedestrian Trajectory Prediction via Physical Priors and Graph-Mamba Jian Chen et.al. 2509.24020 null
2025-09-28 Advancing Multi-agent Traffic Simulation via R1-Style Reinforcement Fine-Tuning Muleilan Pei et.al. 2509.23993 null
2025-09-28 VFSI: Validity First Spatial Intelligence for Constraint-Guided Traffic Diffusion Kargi Chauhan et.al. 2509.23971 null
2025-09-28 DA-MMP: Learning Coordinated and Accurate Throwing with Dynamics-Aware Motion Manifold Primitives Chi Chu et.al. 2509.23721 null
2025-09-27 Distributed Multi-Robot Multi-Target Simultaneous Search and Tracking in an Unknown Non-convex Environment Jun Chen et.al. 2509.23308 null
2025-09-26 Empart: Interactive Convex Decomposition for Converting Meshes to Parts Brandon Vu et.al. 2509.22847 null
2025-09-26 Towards Developing Standards and Guidelines for Robot Grasping and Manipulation Pipelines in the COMPARE Ecosystem Huajing Zhao et.al. 2509.22801 null
2025-09-26 Self-driving cars: Are we there yet? Merve Atasever et.al. 2509.22754 null
2025-10-03 An Intention-driven Lane Change Framework Considering Heterogeneous Dynamic Cooperation in Mixed-traffic Environment Xiaoyun Qiu et.al. 2509.22550 null
2025-09-26 An Adaptive ICP LiDAR Odometry Based on Reliable Initial Pose Qifeng Wang et.al. 2509.22058 null
2025-09-25 DroneFL: Federated Learning for Multi-UAV Visual Target Tracking Xiaofan Yu et.al. 2509.21523 null
2025-09-25 Multi-Robot Vision-Based Task and Motion Planning for EV Battery Disassembly and Sorting Abdelaziz Shaarawy et.al. 2509.21020 null
2025-09-24 BBoE: Leveraging Bundle of Edges for Kinodynamic Bidirectional Motion Planning Srikrishna Bangalore Raghu et.al. 2509.20333 null
2025-09-24 Parse-Augment-Distill: Learning Generalizable Bimanual Visuomotor Policies from Single Human Video Georgios Tziafas et.al. 2509.20286 null
2025-09-23 Look as You Leap: Planning Simultaneous Motion and Perception for High-DOF Robots Qingxi Meng et.al. 2509.19610 null
2025-09-23 Agentic Scene Policies: Unifying Space, Semantics, and Affordances for Robot Action Sacha Morin et.al. 2509.19571 null
2025-09-23 Distributionally Robust Safe Motion Planning with Contextual Information Kaizer Rahaman et.al. 2509.18666 null
2025-09-23 PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving Chengran Yuan et.al. 2509.18609 null
2025-09-22 BlurBall: Joint Ball and Motion Blur Estimation for Table Tennis Ball Tracking Thomas Gossard et.al. 2509.18387 null
2025-09-22 Haptic Communication in Human-Human and Human-Robot Co-Manipulation Katherine H. Allen et.al. 2509.18327 null
2025-09-22 SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model Xiao Zhou et.al. 2509.17850 null
2025-09-22 Learning Dexterous Manipulation with Quantized Hand State Ying Feng et.al. 2509.17450 null
2025-09-22 Fast Trajectory Planner with a Reinforcement Learning-based Controller for Robotic Manipulators Yongliang Wang et.al. 2509.17381 null
2025-09-21 CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving Ruiguo Zhong et.al. 2509.17080 null
2025-09-19 Dynamic Objects Relocalization in Changing Environments with Flow Matching Francesco Argenziano et.al. 2509.16398 null
2025-09-19 AdaSports-Traj: Role- and Domain-Aware Adaptation for Multi-Agent Trajectory Modeling in Sports Yi Xu et.al. 2509.16095 null
2025-09-19 CoPAD : Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-oriented Decoder in V2X Scenarios Kangyu Wu et.al. 2509.15984 null
2025-09-19 Enriched Feature Representation and Motion Prediction Module for MOSEv2 Track of 7th LSVOS Challenge: 3rd Place Solution Chang Soo Lim et.al. 2509.15781 null
2025-09-19 ORB: Operating Room Bot, Automating Operating Room Logistics through Mobile Manipulation Jinkai Qiu et.al. 2509.15600 null
2025-09-18 Trust-Aware Embodied Bayesian Persuasion for Mixed-Autonomy Shaoting Peng et.al. 2509.15404 null
2025-09-18 Out-of-Sight Trajectories: Tracking, Fusion, and Prediction Haichao Zhang et.al. 2509.15219 null
2025-09-17 FlowDrive: Energy Flow Field for End-to-End Autonomous Driving Hao Jiang et.al. 2509.14303 null
2025-09-17 Language Conditioning Improves Accuracy of Aircraft Goal Prediction in Untowered Airspace Sundhar Vinodh Sangeetha et.al. 2509.14063 null
2025-09-17 Repulsive Trajectory Modification and Conflict Resolution for Efficient Multi-Manipulator Motion Planning Junhwa Hong et.al. 2509.13882 null
2025-09-17 CDFlow: Generative Gradient Flows for Configuration Space Distance Fields via Neural ODEs Mengzhu Li et.al. 2509.13771 null
2025-09-16 Dynamic Aware: Adaptive Multi-Mode Out-of-Distribution Detection for Trajectory Prediction in Autonomous Vehicles Tongfei Guo et.al. 2509.13577 null
2025-09-16 Trajectory Tracking with Reachability-Guided Quadratic Programming and Freeze-Resume Hossein Gholampour et.al. 2509.13501 null
2025-09-16 Weakly and Self-Supervised Class-Agnostic Motion Prediction for Autonomous Driving Ruibo Li et.al. 2509.13116 null
2025-09-16 Bridging Perception and Planning: Towards End-to-End Planning for Signal Temporal Logic Tasks Bowen Ye et.al. 2509.12813 null
2025-09-15 DYNAMO: Dependency-Aware Deep Learning Framework for Articulated Assembly Motion Prediction Mayank Patel et.al. 2509.12430 null
2025-09-15 Learning Contact Dynamics for Control with Action-conditioned Face Interaction Graph Networks Zongyao Yi et.al. 2509.12151 null
2025-09-14 Embodied Intelligence in Disassembly: Multimodal Perception Cross-validation and Continual Learning in Neuro-Symbolic TAMP Ziwen He et.al. 2509.11270 null
2025-09-14 SAMP: Spatial Anchor-based Motion Policy for Collision-Aware Robotic Manipulators Kai Chen et.al. 2509.11185 null
2025-09-14 End-to-End Visual Autonomous Parking via Control-Aided Attention Chao Chen et.al. 2509.11090 null
2025-09-28 Follow-Bench: A Unified Motion Planning Benchmark for Socially-Aware Robot Person Following Hanjing Ye et.al. 2509.10796 null
2025-09-12 STL-Based Motion Planning and Uncertainty-Aware Risk Analysis for Human-Robot Collaboration with a Multi-Rotor Aerial Vehicle Giuseppe Silano et.al. 2509.10692 null
2025-09-11 Large Foundation Models for Trajectory Prediction in Autonomous Driving: A Comprehensive Survey Wei Dai et.al. 2509.10570 null
2025-09-12 Coordinated Motion Planning of a Wearable Multi-Limb System for Enhanced Human-Robot Interaction Chaerim Moon et.al. 2509.10444 null
2025-09-17 DECAMP: Towards Scene-Consistent Multi-Agent Motion Prediction with Disentangled Context-Aware Pre-Training Jianxin Shi et.al. 2509.10426 null
2025-09-12 HHI-Assist: A Dataset and Benchmark of Human-Human Interaction in Physical Assistance Scenario Saeed Saadatnejad et.al. 2509.10096 null
2025-09-12 BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird’s-Eye View with Deformable Attention and Sparse Goal Proposals Minsang Kong et.al. 2509.10080 null
2025-09-11 BagIt! An Adaptive Dual-Arm Manipulation of Fabric Bags for Object Bagging Peng Zhou et.al. 2509.09484 null
2025-09-11 ProgD: Progressive Multi-scale Decoding with Dynamic Graphs for Joint Multi-agent Motion Forecasting Xing Gao et.al. 2509.09210 null
2025-09-11 MGTraj: Multi-Granularity Goal-Guided Human Trajectory Prediction with Recursive Refinement Network Ge Sun et.al. 2509.09200 null
2025-09-11 KoopMotion: Learning Almost Divergence Free Koopman Flow Fields for Motion Planning Alice Kate Li et.al. 2509.09074 null
2025-09-11 Joint Model-based Model-free Diffusion for Planning with Constraints Wonsuhk Jung et.al. 2509.08775 null
2025-09-10 Dual-Stage Safe Herding Framework for Adversarial Attacker in Dynamic Environment Wenqing Wang et.al. 2509.08460 null
2025-09-09 Diffusion-Guided Multi-Arm Motion Planning Viraj Parimi et.al. 2509.08160 null
2025-09-09 Decoding RobKiNet: Insights into Efficient Training of Robotic Kinematics Informed Neural Network Yanlong Peng et.al. 2509.07646 null
2025-09-09 Safe and Non-Conservative Contingency Planning for Autonomous Vehicles via Online Learning-Based Reachable Set Barriers Rui Yang et.al. 2509.07464 null
2025-09-08 First Plan Then Evaluate: Use a Vectorized Motion Planner for Grasping Martin Matak et.al. 2509.07162 null
2025-09-08 Deep Reactive Policy: Learning Reactive Manipulator Motion Planning for Dynamic Environments Jiahui Yang et.al. 2509.06953 null
2025-09-08 Safe Robust Predictive Control-based Motion Planning of Automated Surface Vessels in Inland Waterways Sajad Ahmadi et.al. 2509.06687 null
2025-09-05 RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning Matthew Lai et.al. 2509.05397 null
2025-09-02 INF-3DP: Implicit Neural Fields for Collision-Free Multi-Axis 3D Printing Jiasheng Qu et.al. 2509.05345 null
2025-09-01 Anticipatory Fall Detection in Humans with Hybrid Directed Graph Neural Networks and Long Short-Term Memory Younggeol Cho et.al. 2509.05337 null
2025-09-04 SAFE–MA–RRT: Multi-Agent Motion Planning with Data-Driven Safety Certificates Babak Esmaeili et.al. 2509.04413 null
2025-09-04 Lightweight Kinematic and Static Modeling of Cable-Driven Continuum Robots via Actuation-Space Energy Formulation Ke Wu et.al. 2509.04119 null
2025-09-16 Keypoint-based Diffusion for Robotic Motion Planning on the NICOL Robot Lennart Clasmeier et.al. 2509.04076 null
2025-09-04 Human Motion Video Generation: A Survey Haiwei Xue et.al. 2509.03883 null
2025-09-03 sam-llm: interpretable lane change trajectoryprediction via parametric finetuning Zhuo Cao et.al. 2509.03462 null
2025-09-03 KEPT: Knowledge-Enhanced Prediction of Trajectories from Consecutive Driving Frames with Vision-Language Models Yujin Wang et.al. 2509.02966 null
2025-09-02 Systematic Evaluation of Trade-Offs in Motion Planning Algorithms for Optimal Industrial Robotic Work Cell Design G. de Mathelin et.al. 2509.02146 null
2025-09-01 Multi-vessel Interaction-Aware Trajectory Prediction and Collision Risk Assessment Md Mahbub Alam et.al. 2509.01836 null
2025-09-01 Articulated Object Estimation in the Wild Abdelrhman Werby et.al. 2509.01708 null
2025-09-01 MoTo: A Zero-shot Plug-in Interaction-aware Navigation for General Mobile Manipulation Zhenyu Wu et.al. 2509.01658 null
2025-09-01 A Hybrid Input based Deep Reinforcement Learning for Lane Change Decision-Making of Autonomous Vehicle Ziteng Gao et.al. 2509.01611 null
2025-09-01 Metamorphic Testing of Multimodal Human Trajectory Prediction Helge Spieker et.al. 2509.01294 null
2025-09-17 Hierarchical Reactive Grasping via Task-Space Velocity Fields and Joint-Space Quadratic Programming Yonghyeon Lee et.al. 2509.01044 null
2025-09-17 One-Step Model Predictive Path Integral for Manipulator Motion Planning Using Configuration Space Distance Fields Yulin Li et.al. 2509.00836 null
2025-09-06 An Effective Trajectory Planning and an Optimized Path Planning for a 6-Degree-of-Freedom Robot Manipulator Takumu Okazaki et.al. 2509.00828 null
2025-08-30 Vehicle-in-Virtual-Environment (VVE) Method for Developing and Evaluating VRU Safety of Connected and Autonomous Driving with Focus on Bicyclist Safety Haochong Chen et.al. 2509.00624 null
2025-08-30 NeuralSVCD for Efficient Swept Volume Collision Detection Dongwon Son et.al. 2509.00499 null
2025-08-30 A Framework for Task and Motion Planning based on Expanding AND/OR Graphs Fulvio Mastrogiovanni et.al. 2509.00317 null
2025-08-26 Hybrid Perception and Equivariant Diffusion for Robust Multi-Node Rebar Tying Zhitao Wang et.al. 2509.00065 null
2025-08-29 Robust Convex Model Predictive Control with collision avoidance guarantees for robot manipulators Bernhard Wullt et.al. 2508.21677 null
2025-08-29 Dynamics-Compliant Trajectory Diffusion for Super-Nominal Payload Manipulation Anuj Pasricha et.al. 2508.21375 null
2025-08-29 Multi-Modal Model Predictive Path Integral Control for Collision Avoidance Alberto Bertipaglia et.al. 2508.21364 null
2025-08-29 Learning to Assemble the Soma Cube with Legal-Action Masked DQN and Safe ZYZ Regrasp on a Doosan M0609 Jaehong Oh et.al. 2508.21272 null
2025-08-27 ScanMove: Motion Prediction and Transfer for Unregistered Body Meshes Thomas Besnier et.al. 2508.21095 null
2025-09-04 HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning Zhi Su et.al. 2508.21043 null
2025-09-05 Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees Yaniv Hassidof et.al. 2508.21001 null
2025-08-28 Deep Fuzzy Optimization for Batch-Size and Nearest Neighbors in Optimal Robot Motion Planning Liding Zhang et.al. 2508.20884 null
2025-08-28 Uncertainty Aware-Predictive Control Barrier Functions: Safer Human Robot Interaction through Probabilistic Motion Forecasting Lorenzo Busellato et.al. 2508.20812 null
2025-08-28 CardioMorphNet: Cardiac Motion Prediction Using a Shape-Guided Bayesian Recurrent Deep Network Reza Akbari Movahed et.al. 2508.20734 null
2025-08-27 Regulation-Aware Game-Theoretic Motion Planning for Autonomous Racing Francesco Prignoli et.al. 2508.20203 null
2025-08-27 Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning Jinhao Liang et.al. 2508.20095 null
2025-08-27 APT*: Asymptotically Optimal Motion Planning via Adaptively Prolated Elliptical R-Nearest Neighbors Liding Zhang et.al. 2508.19790 null
2025-08-27 Tree-Based Grafting Approach for Bidirectional Motion Planning with Local Subsets Optimization Liding Zhang et.al. 2508.19776 null
2025-08-27 Elliptical K-Nearest Neighbors – Path Optimization via Coulomb’s Law and Invalid Vertices in C-space Obstacles Liding Zhang et.al. 2508.19771 null
2025-08-27 Autonomous Aerial Manipulation at Arbitrary Pose in SE(3) with Robust Control and Whole-body Planning Dongjae Lee et.al. 2508.19608 null
2025-09-16 Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning Antonio Guillen-Perez et.al. 2508.18397 null
2025-08-26 FlowVLA: Thinking in Motion with a Visual Chain of Thought Zhide Zhong et.al. 2508.18269 null
2025-08-25 Adaptive Output Steps: FlexiSteps Network for Dynamic Trajectory Prediction Yunxiang Liu et.al. 2508.17797 null
2025-08-23 LLM-based Human-like Traffic Simulation for Self-driving Tests Wendi Li et.al. 2508.16962 null
2025-08-23 Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model Fan Ding et.al. 2508.16947 null
2025-08-21 Lang2Lift: A Framework for Language-Guided Pallet Detection and Pose Estimation Integrated in Autonomous Outdoor Forklift Operation Huy Hoang Nguyen et.al. 2508.15427 null
2025-08-20 TRUST-Planner: Topology-guided Robust Trajectory Planner for AAVs with Uncertain Obstacle Spatial-temporal Avoidance Junzhi Li et.al. 2508.14610 null
2025-08-20 FiReFly: Fair Distributed Receding Horizon Planning for Multiple UAVs Nicole Fronda et.al. 2508.14381 null
2025-08-16 Task and Motion Planning for Humanoid Loco-manipulation Michal Ciebielski et.al. 2508.14099 null
2025-08-20 Accelerating Signal-Temporal-Logic-Based Task and Motion Planning of Bipedal Navigation using Benders Decomposition Jiming Ren et.al. 2508.13407 null
2025-08-18 BOW: Bayesian Optimization over Windows for Motion Planning in Complex Environments Sourav Raxit et.al. 2508.13052 null
2025-08-28 On the complexity of constrained reconfiguration and motion planning Nicolas Bousquet et.al. 2508.13032 null
2025-08-31 SocialTrack: Multi-Object Tracking in Complex Urban Traffic Scenes Inspired by Social Behavior Wenguang Tao et.al. 2508.12777 null
2025-08-17 Autonomous Oil Spill Response Through Liquid Neural Trajectory Modeling and Coordinated Marine Robotics Hadas C. Kuzmenko et.al. 2508.12456 null
2025-08-17 EgoLoc: A Generalizable Solution for Temporal Interaction Localization in Egocentric Videos Junyi Ma et.al. 2508.12349 null
2025-08-15 A Comparative Study of Floating-Base Space Parameterizations for Agile Whole-Body Motion Planning Evangelos Tsiatsianas et.al. 2508.11520 null
2025-08-15 Relative Position Matters: Trajectory Prediction and Planning with Polar Representation Bozhou Zhang et.al. 2508.11492 null
2025-08-15 EvoPSF: Online Evolution of Autonomous Driving Models via Planning-State Feedback Jiayue Jin et.al. 2508.11453 null
2025-08-15 ReachVox: Clutter-free Reachability Visualization for Robot Motion Planning in Virtual Reality Steffen Hauck et.al. 2508.11426 null
2025-08-15 Learning Differentiable Reachability Maps for Optimization-based Humanoid Motion Generation Masaki Murooka et.al. 2508.11275 null
2025-08-15 A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving Jialin Li et.al. 2508.11218 null
2025-08-20 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation Nikolaos Gkanatsios et.al. 2508.11002 null
2025-08-14 SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving Philipp Wolters et.al. 2508.10567 null
2025-08-14 STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes Keishi Ishihara et.al. 2508.10427 null
2025-08-12 CLF-RL: Control Lyapunov Function Guided Reinforcement Learning Kejun Li et.al. 2508.09354 null
2025-08-10 Whole-Body Coordination for Dynamic Object Grasping with Legged Manipulators Qiwei Liang et.al. 2508.08328 null
2025-08-11 Learning an Implicit Physics Model for Image-based Fluid Simulation Emily Yue-Ting Jia et.al. 2508.08254 null
2025-08-10 A Learning-Based Framework for Collision-Free Motion Planning Mateus Salomão et.al. 2508.07502 null
2025-08-10 Noise-Aware Generative Microscopic Traffic Simulation Vindula Jayawardana et.al. 2508.07453 null
2025-08-10 Bio-Inspired Topological Autonomous Navigation with Active Inference in Robotics Daria de Tinguy et.al. 2508.07267 null
2025-08-12 Understanding Dynamic Scenes in Ego Centric 4D Point Clouds Junsheng Huang et.al. 2508.07251 null
2025-08-10 CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion Xiaotong Lin et.al. 2508.07162 null
2025-08-10 Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction Yu Liu et.al. 2508.07146 null
2025-08-09 ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting Sandro Papais et.al. 2508.07089 null
2025-08-09 Model Predictive Control for Crowd Navigation via Learning-Based Trajectory Prediction Mohamed Parvez Aslam et.al. 2508.07079 null
2025-08-05 Historical Prediction Attention Mechanism based Trajectory Forecasting for Proactive Work Zone Safety in a Digital Twin Environment Minhaj Uddin Ahmad et.al. 2508.06544 null
2025-08-04 Symbolic Learning of Interpretable Reduced-Order Models for Jumping Quadruped Robots Gioele Buriani et.al. 2508.06538 null
2025-08-08 V*: An Efficient Motion Planning Algorithm for Autonomous Vehicles Abdullah Zareh Andaryan et.al. 2508.06404 null
2025-08-08 Incremental Language Understanding for Online Motion Planning of Robot Manipulators Mitchell Abrams et.al. 2508.06095 null
2025-08-08 Dynamical Trajectory Planning of Disturbance Consciousness for Air-Land Bimodal Unmanned Aerial Vehicles Shaoting Liu et.al. 2508.05972 null
2025-08-07 TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven Evolution Zhikai Zhao et.al. 2508.05616 null
2025-08-07 Benchmarking Shortcutting Techniques for Multi-Robot-Arm Motion Planning Philip Huang et.al. 2508.05027 null
2025-08-06 LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction Md Zahidul Hasan et.al. 2508.04847 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 Incorporating Stochastic Models of Controller Behavior into Kinodynamic Efficiently Adaptive State Lattices for Mobile Robot Motion Planning in Off-Road Environments Eric R. Damm et.al. 2508.04384 null
2025-08-06 Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction Yu Liu et.al. 2508.04229 null
2025-08-11 Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems Luai Abuelsamen et.al. 2508.04146 null
2025-08-05 Constraint-Preserving Data Generation for Visuomotor Policy Learning Kevin Lin et.al. 2508.03944 null
2025-08-05 Vision-based Perception System for Automated Delivery Robot-Pedestrians Interactions Ergi Tushe et.al. 2508.03541 null
2025-08-04 X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio Chenxu Zhang et.al. 2508.02944 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering Xu Wang et.al. 2508.02362 null
2025-08-19 Adaptive Lattice-based Motion Planning Abhishek Dhar et.al. 2508.02350 null
2025-08-04 Framework for Robust Motion Planning of Tethered Multi-Robot Systems in Marine Environments Markus Buchholz et.al. 2508.02287 null
2025-08-04 AID4AD: Aerial Image Data for Automated Driving Perception Daniel Lengerer et.al. 2508.02140 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction Hua Yu et.al. 2508.01585 null
2025-07-29 A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles Jiayuan Wang et.al. 2508.00917 null
2025-08-01 On Learning Closed-Loop Probabilistic Multi-Agent Simulator Juanwu Lu et.al. 2508.00384 null
2025-08-01 TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps Zehui Xu et.al. 2508.00303 null
2025-07-31 Data-Driven Motion Planning for Uncertain Nonlinear Systems Babak Esmaeili et.al. 2508.00154 null
2025-07-31 OmniTraj: Pre-Training on Heterogeneous Data for Adaptive and Zero-Shot Human Trajectory Prediction Yang Gao et.al. 2507.23657 null
2025-07-31 A Framework for Ethical Decision-Making in Automated Vehicles through Human Reasons-based Supervision Lucas Elbert Suryana et.al. 2507.23308 null
2025-07-31 Simulation-based planning of Motion Sequences for Automated Procedure Optimization in Multi-Robot Assembly Cells Loris Schneider et.al. 2507.23270 null
2025-08-01 Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future Guoping Xu et.al. 2507.22792 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators Kaustav Chakraborty et.al. 2507.22389 null
2025-07-27 Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars Mattia Piccinini et.al. 2507.20427 null
2025-07-27 VLMPlanner: Integrating Visual Language Models with Motion Planning Zhipeng Tang et.al. 2507.20342 null
2025-07-27 PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks Clinton Ansun Mo et.al. 2507.20170 null
2025-07-25 PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction Haichuan Li et.al. 2507.19701 null
2025-07-25 RAKOMO: Reachability-Aware K-Order Markov Path Optimization for Quadrupedal Loco-Manipulation Mattia Risiglione et.al. 2507.19652 null
2025-07-25 High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins Lorenzo Cazzella et.al. 2507.19173 null
2025-07-31 PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction Yanghong Liu et.al. 2507.19119 null
2025-07-24 Probabilistic Collision Risk Estimation through Gauss-Legendre Cubature and Non-Homogeneous Poisson Processes Trent Weiss et.al. 2507.18819 null
2025-07-24 Delving into Mapping Uncertainty for Mapless Trajectory Prediction Zongzheng Zhang et.al. 2507.18498 null
2025-07-24 Goal-based Trajectory Prediction for improved Cross-Dataset Generalization Daniel Grimm et.al. 2507.18196 null
2025-07-24 DanceGraph: A Complementary Architecture for Synchronous Dancing Online David Sinclair et.al. 2507.18052 null
2025-07-23 Safety Assurance for Quadrotor Kinodynamic Motion Planning Theodoros Tavoulareas et.al. 2507.17679 null
2025-07-23 IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception Haichuan Li et.al. 2507.17445 null
2025-08-06 DeMo++: Motion Decoupling for Autonomous Driving Bozhou Zhang et.al. 2507.17342 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 Falconry-like palm landing by a flapping-wing drone based on the human gesture interaction and distance-aware flight planning Kazuki Numazato et.al. 2507.17144 null
2025-07-22 RAPTAR: Radar Radiation Pattern Acquisition through Automated Collaborative Robotics Maaz Qureshi et.al. 2507.16988 null
2025-07-21 Resilience Evaluation of Kubernetes in Cloud-Edge Environments via Failure Injection Zihao Chen et.al. 2507.16109 null
2025-07-21 Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction Shiyang Li et.al. 2507.15832 null
2025-07-21 Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs Ruochu Yang et.al. 2507.15782 null
2025-07-21 Selective Densification for Rapid Motion Planning in High Dimensions with Narrow Passages Lu Huang et.al. 2507.15710 null
2025-07-21 A Universal Vehicle-Trailer Navigation System with Neural Kinematics and Online Residual Learning Yanbo Chen et.al. 2507.15607 null
2025-07-21 VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving Haichao Liu et.al. 2507.15266 null
2025-07-20 Search-Based Autonomous Vehicle Motion Planning Using Game Theory Pouya Panahandeh et.al. 2507.15088 null
2025-07-20 CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning Pan Hu et.al. 2507.14903 null
2025-07-18 Context-Aware Behavior Learning with Heuristic Motion Memory for Underwater Manipulation Markus Buchholz et.al. 2507.14099 null
2025-07-18 NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning Qingyi Chen et.al. 2507.13940 null
2025-07-18 Conformal Contraction for Robust Nonlinear Control with Distribution-Free Uncertainty Quantification Sihang Wei et.al. 2507.13613 null
2025-08-08 Trustworthy Pedestrian Trajectory Prediction via Pattern-Aware Interaction Modeling Kaiyuan Zhai et.al. 2507.13397 null
2025-07-25 Signal Temporal Logic Compliant Co-design of Planning and Control Manas Sashank Juvvi et.al. 2507.13225 null
2025-07-22 Predictability-Aware Motion Prediction for Edge XR via High-Order Error-State Kalman Filtering Ziyu Zhong et.al. 2507.13179 null
2025-07-17 Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning Giwon Lee et.al. 2507.12977 null
2025-07-17 FFI-VTR: Lightweight and Robust Visual Teach and Repeat Navigation based on Feature Flow Indicator and Probabilistic Motion Planning Jikai Wang et.al. 2507.12800 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios Van-Hoang-Anh Phan et.al. 2507.12449 null
2025-07-16 Regrasp Maps for Sequential Manipulation Planning Svetlana Levit et.al. 2507.12407 null
2025-07-16 Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics Muleilan Pei et.al. 2507.12083 null
2025-07-16 IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving Kanghyun Ryu et.al. 2507.11940 null
2025-07-16 A Fast Method for Planning All Optimal Homotopic Configurations for Tethered Robots and Its Extended Applications Jinyuan Liu et.al. 2507.11880 null
2025-07-15 MPC-based Coarse-to-Fine Motion Planning for Robotic Object Transportation in Cluttered Environments Chen Cai et.al. 2507.11211 null
2025-07-15 Enhancing Autonomous Manipulator Control with Human-in-loop for Uncertain Assembly Environments Ashutosh Mishra et.al. 2507.11006 null
2025-07-15 OffsetCrust: Variable-Radius Offset Approximation with Power Diagrams Zihan Zhao et.al. 2507.10924 null
2025-07-15 Mixed Discrete and Continuous Planning using Shortest Walks in Graphs of Convex Sets Savva Morozov et.al. 2507.10878 null
2025-07-14 A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex Environments Yuchen Wang et.al. 2507.10792 null
2025-07-23 Leveraging RAG-LLMs for Urban Mobility Simulation and Analysis Yue Ding et.al. 2507.10382 null
2025-07-16 TOP: Trajectory Optimization via Parallel Optimization towards Constant Time Complexity Jiajun Yu et.al. 2507.10290 null
2025-07-14 MP-RBFN: Learning-based Vehicle Motion Primitives using Radial Basis Function Networks Marc Kaufeld et.al. 2507.10047 null
2025-07-22 Active Probing with Multimodal Predictions for Motion Planning Darshan Gadginmath et.al. 2507.09822 null
2025-07-13 Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions Yuanhong Zheng et.al. 2507.09446 null
2025-07-12 Real-Time Adaptive Motion Planning via Point Cloud-Guided, Energy-Based Diffusion and Potential Fields Wondmgezahu Teshome et.al. 2507.09383 null
2025-07-19 Informed Hybrid Zonotope-based Motion Planning Algorithm Peng Xie et.al. 2507.09309 null
2025-07-12 Integrating Planning and Predictive Control Using the Path Feasibility Governor Shu Zhang et.al. 2507.09134 null
2025-07-09 Next-Generation Travel Demand Modeling with a Generative Framework for Household Activity Coordination Xishun Liao et.al. 2507.08871 null
2025-07-14 STRAP: Spatial-Temporal Risk-Attentive Vehicle Trajectory Prediction for Autonomous Driving Xinyi Ning et.al. 2507.08563 null
2025-07-11 Prediction of Lane Change Intentions of Human Drivers using an LSTM, a CNN and a Transformer Francesco De Cristofaro et.al. 2507.08365 null
2025-07-11 Neural Parameter-varying Data-enabled Predictive Control of Cold Atmospheric Pressure Plasma Jets Pegah GhafGhanbari et.al. 2507.08259 null
2025-07-10 GGMotion: Group Graph Dynamics-Kinematics Networks for Human Motion Prediction Shuaijin Wan et.al. 2507.07515 null
2025-07-10 Towards Safe Autonomous Driving: A Real-Time Safeguarding Concept for Motion Planning Algorithms Korbinian Moller et.al. 2507.07444 null
2025-07-09 When Context Is Not Enough: Modeling Unexplained Variability in Car-Following Behavior Chengyuan Zhang et.al. 2507.07012 null
2025-07-09 Robust signal decompositions on the circle Aral Kose et.al. 2507.07007 null
2025-07-09 ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture Mingjin Zeng et.al. 2507.06531 null
2025-07-08 AURA-CVC: Autonomous Ultrasound-guided Robotic Assistance for Central Venous Catheterization Deepak Raina et.al. 2507.05979 null
2025-07-08 DRO-EDL-MPC: Evidential Deep Learning-Based Distributionally Robust Model Predictive Control for Safe Autonomous Driving Hyeongchan Ham et.al. 2507.05710 null
2025-07-07 From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving Fabian Konstantinidis et.al. 2507.05254 null
2025-07-07 Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance Tobias Demmler et.al. 2507.05098 null
2025-07-07 Unifying Robot Optimization: Monte Carlo Tree Search with Tensor Factorization Teng Xue et.al. 2507.04949 null
2025-07-25 Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning Giwon Lee et.al. 2507.04790 null
2025-07-07 LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction Yixin Yan et.al. 2507.04634 null
2025-07-06 Free-Space Optical Communication-Driven NMPC Framework for Multi-Rotor Aerial Vehicles in Structured Inspection Scenarios Giuseppe Silano et.al. 2507.04443 null
2025-07-05 Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic Jianwei Tang et.al. 2507.04062 null
2025-07-05 Temporal Continual Learning with Prior Compensation for Human Motion Prediction Jianwei Tang et.al. 2507.04060 null
2025-07-05 DK-RRT: Deep Koopman RRT for Collision-Aware Motion Planning of Space Manipulators in Dynamic Debris Environments Qi Chen et.al. 2507.03878 null
2025-07-05 Enhanced accuracy through ensembling of randomly initialized auto-regressive models for time-dependent PDEs Ishan Khurjekar et.al. 2507.03863 null
2025-07-04 Label-Free Long-Horizon 3D UAV Trajectory Prediction via Motion-Aligned RGB and Event Cues Hanfang Liang et.al. 2507.03365 null
2025-07-03 Trajectory Optimization for Differential Drive Mobile Manipulators via Topological Paths Search and Arc Length-Yaw Parameterization Long Xu et.al. 2507.02761 null
2025-07-03 Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization Caio Azevedo et.al. 2507.02406 null
2025-07-03 Path Planning using a One-shot-sampling Skeleton Map Gabriel O. Flores-Aquino et.al. 2507.02328 null
2025-07-02 GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters Wanjia Zhao et.al. 2507.02085 null
2025-07-09 Test-Time Scaling with Reflective Generative Model Zixiao Wang et.al. 2507.01951 null
2025-07-06 AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction Bin Rao et.al. 2507.01801 null
2025-07-02 Efficient Collision Detection for Long and Slender Robotic Links in Euclidean Distance Fields: Application to a Forestry Crane Marc-Philip Ecker et.al. 2507.01705 null
2025-07-02 LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction Muhammad Atta ur Rahman et.al. 2507.01308 null
2025-07-01 Search-Based Robot Motion Planning With Distance-Based Adaptive Motion Primitives Benjamin Kraljusic et.al. 2507.01198 null
2025-07-01 ARIG: Autoregressive Interactive Head Generation for Real-time Conversations Ying Guo et.al. 2507.00472 null
2025-06-30 Rethink 3D Object Detection from Physical World Satoshi Tanaka et.al. 2507.00190 null
2025-06-30 Epona: Autoregressive Diffusion World Model for Autonomous Driving Kaiwen Zhang et.al. 2506.24113 null
2025-06-30 STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems Mingfei Cheng et.al. 2506.23995 null
2025-06-29 InfGen: Scenario Generation as Next Token Group Prediction Zhenghao Peng et.al. 2506.23316 null
2025-06-29 Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models Maarten Hugenholtz et.al. 2506.23164 null
2025-06-28 Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example Bei Zhou et.al. 2506.22894 null
2025-06-27 Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD Ruthvik Bokkasam et.al. 2506.22111 null
2025-06-27 A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments Akshay Jaitly et.al. 2506.21982 null
2025-06-27 SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model Shuhan Tan et.al. 2506.21976 null
2025-07-14 Ark: An Open-source Python-based Framework for Robot Learning Magnus Dierking et.al. 2506.21628 null
2025-06-26 GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction Muleilan Pei et.al. 2506.21121 null
2025-06-25 Near Time-Optimal Hybrid Motion Planning for Timber Cranes Marc-Philip Ecker et.al. 2506.20314 null
2025-06-24 Trajectory Prediction in Dynamic Object Tracking: A Critical Study Zhongping Dong et.al. 2506.19341 null
2025-06-25 AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation Ziyan Zhao et.al. 2506.19269 null
2025-08-04 Faster Motion Planning via Restarts Nancy Amato et.al. 2506.19016 null
2025-06-23 SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives Yizhou Chen et.al. 2506.18825 null
2025-06-23 Design, fabrication and control of a cable-driven parallel robot Dhruv Sorathiya et.al. 2506.18526 null
2025-06-23 Integrating Maneuverable Planning and Adaptive Control for Robot Cart-Pushing under Disturbances Zhe Zhang et.al. 2506.18410 null
2025-06-23 Selective Social-Interaction via Individual Importance for Fast Human Trajectory Prediction Yota Urano et.al. 2506.18291 null
2025-06-23 Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning Yue Li et.al. 2506.18234 null
2025-06-20 Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation Xiuyu Yang et.al. 2506.17213 null
2025-06-20 Judo: A User-Friendly Open-Source Package for Sampling-Based Model Predictive Control Albert H. Li et.al. 2506.17184 null
2025-07-11 Experimental Setup and Software Pipeline to Evaluate Optimization based Autonomous Multi-Robot Search Algorithms Aditya Bhatt et.al. 2506.16710 null