Updated on 2025.08.29

This page is maintained by Leheng Li that contains papers he interested in. Source code of this web is at here.

3D

Publish Date Title Authors PDF Code
2025-08-28 Multi-View 3D Point Tracking Frano Rajič et.al. 2508.21060 null
2025-08-28 ActLoc: Learning to Localize on the Move via Active Viewpoint Selection Jiajie Li et.al. 2508.20981 null
2025-08-28 DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes Yajiao Xiong et.al. 2508.20965 null
2025-08-28 PLUME: Procedural Layer Underground Modeling Engine Gabriel Manuel Garcia et.al. 2508.20926 null
2025-08-28 Estimating 2D Keypoints of Surgical Tools Using Vision-Language Models with Low-Rank Adaptation Krit Duangprom et.al. 2508.20830 null
2025-08-28 Surfel-based 3D Registration with Equivariant SE(3) Features Xueyang Kang et.al. 2508.20789 null
2025-08-28 SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding Jiawen Lin et.al. 2508.20758 null
2025-08-28 CardioMorphNet: Cardiac Motion Prediction Using a Shape-Guided Bayesian Recurrent Deep Network Reza Akbari Movahed et.al. 2508.20734 null
2025-08-28 Task-Oriented Edge-Assisted Cross-System Design for Real-Time Human-Robot Interaction in Industrial Metaverse Kan Chen et.al. 2508.20664 null
2025-08-28 AvatarBack: Back-Head Generation for Complete 3D Avatars from Front-View Images Shiqi Xin et.al. 2508.20623 null
2025-08-28 Optimization-Based Calibration for Intravascular Ultrasound Volume Reconstruction Karl-Philippe Beaudet et.al. 2508.20605 null
2025-08-28 Embracing Aleatoric Uncertainty: Generating Diverse 3D Human Motion Zheng Qin et.al. 2508.20604 null
2025-08-28 GLaRE: A Graph-based Landmark Region Embedding Network for Emotion Recognition Debasis Maji et.al. 2508.20579 null
2025-08-28 Enhancing Pseudo-Boxes via Data-Level LiDAR-Camera Fusion for Unsupervised 3D Object Detection Mingqian Ji et.al. 2508.20530 null
2025-08-28 Adam SLAM - the last mile of camera calibration with 3DGS Matthieu Gendrin et.al. 2508.20526 null
2025-08-28 IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection Xuanming Cao et.al. 2508.20492 null
2025-08-28 Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts Zixuan Hu et.al. 2508.20488 null
2025-08-28 Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation Jiusi Li et.al. 2508.20471 null
2025-08-28 Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation Xiaochuan Li et.al. 2508.20470 null
2025-08-28 Prediction of Distant Metastasis for Head and Neck Cancer Patients Using Multi-Modal Tumor and Peritumoral Feature Fusion Network Zizhao Tang et.al. 2508.20469 null
2025-08-27 MedNet-PVS: A MedNeXt-Based Deep Learning Model for Automated Segmentation of Perivascular Spaces Zhen Xuen Brandon Low et.al. 2508.20256 null
2025-08-27 Grounding Multimodal Large Language Models with Quantitative Skin Attributes: A Retrieval Study Max Torop et.al. 2508.20188 null
2025-08-27 Is the medical image segmentation problem solved? A survey of current developments and future directions Guoping Xu et.al. 2508.20139 null
2025-08-26 A Machine Learning Approach to Volumetric Computations of Solid Pulmonary Nodules Yihan Zhou et.al. 2508.20127 null
2025-08-27 Seam360GS: Seamless 360° Gaussian Splatting from Real-World Omnidirectional Images Changha Shin et.al. 2508.20080 null
2025-08-27 OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations Peng-Hao Hsu et.al. 2508.20063 null
2025-08-27 Visio-Verbal Teleimpedance Interface: Enabling Semi-Autonomous Control of Physical Interaction via Eye Tracking and Speech Henk H. A. Jekel et.al. 2508.20037 null
2025-08-27 Integrating SAM Supervision for 3D Weakly Supervised Point Cloud Segmentation Lechun You et.al. 2508.19909 null
2025-08-27 Multispectral LiDAR data for extracting tree points in urban and suburban areas Narges Takhtkeshha et.al. 2508.19881 null
2025-08-27 Multimodal Conditional MeshGAN for Personalized Aneurysm Growth Prediction Long Chen et.al. 2508.19862 null
2025-08-27 MAPo : Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction Han Jiao et.al. 2508.19786 null
2025-08-27 FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers Yue Wu et.al. 2508.19754 null
2025-08-27 LabelGS: Label-Aware 3D Gaussian Splatting for 3D Scene Segmentation Yupeng Zhang et.al. 2508.19699 null
2025-08-27 SAT: Supervisor Regularization and Animation Augmentation for Two-process Monocular Texture 3D Human Reconstruction Gangjian Zhang et.al. 2508.19688 null
2025-08-27 Beyond BEV: Optimizing Point-Level Tokens for Collaborative Perception Yang Li et.al. 2508.19638 null
2025-08-27 Generalizing Monocular 3D Object Detection Abhinav Kumar et.al. 2508.19593 null
2025-08-27 DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View Tian Qiu et.al. 2508.19508 null
2025-08-25 2D Ultrasound Elasticity Imaging of Abdominal Aortic Aneurysms Using Deep Neural Networks Utsav Ratna Tuladhar et.al. 2508.19303 null
2025-08-25 CellINR: Implicitly Overcoming Photo-induced Artifacts in 4D Live Fluorescence Microscopy Cunmin Zhao et.al. 2508.19300 null
2025-08-25 Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation Alexandros Gkillas et.al. 2508.19290 null
2025-08-26 VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space Lin Li et.al. 2508.19247 null
2025-08-26 Articulate3D: Zero-Shot Text-Driven 3D Object Posing Oishi Deb et.al. 2508.19244 null
2025-08-26 Style4D-Bench: A Benchmark Suite for 4D Stylization Beiqi Chen et.al. 2508.19243 null
2025-08-26 LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding Julian Ost et.al. 2508.19204 null
2025-08-26 Dual Enhancement on 3D Vision-Language Perception for Monocular 3D Visual Grounding Yuzhen Li et.al. 2508.19165 null
2025-08-26 Random forest-based out-of-distribution detection for robust lung cancer segmentation Aneesh Rangnekar et.al. 2508.19112 null
2025-08-26 GReAT: leveraging geometric artery data to improve wall shear stress assessment Julian Suk et.al. 2508.19030 null
2025-08-26 RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation Siyuan You et.al. 2508.19003 null
2025-08-26 Can we make NeRF-based visual localization privacy-preserving? Maxime Pietrantoni et.al. 2508.18971 null
2025-08-26 PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads Shashikant Verma et.al. 2508.18944 null
2025-08-26 ColorGS: High-fidelity Surgical Scene Reconstruction with Colored Gaussian Splatting Qun Ji et.al. 2508.18696 null
2025-08-26 AgriChrono: A Multi-modal Dataset Capturing Crop Growth and Lighting Variability with a Field Robot Jaehwan Jeong et.al. 2508.18694 null
2025-08-26 ROSE: Remove Objects with Side Effects in Videos Chenxuan Miao et.al. 2508.18633 null
2025-08-26 SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis Xiaohao Sun et.al. 2508.18597 null
2025-08-25 Real-time 3D Visualization of Radiance Fields on Light Field Displays Jonghyun Kim et.al. 2508.18540 null
2025-08-25 Adaptive Visual Navigation Assistant in 3D RPGs Kaijie Xu et.al. 2508.18539 null
2025-08-25 SAT-SKYLINES: 3D Building Generation from Satellite Imagery and Coarse Geometric Priors Zhangyu Jin et.al. 2508.18531 null
2025-08-25 DoGFlow: Self-Supervised LiDAR Scene Flow via Cross-Modal Doppler Guidance Ajinkya Khoche et.al. 2508.18506 null
2025-08-25 FastAvatar: Instant 3D Gaussian Splatting for Faces from Single Unconstrained Poses Hao Liang et.al. 2508.18389 null
2025-08-23 SERES: Semantic-aware neural reconstruction from sparse views Bo Xu et.al. 2508.18314 null
2025-08-22 Towards Training-Free Underwater 3D Object Detection from Sonar Point Clouds: A Comparison of Traditional and Deep Learning Approaches M. Salman Shaukat et.al. 2508.18293 null
2025-08-25 ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models Haitang Feng et.al. 2508.18271 null
2025-08-25 GSVisLoc: Generalizable Visual Localization for Gaussian Splatting Scene Representations Fadi Khatib et.al. 2508.18242 null
2025-08-21 PriorFormer: A Transformer for Real-time Monocular 3D Human Pose Estimation with Versatile Geometric Priors Mohamed Adjel et.al. 2508.18238 null
2025-08-25 Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance Ayce Idil Aytekin et.al. 2508.18213 null
2025-08-25 EventTracer: Fast Path Tracing-based Event Stream Rendering Zhenyang Li et.al. 2508.18071 null
2025-08-25 Topology Aware Neural Interpolation of Scalar Fields Mohamed Kissi et.al. 2508.17995 null
2025-08-25 SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization Junyuan Deng et.al. 2508.17972 null
2025-08-25 A holistic perception system of internal and external monitoring for ground autonomous vehicles: AutoTRUST paradigm Alexandros Gkillas et.al. 2508.17969 null
2025-08-25 Beam Geometry and Input Dimensionality: Impact on Sparse-Sampling Artifact Correction for Clinical CT with U-Nets Tina Dorosti et.al. 2508.17961 null
2025-08-25 EndoUFM: Utilizing Foundation Models for Monocular depth estimation of endoscopic images Xinning Yao et.al. 2508.17916 null
2025-08-25 Camera Pose Refinement via 3D Gaussian Splatting Lulu Hao et.al. 2508.17876 null
2025-08-25 HLG: Comprehensive 3D Room Construction via Hierarchical Layout Generation Xiping Wang et.al. 2508.17832 null
2025-08-25 CubeDN: Real-time Drone Detection in 3D Space from Dual mmWave Radar Cubes Yuan Fang et.al. 2508.17831 null
2025-08-25 MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting Hanzhi Chang et.al. 2508.17811 null
2025-08-25 DroneKey: Drone 3D Pose Estimation in Image Sequences using Gated Key-representation and Pose-adaptive Learning Seo-Bin Hwang et.al. 2508.17746 null
2025-08-25 MEVITA: Open-Source Bipedal Robot Assembled from E-Commerce Components via Sheet Metal Welding Kento Kawaharazuka et.al. 2508.17684 null
2025-08-28 Generating Human-AI Collaborative Design Sequence for 3D Assets via Differentiable Operation Graph Xiaoyang Huang et.al. 2508.17645 null
2025-08-25 Wound3DAssist: A Practical Framework for 3D Wound Assessment Remi Chierchia et.al. 2508.17635 null
2025-08-25 GWM: Towards Scalable Gaussian World Models for Robotic Manipulation Guanxing Lu et.al. 2508.17600 null
2025-08-25 TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints Vinh-Thuan Ly et.al. 2508.17595 null
2025-08-25 IDU: Incremental Dynamic Update of Existing 3D Virtual Environments with New Imagery Data Meida Chen et.al. 2508.17579 null
2025-08-24 Random-phase Gaussian Wave Splatting for Computer-generated Holography Brian Chao et.al. 2508.17480 null
2025-08-24 Investigating Domain Gaps for Indoor 3D Object Detection Zijing Zhao et.al. 2508.17439 null
2025-08-20 Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels Long Le et.al. 2508.17437 null
2025-08-24 MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling Haoyu Wang et.al. 2508.17404 null
2025-08-26 PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation Xiaoyang Hao et.al. 2508.17239 null
2025-08-24 4D Visual Pre-training for Robot Learning Chengkai Hou et.al. 2508.17230 null
2025-08-24 VROOM - Visual Reconstruction over Onboard Multiview Yajat Yadav et.al. 2508.17172 null
2025-08-23 DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method Qingwen Zhang et.al. 2508.17054 null
2025-08-23 PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models Xianjing Cheng et.al. 2508.17050 null
2025-08-23 M3DMap: Object-aware Multimodal 3D Mapping for Dynamic Environments Dmitry Yudin et.al. 2508.17044 null
2025-08-23 DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration Jiayi Li et.al. 2508.17034 null
2025-08-23 Fiducial Marker Splatting for High-Fidelity Robotics Simulations Diram Tabaa et.al. 2508.17012 null
2025-08-23 A Survey of Deep Learning-based Point Cloud Denoising Jinxi Wang et.al. 2508.17011 null
2025-08-23 Align 3D Representation and Text Embedding for 3D Content Personalization Qi Song et.al. 2508.16932 null
2025-08-23 Structural Energy-Guided Sampling for View-Consistent Text-to-3D Qing Zhang et.al. 2508.16917 null
2025-08-23 MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation Prerit Gupta et.al. 2508.16911 null
2025-08-23 Relative Navigation and Dynamic Target Tracking for Autonomous Underwater Proximity Operations David Baxter et.al. 2508.16901 null
2025-08-23 Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network Pouya Shiri et.al. 2508.16897 null
2025-08-23 A Workflow for Map Creation in Autonomous Vehicle Simulations Zubair Islam et.al. 2508.16856 null
2025-08-22 Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes Xinhao Xiang et.al. 2508.16812 null
2025-08-21 BrainPath: Generating Subject-Specific Brain Aging Trajectories Yifan Li et.al. 2508.16667 null
2025-08-22 MV-RAG: Retrieval Augmented Multiview Diffusion Yosef Dayani et.al. 2508.16577 null
2025-08-22 Real-time 3D Light-field Viewing with Eye-tracking on Conventional Displays Trung Hieu Pham et.al. 2508.16535 null
2025-08-26 Comparative Analysis of UAV Path Planning Algorithms for Efficient Navigation in Urban 3D Environments Hichem Cheriet et.al. 2508.16515 null
2025-08-22 On Kinodynamic Global Planning in a Simplicial Complex Environment: A Mixed Integer Approach Otobong Jerome et.al. 2508.16511 null
2025-08-22 Arbitrary-Scale 3D Gaussian Super-Resolution Huimin Zeng et.al. 2508.16467 null
2025-08-25 HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images Anilkumar Swamy et.al. 2508.16465 null
2025-08-22 HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction Sara Rojas et.al. 2508.16433 null
2025-08-22 SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather Edoardo Palladin et.al. 2508.16408 null
2025-08-22 Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars NVIDIA et.al. 2508.16401 null
2025-08-22 Self-Validated Learning for Particle Separation: A Correctness-Based Self-Training Framework Without Human Labels Philipp D. Lösel et.al. 2508.16224 null
2025-08-22 4D Virtual Imaging Platform for Dynamic Joint Assessment via Uni-Plane X-ray and 2D-3D Registration Hao Tang et.al. 2508.16138 null
2025-08-22 Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables Wontae Kim et.al. 2508.16121 null
2025-08-22 A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection Qifeng Liu et.al. 2508.16069 null
2025-08-22 Advances and Trends in the 3D Reconstruction of the Shape and Motion of Animals Ziqi Li et.al. 2508.16062 null
2025-08-22 NeuralMeshing: Complete Object Mesh Extraction from Casual Captures Floris Erich et.al. 2508.16026 null
2025-08-21 Self-Aligning EPM Connector: A Versatile Solution for Adaptive and Multi-Modal Interfaces Bingchao Wang et.al. 2508.16008 null
2025-08-21 GelSLAM: A Real-time, High-Fidelity, and Robust 3D Tactile SLAM System Hung-Jui Huang et.al. 2508.15990 null
2025-08-21 UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation Zhaodong Jiang et.al. 2508.15972 null
2025-08-21 Text-Driven 3D Hand Motion Generation from Sign Language Data Léore Bensabath et.al. 2508.15902 null
2025-08-21 Active Prostate Phantom with Multiple Chambers Sizhe Tian et.al. 2508.15873 null
2025-08-21 SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass Yanxu Meng et.al. 2508.15769 null
2025-08-21 ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling Jinhyung Park et.al. 2508.15767 null
2025-08-21 CM2LoD3: Reconstructing LoD3 Building Models Using Semantic Conflict Maps Franz Hanke et.al. 2508.15672 null
2025-08-25 Hessian-Based Lightweight Neural Network HessNet for State-of-the-Art Brain Vessel Segmentation on a Minimal Training Dataset Alexandra Bernadotte et.al. 2508.15660 null
2025-08-21 Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance Shuchao Pang et.al. 2508.15650 null
2025-08-21 Fast globally optimal Truncated Least Squares point cloud registration with fixed rotation axis Ivo Ivanov et.al. 2508.15613 null
2025-08-21 Deep Equilibrium Convolutional Sparse Coding for Hyperspectral Image Denoising Jin Ye et.al. 2508.15553 null
2025-08-21 MExECON: Multi-view Extended Explicit Clothed humans Optimized via Normal integration Fulden Ece Uğur et.al. 2508.15500 null
2025-08-21 Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework Zongqi He et.al. 2508.15457 null
2025-08-25 DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians Cong Wang et.al. 2508.15376 null
2025-08-21 Image-Conditioned 3D Gaussian Splat Quantization Xinshuang Liu et.al. 2508.15372 null
2025-08-21 RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features Olga Matykina et.al. 2508.15353 null
2025-08-21 Mag-Match: Magnetic Vector Field Features for Map Matching and Registration William McDonald et.al. 2508.15300 null
2025-08-21 BasketLiDAR: The First LiDAR-Camera Multimodal Dataset for Professional Basketball MOT Ryunosuke Hayashi et.al. 2508.15299 null
2025-08-21 Collaborative Multi-Modal Coding for High-Quality 3D Generation Ziang Cao et.al. 2508.15228 null
2025-08-25 MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion Xuyang Chen et.al. 2508.15169 null
2025-08-21 Reliable Multi-view 3D Reconstruction for `Just-in-time’ Edge Environments Md. Nurul Absur et.al. 2508.15158 null
2025-08-21 Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors Jeonghyun Noh et.al. 2508.15151 null
2025-08-20 Virtual Community: An Open World for Humans, Robots, and Society Qinhong Zhou et.al. 2508.14893 null
2025-08-20 Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds Jia Lu et.al. 2508.14892 null
2025-08-20 GaussianArt: Unified Modeling of Geometry and Motion for Articulated Objects Licheng Shen et.al. 2508.14891 null
2025-08-22 MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Bingquan Dai et.al. 2508.14879 null
2025-08-20 Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization Canyu Zhao et.al. 2508.14811 null
2025-08-20 Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels Fabian Holst et.al. 2508.14767 null
2025-08-20 GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting Jiaxin Wei et.al. 2508.14717 null
2025-08-20 GeMS: Efficient Gaussian Splatting for Extreme Motion Blur Gopi Raju Matta et.al. 2508.14682 null
2025-08-20 UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling Peiming Li et.al. 2508.14604 null
2025-08-20 Safety-Critical Learning for Long-Tail Events: The TUM Traffic Accident Dataset Walter Zimmer et.al. 2508.14567 null
2025-08-20 GOGS: High-Fidelity Geometry and Relighting for Glossy Objects via Gaussian Surfels Xingyuan Yang et.al. 2508.14563 null
2025-08-20 Making Pose Representations More Expressive and Disentangled via Residual Vector Quantization Sukhyun Jeong et.al. 2508.14561 null
2025-08-20 From Slices to Structures: Unsupervised 3D Reconstruction of Female Pelvic Anatomy from Freehand Transvaginal Ultrasound Max Krähenmann et.al. 2508.14552 null
2025-08-20 LookOut: Real-World Humanoid Egocentric Navigation Boxiao Pan et.al. 2508.14466 null
2025-08-20 D^3-Talker: Dual-Branch Decoupled Deformation Fields for Few-Shot 3D Talking Head Synthesis Yuhang Guo et.al. 2508.14449 null
2025-08-20 Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting Gyusam Chang et.al. 2508.14443 null
2025-08-20 HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation Bing Han et.al. 2508.14431 null
2025-08-20 Learning Point Cloud Representations with Pose Continuity for Depth-Based Category-Level 6D Object Pose Estimation Zhujun Li et.al. 2508.14358 null
2025-08-19 Pixels to Play: A Foundation Model for 3D Gameplay Yuguang Yue et.al. 2508.14295 null
2025-08-21 GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting Elena Alegret et.al. 2508.14278 null
2025-08-19 Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning Said Djafar Said et.al. 2508.14276 null
2025-08-19 SLAM-based Safe Indoor Exploration Strategy Omar Mostafa et.al. 2508.14235 null
2025-08-19 RynnEC: Bringing MLLMs into Embodied World Ronghao Dang et.al. 2508.14160 null
2025-08-19 Automated surgical planning with nnU-Net: delineation of the anatomy in hepatobiliary phase MRI Karin A. Olthof et.al. 2508.14133 null
2025-08-18 3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models Jolanta Mozyrska et.al. 2508.14122 null
2025-08-19 LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Chin-Yang Lin et.al. 2508.14041 null
2025-08-19 Distilled-3DGS:Distilled 3D Gaussian Splatting Lintao Xiang et.al. 2508.14037 null
2025-08-19 GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation Ken Deng et.al. 2508.14036 null
2025-08-19 Online 3D Gaussian Splatting Modeling with Novel View Selection Byeonggwon Lee et.al. 2508.14014 null
2025-08-19 ResPlan: A Large-Scale Vector-Graph Dataset of 17,000 Residential Floor Plans Mohamed Abouagour et.al. 2508.14006 null
2025-08-19 Self-Supervised Sparse Sensor Fusion for Long Range Perception Edoardo Palladin et.al. 2508.13995 null
2025-08-19 Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment Samuel Seligardi et.al. 2508.13989 null
2025-08-19 OmViD: Omni-supervised active learning for video action detection Aayush Rana et.al. 2508.13983 null
2025-08-19 ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving Xianda Guo et.al. 2508.13977 null
2025-08-19 Augmenting cobots for sheet-metal SMEs with 3D object recognition and localisation Martijn Cramer et.al. 2508.13964 null
2025-08-19 Real-Time, Population-Based Reconstruction of 3D Bone Models via Very-Low-Dose Protocols Yiqun Lin et.al. 2508.13947 null
2025-08-19 PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis Chunji Lv et.al. 2508.13911 null
2025-08-21 Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction Niklas Bubeck et.al. 2508.13826 null
2025-08-19 Is-NeRF: In-scattering Neural Radiance Field for Blurred Images Nan Luo et.al. 2508.13808 null
2025-08-19 Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing Feng-Lin Liu et.al. 2508.13797 null
2025-08-19 VisionLaw: Inferring Interpretable Intrinsic Dynamics from Visual Observations via Bilevel Optimization Jiajing Lin et.al. 2508.13792 null
2025-08-19 Shape-from-Template with Generalised Camera Agniva Sengupta et.al. 2508.13791 null
2025-08-19 Blast Hole Seeking and Dipping – The Navigation and Perception Framework in a Mine Site Inspection Robot Liyang Liu et.al. 2508.13785 null
2025-08-19 Deep Biomechanically-Guided Interpolation for Keypoint-Based Brain Shift Registration Tiago Assis et.al. 2508.13762 null
2025-08-19 Unleashing Semantic and Geometric Priors for 3D Scene Completion Shiyuan Chen et.al. 2508.13601 null
2025-08-19 The 9th AI City Challenge Zheng Tang et.al. 2508.13564 null
2025-08-19 Learnable SMPLify: A Neural Solution for Optimization-Free Human Pose Inverse Kinematics Yuchen Yang et.al. 2508.13562 null
2025-08-22 FLAIR: Frequency and Locality-Aware Implicit Neural Representations Sukhun Ko et.al. 2508.13544 null
2025-08-19 EAvatar: Expression-Aware Head Avatar Reconstruction with Generative Geometry Priors Shikun Zhang et.al. 2508.13537 null
2025-08-19 FAMNet: Integrating 2D and 3D Features for Micro-expression Recognition via Multi-task Learning and Hierarchical Attention Liangyu Fu et.al. 2508.13483 null
2025-08-18 Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction Sedigheh Dargahi et.al. 2508.13340 null
2025-08-18 InnerGS: Internal Scenes Rendering via Factorized 3D Gaussian Splatting Shuxin Liang et.al. 2508.13287 null
2025-08-17 PreSem-Surf: RGB-D Surface Reconstruction with Progressive Semantic Modeling and SG-MLP Pre-Rendering Mechanism Yuyan Ye et.al. 2508.13228 null
2025-08-18 4DNeX: Feed-Forward 4D Generative Modeling Made Easy Zhaoxi Chen et.al. 2508.13154 null
2025-08-18 IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion Wenhao Hu et.al. 2508.13153 null
2025-08-24 Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping Siddharth Khandelwal et.al. 2508.13065 null
2025-08-18 IntelliCap: Intelligent Guidance for Consistent View Sampling Ayaka Yasunaga et.al. 2508.13043 null
2025-08-18 Multi-Phase Automated Segmentation of Dental Structures in CBCT Using a Lightweight Auto3DSeg and SegResNet Implementation Dominic LaBella et.al. 2508.12962 null
2025-08-18 MaskSem: Semantic-Guided Masking for Learning 3D Hybrid High-Order Motion Representation Wei Wei et.al. 2508.12948 null
2025-08-18 Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models Jianshu Zeng et.al. 2508.12945 null
2025-08-18 CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction Zhiwei Ning et.al. 2508.12917 null
2025-08-18 CTFlow: Video-Inspired Latent Flow Matching for 3D CT Synthesis Jiayi Wang et.al. 2508.12900 null
2025-08-18 MCTR: Midpoint Corrected Triangulation for Autonomous Racing via Digital Twin Simulation in CARLA Junhao Ye et.al. 2508.12729 null
2025-08-18 Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting Kangjie Chen et.al. 2508.12720 null
2025-08-18 Neural Rendering for Sensor Adaptation in 3D Object Detection Felix Embacher et.al. 2508.12695 null
2025-08-18 Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection Zhongyao Li et.al. 2508.12684 null
2025-08-18 Stable Diffusion-Based Approach for Human De-Occlusion Seung Young Noh et.al. 2508.12663 null
2025-08-18 DyCrowd: Towards Dynamic Crowd Reconstruction from a Large-scene Video Hao Wen et.al. 2508.12644 null
2025-08-18 Synthesizing Accurate and Realistic T1-weighted Contrast-Enhanced MR Images using Posterior-Mean Rectified Flow Bastian Brandstötter et.al. 2508.12640 null
2025-08-19 WIPES: Wavelet-based Visual Primitives Wenhao Zhang et.al. 2508.12615 null
2025-08-17 Segmenting Thalamic Nuclei: T1 Maps Provide a Reliable and Efficient Solution Anqi Feng et.al. 2508.12508 null
2025-08-17 FractMorph: A Fractional Fourier-Based Multi-Domain Transformer for Deformable Image Registration Shayan Kebriti et.al. 2508.12445 null
2025-08-21 TiP4GEN: Text to Immersive Panorama 4D Scene Generation Ke Xing et.al. 2508.12415 null
2025-08-19 SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes Jun Zeng et.al. 2508.12410 null
2025-08-17 Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR Fatemeh Ghorbani Lohesara et.al. 2508.12336 null
2025-08-17 Semi-Infinite Programming for Collision-Avoidance in Optimal and Model Predictive Control Yunfan Gao et.al. 2508.12335 null
2025-08-17 Improving Densification in 3D Gaussian Splatting for High-Fidelity Rendering Xiaobin Deng et.al. 2508.12313 null
2025-08-17 In vivo 3D ultrasound computed tomography of musculoskeletal tissues with generative neural physics Zhijun Zeng et.al. 2508.12226 null
2025-08-17 Splat Feature Solver Butian Xiong et.al. 2508.12216 null
2025-08-16 RealTalk: Realistic Emotion-Aware Lifelike Talking-Head Synthesis Wenqing Wang et.al. 2508.12163 null
2025-08-16 VELVET-Med: Vision and Efficient Language Pre-training for Volumetric Imaging Tasks in Medicine Ziyang Zhang et.al. 2508.12108 null
2025-08-16 Enhancing 3D point accuracy of laser scanner through multi-stage convolutional neural network for applications in construction Qinyuan Fan et.al. 2508.12089 null
2025-08-16 VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models Haidong Xu et.al. 2508.12081 null
2025-08-16 OASIS: Real-Time Opti-Acoustic Sensing for Intervention Systems in Unstructured Environments Amy Phung et.al. 2508.12071 null
2025-08-16 InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes Hongyuan Liu et.al. 2508.12015 null
2025-08-16 UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding Yueming Xu et.al. 2508.11952 null
2025-08-16 Transferable Class Statistics and Multi-scale Feature Approximation for 3D Object Detection Hao Peng et.al. 2508.11951 null
2025-08-16 OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation Jilei Mao et.al. 2508.11898 null
2025-08-16 ComplicitSplat: Downstream Models are Vulnerable to Blackbox Attacks by 3D Gaussian Splat Camouflages Matthew Hull et.al. 2508.11854 null
2025-08-15 Towards Understanding 3D Vision: the Role of Gaussian Curvature Sherlon Almeida da Silva et.al. 2508.11825 null
2025-08-15 CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion Zhe Zhu et.al. 2508.11603 null
2025-08-15 Remove360: Benchmarking Residuals After Object Removal in 3D Gaussian Splatting Simona Kocour et.al. 2508.11431 null
2025-08-15 RMFAT: Recurrent Multi-scale Feature Atmospheric Turbulence Mitigator Zhiming Liu et.al. 2508.11409 null
2025-08-15 G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration Ramil Khafizov et.al. 2508.11379 null
2025-08-15 AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis Zonglin Wu et.al. 2508.11375 null
2025-08-15 HOID-R1: Reinforcement Learning for Open-World Human-Object Interaction Detection Reasoning with Multimodal Large Language Model Zhenhao Zhang et.al. 2508.11350 null
2025-08-15 Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking Haonan Zhang et.al. 2508.11323 null
2025-08-15 Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction Muzammil Khan et.al. 2508.11282 null
2025-08-15 Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds Pei He et.al. 2508.11265 null
2025-08-15 Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception Junjie Wang et.al. 2508.11256 null
2025-08-15 StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation Seungmi Lee et.al. 2508.11203 null
2025-08-15 CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector Abhinav Kumar et.al. 2508.11185 null
2025-08-14 HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing Xinjie Gao et.al. 2508.11106 null
2025-08-14 Data-Driven Abdominal Phenotypes of Type 2 Diabetes in Lean, Overweight, and Obese Cohorts Lucas W. Remedios et.al. 2508.11063 null
2025-08-14 Advancing 3D Scene Understanding with MV-ScanQA Multi-View Reasoning Evaluation and TripAlign Pre-training Dataset Wentao Mo et.al. 2508.11058 null
2025-08-20 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation Nikolaos Gkanatsios et.al. 2508.11002 null
2025-08-12 Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction Cheng Chen et.al. 2508.10936 null
2025-08-18 HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model Qi Liu et.al. 2508.10935 null
2025-08-12 ViPE: Video Pose Engine for 3D Geometric Perception Jiahui Huang et.al. 2508.10934 null
2025-08-14 Quantum Visual Fields with Neural Amplitude Encoding Shuteng Wang et.al. 2508.10900 null
2025-08-14 Puppeteer: Rig and Animate Your 3D Models Chaoyue Song et.al. 2508.10898 null
2025-08-14 Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning Mengyuan Liu et.al. 2508.10897 null
2025-08-14 STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer Yushi Lan et.al. 2508.10893 null
2025-08-14 TexVerse: A Universe of 3D Objects with High-Resolution Textures Yibo Zhang et.al. 2508.10868 null
2025-08-14 An Efficient Model-Driven Groupwise Approach for Atlas Construction Ziwei Zou et.al. 2508.10743 null
2025-08-14 Novel View Synthesis using DDIM Inversion Sehajdeep SIngh et.al. 2508.10688 null
2025-08-14 Physics-Informed Joint Multi-TE Super-Resolution with Implicit Neural Representation for Robust Fetal T2 Mapping Busra Bulut et.al. 2508.10680 null
2025-08-14 DIVA-VQA: Detecting Inter-frame Variations in UGC Video Quality Xinyi Wang et.al. 2508.10605 null
2025-08-14 SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving Philipp Wolters et.al. 2508.10567 null
2025-08-15 PTQAT: A Hybrid Parameter-Efficient Quantization Algorithm for 3D Perception Tasks Xinhao Wang et.al. 2508.10557 null
2025-08-14 Multi-Sample Anti-Aliasing and Constrained Optimization for 3D Gaussian Splatting Zheng Zhou et.al. 2508.10507 null
2025-08-14 STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes Keishi Ishihara et.al. 2508.10427 null
2025-08-14 SC-Lane: Slope-aware and Consistent Road Height Estimation Framework for 3D Lane Detection Chaesong Park et.al. 2508.10411 null
2025-08-14 Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models Hyundo Lee et.al. 2508.10382 null
2025-08-14 VIFSS: View-Invariant and Figure Skating-Specific Pose Representation Learning for Temporal Action Segmentation Ryota Tanaka et.al. 2508.10281 null
2025-08-14 Deep Learning for Crack Detection: A Review of Learning Paradigms, Generalizability, and Datasets Xinan Zhang et.al. 2508.10256 null
2025-08-13 EntropyGS: An Efficient Entropy Coding on 3D Gaussian Splatting Yuning Huang et.al. 2508.10227 null
2025-08-13 B-repLer: Semantic B-rep Latent Editor using Large Language Models Yilin Liu et.al. 2508.10201 null
2025-08-18 From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation Ke Niu et.al. 2508.10118 null
2025-08-13 A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation Shuting He et.al. 2508.09977 null
2025-08-13 PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image Geonhee Sim et.al. 2508.09973 null
2025-08-13 LIA-X: Interpretable Latent Portrait Animator Yaohui Wang et.al. 2508.09959 null
2025-08-13 E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras Chaoran Feng et.al. 2508.09912 null
2025-08-13 HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics Weiqi Li et.al. 2508.09858 null
2025-08-13 Toward Human-Robot Teaming: Learning Handover Behaviors from 3D Scenes Yuekun Wu et.al. 2508.09855 null
2025-08-13 ARI3D: A Software for Interactive Quantification of Regions in X-Ray CT 3D Images Jan Phillipp Albrecht et.al. 2508.09849 null
2025-08-13 RayletDF: Raylet Distance Fields for Generalizable 3D Surface Reconstruction from Point Clouds or Gaussians Shenxing Wei et.al. 2508.09830 null
2025-08-13 TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos Jinxi Li et.al. 2508.09811 null
2025-08-13 Automated Segmentation of Coronal Brain Tissue Slabs for 3D Neuropathology Jonathan Williams Ramirez et.al. 2508.09805 null
2025-08-13 MUJICA: Reforming SISR Models for PBR Material Super-Resolution via Cross-Map Attention Xin Du et.al. 2508.09802 null
2025-08-13 Surg-InvNeRF: Invertible NeRF for 3D tracking and reconstruction in surgical vision Gerardo Loza et.al. 2508.09681 null
2025-08-13 GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors Xingyilang Yin et.al. 2508.09667 null
2025-08-13 Noise-adapted Neural Operator for Robust Non-Line-of-Sight Imaging Lianfang Wang et.al. 2508.09655 null
2025-08-13 TOTNet: Occlusion-Aware Temporal Tracking for Robust Ball Detection in Sports Videos Hao Xu et.al. 2508.09650 null
2025-08-13 The Brain Resection Multimodal Image Registration (ReMIND2Reg) 2025 Challenge Reuben Dorent et.al. 2508.09649 null
2025-08-13 Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors Giorgos Karvounas et.al. 2508.09629 null
2025-08-14 Semantic-aware DropSplat: Adaptive Pruning of Redundant Gaussians for 3D Aerial-View Segmentation Xu Tang et.al. 2508.09626 null
2025-08-13 MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography Daniel Barco et.al. 2508.09616 null
2025-08-13 DualPhys-GS: Dual Physically-Guided 3D Gaussian Splatting for Underwater Scene Reconstruction Jiachen Li et.al. 2508.09610 null
2025-08-15 SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing Heyi Sun et.al. 2508.09597 null
2025-08-13 CaRoBio: 3D Cable Routing with a Bio-inspired Gripper Fingernail Jiahui Zuo et.al. 2508.09558 null
2025-08-14 Iterative Volume Fusion for Asymmetric Stereo Matching Yuanting Gao et.al. 2508.09543 null
2025-08-13 SkySplat: Generalizable 3D Gaussian Splatting from Multi-Temporal Sparse Satellite Images Xuejun Huang et.al. 2508.09479 null
2025-08-13 CitySeg: A 3D Open Vocabulary Semantic Segmentation Foundation Model in City-scale Scenarios Jialei Xu et.al. 2508.09470 null
2025-08-13 DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation Haoxiang Shi et.al. 2508.09444 null
2025-08-13 Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving Guangxun Zhu et.al. 2508.09404 null
2025-08-12 X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents Guoxian Song et.al. 2508.09383 null
2025-08-12 Gradient-Direction-Aware Density Control for 3D Gaussian Splatting Zheng Zhou et.al. 2508.09239 null
2025-08-12 Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices Ya Zou et.al. 2508.09136 null
2025-08-13 GeoVLA: Empowering 3D Representations in Vision-Language-Action Models Lin Sun et.al. 2508.09071 null
2025-08-12 A new dataset and comparison for multi-camera frame synthesis Conall Daly et.al. 2508.09068 null
2025-08-12 VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception Fuhao Chang et.al. 2508.09061 null
2025-08-12 DASC: Depth-of-Field Aware Scene Complexity Metric for 3D Visualization on Light Field Display Kamran Akbar et.al. 2508.08928 null
2025-08-12 Masked Clustering Prediction for Unsupervised Point Cloud Pre-training Bin Ren et.al. 2508.08910 null
2025-08-12 GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments Lin Zeng et.al. 2508.08867 null
2025-08-12 DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI Bo-Hsun Chen et.al. 2508.08831 null
2025-08-12 3DFroMLLM: 3D Prototype Generation only from Pretrained Multimodal LLMs Noor Ahmed et.al. 2508.08821 null
2025-08-12 MonoPartNeRF:Human Reconstruction from Monocular Video via Part-Based Neural Radiance Fields Yao Lu et.al. 2508.08798 null
2025-08-12 SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA) Trong-Thuan Nguyen et.al. 2508.08781 null
2025-08-12 ROD: RGB-Only Fast and Efficient Off-road Freespace Detection Tong Sun et.al. 2508.08697 null
2025-08-14 Yan: Foundational Interactive Video Generation Deheng Ye et.al. 2508.08601 null
2025-08-12 RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space Jingyun Liang et.al. 2508.08588 null
2025-08-12 Bio-Generative Design Morphology with Radiolaria: An application of a Nature-Based Generative Shape Grammar for Geometrical Design of Space Frames Michael Kleiss et.al. 2508.08572 null
2025-08-12 Revisiting the City Tower Project: Geometric Principles and Structural Morphology in the Works of Louis I. Kahn and Anne Tyng Aysan Mokhtarimousavi et.al. 2508.08561 null
2025-08-11 Empowering Children to Create AI-Enabled Augmented Reality Experiences Lei Zhang et.al. 2508.08467 null
2025-08-11 Enhanced Liver Tumor Detection in CT Images Using 3D U-Net and Bat Algorithm for Hyperparameter Optimization Nastaran Ghorbani et.al. 2508.08452 null
2025-08-11 ImageDDI: Image-enhanced Molecular Motif Sequence Representation for Drug-Drug Interaction Prediction Yuqin He et.al. 2508.08338 null
2025-08-11 Learning an Implicit Physics Model for Image-based Fluid Simulation Emily Yue-Ting Jia et.al. 2508.08254 null
2025-08-11 ReferSplat: Referring Segmentation in 3D Gaussian Splatting Shuting He et.al. 2508.08252 null
2025-08-11 LL3M: Large Language 3D Modelers Sining Lu et.al. 2508.08228 null
2025-08-11 SAGOnline: Segment Any Gaussians Online Wentao Sun et.al. 2508.08219 null
2025-08-11 Spatial-ORMLLM: Improve Spatial Relation Understanding in the Operating Room with Multimodal Large Language Model Peiqi He et.al. 2508.08199 null
2025-08-11 Emergent morphogenesis via planar fabrication enabled by a reduced model of composites Yupeng Zhang et.al. 2508.08198 null
2025-08-12 3D Human Mesh Estimation from Single View RGBD Ozhan Suat et.al. 2508.08178 null
2025-08-13 CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data Chongke Bi et.al. 2508.08173 null
2025-08-11 FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting Yitong Yang et.al. 2508.08136 null
2025-08-11 GRASPTrack: Geometry-Reasoned Association via Segmentation and Projection for Multi-Object Tracking Xudong Han et.al. 2508.08117 null
2025-08-11 3D Plant Root Skeleton Detection and Extraction Jiakai Lin et.al. 2508.08094 null
2025-08-11 Matrix-3D: Omnidirectional Explorable 3D World Generation Zhongqi Yang et.al. 2508.08086 null
2025-08-11 S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix Peng Dai et.al. 2508.08048 null
2025-08-11 Aerial Target Encirclement and Interception with Noisy Range Observations Fen Liu et.al. 2508.08046 null
2025-08-11 TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation Huawei Sun et.al. 2508.08038 null
2025-08-11 Mitigating Biases in Surgical Operating Rooms with Geometry Tony Danjun Wang et.al. 2508.08028 null
2025-08-11 TrackOR: Towards Personalized Intelligent Operating Rooms Through Robust Tracking Tony Danjun Wang et.al. 2508.07968 null
2025-08-11 Safeguarding Generative AI Applications in Preclinical Imaging through Hybrid Anomaly Detection Jakub Binda et.al. 2508.07923 null
2025-08-11 Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models Johanna P. Müller et.al. 2508.07903 null
2025-08-11 NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction Tianle Zeng et.al. 2508.07897 null
2025-08-11 Autonomous Navigation of Cloud-Controlled Quadcopters in Confined Spaces Using Multi-Modal Perception and LLM-Driven High Semantic Reasoning Shoaib Ahmmad et.al. 2508.07885 null
2025-08-11 Vertex Features for Neural Global Illumination Rui Su et.al. 2508.07852 null
2025-08-11 Tracking Any Point Methods for Markerless 3D Tissue Tracking in Endoscopic Stereo Images Konrad Reuter et.al. 2508.07851 null
2025-08-11 CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving Qi Xiang et.al. 2508.07838 null
2025-08-11 DiTVR: Zero-Shot Diffusion Transformer for Video Restoration Sicheng Gao et.al. 2508.07811 null
2025-08-11 Pose-RFT: Enhancing MLLMs for 3D Pose Generation via Hybrid Action Reinforcement Fine-Tuning Bao Li et.al. 2508.07804 null
2025-08-11 MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks Yushen Xu et.al. 2508.07803 null
2025-08-11 Forecasting Continuous Non-Conservative Dynamical Systems in SO(3) Lennart Bastian et.al. 2508.07775 null
2025-08-13 Multi-view Normal and Distance Guidance Gaussian Splatting for Surface Reconstruction Bo Jia et.al. 2508.07701 null
2025-08-11 Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing Weitao Wang et.al. 2508.07700 null
2025-08-11 GraphCoT-VLA: A 3D Spatial-Aware Reasoning Vision-Language-Action Model for Robotic Manipulation with Ambiguous Instructions Helong Huang et.al. 2508.07650 null
2025-08-11 Breaking Down and Building Up: Mixture of Skill-Based Vision-and-Language Navigation Agents Tianyi Ma et.al. 2508.07642 null
2025-08-11 End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy Zifan Wang et.al. 2508.07611 null
2025-08-12 Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring Ludan Zhang et.al. 2508.07552 null
2025-08-11 CoT-Pose: Chain-of-Thought Reasoning for 3D Pose Generation from Abstract Prompts Junuk Cha et.al. 2508.07540 null
2025-08-10 Novel View Synthesis with Gaussian Splatting: Impact on Photogrammetry Model Accuracy and Resolution Pranav Chougule et.al. 2508.07483 null
2025-08-10 CharacterShot: Controllable and Consistent 4D Character Animation Junyao Gao et.al. 2508.07409 null
2025-08-10 DIP-GS: Deep Image Prior For Gaussian Splatting Sparse View Recovery Rajaei Khatib et.al. 2508.07372 null
2025-08-10 GS4Buildings: Prior-Guided Gaussian Splatting for 3D Building Reconstruction Qilin Zhang et.al. 2508.07355 null
2025-08-10 Navigation and Exploration with Active Inference: from Biology to Industry Daria de Tinguy et.al. 2508.07269 null
2025-08-10 Fading the Digital Ink: A Universal Black-Box Attack Framework for 3DGS Watermarking Systems Qingyuan Zeng et.al. 2508.07263 null
2025-08-12 Understanding Dynamic Scenes in Ego Centric 4D Point Clouds Junsheng Huang et.al. 2508.07251 null
2025-08-10 3D Gaussian Representations with Motion Trajectory Field for Dynamic Scene Reconstruction Xuesong Li et.al. 2508.07182 null
2025-08-10 CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion Xiaotong Lin et.al. 2508.07162 null
2025-08-09 DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit Aiden Swann et.al. 2508.07118 null
2025-08-09 AugLift: Boosting Generalization in Lifting-based 3D Human Pose Estimation Nikolai Warner et.al. 2508.07112 null
2025-08-09 Communication-Efficient Multi-Agent 3D Detection via Hybrid Collaboration Yue Hu et.al. 2508.07092 null
2025-08-09 ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting Sandro Papais et.al. 2508.07089 null
2025-08-09 TeSO: Representing and Compressing 3D Point Cloud Scenes with Textured Surfel Octree Yueyu Hu et.al. 2508.07083 null
2025-08-09 SAGCNet: Spatial-Aware Graph Completion Network for Missing Slice Imputation in Population CMR Imaging Junkai Liu et.al. 2508.07041 null
2025-08-09 3DGS-VBench: A Comprehensive Video Quality Evaluation Benchmark for 3DGS Compression Yuke Xing et.al. 2508.07038 null
2025-08-12 HiMat: DiT-based Ultra-High Resolution SVBRDF Generation Zixiong Wang et.al. 2508.07011 null
2025-08-09 Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments Gian Mario Favero et.al. 2508.07006 null
2025-08-09 EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events Siyu Chen et.al. 2508.07003 null
2025-08-09 Evaluating Fisheye-Compatible 3D Gaussian Splatting Methods on Real Images Beyond 180 Degree Field of View Ulas Gunes et.al. 2508.06968 null
2025-08-09 Hybrid Machine Learning Framework for Predicting Geometric Deviations from 3D Surface Metrology Hamidreza Samadi et.al. 2508.06845 null
2025-08-09 Edge Detection for Organ Boundaries via Top Down Refinement and SubPixel Upsampling Aarav Mehta et.al. 2508.06805 null
2025-08-09 DiffUS: Differentiable Ultrasound Rendering from Volumetric Imaging Noe Bertramo et.al. 2508.06768 null
2025-08-09 VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions Yash Garg et.al. 2508.06757 null
2025-08-08 Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video Jixuan He et.al. 2508.06715 null
2025-08-08 Fourier Optics and Deep Learning Methods for Fast 3D Reconstruction in Digital Holography Justin London et.al. 2508.06703 null
2025-08-08 CoDe-NeRF: Neural Rendering via Dynamic Coefficient Decomposition Wenpeng Xing et.al. 2508.06632 null
2025-08-08 LightSwitch: Multi-view Relighting with Material-guided Diffusion Yehonathan Litman et.al. 2508.06494 null
2025-08-08 MotionSwap Om Patil et.al. 2508.06430 null
2025-08-08 FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation Wenbin Teng et.al. 2508.06392 null
2025-08-08 ViPro-2: Unsupervised State Estimation via Integrated Dynamics for Guiding Video Prediction Patrick Takenaka et.al. 2508.06335 null
2025-08-08 L2Calib: $SE(3)$ -Manifold Reinforcement Learning for Robust Extrinsic Calibration with Degenerate Motion Resilience Baorun Li et.al. 2508.06330 null
2025-08-08 Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging? Xin Ci Wong et.al. 2508.06327 null
2025-08-08 Real-Time 3D Vision-Language Embedding Mapping Christian Rauch et.al. 2508.06291 null
2025-08-08 Situationally-aware Path Planning Exploiting 3D Scene Graphs Saad Ejaz et.al. 2508.06283 null
2025-08-08 XAG-Net: A Cross-Slice Attention and Skip Gating Network for 2.5D Femur MRI Segmentation Byunghyun Ko et.al. 2508.06258 null
2025-08-08 PA-HOI: A Physics-Aware Human and Object Interaction Dataset Ruiyan Wang et.al. 2508.06205 null
2025-08-08 AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection Zhaopeng Gu et.al. 2508.06203 null
2025-08-08 UW-3DGS: Underwater 3D Reconstruction with Physics-Aware Gaussian Splatting Wenpeng Xing et.al. 2508.06169 null
2025-08-08 Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation YoungChan Choi et.al. 2508.06136 null
2025-08-12 GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving Jian Wang et.al. 2508.06113 null
2025-08-08 MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment Gui Zou et.al. 2508.06104 null
2025-08-08 Towards MR-Based Trochleoplasty Planning Michael Wehrli et.al. 2508.06076 null
2025-08-08 LV-Net: Anatomy-aware lateral ventricle shape modeling with a case study on Alzheimer’s disease, the Australian Imaging Biomarkers and Lifestyle flagship study of ageing Wonjung Park et.al. 2508.06055 null
2025-08-08 Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts Kiran Chhatre et.al. 2508.06032 null
2025-08-08 ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors Minsu Kim et.al. 2508.06014 null
2025-08-08 AnimateScene: Camera-controllable Animation in Any Scene Qingyang Liu et.al. 2508.05982 null
2025-08-08 A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image Yanxing Liang et.al. 2508.05950 null
2025-08-08 Enhancing Construction Site Analysis and Understanding with 3D Segmentation Sri Ramana Saketh Vasanthawada et.al. 2508.05922 null
2025-08-07 HOLODECK 2.0: Vision-Language-Guided 3D World Generation with Editing Zixuan Bian et.al. 2508.05899 null
2025-08-07 MZEN: Multi-Zoom Enhanced NeRF for 3-D Reconstruction with Unknown Camera Poses Jong-Ik Park et.al. 2508.05819 null
2025-08-07 Optimization-Free Style Transfer for 3D Gaussian Splats Raphael Du Sablon et.al. 2508.05813 null
2025-08-07 MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss Can Zhao et.al. 2508.05772 null
2025-08-07 GAP: Gaussianize Any Point Clouds with Text Guidance Weiqi Zhang et.al. 2508.05631 null
2025-08-07 Physically Controllable Relighting of Photographs Chris Careaga et.al. 2508.05626 null
2025-08-07 Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity Yuhan Zhang et.al. 2508.05609 null
2025-08-07 Robust adaptive fuzzy sliding mode control for trajectory tracking for of cylindrical manipulator Van Cuong Pham et.al. 2508.05584 null
2025-08-07 Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis Kunyu Feng et.al. 2508.05580 null
2025-08-07 Point cloud segmentation for 3D Clothed Human Layering Davide Garavaso et.al. 2508.05531 null
2025-08-07 Head Anchor Enhanced Detection and Association for Crowded Pedestrian Tracking Zewei Wu et.al. 2508.05514 null
2025-08-07 MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips Shibo Wang et.al. 2508.05506 null
2025-08-07 Symmetry Understanding of 3D Shapes via Chirality Disentanglement Weikang Wang et.al. 2508.05505 null
2025-08-07 Computational Design and Fabrication of Modular Robots with Untethered Control Manas Bhargava et.al. 2508.05410 null
2025-08-07 CT-GRAPH: Hierarchical Graph Attention Network for Anatomy-Guided CT Report Generation Hamza Kalisch et.al. 2508.05375 null
2025-08-07 3DGabSplat: 3D Gabor Splatting for Frequency-adaptive Radiance Field Rendering Junyu Zhou et.al. 2508.05343 null
2025-08-08 CF3: Compact and Fast 3D Feature Fields Hyunjoon Lee et.al. 2508.05254 null
2025-08-07 Coarse-to-Fine Joint Registration of MR and Ultrasound Images via Imaging Style Transfer Junyi Wang et.al. 2508.05240 null
2025-08-07 EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery Bingyu Yang et.al. 2508.05205 null
2025-08-07 Refining Gaussian Splatting: A Volumetric Densification Approach Mohamed Abdul Gafoor et.al. 2508.05187 null
2025-08-07 Learning to See and Act: Task-Aware View Planning for Robotic Manipulation Yongjie Bai et.al. 2508.05186 null
2025-08-07 FCBV-Net: Category-Level Robotic Garment Smoothing via Feature-Conditioned Bimanual Value Prediction Mohammed Daba et.al. 2508.05153 null
2025-08-07 FedGIN: Federated Learning with Dynamic Global Intensity Non-linear Augmentation for Organ Segmentation using Multi-modal Images Sachin Dudda Nagaraju et.al. 2508.05137 null
2025-08-07 A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding Mahmoud Chick Zaouali et.al. 2508.05064 null
2025-08-07 DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion Yifeng Huang et.al. 2508.05060 null
2025-08-07 MAG-Nav: Language-Driven Object Navigation Leveraging Memory-Reserved Active Grounding Weifan Zhang et.al. 2508.05021 null
2025-08-07 Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion Shenglun Chen et.al. 2508.04984 null
2025-08-07 UGOD: Uncertainty-Guided Differentiable Opacity and Soft Dropout for Enhanced Sparse-View 3DGS Zhihao Guo et.al. 2508.04968 null
2025-08-07 Laplacian Analysis Meets Dynamics Modelling: Gaussian Splatting for 4D Reconstruction Yifan Zhou et.al. 2508.04966 null
2025-08-07 Perceive-Sample-Compress: Towards Real-Time 3D Gaussian Splatting Zijian Wang et.al. 2508.04965 null
2025-08-06 CryoGS: Gaussian Splatting for Cryo-EM Homogeneous Reconstruction Suyi Chen et.al. 2508.04929 null
2025-08-06 LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction Md Zahidul Hasan et.al. 2508.04847 null
2025-08-06 Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models Mehrdad Moradi et.al. 2508.04818 null
2025-08-05 Neural Field-Based 3D Surface Reconstruction of Microstructures from Multi-Detector Signals in Scanning Electron Microscopy Shuo Chen et.al. 2508.04728 null
2025-08-06 Occupancy Learning with Spatiotemporal Memory Ziyang Leng et.al. 2508.04705 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 MienCap: Realtime Performance-Based Facial Animation with Live Mood Dynamics Ye Pan et.al. 2508.04687 null
2025-08-06 PixCuboid: Room Layout Estimation from Multi-view Featuremetric Alignment Gustav Hanning et.al. 2508.04659 null
2025-08-06 OmniDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment Tongfan Guan et.al. 2508.04611 null
2025-08-06 $NavA^3$ : Understanding Any Instruction, Navigating Anywhere, Finding Anything Lingfeng Zhang et.al. 2508.04598 null
2025-08-06 Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline Linqing Zhao et.al. 2508.04597 null
2025-08-06 LA-CaRe-CNN: Cascading Refinement CNN for Left Atrial Scar Segmentation Franz Thaler et.al. 2508.04553 null
2025-08-06 Surf3R: Rapid Surface Reconstruction from Sparse RGB Views in Seconds Haodong Zhu et.al. 2508.04508 null
2025-08-06 MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular Videos Daisheng Jin et.al. 2508.04505 null
2025-08-06 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation Shuzhou Yang et.al. 2508.04467 null
2025-08-06 Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models Yinan Yu et.al. 2508.04406 null
2025-08-06 RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization Yanyan Li et.al. 2508.04335 null
2025-08-07 Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research Ke Li et.al. 2508.04326 null
2025-08-06 MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction Yaopeng Lou et.al. 2508.04297 null
2025-08-06 PKSS-Align: Robust Point Cloud Registration on Pre-Kendall Shape Space Chenlei Lv et.al. 2508.04286 null
2025-08-06 PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction Muhua Zhu et.al. 2508.04236 null
2025-08-06 SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition Jiahui Li et.al. 2508.04224 null
2025-08-06 Small Lesions-aware Bidirectional Multimodal Multiscale Fusion Network for Lung Disease Classification Jianxun Yu et.al. 2508.04205 null
2025-08-06 IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control Lijuan Liu et.al. 2508.04147 null
2025-08-06 DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting Zexu Huang et.al. 2508.04099 null
2025-08-06 Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework Yi-Ting Chen et.al. 2508.04090 null
2025-08-06 RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting Zhan Li et.al. 2508.04078 null
2025-08-06 Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation Jiayi He et.al. 2508.04049 null
2025-08-06 JanusNet: Hierarchical Slice-Block Shuffle and Displacement for Semi-Supervised 3D Multi-Organ Segmentation Zheng Zhang et.al. 2508.03997 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-05 Inland-LOAM: Voxel-Based Structural Semantic Mapping for Inland Waterways Zhongbi Luo et.al. 2508.03672 null
2025-08-05 OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World Katherine Liu et.al. 2508.03669 null
2025-08-06 Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images Xiangyu Sun et.al. 2508.03643 null
2025-08-05 FPG-NAS: FLOPs-Aware Gated Differentiable Neural Architecture Search for Efficient 6DoF Pose Estimation Nassim Ali Ousalah et.al. 2508.03618 null
2025-08-05 CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models Ana Lawry Aguila et.al. 2508.03594 null
2025-08-05 Spatial Imputation Drives Cross-Domain Alignment for EEG Classification Hongjun Liu et.al. 2508.03437 null
2025-08-05 WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval Junlong Ren et.al. 2508.03343 null
2025-08-05 Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion Wentao Qu et.al. 2508.03252 null
2025-08-05 Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing Hongyu Shen et.al. 2508.03227 null
2025-08-05 Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling Heng Wu et.al. 2508.03186 null
2025-08-05 Duplex-GS: Proxy-Guided Weighted Blending for Real-Time Order-Independent Gaussian Splatting Weihang Liu et.al. 2508.03180 null
2025-08-05 H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction Heng Jia et.al. 2508.03118 null
2025-08-05 Point2Act: Efficient 3D Distillation of Multimodal LLMs for Zero-Shot Context-Aware Grasping Sang Min Kim et.al. 2508.03099 null
2025-08-05 RobustGS: Unified Boosting of Feedforward 3D Gaussian Splatting under Low-Quality Conditions Anran Wu et.al. 2508.03077 null
2025-08-05 SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation Bo Zhang et.al. 2508.03069 null
2025-08-05 A Survey of Medical Point Cloud Shape Learning: Registration, Reconstruction and Variation Tongxu Zhang et.al. 2508.03057 null
2025-08-05 SA-3DGS: A Self-Adaptive Compression Method for 3D Gaussian Splatting Liheng Zhang et.al. 2508.03017 null
2025-08-05 ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion Meng Zhou et.al. 2508.03008 null
2025-08-05 GACL: Grounded Adaptive Curriculum Learning with Active Task and Performance Monitoring Linji Wang et.al. 2508.02988 null
2025-08-04 Evaluation of 3D Counterfactual Brain MRI Generation Pengwei Sun et.al. 2508.02880 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing Mikołaj Zieliński et.al. 2508.02831 null
2025-08-04 PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation Zongyou Yang et.al. 2508.02806 null
2025-08-04 PMGS: Reconstruction of Projectile Motion across Large Spatiotemporal Spans via 3D Gaussian Splatting Yijun Xu et.al. 2508.02660 null
2025-08-04 RL-U $^2$ Net: A Dual-Branch UNet with Reinforcement Learning-Assisted Multimodal Feature Fusion for Accurate 3D Whole-Heart Segmentation Jierui Qu et.al. 2508.02557 null
2025-08-04 Uncertainty-Aware Perception-Based Control for Autonomous Racing Jelena Trisovic et.al. 2508.02494 null
2025-08-05 Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting Jianchao Wang et.al. 2508.02493 null
2025-08-06 GR-Gaussian: Graph-Based Radiative Gaussian Splatting for Sparse-View CT Reconstruction Yikuang Yuluo et.al. 2508.02408 null
2025-08-04 Correspondence-Free Fast and Robust Spherical Point Pattern Registration Anik Sarker et.al. 2508.02339 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 ASDR: Exploiting Adaptive Sampling and Data Reuse for CIM-based Instant Neural Rendering Fangxin Liu et.al. 2508.02304 null
2025-08-04 Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection Jae-Young Kang et.al. 2508.02288 null
2025-08-04 SplatSSC: Decoupled Depth-Guided Gaussian Splatting for Semantic Scene Completion Rui Qian et.al. 2508.02261 null
2025-08-04 GaussianCross: Cross-modal Self-supervised 3D Representation Learning via Gaussian Splatting Lei Yao et.al. 2508.02172 null
2025-08-04 Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes Tom Fischer et.al. 2508.02157 null
2025-08-04 ScrewSplat: An End-to-End Method for Articulated Object Recognition Seungyeon Kim et.al. 2508.02146 null
2025-08-04 VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling Yuru Xiao et.al. 2508.02129 null
2025-08-04 REACT-KD: Region-Aware Cross-modal Topological Knowledge Distillation for Interpretable Medical Image Classification Hongzhao Chen et.al. 2508.02104 null
2025-08-04 StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion Haoxin Yang et.al. 2508.02056 null
2025-08-04 Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure Ziling Wang et.al. 2508.02034 null
2025-08-04 On-the-Fly Object-aware Representative Point Selection in Point Cloud Xiaoyu Zhang et.al. 2508.01980 null
2025-08-04 From Photons to Physics: Autonomous Indoor Drones and the Future of Objective Property Assessment Petteri Teikari et.al. 2508.01965 null
2025-08-03 Less is More: AMBER-AFNO – a New Benchmark for Lightweight 3D Medical Image Segmentation Andrea Dosi et.al. 2508.01941 null
2025-08-03 MUTE-DSS: A Digital-Twin-Based Decision Support System for Minimizing Underwater Radiated Noise in Ship Voyage Planning Akash Venkateshwaran et.al. 2508.01907 null
2025-08-03 Beyond Vulnerabilities: A Survey of Adversarial Attacks as Both Threats and Defenses in Computer Vision Systems Zhongliang Guo et.al. 2508.01845 null
2025-08-03 OmniEvent: Unified Event Representation Learning Weiqi Yan et.al. 2508.01842 null
2025-08-03 Diffusion-based 3D Hand Motion Recovery with Intuitive Physics Yufei Zhang et.al. 2508.01835 null
2025-08-03 Skip priors and add graph-based anatomical information, for point-based Couinaud segmentation Xiaotong Zhang et.al. 2508.01785 null
2025-08-05 VPN: Visual Prompt Navigation Shuo Feng et.al. 2508.01766 null
2025-08-03 AG $^2$ aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing Zhaonan Wang et.al. 2508.01740 null
2025-08-03 OpenMap: Instruction Grounding via Open-Vocabulary Visual-Language Mapping Danyang Li et.al. 2508.01723 null
2025-08-03 LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving Luqi Cheng et.al. 2508.01704 null
2025-08-03 Register Anything: Estimating “Corresponding Prompts” for Segment Anything Model Shiqi Huang et.al. 2508.01697 null
2025-08-03 DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing Yufeng Chi et.al. 2508.01684 null
2025-08-03 DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding Hanqing Wang et.al. 2508.01651 null
2025-08-03 StrandDesigner: Towards Practical Strand Generation with Sketch Guidance Na Zhang et.al. 2508.01650 null
2025-08-03 Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection Hanxi Li et.al. 2508.01591 null
2025-08-03 A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction Hua Yu et.al. 2508.01585 null
2025-08-03 Deeply Supervised Multi-Task Autoencoder for Biological Brain Age estimation using three dimensional T $_1$ -weighted magnetic resonance imaging Mehreen Kanwal et.al. 2508.01565 null
2025-08-03 Adaptive LiDAR Scanning: Harnessing Temporal Cues for Efficient 3D Object Detection via Multi-Modal Fusion Sara Shoouri et.al. 2508.01562 null
2025-08-02 Decentralized Aerial Manipulation of a Cable-Suspended Load using Multi-Agent Reinforcement Learning Jack Zeng et.al. 2508.01522 null
2025-08-02 EfficientGFormer: Multimodal Brain Tumor Segmentation via Pruned Graph-Augmented Transformer Fatemeh Ziaeetabar et.al. 2508.01465 null
2025-08-02 Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians Quankai Gao et.al. 2508.01464 null
2025-08-02 Uncertainty-Aware Segmentation Quality Prediction via Deep Learning Bayesian Modeling: Comprehensive Evaluation and Interpretation on Skin Cancer and Liver Segmentation Sikha O K et.al. 2508.01460 null
2025-08-05 3DRot: 3D Rotation Augmentation for RGB-Based 3D Tasks Shitian Yang et.al. 2508.01423 null
2025-08-02 ReMu: Reconstructing Multi-layer 3D Clothed Human from Image Layers Onat Vuran et.al. 2508.01381 null
2025-08-02 P3P Made Easy Seong Hun Lee et.al. 2508.01312 null
2025-08-02 C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor Haoquan Lu et.al. 2508.01311 null
2025-08-02 CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis Alec Sargood et.al. 2508.01292 null
2025-08-02 Integrating Disparity Confidence Estimation into Relative Depth Prior-Guided Unsupervised Stereo Matching Chuang-Wei Liu et.al. 2508.01275 null
2025-08-05 MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh Shuangkang Fang et.al. 2508.01242 null
2025-08-02 OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS Han Ling et.al. 2508.01239 null
2025-08-02 Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system Jiyong Kim et.al. 2508.01230 null
2025-08-02 MoGaFace: Momentum-Guided and Texture-Aware Gaussian Avatars for Consistent Facial Geometry Yujian Liu et.al. 2508.01218 null
2025-08-02 Perspective from a Broader Context: Can Room Style Knowledge Help Visual Floorplan Localization? Bolei Chen et.al. 2508.01216 null
2025-08-02 A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Zhan Shi et.al. 2508.01197 null
2025-08-02 Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning Xinhang Wan et.al. 2508.01184 null
2025-08-02 No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views Ranran Huang et.al. 2508.01171 null
2025-08-02 DELTAv2: Accelerating Dense 3D Tracking Tuan Duc Ngo et.al. 2508.01170 null
2025-08-02 OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding Dianyi Yang et.al. 2508.01150 null
2025-08-02 Design of Q8bot: A Miniature, Low-Cost, Dynamic Quadruped Built with Zero Wires Yufeng Wu et.al. 2508.01149 null
2025-08-02 UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation Chaitanya Patel et.al. 2508.01126 null
2025-08-01 DreamSat-2.0: Towards a General Single-View Asteroid 3D Reconstruction Santiago Diaz et.al. 2508.01079 null
2025-08-01 Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation Fenghe Tang et.al. 2508.01064 null
2025-08-01 Structured Spectral Graph Learning for Anomaly Classification in 3D Chest CT Scans Theo Di Piazza et.al. 2508.01045 null
2025-08-01 3D Reconstruction via Incremental Structure From Motion Muhammad Zeeshan et.al. 2508.01019 null
2025-08-01 Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection Cheng-You Lu et.al. 2508.01014 null
2025-08-01 Cooperative Perception: A Resource-Efficient Framework for Multi-Drone 3D Scene Reconstruction Using Federated Diffusion and NeRF Massoud Pourmandi et.al. 2508.00967 null
2025-07-31 Investigating Crossing Perception in 3D Graph Visualisation Ying Zhang et.al. 2508.00950 null
2025-08-01 IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation Wenxuan Guo et.al. 2508.00823 null
2025-08-01 Cross-Dataset Semantic Segmentation Performance Analysis: Unifying NIST Point Cloud City Datasets for 3D Deep Learning Alexander Nikitas Dimopoulos et.al. 2508.00822 null
2025-08-01 GECO: Geometrically Consistent Embedding with Lightspeed Inference Regine Hartwig et.al. 2508.00746 null
2025-08-01 Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR Adwait Chandorkar et.al. 2508.00744 null
2025-08-04 DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior Junzhe Lu et.al. 2508.00599 null
2025-08-01 OmniUnet: A Multimodal Network for Unstructured Terrain Segmentation on Planetary Rovers Using RGB, Depth, and Thermal Imagery Raul Castilla-Arquillo et.al. 2508.00580 null
2025-08-04 LesiOnTime – Joint Temporal and Clinical Modeling for Small Breast Lesion Segmentation in Longitudinal DCE-MRI Mohammed Kamran et.al. 2508.00496 null
2025-08-01 HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection Jiaping Cao et.al. 2508.00473 null
2025-08-01 Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation Nan Xiang et.al. 2508.00428 null
2025-08-01 Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting Seunggeun Chi et.al. 2508.00427 null
2025-08-01 Sari Sandbox: A Virtual Retail Store Environment for Embodied AI Agents Janika Deborah Gajo et.al. 2508.00400 null
2025-08-01 Occlusion-robust Stylization for Drawing-based 3D Animation Sunjae Yoon et.al. 2508.00398 null
2025-08-01 SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies Liang Han et.al. 2508.00366 null
2025-08-01 Stable at Any Speed: Speed-Driven Multi-Object Tracking with Learnable Kalman Filtering Yan Gong et.al. 2508.00358 null
2025-08-01 Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging Tianshuang Qiu et.al. 2508.00354 null
2025-08-01 AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer Jin Lyu et.al. 2508.00298 null
2025-08-01 Towards Robust Semantic Correspondence: A Benchmark and Insights Wenyue Chong et.al. 2508.00272 null
2025-08-05 Multimodal Referring Segmentation: A Survey Henghui Ding et.al. 2508.00265 null
2025-08-01 PointGauss: Point Cloud-Guided Multi-Object Segmentation for Gaussian Splatting Wentao Sun et.al. 2508.00259 null
2025-08-01 Weakly Supervised Intracranial Aneurysm Detection and Segmentation in MR angiography via Multi-task UNet with Vesselness Prior Erin Rainville et.al. 2508.00235 null
2025-07-31 Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs Bhavya Goyal et.al. 2508.00169 null
2025-07-31 GEPAR3D: Geometry Prior-Assisted Learning for 3D Tooth Segmentation Tomasz Szczepański et.al. 2508.00155 null
2025-07-31 Stress-Aware Resilient Neural Training Ashkan Shakarami et.al. 2508.00098 null
2025-07-31 Punching Bag vs. Punching Person: Motion Transferability in Videos Raiyaan Abdullah et.al. 2508.00085 null
2025-07-31 Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Bowen Zhang et.al. 2507.23785 null
2025-07-31 Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions Li Siyao et.al. 2507.23778 null
2025-07-31 SeqAffordSplat: Scene-level Sequential Affordance Reasoning on 3D Gaussian Splatting Di Li et.al. 2507.23772 null
2025-08-05 Topology Optimization in Medical Image Segmentation with Fast Euler Characteristic Liu Li et.al. 2507.23763 null
2025-07-31 Enhanced Velocity Field Modeling for Gaussian Video Reconstruction Zhenyang Li et.al. 2507.23704 null
2025-07-31 Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents Shaofei Cai et.al. 2507.23698 null
2025-07-31 High-resolution eikonal imaging and uncertainty quantification of the Kilauea caldera Angela F. Gao et.al. 2507.23692 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes Xiaohan Li et.al. 2507.23677 null
2025-07-31 DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation Yuchen Zhou et.al. 2507.23599 null
2025-08-02 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization Maxime Pietrantoni et.al. 2507.23569 null
2025-07-31 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection Yung-Hsu Yang et.al. 2507.23567 null
2025-08-01 H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation Hongzhe Bi et.al. 2507.23523 null
2025-07-31 Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion Mutian Xu et.al. 2507.23483 null
2025-07-31 FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction Donghyun Lee et.al. 2507.23480 null
2025-07-31 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding Ting Huang et.al. 2507.23478 null
2025-07-31 NeRF Is a Valuable Assistant for 3D Gaussian Splatting Shuangkang Fang et.al. 2507.23374 null
2025-07-31 MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting Xingyue Peng et.al. 2507.23340 null
2025-08-01 Training-free Geometric Image Editing on Diffusion Models Hanshen Zhu et.al. 2507.23300 null
2025-07-31 iLRM: An Iterative Large 3D Reconstruction Model Gyeongjin Kang et.al. 2507.23277 null
2025-07-31 GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting Jaeseok Park et.al. 2507.23273 null
2025-07-31 Towards Affordable Tumor Segmentation and Visualization for 3D Breast MRI Using SAM2 Solha Kang et.al. 2507.23272 null
2025-07-30 Details Matter for Indoor Open-vocabulary 3D Instance Segmentation Sanghun Jung et.al. 2507.23134 null
2025-07-30 Rethink Domain Generalization in Heterogeneous Sequence MRI Segmentation Zheyuan Zhang et.al. 2507.23110 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-07-30 Adaptive Time-step Training for Enhancing Spike-Based Neural Radiance Fields Ranxi Lin et.al. 2507.23033 null
2025-07-30 Learning to Prune Branches in Modern Tree-Fruit Orchards Abhinav Jain et.al. 2507.23015 null
2025-07-30 Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction Zhensheng Yuan et.al. 2507.23006 null
2025-07-30 Viser: Imperative, Web-based 3D Visualization in Python Brent Yi et.al. 2507.22885 null
2025-07-30 DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion Qingcheng Zhao et.al. 2507.22825 null
2025-07-30 Wall Shear Stress Estimation in Abdominal Aortic Aneurysms: Towards Generalisable Neural Surrogate Models Patryk Rygiel et.al. 2507.22817 null
2025-07-30 Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques Weide Liu et.al. 2507.22791 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks Hang Su et.al. 2507.22733 null
2025-07-30 Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints Thuy Tran et.al. 2507.22699 null
2025-07-30 Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation Hongbin Lin et.al. 2507.22668 null
2025-07-30 trAIce3D: A Prompt-Driven Transformer Based U-Net for Semantic Segmentation of Microglial Cells from Large-Scale 3D Microscopy Images MohammadAmin Alamalhoda et.al. 2507.22635 null
2025-07-30 Estimating 2D Camera Motion with Hybrid Motion Basis Haipeng Li et.al. 2507.22480 null
2025-07-30 UAVScenes: A Multi-Modal Dataset for UAVs Sijie Wang et.al. 2507.22412 null
2025-07-30 UFV-Splatter: Pose-Free Feed-Forward 3D Gaussian Splatting Adapted to Unfavorable Views Yuki Fujimura et.al. 2507.22342 null
2025-07-30 A Segmentation Framework for Accurate Diagnosis of Amyloid Positivity without Structural Images Penghan Zhu et.al. 2507.22336 null
2025-07-29 Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception Christian Ellis et.al. 2507.22194 null
2025-07-29 Enhancing efficiency in paediatric brain tumour segmentation using a pathologically diverse single-center clinical dataset A. Piffer et.al. 2507.22152 null
2025-07-29 Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos Ziren Gong et.al. 2507.22052 null
2025-07-29 ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports Mohammed Baharoon et.al. 2507.22030 null
2025-07-29 Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images Yutao Hu et.al. 2507.22024 null
2025-07-29 XAI for Point Cloud Data using Perturbations based on Meaningful Segmentation Raju Ningappa Mulawade et.al. 2507.22020 null
2025-07-29 DISCOVERSE: Efficient Robot Simulation in Complex High-Fidelity Environments Yufei Jia et.al. 2507.21981 null
2025-07-29 PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction Jiahui Ren et.al. 2507.21960 null
2025-07-31 MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors Shouyi Lu et.al. 2507.21872 null
2025-07-29 VidFuncta: Towards Generalizable Neural Representations for Ultrasound Videos Julia Wolleb et.al. 2507.21863 null
2025-07-29 HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels HunyuanWorld Team et.al. 2507.21809 null
2025-07-29 AU-LLM: Micro-Expression Action Unit Detection via Enhanced LLM-Based Feature Fusion Zhishu Liu et.al. 2507.21778 null
2025-07-29 Multi-UAV Deployment in Obstacle-Cluttered Environments with LOS Connectivity Yuda Chen et.al. 2507.21772 null
2025-07-30 No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering Linye Wei et.al. 2507.21572 null
2025-07-29 Multi-View Reconstruction with Global Context for 3D Anomaly Detection Yihan Sun et.al. 2507.21555 null
2025-07-29 LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments Junhao Chen et.al. 2507.21517 null
2025-07-29 ST-DAI: Single-shot 2.5D Spatial Transcriptomics with Intra-Sample Domain Adaptive Imputation for Cost-efficient 3D Reconstruction Jiahe Qian et.al. 2507.21516 null
2025-07-29 BANG: Dividing 3D Assets via Generative Exploded Dynamics Longwen Zhang et.al. 2507.21493 null
2025-07-29 Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval Zhichuan Wang et.al. 2507.21489 null
2025-07-28 Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View Zitong Zhang et.al. 2507.21371 null
2025-08-03 Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy Jicheng Yuan et.al. 2507.21358 null
2025-07-28 DEM-NeRF: A Neuro-Symbolic Method for Scientific Discovery through Physics-Informed Simulation Wenkai Tan et.al. 2507.21350 null
2025-07-28 GLCP: Global-to-Local Connectivity Preservation for Tubular Structure Segmentation Feixiang Zhou et.al. 2507.21328 null
2025-07-28 VoluMe – Authentic 3D Video Calls from Live Gaussian Splat Prediction Martin de La Gorce et.al. 2507.21311 null
2025-07-28 Fluidically Innervated Lattices Make Versatile and Durable Tactile Sensors Annan Zhang et.al. 2507.21225 null
2025-08-03 Reconstructing 4D Spatial Intelligence: A Survey Yukang Cao et.al. 2507.21045 null
2025-07-28 GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction Tianhao Li et.al. 2507.20963 null
2025-07-28 $S^3$ LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping Ruoyu Fan et.al. 2507.20854 null
2025-07-28 An Efficient Machine Learning Framework for Forest Height Estimation from Multi-Polarimetric Multi-Baseline SAR data Francesca Razzano et.al. 2507.20798 null
2025-07-28 KASportsFormer: Kinematic Anatomy Enhanced Transformer for 3D Human Pose Estimation on Short Sports Scene Video Zhuoer Yin et.al. 2507.20763 null
2025-07-28 Methods for the Segmentation of Reticular Structures Using 3D LiDAR Data: A Comparative Evaluation Francisco J. Soler Mora et.al. 2507.20589 null
2025-07-28 M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast Jiacheng Lu et.al. 2507.20582 null
2025-07-28 Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation Hyung Kyu Kim et.al. 2507.20568 null
2025-07-28 MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization Hyung Kyu Kim et.al. 2507.20562 null
2025-07-28 Uni-Mapper: Unified Mapping Framework for Multi-modal LiDARs in Complex and Dynamic Environments Gilhwan Kang et.al. 2507.20538 null
2025-07-28 Enhancing Spatial Reasoning through Visual and Textual Thinking Xun Liang et.al. 2507.20529 null
2025-07-28 GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections Haiyang Bai et.al. 2507.20512 null
2025-07-28 Automated 3D-GS Registration and Fusion via Skeleton Alignment and Gaussian-Adaptive Features Shiyang Liu et.al. 2507.20480 null
2025-07-29 From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos Chenjian Gao et.al. 2507.20331 null
2025-07-27 Decomposing Densification in Gaussian Splatting for Faster 3D Scene Reconstruction Binxiao Huang et.al. 2507.20239 null
2025-07-27 NeuroVoxel-LM: Language-Aligned 3D Perception via Dynamic Voxelization and Meta-Embedding Shiyu Liu et.al. 2507.20110 null
2025-07-26 High-Speed Event Vision-Based Tactile Roller Sensor for Large Surface Measurements Akram Khairi et.al. 2507.19914 null
2025-07-30 RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection Xiaokai Bai et.al. 2507.19856 null
2025-07-26 Taking Language Embedded 3D Gaussian Splatting into the Wild Yuze Wang et.al. 2507.19830 null
2025-07-25 GSCache: Real-Time Radiance Caching for Volume Path Tracing using 3D Gaussian Splatting David Bauer et.al. 2507.19718 null
2025-07-25 DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations Ziren Gong et.al. 2507.19474 null
2025-07-25 Fast Learning of Non-Cooperative Spacecraft 3D Models through Primitive Initialization Pol Francesch Huc et.al. 2507.19459 null
2025-07-25 NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography Kirsten W. H. Maas et.al. 2507.19328 null
2025-07-25 3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering Wei-Hsing Huang et.al. 2507.19133 null
2025-07-25 Gaussian Set Surface Reconstruction through Per-Gaussian Optimization Zhentao Huang et.al. 2507.18923 null
2025-07-24 SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time Yun Chen et.al. 2507.18713 null
2025-07-24 Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping Chong Cheng et.al. 2507.18541 null
2025-07-24 G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM Gyuhyeon Pak et.al. 2507.18344 null
2025-07-24 LONG3R: Long Sequence Streaming 3D Reconstruction Zhuoguang Chen et.al. 2507.18255 null
2025-07-24 PS-GS: Gaussian Splatting for Multi-View Photometric Stereo Yixiao Chen et.al. 2507.18231 null
2025-07-24 High-fidelity 3D Gaussian Inpainting: preserving multi-view consistency and photorealistic details Jun Zhou et.al. 2507.18023 null
2025-07-24 Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners Kostas Karakontis et.al. 2507.17519 null
2025-07-23 Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field Yuzhe Zhu et.al. 2507.17351 null
2025-07-23 Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting Hyeongmin Lee et.al. 2507.17336 null
2025-07-24 PolarAnything: Diffusion-based Polarimetric Image Synthesis Kailong Zhang et.al. 2507.17268 null
2025-07-22 StreamME: Simplify 3D Gaussian Avatar within Live Stream Luchuan Song et.al. 2507.17029 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-22 Sparse-View 3D Reconstruction: Recent Advances and Open Challenges Tanveer Younis et.al. 2507.16406 null
2025-07-22 Dens3R: A Foundation Model for 3D Geometry Prediction Xianze Fang et.al. 2507.16290 null
2025-07-22 LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images Guichen Huang et.al. 2507.16144 null
2025-07-21 Appearance Harmonization via Bilateral Grid Prediction with Transformers for 3DGS Jisu Shin et.al. 2507.15748 null
2025-07-21 DWTGS: Rethinking Frequency Regularization for Sparse-view 3D Gaussian Splatting Hung Nguyen et.al. 2507.15690 null
2025-07-21 Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing Boni Hu et.al. 2507.15683 null
2025-07-21 Gaussian Splatting with Discretized SDF for Relightable Assets Zuo-Liang Zhu et.al. 2507.15629 null
2025-07-28 SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting Zihui Gao et.al. 2507.15602 null
2025-07-21 ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting Ruijie Zhu et.al. 2507.15454 null
2025-07-25 GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing Minnan Pei et.al. 2507.15300 null
2025-07-20 3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline Kaishva Chintan Shah et.al. 2507.14924 null
2025-07-20 Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction Xiufeng Huang et.al. 2507.14921 null
2025-07-20 An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks Xinyi Wu et.al. 2507.14798 null
2025-07-30 Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey Jiahui Zhang et.al. 2507.14501 null
2025-07-19 Adaptive 3D Gaussian Splatting Video Streaming: Visual Saliency-Aware Tiling and Meta-Learning-Based Bitrate Adaptation Han Gong et.al. 2507.14454 null
2025-07-19 Adaptive 3D Gaussian Splatting Video Streaming Han Gong et.al. 2507.14432 null
2025-08-01 C-DOG: Multi-View Multi-instance Feature Association Using Connected δ-Overlap Graphs Yung-Hong Sun et.al. 2507.14095 null
2025-07-18 TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views Hsiang-Hui Hung et.al. 2507.13929 null
2025-07-18 Neural-GASh: A CGA-based neural radiance prediction pipeline for real-time shading Efstratios Geronikolakis et.al. 2507.13917 null
2025-07-21 PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations Yu Wei et.al. 2507.13891 null
2025-07-18 EPSilon: Efficient Point Sampling for Lightening of Hybrid-based 3D Avatar Generation Seungjun Moon et.al. 2507.13648 null
2025-07-18 Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation Masahiro Ogawa et.al. 2507.13628 null
2025-07-19 AutoPartGen: Autogressive 3D Part Generation and Discovery Minghao Chen et.al. 2507.13346 null
2025-07-16 VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians Siyuan Yao et.al. 2507.12667 null
2025-07-16 NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting Kuangshi Ai et.al. 2507.12621 null
2025-07-21 Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition Beizhen Zhao et.al. 2507.12498 null
2025-07-19 SpatialTrackerV2: 3D Point Tracking Made Easy Yuxi Xiao et.al. 2507.12462 null
2025-07-16 Revealing the Ancient Beauty: Digital Reconstruction of Temple Tiles using Computer Vision Arkaprabha Basu et.al. 2507.12195 null
2025-07-16 DoRF: Doppler Radiance Fields for Robust Human Activity Recognition Using Wi-Fi Navid Hasanzadeh et.al. 2507.12132 null
2025-07-16 BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images Davide Di Nucci et.al. 2507.12095 null
2025-07-16 SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation Beining Xu et.al. 2507.12027 null
2025-07-16 HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing Tielong Wang et.al. 2507.11971 null
2025-07-16 Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark Jingqian Wu et.al. 2507.11931 null
2025-07-16 CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning Peiwen Xia et.al. 2507.11834 null
2025-07-15 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Zhen Xu et.al. 2507.11540 null
2025-07-21 Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling Hayeon Kim et.al. 2507.11061 null
2025-07-14 ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions Shivangi Aneja et.al. 2507.10542 null
2025-07-14 Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry Geyou Zhang et.al. 2507.10009 null
2025-07-19 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving Yixun Zhang et.al. 2507.09993 null
2025-07-14 VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling Zihang Zeng et.al. 2507.09987 null
2025-07-11 From images to properties: a NeRF-driven framework for granular material parameter inversion Cheng-Hsi Hsiao et.al. 2507.09005 null
2025-07-11 An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan Mengyuan Liu et.al. 2507.08690 null
2025-07-11 Adaptive Framework for Ambient Intelligence in Rehabilitation Assistance Gábor Baranyi et.al. 2507.08624 null
2025-07-11 Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT Wei Zhang et.al. 2507.08448 null
2025-07-11 RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting Ji Hyun Seo et.al. 2507.08434 null
2025-07-11 CL3R: 3D Reconstruction and Contrastive Learning for Enhanced Robotic Manipulation Representations Wenbo Cui et.al. 2507.08262 null
2025-07-10 Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction Hyungjun Doh et.al. 2507.08137 null
2025-07-18 RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration Chong Cheng et.al. 2507.08136 null
2025-07-10 Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions Longfei Li et.al. 2507.07978 null
2025-07-10 RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection Yongyang Zhou et.al. 2507.07733 null

Diffusion

Publish Date Title Authors PDF Code
2025-08-28 First-Place Solution to NeurIPS 2024 Invisible Watermark Removal Challenge Fahad Shamshad et.al. 2508.21072 null
2025-08-28 Dress&Dance: Dress up and Dance as You Like It - Technical Preview Jun-Kun Chen et.al. 2508.21070 null
2025-08-28 OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning Yuan Gong et.al. 2508.21066 null
2025-08-28 Mixture of Contexts for Long Video Generation Shengqu Cai et.al. 2508.21058 null
2025-08-28 HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning Zhi Su et.al. 2508.21043 null
2025-08-28 FW-GAN: Frequency-Driven Handwriting Synthesis with Wave-Modulated MLP Generator Huynh Tong Dang Khoa et.al. 2508.21040 null
2025-08-28 Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets Dale Decatur et.al. 2508.21032 null
2025-08-28 System size and event shape dependence of particle-identified balance functions in proton-proton collisions at $\sqrt{s}=13$ TeV Subash Chandra Behera et.al. 2508.21030 null
2025-08-28 POSE: Phased One-Step Adversarial Equilibrium for Video Diffusion Models Jiaxiang Cheng et.al. 2508.21019 null
2025-08-28 Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance Luozhijie Jin et.al. 2508.21016 null
2025-08-28 Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees Yaniv Hassidof et.al. 2508.21001 null
2025-08-28 RANGAN: GAN-empowered Anomaly Detection in 5G Cloud RAN Douglas Liao et.al. 2508.20985 null
2025-08-28 Random attractors and nonergodic attractors for diffusions with degeneracies Yuri Bakhtin et.al. 2508.20968 null
2025-08-28 Very high-energy gamma-ray and neutrino emission from hadronic interaction in compact binary millisecond pulsars Vittoria Vecchiotti et.al. 2508.20952 null
2025-08-28 Lattice Random Walk Discretisations of Stochastic Differential Equations Samuel Duffield et.al. 2508.20883 null
2025-08-28 Understanding and evaluating computer vision models through the lens of counterfactuals Pushkar Shukla et.al. 2508.20881 null
2025-08-28 Leveraging Discriminative Latent Representations for Conditioning GAN-Based Speech Enhancement Shrishti Saha Shetu et.al. 2508.20859 null
2025-08-28 Uniform error analysis of a rectangular Morley finite element method on a Shishkin mesh for a 4th-order singularly perturbed boundary value problem Xiangyun Meng et.al. 2508.20857 null
2025-08-28 Learning Primitive Embodied World Models: Towards Scalable Robotic Learning Qiao Sun et.al. 2508.20840 null
2025-08-28 High-Resolution Atomic Magnetometer-Based Imaging of Integrated Circuits and Batteries Dominic Hunter et.al. 2508.20834 null
2025-08-28 Distinct Spatiotemporal Dynamics of Thermoelectric Transport Across Superconducting Transition Rajae Malek et.al. 2508.20792 null
2025-08-28 Prediction of sulphate hazes in the lower Venus atmosphere Peter Woitke et.al. 2508.20790 null
2025-08-28 Evaluating Compositional Generalisation in VLMs and Diffusion Models Beth Pearson et.al. 2508.20783 null
2025-08-28 Unleashing Uncertainty: Efficient Machine Unlearning for Generative AI Christoforos N. Spartalis et.al. 2508.20773 null
2025-08-28 Anomalous diffusion and run-and-tumble motion of a chemotactic particle in low dimensions Jacopo Romano et.al. 2508.20756 null
2025-08-28 Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning Yibin Wang et.al. 2508.20751 null
2025-08-28 A two-state generalisation of the strong collision model Ola Kenji Forslund et.al. 2508.20727 null
2025-08-28 EEGDM: Learning EEG Representation with Latent Diffusion Model Shaocong Wang et.al. 2508.20705 null
2025-08-28 Agent-based model of information diffusion in the limit order book trading Mateusz Wilinski et.al. 2508.20672 null
2025-08-28 “Humor, Art, or Misinformation?”: A Multimodal Dataset for Intent-Aware Synthetic Image Detection Anastasios Skoularikis et.al. 2508.20670 null
2025-08-28 Amadeus: Autoregressive Model with Bidirectional Attribute Modelling for Symbolic Music Hongju Su et.al. 2508.20665 null
2025-08-28 VarDiU: A Variational Diffusive Upper Bound for One-Step Diffusion Distillation Leyang Wang et.al. 2508.20646 null
2025-08-28 CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models Ayan Banerjee et.al. 2508.20640 null
2025-08-28 EmoCAST: Emotional Talking Portrait via Emotive Text Description Yiguo Jiang et.al. 2508.20615 null
2025-08-28 Revisiting the Privacy Risks of Split Inference: A GAN-Based Data Reconstruction Attack via Progressive Feature Optimization Yixiang Qiu et.al. 2508.20613 null
2025-08-28 Physics Informed Generative Models for Magnetic Field Images Aye Phyu Phyu Aung et.al. 2508.20612 null
2025-08-28 GENRE-CMR: Generalizable Deep Learning for Diverse Multi-Domain Cardiac MRI Reconstruction Kian Anvari Hamedani et.al. 2508.20600 null
2025-08-28 Disruptive Attacks on Face Swapping via Low-Frequency Perceptual Perturbations Mengxiao Huang et.al. 2508.20595 null
2025-08-28 FastFit: Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models Zheng Chong et.al. 2508.20586 null
2025-08-28 Persode: Personalized Visual Journaling with Episodic Memory-Aware AI Agent Seokho Jin et.al. 2508.20585 null
2025-08-28 SimShear: Sim-to-Real Shear-based Tactile Servoing Kipp McAdam Freud et.al. 2508.20561 null
2025-08-28 Equilibria of aggregation-diffusion models with nonlinear potentials Francesco Bozzola et.al. 2508.20523 null
2025-08-28 Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent En Ci et.al. 2508.20505 null
2025-08-28 Run-and-tumble particle with diffusion: boundary local times and the zero-diffusion limit Paul C Bressloff et.al. 2508.20473 null
2025-08-28 Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation Jiusi Li et.al. 2508.20471 null
2025-08-28 Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models Desen Sun et.al. 2508.20424 null
2025-08-28 AWorld: Orchestrating the Training Recipe for Agentic AI Chengyue Yu et.al. 2508.20404 null
2025-08-28 Mean Field Game with Reflected Jump Diffusion Dynamics: A Linear Programming Approach Zongxia Liang et.al. 2508.20388 null
2025-08-28 Do triangles matter? Replicating hypergraph disease dynamics with lower-order interactions Eugene Tan et.al. 2508.20380 null
2025-08-28 Audio-Guided Visual Editing with Complex Multi-Modal Prompts Hyeonyu Kim et.al. 2508.20379 null
2025-08-28 Numerical Method for Space-Time Fractional Diffusion: A Stochastic Approach Tengteng Cui et.al. 2508.20361 null
2025-08-28 Artificial neural network solver for Fokker-Planck and Koopman eigenfunctions Max Kreider et.al. 2508.20339 null
2025-08-27 Score-Based Diffusion Models in Infinite Dimensions: A Malliavin Calculus Perspective Ehsan Mirafzali et.al. 2508.20316 null
2025-08-27 Efficient ion re-acceleration in laboratory-produced interpenetrating collisionless shocks W. Yao et.al. 2508.20303 null
2025-08-27 Out-of-time-order correlators bridge classical transport and quantum dynamics Sophia N. Fricke et.al. 2508.20235 null
2025-08-27 Velocity Spectrum Imaging using velocity encoding preparation pulses Luis Hernandez-Garcia et.al. 2508.20218 null
2025-08-27 InfinityHuman: Towards Long-Term Audio-Driven Human Xiaodi Li et.al. 2508.20210 null
2025-08-27 The structure of the giant radio fossil in the Ophiuchus galaxy cluster Simona Giacintucci et.al. 2508.20190 null
2025-08-27 SDiFL: Stable Diffusion-Driven Framework for Image Forgery Localization Yang Su et.al. 2508.20182 null
2025-08-27 Nonlinear diffusion in relativistic kinetic theory Simone Calogero et.al. 2508.20147 null
2025-08-27 MicroLad: 2D-to-3D Microstructure Reconstruction and Generation via Latent Diffusion and Score Distillation Kang-Hyun Lee et.al. 2508.20138 null
2025-08-27 Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning Jinhao Liang et.al. 2508.20095 null
2025-08-27 AudioStory: Generating Long-Form Narrative Audio with Large Language Models Yuxin Guo et.al. 2508.20088 null
2025-08-27 Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies Zhixuan Liang et.al. 2508.20072 null
2025-08-27 A unique solution to overcome the barriers to planetesimal formation at low dust-to-gas ratio H. Meheut et.al. 2508.20070 null
2025-08-27 Neural Conditional Simulation for Complex Spatial Processes Julia Walchessen et.al. 2508.20067 null
2025-08-27 Joint Analysis of HI Absorption Zeeman Measurements and the Morphology of Filamentary HI Emission Marta Nowotka et.al. 2508.20065 null
2025-08-27 Wave coarsening drives time crystallization in active solids Jonas Veenstra et.al. 2508.20052 null
2025-08-27 GS: Generative Segmentation via Label Diffusion Yuhao Chen et.al. 2508.20020 null
2025-08-27 Diffusion Language Models Know the Answer Before Decoding Pengxiang Li et.al. 2508.19982 null
2025-08-27 The Information Dynamics of Generative Diffusion Luca Ambrogioni et.al. 2508.19897 null
2025-08-27 Quantum latent distributions in deep generative models Omar Bacarreza et.al. 2508.19857 null
2025-08-28 Ego-centric Predictive Model Conditioned on Hand Trajectories Binjie Zhang et.al. 2508.19852 null
2025-08-27 Physics-Informed DeepONet Coupled with FEM for Convective Transport in Porous Media with Sharp Gaussian Sources Erdi Kara et.al. 2508.19847 null
2025-08-27 Exotic rheology of materials with active rearrangements Aondoyima Ioratim-Uba et.al. 2508.19844 null
2025-08-27 Not Every Gift Comes in Gold Paper or with a Red Ribbon: Exploring Color Perception in Text-to-Image Models Shay Shomer Chai et.al. 2508.19791 null
2025-08-27 StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation Xiuchao Wu et.al. 2508.19789 null
2025-08-27 Fast 3D Diffusion for Scalable Granular Media Synthesis Muhammad Moeeze Hassan et.al. 2508.19752 null
2025-08-27 Fractal Flow: Hierarchical and Interpretable Normalizing Flow via Topic Modeling and Recursive Strategy Binhui Zhang et.al. 2508.19750 null
2025-08-27 MC for Gastroretentive Drug Delivery Sebastian Lotter et.al. 2508.19739 null
2025-08-27 Synthetic Image Detection via Spectral Gaps of QC-RBIM Nishimori Bethe-Hessian Operators V. S. Usatyuk et.al. 2508.19698 null
2025-08-27 MnBr $_2$ on the graphene on Ir(110) substrate: growth, structure, and super-moiré Affan Safeer et.al. 2508.19694 null
2025-08-27 Atomistic insights into hydrogen migration in IGZO from machine-learning interatomic potential: linking atomic diffusion to device performance Hyunsung Cho et.al. 2508.19674 null
2025-08-27 Multi-value Probabilistic Computing with current-controlled Skyrmion Diffusion Thomas B. Winkler et.al. 2508.19623 null
2025-08-27 IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation Qizhe Fan et.al. 2508.19604 null
2025-08-27 Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction Dat Nguyen Cong et.al. 2508.19581 null
2025-08-28 Interact-Custom: Customized Human Object Interaction Image Generation Zhu Xu et.al. 2508.19575 null
2025-08-27 Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era Dawei Li et.al. 2508.19570 null
2025-08-27 MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery Yu-Wei Zhang et.al. 2508.19555 null
2025-08-27 Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding Bowen Sun et.al. 2508.19529 null
2025-08-27 MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment Zhiting Gao et.al. 2508.19527 null
2025-08-27 Functionally-graded drug delivery systems with binding reactions: analytical and stochastic approaches for the fraction of drug released Obi A. Carwood et.al. 2508.19510 null
2025-08-27 DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View Tian Qiu et.al. 2508.19508 null
2025-08-27 Sat2Flow: A Structure-Aware Diffusion Framework for Human Flow Generation from Satellite Imagery Xiangxu Wang et.al. 2508.19499 null
2025-08-27 Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks Muhammad Ahmed Mohsin et.al. 2508.19495 null
2025-08-26 MRExtrap: Longitudinal Aging of Brain MRIs using Linear Modeling in Latent Space Jaivardhan Kapoor et.al. 2508.19482 null
2025-08-26 Shining light on degeneracies and uncertainties in quantifying both exchange and restriction with time-dependent diffusion MRI using Bayesian inference Maëliss Jallais et.al. 2508.19478 null
2025-08-26 Hydrodynamic Limit of the Symmetric Zero-Range Process with Slow Boundary Oslenne Araújo et.al. 2508.19447 null
2025-08-26 On Surjectivity of Neural Networks: Can you elicit any behavior from your model? Haozhe Jiang et.al. 2508.19445 null
2025-08-26 Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization Paimon Goulart et.al. 2508.19443 null
2025-08-26 Quantification of mobile ions in perovskite solar cells with thermally activated ion current measurements Moritz C. Schmidt et.al. 2508.19403 null
2025-08-26 DETNO: A Diffusion-Enhanced Transformer Neural Operator for Long-Term Traffic Forecasting Owais Ahmad et.al. 2508.19389 null
2025-08-26 Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs Supratik Sarkar et.al. 2508.19366 null
2025-08-28 MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation Ming Chen et.al. 2508.19320 null
2025-08-26 Disorder-induced proximate quantum spin ice phase in Pr $_2$Sn$_2$O$_7$ Yi Luo et.al. 2508.19248 null
2025-08-26 Articulate3D: Zero-Shot Text-Driven 3D Object Posing Oishi Deb et.al. 2508.19244 null
2025-08-26 MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation Hao Shi et.al. 2508.19236 null
2025-08-26 VibeVoice Technical Report Zhiliang Peng et.al. 2508.19205 null
2025-08-26 LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding Julian Ost et.al. 2508.19204 null
2025-08-26 Planning-Query-Guided Model Generation for Model-Based Deformable Object Manipulation Alex LaGrassa et.al. 2508.19199 null
2025-08-26 All-in-One Slider for Attribute Manipulation in Diffusion Models Weixin Ye et.al. 2508.19195 null
2025-08-26 MDD: a Mask Diffusion Detector to Protect Speaker Verification Systems from Adversarial Perturbations Yibo Bai et.al. 2508.19180 null
2025-08-26 Stoch-IDENT: New Method and Mathematical Analysis for Identifying SPDEs from Data Jianbo Cui et.al. 2508.19177 null
2025-08-26 RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration Yan Chen et.al. 2508.19154 null
2025-08-26 Saddle Hierarchy in Dense Associative Memory Robin Thériault et.al. 2508.19151 null
2025-08-26 Alloyed cementite (Fe-Ni-Cr) $_3$ C: structure and hyperfine field from DFT calculations and experimental comparison Lyudmila V. Dobysheva et.al. 2508.19148 null
2025-08-26 Lattice vacancy migration barriers in Fe-Ni alloys, and why Ni atoms diffuse slowly: An ab initio study Adam M. Fisher et.al. 2508.19124 null
2025-08-26 Composition and Alignment of Diffusion Models using Constrained Learning Shervin Khalafi et.al. 2508.19104 null
2025-08-26 Evaluation of in vitro antibacterial activity and phytochemical profile of aqueous leaf extract of Asystasia variabilis R Wijerathna et.al. 2508.19049 null
2025-08-26 In-vitro Anti-bacterial Activity of Methanol and Aqueous Crude Extracts of Horsfieldia iryaghedhi RMHKK Rajapaksha et.al. 2508.19025 null
2025-08-28 STDiff: A State Transition Diffusion Framework for Time Series Imputation in Industrial Systems Gary Simethy et.al. 2508.19011 null
2025-08-26 Detection of Diffuse Radio Emission inside the Supernova Remnant G338.3-0.0 associated with the Gamma-ray Source HESS J1640-465 Moaz Abdelmaguid et.al. 2508.18999 null
2025-08-26 Krylov-Veretennikov desomposition for measure-valued processes induced by SDEs with interaction on Riemannian manifolds Andrey Dorogovtsev et.al. 2508.18995 null
2025-08-26 Junctional-Fluctuation-Mediated Fluidisation of Multi-Phase Field Epithelial Monolayers James N. Graham et.al. 2508.18987 null
2025-08-26 Vanishing Angular Viscosity Limit For Micropolar Fluid Model In $\mathbb{R}_+^2$ : Boundary Layer And Optimal Convergence Rate Yinghui Wang et.al. 2508.18980 null
2025-08-26 Linear approximations of large deviations: Cubic diffusion test Pelerine Tsobgni Nyawo et.al. 2508.18977 null
2025-08-26 Generative AI in Map-Making: A Technical Exploration and Its Implications for Cartographers Claudio Affolter et.al. 2508.18959 null
2025-08-26 Energy-Based Flow Matching for Generating 3D Molecular Structure Wenyin Zhou et.al. 2508.18949 null
2025-08-26 Stochastic Forces Enhance Tracer Diffusion in Non-motile Active Matter Henry Alston et.al. 2508.18882 null
2025-08-26 Experimental investigation of turbulence and turbulent thermal diffusion in strongly inhomogeneous and anisotropic forced convection E. Zarbib et.al. 2508.18865 null
2025-08-26 Super and Weak Poincaré Inequalities for Sticky-Reflected Diffusion Processes Feng-Yu Wang et.al. 2508.18846 null
2025-08-26 Single-Photon Detection in Few-Layer NbSe $_2$ Superconducting Nanowires Lucio Zugliani et.al. 2508.18843 null
2025-08-26 Quantum-Circuit-Based Visual Fractal Image Generation in Qiskit and Analytics Hillol Biswas et.al. 2508.18835 null
2025-08-26 On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation Adrian Meise et.al. 2508.18833 null
2025-08-26 Asymptotic limit of a vector-valued Allen-Cahn equation for phase transition dynamics Huan Dong et.al. 2508.18754 null
2025-08-26 Joint Time-Position Statistics and Fisher Information in Drift-Diffusion Molecular Channels Yun-Feng Lo et.al. 2508.18680 null
2025-08-26 ROSE: Remove Objects with Side Effects in Videos Chenxuan Miao et.al. 2508.18633 null
2025-08-26 Wan-S2V: Audio-Driven Cinematic Video Generation Xin Gao et.al. 2508.18621 null
2025-08-26 SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis Xiaohao Sun et.al. 2508.18597 null
2025-08-26 Search for the radiative decay of the cosmic neutrino background through spectral measurements of the cosmic infrared background using PRIMA Yuji Takeuchi et.al. 2508.18590 null
2025-08-25 Controllable Single-shot Animation Blending with Temporal Conditioning Eleni Tselepi et.al. 2508.18525 null
2025-08-25 VQualA 2025 Challenge on Face Image Quality Assessment: Methods and Results Sizhuo Ma et.al. 2508.18445 null
2025-08-25 Phase-Field Model of Freeze Casting Kaihua Ji et.al. 2508.18416 null
2025-08-25 Hillas meets Eddington: the case for blazars as ultra-high-energy neutrino sources Xavier Rodrigues et.al. 2508.18345 null
2025-08-25 ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models Haitang Feng et.al. 2508.18271 null
2025-08-25 SafeBimanual: Diffusion-based Trajectory Optimization for Safe Bimanual Manipulation Haoyuan Deng et.al. 2508.18268 null
2025-08-25 Diffusiophoretic corner flows Dobromir Nowak et.al. 2508.18233 null
2025-08-25 Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance Ayce Idil Aytekin et.al. 2508.18213 null
2025-08-25 New shell-model calculations of the $δ_C$ correction to superallowed $0^+\rightarrow0^+$ nuclear $β$ decay and standard-model implications L. Xayavong et.al. 2508.18189 null
2025-08-25 SpotEdit: Evaluating Visually-Guided Image Editing Methods Sara Ghazanfari et.al. 2508.18159 null
2025-08-25 Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation Haijian Ma et.al. 2508.18148 null
2025-08-25 Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem Zhicong Tang et.al. 2508.18095 null
2025-08-26 Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation Yaqi Li et.al. 2508.18032 null
2025-08-25 HD 28471: a near-resonant compact multiplanet system with a possible cold giant planet A. T. Stevenson et.al. 2508.18000 null
2025-08-26 Solute dispersion in axially strained tube flows: Large-time asymptotics and Ornstein-Uhlenbeck Gaussian profiles Prabakaran Rajamanickam et.al. 2508.17982 null
2025-08-25 Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech Dimme de Groot et.al. 2508.17980 null
2025-08-26 Generative Feature Imputing – A Technique for Error-resilient Semantic Communication Jianhao Huang et.al. 2508.17957 null
2025-08-25 Nodal error behind discrepancies between coupled cluster and diffusion Monte Carlo: AcOH dimer case study S. Lambie et.al. 2508.17937 null
2025-08-25 Parallel Nodal Interior-Penalty Discontinuous Galerkin Methods for the Subsonic Compressible Navier-Stokes Equations: Applications to Vortical Flows and VIV Problems Spiros Zafeiris et.al. 2508.17917 null
2025-08-25 Quasi-likelihood inference for SDE with mixed-effects observed at high frequency Maud Delattre et.al. 2508.17910 null
2025-08-25 Local Well-Posedness of the Cahn-Hilliard-Biot System Helmut Abels et.al. 2508.17893 null
2025-08-27 Vocoder-Projected Feature Discriminator Takuhiro Kaneko et.al. 2508.17874 null
2025-08-25 FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation Takuhiro Kaneko et.al. 2508.17868 null
2025-08-25 Diffusion-Based Data Augmentation for Medical Image Segmentation Maham Nazir et.al. 2508.17844 null
2025-08-25 Threshold Diffusions Lina Ji et.al. 2508.17812 null
2025-08-25 CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation Mingyue Yang et.al. 2508.17760 null
2025-08-25 SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling Fanjiang Ye et.al. 2508.17756 null
2025-08-25 DiffusionGS: Generative Search with Query Conditioned Diffusion in Kuaishou Qinyao Li et.al. 2508.17754 null
2025-08-25 Few-shot Human Action Anomaly Detection via a Unified Contrastive Learning Framework Koichiro Kamide et.al. 2508.17726 null
2025-08-25 Instant Preference Alignment for Text-to-Image Diffusion Models Yang Li et.al. 2508.17718 null
2025-08-25 CATformer: Contrastive Adversarial Transformer for Image Super-Resolution Qinyi Tian et.al. 2508.17708 null
2025-08-25 On the Edge of Memorization in Diffusion Models Sam Buchanan et.al. 2508.17689 null
2025-08-25 Calculating the power spectrum in stochastic inflation by Monte Carlo simulation and least squares curve fitting Koichi Miyamoto et.al. 2508.17654 null
2025-08-27 ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion Nima Kondori et.al. 2508.17631 null
2025-08-25 Effects of Near-Field Hydrodynamic Interactions on Bacterial Dynamics Near a Solid Surface Baopi Liu et.al. 2508.17626 null
2025-08-25 Steering When Necessary: Flexible Steering Large Language Models with Backtracking Jinwei Gan et.al. 2508.17621 null
2025-08-25 Preference Trajectory Modeling via Flow Matching for Sequential Recommendation Li Li et.al. 2508.17618 null
2025-08-25 JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on Aowen Wang et.al. 2508.17614 null
2025-08-25 HotSpotter - Patterned Species Instance Recognition Jonathan P. Crall et.al. 2508.17605 null
2025-08-25 GWM: Towards Scalable Gaussian World Models for Robotic Manipulation Guanxing Lu et.al. 2508.17600 null
2025-08-25 HERO: Hierarchical Extrapolation and Refresh for Efficient World Models Quanjian Song et.al. 2508.17588 null
2025-08-24 Controllability of a system of non-autonomous degenerate coupled parabolic equations Alfredo S. Gamboa et.al. 2508.17546 null
2025-08-24 Universal scaling of higher-order cumulants in quantum isotropic spin chains Shixian Jiang et.al. 2508.17535 null
2025-08-24 Learning Reaction-Diffusion Kinetics from Mechanical Information Royal C. Ihuaenyi et.al. 2508.17523 null
2025-08-24 Variational Shape Inference for Grasp Diffusion on SE(3) S. Talha Bukhari et.al. 2508.17482 null
2025-08-24 T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation Kaiyue Sun et.al. 2508.17472 null
2025-08-24 A Synthetic Dataset for Manometry Recognition in Robotic Applications Pedro Antonio Rabelo Saraiva et.al. 2508.17468 null
2025-08-24 Bias Amplification in Stable Diffusion’s Representation of Stigma Through Skin Tones and Their Homogeneity Kyra Wilson et.al. 2508.17465 null
2025-08-24 Disentangled Geometry and Appearance for Efficient Multi-View Surface Reconstruction and Rendering Qitong Zhang et.al. 2508.17436 null
2025-08-24 An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing Zihan Liang et.al. 2508.17435 null
2025-08-24 TinySR: Pruning Diffusion for Real-World Image Super-Resolution Linwei Dong et.al. 2508.17434 null
2025-08-24 Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling Haochen You et.al. 2508.17426 null
2025-08-24 Asteroid Rotation Periods: Statistical Analysis in the Diameter-Spin Distribution Maryam Nastaran et.al. 2508.17415 null
2025-08-24 MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling Haoyu Wang et.al. 2508.17404 null
2025-08-24 Stability and uniqueness of bounded weak solutions to triangular degenerate cross-diffusion systems Xiuqing Chen et.al. 2508.17379 null
2025-08-24 ShaLa: Multimodal Shared Latent Space Modelling Jiali Cui et.al. 2508.17376 null
2025-08-24 Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation Guoqing Zhang et.al. 2508.17364 null
2025-08-24 DiCache: Let Diffusion Model Determine Its Own Cache Jiazi Bu et.al. 2508.17356 null
2025-08-24 ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation Yuxuan Song et.al. 2508.17345 null
2025-08-24 Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing Tristan S. W. Stevens et.al. 2508.17326 null
2025-08-24 An improved nonlocal electron heat transport model for magnetized plasmas Z. H. Chen et.al. 2508.17309 null
2025-08-24 PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing Peilin Xiong et.al. 2508.17302 null
2025-08-24 FoundDiff: Foundational Diffusion Model for Generalizable Low-Dose CT Denoising Zhihao Chen et.al. 2508.17299 null
2025-08-24 4D Visual Pre-training for Robot Learning Chengkai Hou et.al. 2508.17230 null
2025-08-24 Multi-Metric Preference Alignment for Generative Speech Restoration Junan Zhang et.al. 2508.17229 null
2025-08-24 Effects of Geometric configuration in relativistic isobaric collisions at $\sqrt{s_{NN}}=200$ GeV Akash Das et.al. 2508.17227 null
2025-08-24 MMCIG: Multimodal Cover Image Generation for Text-only Documents and Its Dataset Construction via Pseudo-labeling Hyeyeon Kim et.al. 2508.17199 null
2025-08-23 Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities Yili Jin et.al. 2508.17163 null
2025-08-23 SyncGuard: Robust Audio Watermarking Capable of Countering Desynchronization Attacks Zhenliang Gan et.al. 2508.17121 null
2025-08-23 CP4SBI: Local Conformal Calibration of Credible Sets in Simulation-Based Inference Luben M. C. Cabezas et.al. 2508.17077 null
2025-08-23 LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening Halid Abdulrahim Kadi et.al. 2508.17070 null
2025-08-23 SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation Peng Hu et.al. 2508.17062 null
2025-08-23 PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models Xianjing Cheng et.al. 2508.17050 null
2025-08-23 Styleclone: Face Stylization with Diffusion Based Data Augmentation Neeraj Matiyali et.al. 2508.17045 null
2025-08-23 A Novel Local Focusing Mechanism for Deepfake Detection Generalization Mingliang Li et.al. 2508.17029 null
2025-08-23 Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation Konstantina Nikolaidou et.al. 2508.17017 null
2025-08-23 An improved lattice Boltzmann method with a novel conservative boundary scheme for viscoelastic fluid flows Yuan Yu et.al. 2508.16997 null
2025-08-23 Score Matching on Large Geometric Graphs for Cosmology Generation Diana-Alexandra Onutu et.al. 2508.16990 null
2025-08-23 HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching Liang Feng et.al. 2508.16984 null
2025-08-23 Shape optimization problems with random coefficients via the penalty method Xiaowei Pang et.al. 2508.16961 null
2025-08-23 RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze Ruicheng Zhang et.al. 2508.16956 null
2025-08-23 Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model Fan Ding et.al. 2508.16947 null
2025-08-23 Sig-DEG for Distillation: Making Diffusion Models Faster and Lighter Lei Jiang et.al. 2508.16939 null
2025-08-23 HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation Sizhe Shan et.al. 2508.16930 null
2025-08-23 Structural Energy-Guided Sampling for View-Consistent Text-to-3D Qing Zhang et.al. 2508.16917 null
2025-08-23 Remarks on the three-dimensional Navier-Stokes equations with Lions’ exponent forced by space-time white noise Kazuo Yamazaki et.al. 2508.16906 null
2025-08-23 Enhanced shape recovery in advection–diffusion problems via a novel ADMM-based CCBM optimization Elmehdi Cherrat et.al. 2508.16898 null
2025-08-23 Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network Pouya Shiri et.al. 2508.16897 null
2025-08-23 Delta-SVD: Efficient Compression for Personalized Text-to-Image Models Tangyuan Zhang et.al. 2508.16863 null
2025-08-23 Subtleties of UV-crosslinking in microfluidic particle fabrication: UV dosage and intensity matter Sabrina Marnoto et.al. 2508.16862 null
2025-08-23 Intelligent Shanghai Typhoon Model (ISTM): A generative probabilistic emulator for typhoon hybrid modeling Zeyi Niu et.al. 2508.16851 null
2025-08-23 NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows Denis Tarasov et.al. 2508.16845 null
2025-08-22 A Fluctuating Hydrodynamics Model for Nanoscale Surfactant-laden Interfaces John B. Bell et.al. 2508.16820 null
2025-08-22 Two-Step Bose-Einstein Condensation of an ideal Magnetized Charged Bosonic gas under neutron star-like conditions Amanda Castillo Ayon et.al. 2508.16799 null
2025-08-22 TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling Yuancheng Wang et.al. 2508.16790 null
2025-08-22 Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data Stefania L. Moroianu et.al. 2508.16783 null
2025-08-26 Characterising the short-orbital period X-ray transient Swift J1910.2-0546 J. M. Corral-Santana et.al. 2508.16775 null
2025-08-22 Spontaneous spiral patterns etched on Germanium Yilin Wong et.al. 2508.16764 null
2025-08-22 A Framework for Benchmarking Fairness-Utility Trade-offs in Text-to-Image Models via Pareto Frontiers Marco N. Bochernitsan et.al. 2508.16752 null
2025-08-22 Hamiltonian Simulation for Advection-Diffusion Equation with arbitrary transport field Niladri Gomes et.al. 2508.16728 null
2025-08-22 MV-RAG: Retrieval Augmented Multiview Diffusion Yosef Dayani et.al. 2508.16577 null
2025-08-22 Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution Tainyi Zhang et.al. 2508.16557 null
2025-08-22 Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning Xuan Zhang et.al. 2508.16524 null
2025-08-22 Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation Zhijian Zhou et.al. 2508.16521 null
2025-08-22 ARSP: Automated Repair of Verilog Designs via Semantic Partitioning Bingkun Yao et.al. 2508.16517 null
2025-08-22 Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation Chun-Peng Chang et.al. 2508.16512 null
2025-08-22 Underdamped Langevin MCMC with third order convergence Maximilian Scott et.al. 2508.16485 null
2025-08-22 Large-scale concentration and relaxation for mean-field Langevin particle systems Songbo Wang et.al. 2508.16428 null
2025-08-22 Multiscale Growth Kinetics of Model Biomolecular Condensates Under Passive and Active Conditions Tamizhmalar Sundararajan et.al. 2508.16398 null
2025-08-22 Parrondo paradox in quantum image encryption Łukasz Pawela et.al. 2508.16382 null
2025-08-22 Observation of negative orbital torque from Vanadium Nikhil Vijayan et.al. 2508.16339 null
2025-08-22 A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions Nishant Jain et.al. 2508.16306 null
2025-08-22 Towards Diagnostic Quality Flat-Panel Detector CT Imaging Using Diffusion Models Hélène Corbaz et.al. 2508.16252 null
2025-08-22 Numerical solution of the time fractional nonlinear Fisher-KPP diffusion-reaction equation using the local domain boundary element method Theodore V. Gortsas et.al. 2508.16241 null
2025-08-22 UniEM-3M: A Universal Electron Micrograph Dataset for Microstructural Segmentation and Generation Nan wang et.al. 2508.16239 null
2025-08-22 PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting Hohyun Na et.al. 2508.16217 null
2025-08-22 OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models Huanpeng Chu et.al. 2508.16212 null
2025-08-22 Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers Shikang Zheng et.al. 2508.16211 null
2025-08-22 Competition and Attraction Improve Model Fusion João Abrantes et.al. 2508.16204 null
2025-08-22 FuXi-TC: A generative framework integrating deep learning and physics-based models for improved tropical cyclone forecasts Shan Guo et.al. 2508.16168 null
2025-08-22 Transport Properties of QGP within a Bayesian Holographic QCD Model Bing Chen et.al. 2508.16167 null
2025-08-22 RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution Haodong He et.al. 2508.16158 null
2025-08-22 On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models Yi Zhang et.al. 2508.16154 null
2025-08-22 Machine Learning for Medicine Must Be Interpretable, Shareable, Reproducible and Accountable by Design Ayyüce Begüm Bektaş et.al. 2508.16097 null
2025-08-22 Two-flow Feedback Multi-scale Progressive Generative Adversarial Network Sun Weikai et.al. 2508.16089 null
2025-08-22 A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection Qifeng Liu et.al. 2508.16069 null
2025-08-21 Clinically-Informed Preprocessing Improves Stroke Segmentation in Low-Resource Settings Juampablo E. Heras Rivera et.al. 2508.16004 null
2025-08-21 Multiscale Analysis of a Kinetic Model of Confined Suspensions of Self-Propelled Rods Leonid Berlyand et.al. 2508.16003 null
2025-08-21 Universal Fluctuations in the Tail Probability for d=2 Random Walks in Space-Time Random Environments Franscesca Ark et.al. 2508.15999 null
2025-08-21 Diverse Signer Avatars with Manual and Non-Manual Feature Modelling for Sign Language Production Mohamed Ilyes Lakhal et.al. 2508.15988 null
2025-08-21 UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation Zhaodong Jiang et.al. 2508.15972 null
2025-08-21 Physical blowups via buffered time change in a mean-field neural network Nikolaos Papadopoulos et.al. 2508.15961 null
2025-08-21 Structure-Preserving Medical Image Generation from a Latent Graph Representation Kevin Arias et.al. 2508.15920 null
2025-08-21 Text-Driven 3D Hand Motion Generation from Sign Language Data Léore Bensabath et.al. 2508.15902 null
2025-08-21 Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning Yijun Liu et.al. 2508.15874 null
2025-08-21 CineScale: Free Lunch in High-Resolution Cinematic Visual Generation Haonan Qiu et.al. 2508.15774 null
2025-08-21 Scaling Group Inference for Diverse and High-Quality Generation Gaurav Parmar et.al. 2508.15773 null
2025-08-21 Visual Autoregressive Modeling for Instruction-Guided Image Editing Qingyang Mao et.al. 2508.15772 null
2025-08-21 Waver: Wave Your Way to Lifelike Video Generation Yifu Zhang et.al. 2508.15761 null
2025-08-21 Skyrmion Lattice Order Controlled by Confinement Geometry Raphael Gruber et.al. 2508.15758 null
2025-08-21 Spatial Super-Infection and Co-Infection Dynamics in Networks Alyssa Yu et.al. 2508.15740 null
2025-08-21 Probability Density from Latent Diffusion Models for Out-of-Distribution Detection Joonas Järve et.al. 2508.15737 null
2025-08-21 The Status of the Astrophysical Parameters of Upper Main Sequence Stars Lukas Kueß et.al. 2508.15722 null
2025-08-21 WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception Zhiheng Liu et.al. 2508.15720 null
2025-08-21 Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation Nikita Kachaev et.al. 2508.15663 null
2025-08-21 When and What: Diffusion-Grounded VideoLLM with Entity Aware Segmentation for Long Video Understanding Pengcheng Fang et.al. 2508.15641 null
2025-08-21 Are Virtual DES Images a Valid Alternative to the Real Ones? Ana C. Perre et.al. 2508.15594 null
2025-08-21 Lattice distortions and non-sluggish diffusion in BCC refractory high entropy alloys Jingfeng Zhang et.al. 2508.15558 null
2025-08-21 Dream 7B: Diffusion Large Language Models Jiacheng Ye et.al. 2508.15487 null
2025-08-21 Reevaluating Anomalous Electric Fields at the Air-Water Interface: A Surface-Specific Spectroscopic Survey Joseph C. Shirley et.al. 2508.15422 null
2025-08-21 Speckle suppression in digital in-line holographic microscopy through liquid crystal dynamic scattering Emilia Wdowiak et.al. 2508.15419 null
2025-08-21 Numerical Analysis of Unsupervised Learning Approaches for Parameter Identification in PDEs Siyu Cen et.al. 2508.15381 null
2025-08-21 Diffusion-driven pattern formation in an opinion dynamical network model Tim Mauch et.al. 2508.15377 null
2025-08-21 Performance Analysis of RIS-Aided High-Mobility Wireless Systems Hanwen Hu et.al. 2508.15375 null
2025-08-22 Analytical Theory of Chiral Active Particle Transport in a Fluctuating Density Field Jayam Joshi et.al. 2508.15366 null
2025-08-21 The effect of multi-occupancy traps on the diffusion and retention of multiple hydrogen isotopes in irradiated tungsten and vanadium Sanjeet Kaur et.al. 2508.15341 null
2025-08-21 Discovering correlations between metal foam thermal characteristics and non-Fourier behavior Anna Fehér et.al. 2508.15340 null
2025-08-21 Interface fluctuations for $1$ D stochastic Allen-Cahn equation – singular regime Weijun Xu et.al. 2508.15319 null
2025-08-21 VideoEraser: Concept Erasure in Text-to-Video Diffusion Models Naen Xu et.al. 2508.15314 null
2025-08-21 HIP: Model-Agnostic Hypergraph Influence Prediction via Distance-Centrality Fusion and Neural ODEs Su-Su Zhang et.al. 2508.15312 null
2025-08-21 Modeling Long-term User Behaviors with Diffusion-driven Multi-interest Network for CTR Prediction Weijiang Lai et.al. 2508.15311 null
2025-08-21 Contribution of Globular Clusters to Diffuse Gamma-ray Emission from Galactic Plane Jiayin He et.al. 2508.15295 null
2025-08-21 Optimizing Compilation for Distributed Quantum Computing via Clustering and Annealing Ruilin Zhou et.al. 2508.15267 null
2025-08-21 Pathology-Informed Latent Diffusion Model for Anomaly Detection in Lymph Node Metastasis Jiamu Wang et.al. 2508.15236 null
2025-08-21 Pretrained Diffusion Models Are Inherently Skipped-Step Samplers Wenju Xu et.al. 2508.15233 null
2025-08-21 Collaborative Multi-Modal Coding for High-Quality 3D Generation Ziang Cao et.al. 2508.15228 null
2025-08-21 GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design Wen-Fan Wang et.al. 2508.15227 null
2025-08-21 A rutile-based homologous series Na(PtO $2$)${2\it{n}+1}$ discovered by computationally assisted high-pressure synthesis Yasuhito Kobayashi et.al. 2508.15223 null
2025-08-21 See it. Say it. Sorted: Agentic System for Compositional Diagram Generation Hantao Zhang et.al. 2508.15222 null
2025-08-21 Obstacle-tuned transition from chaotic to coherent vortex flows and odd diffusion in chiral active fluids Joscha Mecke et.al. 2508.15210 null
2025-08-21 Quantum Differential Equation Solvers with Low State Preparation Cost: Eliminating the Time Dependence in Dissipative Equations Gengzhi Yang et.al. 2508.15170 null
2025-08-21 MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion Xuyang Chen et.al. 2508.15169 null
2025-08-21 Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors Jeonghyun Noh et.al. 2508.15151 null
2025-08-21 Electron-Ion Equilibration in the Merging Galaxy Cluster Abell 665 Christian Norseth et.al. 2508.15138 null
2025-08-24 Side Effects of Erasing Concepts from Diffusion Models Shaswati Saha et.al. 2508.15124 null
2025-08-20 Microstructural and preliminary optical and microwave characterization of erbium doped CaMoO $_4$ thin films Ignas Masiulionis et.al. 2508.15122 null
2025-08-24 CurveFlow: Curvature-Guided Flow Matching for Image Generation Yan Luo et.al. 2508.15093 null
2025-08-20 Sampling by averaging: A multiscale approach to score estimation Paula Cordero-Encinar et.al. 2508.15069 null
2025-08-20 Asymptotic analysis on narrow tubes: narrow escape problems and diffusion processes Wen-Tai Hsu et.al. 2508.15060 null
2025-08-20 Correlating Particle Acceleration Rates with Plasma Conditions in Colliding Wind Binaries Gislaine B Cordeiro et.al. 2508.15059 null
2025-08-20 An MRI Atlas of the Human Fetal Brain: Reference and Segmentation Tools for Fetal Brain MRI Analysis Mahdi Bagheri et.al. 2508.15034 null
2025-08-20 Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement Chunming He et.al. 2508.15027 null
2025-08-20 TAIGen: Training-Free Adversarial Image Generation via Diffusion Models Susim Roy et.al. 2508.15020 null
2025-08-20 Probing Magnetic Properties of RuO $_{2}$ Heterostructures Through the Ferromagnetic Layer Frank M. Abel et.al. 2508.15004 null
2025-08-20 LyLA-Therm: Lyapunov-based Langevin Adaptive Thermodynamic Neural Network Controller Saiedeh Akbari et.al. 2508.14989 null
2025-08-20 Aura-CAPTCHA: A Reinforcement Learning and GAN-Enhanced Multi-Modal CAPTCHA System Joydeep Chandra et.al. 2508.14976 null
2025-08-20 Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI Oliver Welin Odeback et.al. 2508.14950 null
2025-08-19 Inference Time Debiasing Concepts in Diffusion Models Lucas S. Kupssinskü et.al. 2508.14933 null
2025-08-19 TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation Jiacheng Xie et.al. 2508.14932 null
2025-08-20 Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs Haokun Lin et.al. 2508.14896 null
2025-08-20 Virtual Community: An Open World for Humans, Robots, and Society Qinhong Zhou et.al. 2508.14893 null
2025-08-20 Squeezed Diffusion Models Jyotirmai Singh et.al. 2508.14871 null
2025-08-20 Critical trajectories in kinetic geometry Helge Dietert et.al. 2508.14868 null
2025-08-20 Universal winding properties of chiral active motion Ion Santra et.al. 2508.14862 null
2025-08-20 Physics-Informed ML Exploration of Structure-Transport Relationships in Hard Carbon Nikhil Rampal et.al. 2508.14849 null
2025-08-20 TransLight: Image-Guided Customized Lighting Control with Generative Decoupling Zongming Li et.al. 2508.14814 null
2025-08-20 Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization Canyu Zhao et.al. 2508.14811 null
2025-08-20 Cross-Modality Controlled Molecule Generation with Diffusion Language Model Yunzhe Zhang et.al. 2508.14748 null
2025-08-20 Modeling the impact of temperature and bird migration on the spread of West Nile virus Pride Duve et.al. 2508.14740 null
2025-08-20 GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting Jiaxin Wei et.al. 2508.14717 null
2025-08-20 The heating and cooling of 2D electrons at low temperatures A. K. Jain et.al. 2508.14694 null
2025-08-20 Virtual Multiplex Staining for Histological Images using a Marker-wise Conditioned Diffusion Model Hyun-Jic Oh et.al. 2508.14681 null
2025-08-21 Phase space transport, quasilinear diffusion and locality in phase velocity Didier Bénisti et.al. 2508.14657 null
2025-08-20 AnchorSync: Global Consistency Optimization for Long Video Editing Zichi Liu et.al. 2508.14609 null
2025-08-20 Call Option Price using Pearson Diffusion Processes Tapan Kar et.al. 2508.14577 null
2025-08-20 Minimizing Task-Oriented Age of Information for Remote Monitoring with Pre-Identification Shuying Gan et.al. 2508.14575 null
2025-08-20 EffiFusion-GAN: Efficient Fusion Generative Adversarial Network for Speech Enhancement Bin Wen et.al. 2508.14525 null
2025-08-20 SATURN: Autoregressive Image Generation Guided by Scene Graphs Thanh-Nhan Vo et.al. 2508.14502 null
2025-08-20 Multimode Fiber Imaging Based on Hydrogel Fiber Lele He et.al. 2508.14501 null
2025-08-20 DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion Moyu Zhang et.al. 2508.14500 null
2025-08-20 Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration Haoran Bai et.al. 2508.14483 null
2025-08-20 DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing Weitao Wang et.al. 2508.14465 null
2025-08-20 Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering Shanlin Sun et.al. 2508.14461 null
2025-08-20 Early Evolution of the Cavity and Core of a Coronal Mass Ejection in the Inner Corona Shuting Li et.al. 2508.14455 null
2025-08-20 FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy Yijin Chen et.al. 2508.14441 null
2025-08-20 MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion Fei Peng et.al. 2508.14440 null
2025-08-20 Weakly-Convex Regularization for Magnetic Resonance Image Denoising Akash Prabakar et.al. 2508.14438 null
2025-08-20 FOCUS: Frequency-Optimized Conditioning of DiffUSion Models for mitigating catastrophic forgetting during Test-Time Adaptation Gabriel Tjio et.al. 2508.14437 null
2025-08-20 HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation Bing Han et.al. 2508.14431 null
2025-08-20 Disentanglement in T-space for Faster and Distributed Training of Diffusion Models with Fewer Latent-states Samarth Gupta et.al. 2508.14413 null
2025-08-20 A Real-world Display Inverse Rendering Dataset Seokjun Choi et.al. 2508.14411 null
2025-08-20 CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities Yue Gong et.al. 2508.14405 null
2025-08-20 Img2ST-Net: Efficient High-Resolution Spatial Omics Prediction from Whole Slide Histology Images via Fully Convolutional Image-to-Image Learning Junchao Zhu et.al. 2508.14393 null
2025-08-20 Physics-Constrained Diffusion Reconstruction with Posterior Correction for Quantitative and Fast PET Imaging Yucun Hou et.al. 2508.14364 null
2025-08-20 Organ-Agents: Virtual Human Physiology Simulator via LLMs Rihao Chang et.al. 2508.14357 null
2025-08-20 SBGD: Improving Graph Diffusion Generative Model via Stochastic Block Diffusion Junwei Su et.al. 2508.14352 null
2025-08-20 A Non-Asymptotic Convergent Analysis for Scored-Based Graph Generative Model via a System of Stochastic Differential Equations Junwei Su et.al. 2508.14351 null
2025-08-20 Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation Lingkai Kong et.al. 2508.14342 null
2025-08-20 Modeling oxygen-void interactions in uranium nitride Mohamed AbdulHameed et.al. 2508.14329 null
2025-08-20 MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation Guile Wu et.al. 2508.14327 null
2025-08-20 Modeling of silver transport in cubic SiC: Integrating molecular dynamics, bounds averaging, and uncertainty quantification Mohamed AbdulHameed et.al. 2508.14325 null
2025-08-19 Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning Said Djafar Said et.al. 2508.14276 null
2025-08-19 Mean field social optimization: feedback person-by-person optimality and the dynamic programming equation Minyi Huang et.al. 2508.14236 null
2025-08-19 CO Adsorption Sites on Interstellar Water Ices Explored with Machine Learning Potentials. Binding energy distributions and snowline Giulia M. Bovolenta et.al. 2508.14219 null
2025-08-19 A well-balanced gas-kinetic scheme with adaptive mesh refinement for shallow water equations Gaocheng Liu et.al. 2508.14216 null
2025-08-19 Nonadiabatic force matching for alchemical free-energy estimation Jorge L. Rosa-Raíces et.al. 2508.14179 null
2025-08-19 DPad: Efficient Diffusion Language Models with Suffix Dropout Xinhua Chen et.al. 2508.14148 null
2025-08-18 3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models Jolanta Mozyrska et.al. 2508.14122 null
2025-08-19 InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing Shaoshu Yang et.al. 2508.14033 null
2025-08-19 Electrochemical response of biological membranes to localized currents and external electric fields Joshua B. Fernandes et.al. 2508.14001 null
2025-08-19 Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment Samuel Seligardi et.al. 2508.13989 null
2025-08-20 Towards a general diffusion-based information quality assessment model Anthony Lopes Temporao et.al. 2508.13927 null
2025-08-19 Learning to See Through Flare Xiaopeng Peng et.al. 2508.13907 null
2025-08-19 Revisiting Diffusion Q-Learning: From Iterative Denoising to One-Step Action Generation Thanh Nguyen et.al. 2508.13904 null
2025-08-19 Diffusion-Driven High-Dimensional Variable Selection Minjie Wang et.al. 2508.13890 null
2025-08-19 Toward Deployable Multi-Robot Collaboration via a Symbolically-Guided Decision Transformer Rathnam Vidushika Rasanji et.al. 2508.13877 null
2025-08-19 SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation Paul Grimal et.al. 2508.13866 null
2025-08-19 Stochastic synaptic dynamics under learning Jakob Stubenrauch et.al. 2508.13846 null
2025-08-19 UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion Zihan Liang et.al. 2508.13843 null
2025-08-20 Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction Niklas Bubeck et.al. 2508.13826 null
2025-08-19 COCO: Cognitive Operating System with Continuous Oversight for Multi-Agent Workflow Reliability Churong Liang et.al. 2508.13815 null
2025-08-19 Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs Juncheng Xie et.al. 2508.13805 null
2025-08-19 Elementary Monte Carlo model of the anisotropic recrystallization and antiripening under intensive stirring and high supersaturations Serhii Abakumov et.al. 2508.13799 null
2025-08-19 Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing Feng-Lin Liu et.al. 2508.13797 null
2025-08-19 DegDiT: Controllable Audio Generation with Dynamic Event Graph Guided Diffusion Transformer Yisu Liu et.al. 2508.13786 null
2025-08-19 Comparing Conditional Diffusion Models for Synthesizing Contrast-Enhanced Breast MRI from Pre-Contrast Images Sebastian Ibarra et.al. 2508.13776 null
2025-08-19 Eliminating Rasterization: Direct Vector Floor Plan Generation with DiffPlanner Shidong Wang et.al. 2508.13738 null
2025-08-19 Simulation of Impact-induced seismic shaking on asteroid (25143) Itokawa to address its resurfacing process Sunho Jin et.al. 2508.13727 null
2025-08-19 Unravelling disorder in kagome Yb $_{0.5}$Co$_3$Ge$_3$ A. Korshunov et.al. 2508.13719 null
2025-08-19 Diffuse-Layer Capacitance at the Potential of Zero Charge in Binary Mixtures Yuki Uematsu et.al. 2508.13691 null
2025-08-19 PHECT: A lightweight computation tool for pulsar halo emission Kun Fang et.al. 2508.13667 null
2025-08-19 Calibrated Semantic Diffusion: A p-Laplacian Synthesis with Learnable Dissipation, Quantified Constants, and Graph-Aware Calibration Faruk Alpay et.al. 2508.13658 null
2025-08-19 Personalized Subgraph Federated Learning with Sheaf Collaboration Wenfei Liang et.al. 2508.13642 null
2025-08-19 V2P: From Background Suppression to Center Peaking for Robust GUI Grounding Task Jikai Chen et.al. 2508.13634 null
2025-08-19 Text2Weight: Bridging Natural Language and Neural Network Weight Spaces Bowen Tian et.al. 2508.13633 null
2025-08-20 DiffIER: Optimizing Diffusion Models with Iterative Error Reduction Ao Chen et.al. 2508.13628 null
2025-08-19 Bridging Clear and Adverse Driving Conditions Yoel Shapiro et.al. 2508.13592 null
2025-08-19 Temporal-Conditional Referring Video Object Segmentation with Noise-Free Text-to-Video Diffusion Model Ruixin Zhang et.al. 2508.13584 null
2025-08-19 Overcoming Quantum Resistivity Scaling in Nanoscale Interconnects Using Delafossite PdCoO2 Seoung-Hun Kang et.al. 2508.13573 null
2025-08-19 A stability-enhanced nonstandard finite difference framework for solving one and two-dimensional nonlocal differential equations Shweta Kumari et.al. 2508.13542 null
2025-08-20 2D Gaussians Meet Visual Tokenizer Yiang Shi et.al. 2508.13515 null
2025-08-19 A Monte Carlo simulation on the scattering coefficients of solar radio wave propagation Jiazhen Gan et.al. 2508.13494 null
2025-08-19 The Lévy flight foraging hypothesis: comparison between stationary distributions and anomalous diffusion Serena Dipierro et.al. 2508.13487 null
2025-08-19 EventTSF: Event-Aware Non-Stationary Time Series Forecasting Yunfeng Ge et.al. 2508.13434 null
2025-08-19 Hyperactive Magnetar Eruptions: Giant Flares, Baryon Ejections, and FRBs Ashley Bransgrove et.al. 2508.13419 null
2025-08-18 Counterfactual Probabilistic Diffusion with Expert Models Wenhao Mu et.al. 2508.13355 null
2025-08-18 Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction Sedigheh Dargahi et.al. 2508.13340 null
2025-08-18 Resistive diffusion and radiative cooling effects in magnetized oblique shocks R. Datta et.al. 2508.13310 null
2025-08-18 GaitCrafter: Diffusion Model for Biometric Preserving Gait Synthesis Sirshapan Mitra et.al. 2508.13300 null
2025-08-18 Field-level Reconstruction from Foreground-Contaminated 21-cm Maps Shu-Fan Chen et.al. 2508.13265 null
2025-08-18 4DNeX: Feed-Forward 4D Generative Modeling Made Easy Zhaoxi Chen et.al. 2508.13154 null
2025-08-18 MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models Haoyu He et.al. 2508.13148 null
2025-08-18 Some semi-decoupled algorithms with optimal convergence for a four-field linear thermo-poroelastic model Ziliang Li et.al. 2508.13109 null
2025-08-18 Precise Action-to-Video Generation Through Visual Action Prompts Yuang Wang et.al. 2508.13104 null
2025-08-18 Denoising diffusion models for inverse design of inflatable structures with programmable deformations Sara Karimi et.al. 2508.13097 null
2025-08-18 DMS:Diffusion-Based Multi-Baseline Stereo Generation for Improving Self-Supervised Depth Estimation Zihua Liu et.al. 2508.13091 null
2025-08-18 ID-Card Synthetic Generation: Toward a Simulated Bona fide Dataset Qingwen Zeng et.al. 2508.13078 null
2025-08-18 From Transthoracic to Transesophageal: Cross-Modality Generation using LoRA Diffusion Emmanuel Oladokun et.al. 2508.13077 null
2025-08-18 Reinforced Context Order Recovery for Adaptive Reasoning and Planning Long Ma et.al. 2508.13070 null
2025-08-18 Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping Siddharth Khandelwal et.al. 2508.13065 null
2025-08-19 PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models Pengcheng Huang et.al. 2508.13021 null
2025-08-18 EgoTwin: Dreaming Body and View in First Person Jingqiao Xiu et.al. 2508.13013 null
2025-08-18 Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model Xianglong He et.al. 2508.13009 null
2025-08-18 Transfer Learning for Neutrino Scattering: Domain Adaptation with GANs Jose L. Bonilla et.al. 2508.12987 null
2025-08-18 The Leibenson process Viorel Barbu et.al. 2508.12979 null
2025-08-18 Compact Attention: Exploiting Structured Spatio-Temporal Sparsity for Fast Video Generation Qirui Li et.al. 2508.12969 null
2025-08-18 Self-Consistent Heating of the Magnetically Closed Solar Corona: Generation of Nanoflares, Thermodynamic Response of the Plasma and Observational Signatures Craig D. Johnston et.al. 2508.12952 null
2025-08-18 Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models Jianshu Zeng et.al. 2508.12945 null
2025-08-19 Fully Automated Segmentation of Fiber Bundles in Anatomic Tracing Data Kyriaki-Margarita Bintsi et.al. 2508.12942 null
2025-08-18 7Bench: a Comprehensive Benchmark for Layout-guided Text-to-image Models Elena Izzo et.al. 2508.12919 null
2025-08-18 FoleySpace: Vision-Aligned Binaural Spatial Audio Generation Lei Zhao et.al. 2508.12918 null
2025-08-18 S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models Chubin Chen et.al. 2508.12880 null
2025-08-18 E3RG: Building Explicit Emotion-driven Empathetic Response Generation System with Multimodal Large Language Model Ronghao Lin et.al. 2508.12854 null
2025-08-18 Strongly correlated stochastic systems Marco Biroli et.al. 2508.12818 null
2025-08-18 Next Visual Granularity Generation Yikai Wang et.al. 2508.12811 null
2025-08-18 Wavy Transformer Satoshi Noguchi et.al. 2508.12787 null
2025-08-18 Right and Wrong Ansätze for Nonlinear Waves in Stochastic PDEs C. H. S. Hamster et.al. 2508.12786 null
2025-08-18 Leveraging Diffusion Models for Stylization using Multiple Style Images Dan Ruta et.al. 2508.12784 null
2025-08-18 TURB-Scalar. A large database of passive scalar fields advected by 2D Navier-Stokes in the turbulent inverse cascade regime Chiara Calascibetta et.al. 2508.12762 null
2025-08-18 Effects of Defects on Thermal Transport across Solid/Solid Heterogeneous Interfaces Ershuai Yin et.al. 2508.12744 null
2025-08-18 Single-Reference Text-to-Image Manipulation with Dual Contrastive Denoising Score Syed Muhmmad Israr et.al. 2508.12718 null
2025-08-18 Hyperparameter Optimization in the Estimation of PDE and Delay-PDE models from data Oliver Mai et.al. 2508.12715 null
2025-08-18 Asymmetric Diffusion Recommendation Model Yongchun Zhu et.al. 2508.12706 null
2025-08-18 Deadline-Aware Bandwidth Allocation for Semantic Generative Communication with Diffusion Models Jinhyuk Choi et.al. 2508.12701 null
2025-08-18 MixCache: Mixture-of-Cache for Video Diffusion Transformer Acceleration Yuanxin Wei et.al. 2508.12691 null
2025-08-18 WP-CLIP: Leveraging CLIP to Predict Wölfflin’s Principles in Visual Art Abhijay Ghildyal et.al. 2508.12668 null
2025-08-18 Stable Diffusion-Based Approach for Human De-Occlusion Seung Young Noh et.al. 2508.12663 null
2025-08-18 Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery Jiyeon Kang et.al. 2508.12650 null
2025-08-18 Cognitive Structure Generation: From Educational Priors to Policy Optimization Hengnian Gu et.al. 2508.12647 null
2025-08-18 ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving Can Cui et.al. 2508.12603 null
2025-08-19 A Tale of Two Sightlines: Comparison of Hydrocarbon Dust Absorption Bands toward Cygnus OB2-12 and the Galactic Center Yvonne J. Pendleton et.al. 2508.12601 null
2025-08-17 Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference Denis Blessing et.al. 2508.12511 null
2025-08-17 Say It, See It: A Systematic Evaluation on Speech-Based 3D Content Generation Methods in Augmented Reality Yanming Xiu et.al. 2508.12498 null
2025-08-19 Portable Laser-Pumped Rb Atomic Clock with Digital Circuits Qiang Hao et.al. 2508.12437 null
2025-08-17 Spin decoherence dynamics of Er $^{3+}$ in CeO$_2$ film Sagar Kumar Seth et.al. 2508.12429 null
2025-08-17 TiP4GEN: Text to Immersive Panorama 4D Scene Generation Ke Xing et.al. 2508.12415 null
2025-08-17 Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position Zhixin Xie et.al. 2508.12398 null
2025-08-17 DeCoT: Decomposing Complex Instructions for Enhanced Text-to-Image Generation with Large Language Models Xiaochuan Lin et.al. 2508.12396 null
2025-08-17 Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models Xun Su et.al. 2508.12361 null
2025-08-17 Topological Dissipation as the Missing Link in Multiscale Polymer Dynamics Xu-Ze Zhang et.al. 2508.12359 null
2025-08-17 Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data Ahmet H. Güzel et.al. 2508.12356 null
2025-08-17 Semantic Discrepancy-aware Detector for Image Forgery Identification Ziye Wang et.al. 2508.12341 null
2025-08-17 Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR Fatemeh Ghorbani Lohesara et.al. 2508.12336 null
2025-08-17 Sketchar: Supporting Character Design and Illustration Prototyping Using Generative AI Long Ling et.al. 2508.12333 null
2025-08-17 Steering chiral active Brownian motion via stochastic position-orientation resetting Amir Shee et.al. 2508.12223 null
2025-08-17 Distribution Matching via Generalized Consistency Models Sagar Shrestha et.al. 2508.12222 null
2025-08-17 Self-Guided Action Diffusion Rhea Malhotra et.al. 2508.12189 null
2025-08-16 Critical Importance of Grain Boundaries to the Conductivity of Polycrystalline Molecular Crystals Shujit Chandra Paul et.al. 2508.12172 null
2025-08-16 Belief-Conditioned One-Step Diffusion: Real-Time Trajectory Planning with Just-Enough Sensing Gokul Puthumanaillam et.al. 2508.12166 null
2025-08-16 A Systematic Particle Filter for Estimating Time-Varying Parameters in Advection-Diffusion Equations with Source Terms Andrea Arnold et.al. 2508.12155 null
2025-08-16 Demystifying Foreground-Background Memorization in Diffusion Models Jimmy Z. Di et.al. 2508.12148 null
2025-08-16 Relativistic quintuple-zeta basis sets for the s block Marten L. Reitsma et.al. 2508.12144 null
2025-08-16 DualFit: A Two-Stage Virtual Try-On via Warping and Synthesis Minh Tran et.al. 2508.12131 null
2025-08-16 Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion Songwei Liu et.al. 2508.12094 null
2025-08-16 Strong overlap of deterministic and stochastic dynamics in a super-diffusive regime Muhammad Tayyab et.al. 2508.12091 null
2025-08-16 Generic Event Boundary Detection via Denoising Diffusion Jaejun Hwang et.al. 2508.12084 null
2025-08-16 Content Accuracy and Quality Aware Resource Allocation Based on LP-Guided DRL for ISAC-Driven AIGC Networks Ningzhe Shi et.al. 2508.12079 null
2025-08-16 Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization Kousuke Nakano et.al. 2508.12033 null
2025-08-16 Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems Szymon Pawlonka et.al. 2508.12026 null
2025-08-16 Virtual Trading in Multi-Settlement Electricity Markets Agostino Capponi et.al. 2508.11979 null
2025-08-16 UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding Yueming Xu et.al. 2508.11952 null
2025-08-19 Assessment of Using Synthetic Data in Brain Tumor Segmentation Aditi Jahagirdar et.al. 2508.11922 null
2025-08-16 SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress Lingyun Zhang et.al. 2508.11904 null
2025-08-16 OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation Jilei Mao et.al. 2508.11898 null
2025-08-16 Simulation of heavy quarkonium equilibration in the quark-gluon plasma Shouxing Zhao et.al. 2508.11897 null
2025-08-16 SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System Truong Thanh Hung Nguyen et.al. 2508.11873 null
2025-08-15 Serendipitous discovery of a young cluster of galaxies at $z \sim 0.5$ projected next to the nearby tadpole galaxy KUG 1138 + 327 Q. Daniel Wang et.al. 2508.11819 null
2025-08-15 FairTabGen: Unifying Counterfactual and Causal Fairness in Synthetic Tabular Data Generation Nitish Nagesh et.al. 2508.11810 null
2025-08-15 LoRAtorio: An intrinsic approach to LoRA Skill Composition Niki Foteinopoulou et.al. 2508.11624 null
2025-08-15 Dataset Creation for Visual Entailment using Generative AI Rob Reijtenbach et.al. 2508.11605 null
2025-08-15 CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion Zhe Zhu et.al. 2508.11603 null
2025-08-15 Low barrier ZrO $_x$ -based Josephson junctions Jaehong Choi et.al. 2508.11593 null
2025-08-15 Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model Zuo Zuo et.al. 2508.11550 null
2025-08-15 Physics-Informed Diffusion Models for Unsupervised Anomaly Detection in Multivariate Time Series Juhi Soni et.al. 2508.11528 null
2025-08-15 CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models Xiaoxue Wu et.al. 2508.11484 null
2025-08-15 SPG: Style-Prompting Guidance for Style-Specific Content Creation Qian Liang et.al. 2508.11476 null
2025-08-15 DPI-SPR: A Differentiable Physical Inversion for Shadow Profile Reconstruction Framework in Forward Scatter Radar ShuQi Lei et.al. 2508.11470 null
2025-08-15 Simulation-based inference using splitting schemes for partially observed diffusions in chemical reaction networks Petar Jovanovski et.al. 2508.11438 null
2025-08-15 MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation Qian Liang et.al. 2508.11433 null
2025-08-15 Wavelength dependence of laser pulse filamentation around atomic resonances Gabor Demeter et.al. 2508.11417 null
2025-08-15 The Effect of Flow Parameters and Wall Models on Gas-Surface Interactions: A Numerical Investigation of dsmcFoam M. B. Agir et.al. 2508.11403 null
2025-08-15 Pairwise correlations of global times in one-dimensional Brownian motion under stochastic resetting Yihao Wang et.al. 2508.11387 null
2025-08-15 AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis Zonglin Wu et.al. 2508.11375 null
2025-08-15 GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition Md Asgor Hossain Reaj et.al. 2508.11334 null
2025-08-15 Noise Matters: Optimizing Matching Noise for Diffusion Classifiers Yanghao Wang et.al. 2508.11330 null
2025-08-18 TimeMachine: Fine-Grained Facial Age Editing with Identity Preservation Yilin Mi et.al. 2508.11284 null
2025-08-15 Probing the Representational Power of Sparse Autoencoders in Vision Models Matthew Lyle Olson et.al. 2508.11277 null
2025-08-15 Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception Junjie Wang et.al. 2508.11256 null
2025-08-15 FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation MengChao Wang et.al. 2508.11255 null
2025-08-15 Graph Neural Diffusion via Generalized Opinion Dynamics Asela Hevapathige et.al. 2508.11249 null
2025-08-15 Cross-Granularity Hypergraph Retrieval-Augmented Generation for Multi-hop Question Answering Changjian Wang et.al. 2508.11247 null
2025-08-15 Efficient Image-to-Image Schrödinger Bridge for CT Field of View Extension Zhenhao Li et.al. 2508.11211 null
2025-08-15 StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation Seungmi Lee et.al. 2508.11203 null
2025-08-15 NGC 2392 and NGC 4361: Spectroscopic Diagnostics of Planetary Nebula Evolution Atul Kumar Singh et.al. 2508.11202 null
2025-08-15 Statistical Properties of Current Noise Induced by Electron-Phonon Scattering in Metallic Carbon Nanotubes Aina Sumiyoshi et.al. 2508.11201 null
2025-08-15 Representation Quantization for Collaborative Filtering Augmentation Yunze Luo et.al. 2508.11194 null
2025-08-15 Semi-supervised Image Dehazing via Expectation-Maximization and Bidirectional Brownian Bridge Diffusion Models Bing Liu et.al. 2508.11165 null
2025-08-15 LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction Maoquan Zhang et.al. 2508.11153 null
2025-08-15 Residual-based Efficient Bidirectional Diffusion Model for Image Dehazing and Haze Generation Bing Liu et.al. 2508.11134 null
2025-08-15 SQ-A: A Collision Triggered Starburst in Intra-Group Medium of Stephan’s Quintet C. K. Xu et.al. 2508.11124 null
2025-08-14 Diffusion is a code repair operator and generator Mukul Singh et.al. 2508.11110 null
2025-08-14 HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing Xinjie Gao et.al. 2508.11106 null
2025-08-14 GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning Kelin Yu et.al. 2508.11049 null
2025-08-14 A porous medium equation with spatially inhomogeneous absorption. Part II: Large time behavior Razvan Gabriel Iagar et.al. 2508.11046 null
2025-08-14 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation Nikolaos Gkanatsios et.al. 2508.11002 null
2025-08-14 Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling Tejomay Kishor Padole et.al. 2508.10995 null
2025-08-14 Match & Choose: Model Selection Framework for Fine-tuning Text-to-Image Diffusion Models Basile Lewandowski et.al. 2508.10993 null
2025-08-14 The extended molecular gas of the Circinus galaxy and NGC 1097 as seen by APEX Akhil Lasrado et.al. 2508.10982 null
2025-08-14 EVCtrl: Efficient Control Adapter for Visual Generation Zixiang Yang et.al. 2508.10963 null
2025-08-13 From Promise to Practical Reality: Transforming Diffusion MRI Analysis with Fast Deep Learning Enhancement Xinyi Wang et.al. 2508.10950 null
2025-08-14 Exchange-driven self-diffusion of nanoscale crystalline parahydrogen clusters on graphite K. M. Kolevski et.al. 2508.10883 null
2025-08-14 A Survey on Diffusion Language Models Tianyi Li et.al. 2508.10875 null
2025-08-14 Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation Harold Haodong Chen et.al. 2508.10858 null
2025-08-16 Object Fidelity Diffusion for Remote Sensing Image Generation Ziqi Ye et.al. 2508.10801 null
2025-08-14 Ultra-High-Definition Reference-Based Landmark Image Super-Resolution with Generative Diffusion Prior Zhenning Shi et.al. 2508.10779 null
2025-08-14 Video-BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation Youping Gu et.al. 2508.10774 null
2025-08-14 AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences Jieyu Li et.al. 2508.10771 null
2025-08-14 Formation and protection of an Eu-Ir surface compound below hexagonal boron nitride Alaa Mohammed Idris Bakhit et.al. 2508.10746 null
2025-08-14 A Kinetic Theory Approach to Ordered Fluids José A. Carrillo et.al. 2508.10744 null
2025-08-14 Thinking Inside the Mask: In-Place Prompting in Diffusion LLMs Xiangqi Jin et.al. 2508.10736 null
2025-08-14 Exploiting Discriminative Codebook Prior for Autoregressive Image Generation Longxiang Tang et.al. 2508.10719 null
2025-08-14 NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale NextStep Team et.al. 2508.10711 null
2025-08-14 CountCluster: Training-Free Object Quantity Guidance with Cross-Attention Map Clustering for Text-to-Image Generation Joohyeon Lee et.al. 2508.10710 null
2025-08-14 Probabilistic Forecasting Method for Offshore Wind Farm Cluster under Typhoon Conditions: a Score-Based Conditional Diffusion Model Jinhua He et.al. 2508.10705 null
2025-08-14 Effective permeability conditions for diffusive transport through impermeable membranes with gaps Molly Brennan et.al. 2508.10694 null
2025-08-14 Novel View Synthesis using DDIM Inversion Sehajdeep SIngh et.al. 2508.10688 null
2025-08-14 MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control Yuchen Zhu et.al. 2508.10684 null
2025-08-14 Hybrid Generative Fusion for Efficient and Privacy-Preserving Face Recognition Dataset Generation Feiran Li et.al. 2508.10672 null
2025-08-14 Geospatial Diffusion for Land Cover Imperviousness Change Forecasting Debvrat Varshney et.al. 2508.10649 null
2025-08-14 Increasing the Utility of Synthetic Images through Chamfer Guidance Nicola Dall’Asen et.al. 2508.10631 null
2025-08-14 A Unified Framework from Boltzmann Transport to Proton Treatment Planning Andreas E. Kyprianou et.al. 2508.10596 null
2025-08-14 HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis Shiyu Liu et.al. 2508.10566 null
2025-08-14 Projected Coupled Diffusion for Test-Time Constrained Joint Generation Hao Luan et.al. 2508.10531 null
2025-08-14 EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba Quang Nguyen et.al. 2508.10522 null
2025-08-15 KDPE: A Kernel Density Estimation Strategy for Diffusion Policy Trajectory Selection Andrea Rosasco et.al. 2508.10511 null
2025-08-14 A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection Yangjie Xiao et.al. 2508.10509 null
2025-08-14 TweezeEdit: Consistent and Efficient Image Editing with Path Regularization Jianda Mao et.al. 2508.10498 null
2025-08-14 A Unified Multi-Agent Framework for Universal Multimodal Understanding and Generation Jiulin Li et.al. 2508.10494 null
2025-08-14 Jamming of active particles in narrow pores: Implications for ratchet effect and diffusion coefficient Šimon Pajger et.al. 2508.10483 null
2025-08-14 NanoControl: A Lightweight Framework for Precise and Efficient Control in Diffusion Transformer Shanyuan Liu et.al. 2508.10424 null
2025-08-14 Extracting a stochastic model for predator-prey dynamic of turbulence and zonal flows with limited data J. C. Huang et.al. 2508.10408 null
2025-08-14 Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models Eunseo Koh et.al. 2508.10407 null
2025-08-14 PQ-DAF: Pose-driven Quality-controlled Data Augmentation for Data-scarce Driver Distraction Detection Haibin Sun et.al. 2508.10397 null
2025-08-14 EDIS: A Simulation Software for Dynamic Ion Intercalation/Deintercalation Processes in Electrode Materials Liqi Wang et.al. 2508.10384 null
2025-08-14 Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models Hyundo Lee et.al. 2508.10382 null
2025-08-14 A Semantic-Aware Framework for Safe and Intent-Integrative Assistance in Upper-Limb Exoskeletons Yu Chen et.al. 2508.10378 null
2025-08-14 Scalable Modeling of Nonlinear Network Dynamics in Neurodegenerative Disease Daniel Semchin et.al. 2508.10343 null
2025-08-14 ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver Wenxuan Song et.al. 2508.10333 null
2025-08-14 Cross-view Generalized Diffusion Model for Sparse-view CT Reconstruction Jixiang Chen et.al. 2508.10313 null
2025-08-14 DiffAxE: Diffusion-driven Hardware Accelerator Generation and Design Space Exploration Arkapravo Ghosh et.al. 2508.10303 null
2025-08-14 Influence Maximization in Multi-layer Social Networks Based on Differentiated Graph Embeddings Ronghua Lin et.al. 2508.10289 null
2025-08-14 High Fidelity Text to Image Generation with Contrastive Alignment and Structural Guidance Danyi Gao et.al. 2508.10280 null
2025-08-14 A Spectral Solver to Capture Unsteady Dynamics in the Aerospike Nozzle Wake Zachary Pyle et.al. 2508.10275 null
2025-08-14 Non-Decaying Solutions to the 2D Dissipative Quasi-Geostrophic Equations David M. Ambrose et.al. 2508.10254 null
2025-08-13 Run-and-tumble dynamics with non-reciprocal transitions between three velocity states Julio C. R. Romo-Cruz et.al. 2508.10213 null
2025-08-13 Diffusive Braking of Penetrative Convection in Stably-Stratified Fluids Bradley W. Hindman et.al. 2508.10174 null
2025-08-13 Predicting First-Passage Dynamics in Disordered Systems Exactly: Application to Sparse Networks Daniel Marris et.al. 2508.10140 null
2025-08-13 The Perturbation Theory Approach to Stability in the Scattered Disk Matthew Belyakov et.al. 2508.10119 null
2025-08-13 Constrained Decoding of Diffusion LLMs with Context-Free Grammars Niels Mündler et.al. 2508.10111 null
2025-08-13 Quantum circuit simulation with a local time-dependent variational principle Aaron Sander et.al. 2508.10096 null
2025-08-13 Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design Yuhao Sun et.al. 2508.10065 null
2025-08-13 Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation Junyan Ye et.al. 2508.09987 null
2025-08-13 Story2Board: A Training-Free Approach for Expressive Storyboard Generation David Dinkevich et.al. 2508.09983 null
2025-08-13 Masquerade: Learning from In-the-wild Human Videos using Data-Editing Marion Lepert et.al. 2508.09976 null
2025-08-13 PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image Geonhee Sim et.al. 2508.09973 null
2025-08-13 Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models Luca Eyring et.al. 2508.09968 null
2025-08-13 Stable Diffusion Models are Secretly Good at Visual In-Context Learning Trevine Oorloff et.al. 2508.09949 null
2025-08-13 AST-n: A Fast Sampling Approach for Low-Dose CT Reconstruction using Diffusion Models Tomás de la Sotta et.al. 2508.09943 null
2025-08-13 Quo Vadis Handwritten Text Generation for Handwritten Text Recognition? Vittorio Pippi et.al. 2508.09936 null
2025-08-13 Active Particle Diffusion in Convection Roll Arrays Pulak Kumar Ghosh et.al. 2508.09924 null
2025-08-14 Prototype-Guided Diffusion: Visual Conditioning without External Memory Bilal Faye et.al. 2508.09922 null
2025-08-13 Hybrid Quantum-Classical Latent Diffusion Models for Medical Image Generation Kübra Yeter-Aydeniz et.al. 2508.09903 null
2025-08-13 Binary Mixtures in Linear Convection Arrays Pulak Kumar Ghosh et.al. 2508.09902 null
2025-08-13 Exploring the Physics of the Plasma Liner Experiment: A Multi-dimensional Study with FLASH, OSIRIS, and HELIOS E. C. Hansen et.al. 2508.09895 null
2025-08-13 Marketron Through the Looking Glass: From Equity Dynamics to Option Pricing in Incomplete Markets Igor Halperin et.al. 2508.09863 null
2025-08-13 HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics Weiqi Li et.al. 2508.09858 null
2025-08-13 Enhancing Diffusion Face Generation with Contrastive Embeddings and SegFormer Guidance Dhruvraj Singh Rawat et.al. 2508.09847 null
2025-08-13 On the Generalization Limits of Quantum Generative Adversarial Networks with Pure State Generators Jasmin Frkatovic et.al. 2508.09844 null
2025-08-13 Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Weigao Sun et.al. 2508.09834 null
2025-08-13 Physical Autoregressive Model for Robotic Manipulation without Action Pretraining Zijian Song et.al. 2508.09822 null
2025-08-13 Feature Impact Analysis on Top Long-Jump Performances with Quantile Random Forest and Explainable AI Techniques Qi Gan et.al. 2508.09810 null
2025-08-13 Condition number for finite element discretisation of nonlocal PDE systems with applications to biology Olusegun E. Adebayo et.al. 2508.09781 null
2025-08-13 Impacts of the duration and intensity of grazing cycle on vegetation population dynamics in semi-arid ecosystems with seasonal succession Junhong Gan et.al. 2508.09760 null
2025-08-13 Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection Zhiqiu Zhang et.al. 2508.09746 null
2025-08-13 MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers Qianru Qiu et.al. 2508.09709 null
2025-08-13 Hydrodynamic approximations for driven dense colloidal mixtures in narrow pores Frantisek Slanina et.al. 2508.09686 null
2025-08-13 Anomalous Transport of Elongated Particles in Oscillatory Vortical Flows Shiyuan Hu et.al. 2508.09677 null
2025-08-13 GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors Xingyilang Yin et.al. 2508.09667 null
2025-08-13 NegFaceDiff: The Power of Negative Context in Identity-Conditioned Diffusion for Synthetic Face Generation Eduarda Caldeira et.al. 2508.09661 null
2025-08-13 Asymptotic-analysis-inspired boundary conditions aiming at eliminating polymer diffusive instability Ming Dong et.al. 2508.09635 null
2025-08-15 Preacher: Paper-to-Video Agentic System Jingwei Liu et.al. 2508.09632 null
2025-08-13 MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography Daniel Barco et.al. 2508.09616 null
2025-08-13 Global uniform regularity for the 3D incompressible MHD equations with slip boundary condition near a background magnetic field Jincheng Gao et.al. 2508.09609 null
2025-08-13 Images Speak Louder Than Scores: Failure Mode Escape for Enhancing Generative Quality Jie Shao et.al. 2508.09598 null
2025-08-13 Dual Recursive Feedback on Generation and Appearance Latents for Pose-Robust Text-to-Image Diffusion Jiwon Kim et.al. 2508.09575 null
2025-08-13 Zeolitic imidazolate framework glasses emit white light Zhencai Li et.al. 2508.09552 null
2025-08-13 Exploring the Equivalence of Closed-Set Generative and Real Data Augmentation in Image Classification Haowen Wang et.al. 2508.09550 null
2025-08-13 Boron Clusters for Metal-Free Water Splitting Masaya Fujioka et.al. 2508.09538 null
2025-08-13 Ehrenfest Dynamics with Spontaneous Localization Anderson A. Tomaz et.al. 2508.09526 null
2025-08-13 Generation of Indian Sign Language Letters, Numbers, and Words Ajeet Kumar Yadav et.al. 2508.09522 null
2025-08-13 A hyperbolic finite difference scheme for anisotropic diffusion equations: preserving the discrete maximum principle Tokuhiro Eto et.al. 2508.09509 null
2025-08-13 Stingrays in the radio sky: Two unusual diffuse radio relic sources in the direction of the Magellanic Stream Zachary J Smeaton et.al. 2508.09495 null
2025-08-13 SARE: Semantic-Aware Reconstruction Error for Generalizable Diffusion-Generated Image Detection Ju Yeon Kang et.al. 2508.09487 null
2025-08-13 CLIP-Flow: A Universal Discriminator for AI-Generated Images Inspired by Anomaly Detection Zhipeng Yuan et.al. 2508.09477 null
2025-08-14 From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts Yuji Wang et.al. 2508.09476 null
2025-08-13 Leveraging Failed Samples: A Few-Shot and Training-Free Framework for Generalized Deepfake Detection Shibo Yao et.al. 2508.09475 null
2025-08-13 Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy Hao Yu et.al. 2508.09461 null
2025-08-13 RASR: Retrieval-Augmented Super Resolution for Practical Reference-based Image Restoration Jiaqi Yan et.al. 2508.09449 null
2025-08-13 DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation Haoxiang Shi et.al. 2508.09444 null
2025-08-13 Scaling behaviour of rotating convection in a spherical shell with different Prandtl numbers Wei Fan et.al. 2508.09416 null
2025-08-13 Dynamos driven by top-heavy double-diffusive convection in the strong-field regime Wei Fan et.al. 2508.09410 null
2025-08-12 Understanding Dementia Speech Alignment with Diffusion-Based Image Generation Mansi et.al. 2508.09385 null
2025-08-12 X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents Guoxian Song et.al. 2508.09383 null
2025-08-12 UltraLight Med-Vision Mamba for Classification of Neoplastic Progression in Tubular Adenomas Aqsa Sultana et.al. 2508.09339 null
2025-08-12 Lung-DDPM+: Efficient Thoracic CT Image Synthesis using Diffusion Probabilistic Model Yifan Jiang et.al. 2508.09327 null
2025-08-12 Quantum correction to the Langevin cross section in resonant-exchange processes I. Simbotin et.al. 2508.09302 null
2025-08-12 Evolution of a Long-Lived Deep-Seated Main-Sequence Magnetic Field During White Dwarf Cooling Matias Castro-Tapia et.al. 2508.09268 null
2025-08-12 TFZ: Topology-Preserving Compression of 2D Symmetric and Asymmetric Second-Order Tensor Fields Nathaniel Gorski et.al. 2508.09235 null
2025-08-12 GSMT: Graph Fusion and Spatiotemporal TaskCorrection for Multi-Bus Trajectory Prediction Fan Ding et.al. 2508.09227 null
2025-08-12 Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models Wen Wang et.al. 2508.09138 null
2025-08-12 Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices Ya Zou et.al. 2508.09136 null
2025-08-13 Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer Zixin Yin et.al. 2508.09131 null
2025-08-13 Robust quantum computational advantage with programmable 3050-photon Gaussian boson sampling Hua-Liang Liu et.al. 2508.09092 null
2025-08-13 Direct Measurement of Electron Heating in Electron-Only Reconnection in a Laboratory Mini-Magnetosphere Lucas Rovige et.al. 2508.09086 null
2025-08-12 Rankin-Selberg integrals for $\mathrm{GSpin}$ groups with application to the global Gan-Gross-Prasad conjecture Pan Yan et.al. 2508.09066 null
2025-08-12 Per-Query Visual Concept Learning Ori Malca et.al. 2508.09045 null
2025-08-12 Stochastic Decentralized Optimization of Non-Smooth Convex and Convex-Concave Problems over Time-Varying Networks Maxim Divilkovskiy et.al. 2508.09029 null
2025-08-12 Envisioning Generative Artificial Intelligence in Cartography and Mapmaking Yuhao Kang et.al. 2508.09028 null
2025-08-12 TaoCache: Structure-Maintained Video Generation Acceleration Zhentao Fan et.al. 2508.08978 null
2025-08-12 Urban-STA4CLC: Urban Theory-Informed Spatio-Temporal Attention Model for Predicting Post-Disaster Commercial Land Use Change Ziyi Guo et.al. 2508.08976 null
2025-08-12 Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation Soo-Whan Chung et.al. 2508.08953 null
2025-08-12 Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation Ao Ma et.al. 2508.08949 null
2025-08-12 EGGCodec: A Robust Neural Encodec Framework for EGG Reconstruction and F0 Extraction Rui Feng et.al. 2508.08924 null
2025-08-12 When and How Ultrasound Enhances Nanoparticle Diffusion in Hydrogels: A Stick-and-Release Mechanism Pablo M. Blanco et.al. 2508.08918 null
2025-08-12 Sound Signal Synthesis with Auxiliary Classifier GAN, COVID-19 cough as an example Yahya Sherif Solayman Mohamed Saleh et.al. 2508.08892 null
2025-08-12 Transient Noise Removal via Diffusion-based Speech Inpainting Mordehay Moradi et.al. 2508.08890 null
2025-08-12 DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI Bo-Hsun Chen et.al. 2508.08831 null
2025-08-12 Geometry-Aware Global Feature Aggregation for Real-Time Indirect Illumination Meng Gai et.al. 2508.08826 null
2025-08-12 TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models Yuqi Peng et.al. 2508.08812 null
2025-08-12 Identity-Preserving Aging and De-Aging of Faces in the StyleGAN Latent Space Luis S. Luevano et.al. 2508.08808 null
2025-08-12 Anomalous Sodium Insertion in Highly Oriented Graphite: Thermodynamics, Kinetics and Evidence for Two-Sided Intercalation Chuanhai Gan et.al. 2508.08806 null
2025-08-14 Measurement-Based Quantum Diffusion Models Xinyu Liu et.al. 2508.08799 null
2025-08-12 DiffPose-Animal: A Language-Conditioned Diffusion Framework for Animal Pose Estimation Tianyu Xiong et.al. 2508.08783 null
2025-08-12 Patient-Adaptive Focused Transmit Beamforming using Cognitive Ultrasound Wessel L. van Nierop et.al. 2508.08782 null
2025-08-12 Exploring Palette based Color Guidance in Diffusion Models Qianru Qiu et.al. 2508.08754 null
2025-08-12 Elucidating Rectified Flow with Deterministic Sampler: Polynomial Discretization Complexity for Multi and One-step Models Ruofeng Yang et.al. 2508.08735 null
2025-08-13 A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models Lingzhe Zhang et.al. 2508.08712 null
2025-08-12 Towards Safe Imitation Learning via Potential Field-Guided Flow Matching Haoran Ding et.al. 2508.08707 null
2025-08-12 SafeFix: Targeted Model Repair via Controlled Image Generation Ouyang Xu et.al. 2508.08701 null
2025-08-12 Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos Qi Zheng et.al. 2508.08700 null
2025-08-12 DiffVolume: Diffusion Models for Volume Generation in Limit Order Books Zhuohan Wang et.al. 2508.08698 null
2025-08-12 Detecting Sterile Neutrino Dark Matter at MeV Gamma-Ray Observatories Subaru Fujisawa et.al. 2508.08695 null
2025-08-12 Expert-Guided Diffusion Planner for Auto-bidding Yunshan Peng et.al. 2508.08687 null
2025-08-12 In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality Chenrui Liu et.al. 2508.08673 null
2025-08-12 Nonlinear dynamics of reaction-diffusion wave trains under large and fully nonlocalized modulations Joannis Alexopoulos et.al. 2508.08637 null
2025-08-14 Yan: Foundational Interactive Video Generation Deheng Ye et.al. 2508.08601 null
2025-08-12 RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space Jingyun Liang et.al. 2508.08588 null
2025-08-12 Unlocking the Potential of Diffusion Priors in Blind Face Restoration Yunqi Miao et.al. 2508.08556 null
2025-08-12 UQGNN: Uncertainty Quantification of Graph Neural Networks for Multivariate Spatiotemporal Prediction Dahai Yu et.al. 2508.08551 null
2025-08-12 Fluorescence time profile measurement of LAB based liquid scintillator in response to medium relativistic ion particles Xiaojie Luo et.al. 2508.08546 null
2025-08-12 Transition to Petschek Reconnection in Subrelativistic Pair Plasmas: Implications for Particle Acceleration Adam Robbins et.al. 2508.08533 null
2025-08-11 SynLLM: A Comparative Analysis of Large Language Models for Medical Tabular Synthetic Data Generation via Prompt Engineering Arshia Ilaty et.al. 2508.08529 null
2025-08-11 Control-affine Schrödinger Bridge and Generalized Bohm Potential Alexis M. H. Teter et.al. 2508.08511 null
2025-08-11 CObL: Toward Zero-Shot Ordinal Layering without User Prompting Aneel Damaraju et.al. 2508.08498 null
2025-08-11 MuGa-VTON: Multi-Garment Virtual Try-On via Diffusion Transformers with Prompt Customization Ankan Deria et.al. 2508.08488 null
2025-08-11 MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling Qian Wang et.al. 2508.08487 null
2025-08-11 Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features Pallabee Das et.al. 2508.08458 null
2025-08-11 Hot Jupiter formation in dense stellar clusters: A Monte Carlo model applied to 47 Tucanae J. A. Wirth et.al. 2508.08406 null
2025-08-11 Wave Propagation Dynamics via Lattice Difference Equations Eddy Kwessi et.al. 2508.08387 null
2025-08-11 Spatiotemporally Consistent Indoor Lighting Estimation with Diffusion Priors Mutian Tong et.al. 2508.08384 null
2025-08-11 Exponentially Improved Constant in Quantum Solution Extraction Gumaro Rendon et.al. 2508.08375 null
2025-08-11 StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation Shuyuan Tu et.al. 2508.08248 null
2025-08-12 Cut2Next: Generating Next Shot via In-Context Tuning Jingwen He et.al. 2508.08244 null
2025-08-13 BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion Qiayuan Liao et.al. 2508.08241 null
2025-08-11 OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution Zhiqiang Wu et.al. 2508.08227 null
2025-08-11 Learning User Preferences for Image Generation Model Wenyi Mo et.al. 2508.08220 null
2025-08-11 Reinforcement Learning in Vision: A Survey Weijia Wu et.al. 2508.08189 null
2025-08-13 CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data Chongke Bi et.al. 2508.08173 null
2025-08-11 ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction Chaojun Ni et.al. 2508.08170 null
2025-08-11 An effective potential for generative modelling with active matter Adrian Baule et.al. 2508.08146 null
2025-08-11 Reproducing and Extending Brownian Motion in Optical Trap: A Computational Reimplementation of Volpe and Volpe (2013) Eyad I. B Hamid et.al. 2508.08138 null
2025-08-11 FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting Yitong Yang et.al. 2508.08136 null
2025-08-11 Optimal Dividend, Reinsurance, and Capital Injection Strategies for an Insurer with Two Collaborating Business Lines Tim J. Boonen et.al. 2508.08130 null
2025-08-11 Learned Regularization for Microwave Tomography Bowen Tong et.al. 2508.08114 null
2025-08-11 TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning Junzhe Xu et.al. 2508.08098 null
2025-08-11 Fast and Generalizable parameter-embedded Neural Operators for Lithium-Ion Battery Simulation Amir Ali Panahi et.al. 2508.08087 null
2025-08-11 Matrix-3D: Omnidirectional Explorable 3D World Generation Zhongqi Yang et.al. 2508.08086 null
2025-08-12 Why Bohmian velocity might not be the only quantum velocity and the role of quantum diffusion flux is super-luminal wave packets Charalampos Antonakos et.al. 2508.08065 null
2025-08-11 S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix Peng Dai et.al. 2508.08048 null
2025-08-12 Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation Fangyuan Mao et.al. 2508.07981 null
2025-08-11 Well-posedness for a fourth-order nonisothermal tumor growth model of Caginalp type Giulia Cavalleri et.al. 2508.07979 null
2025-08-12 Adaptive Multiple Access and Service Placement for Generative Diffusion Models Hamidreza Mazandarani et.al. 2508.07978 null
2025-08-11 Deep imaging of the galaxy Malin 2 shows new faint structures and a candidate satellite dwarf galaxy Junais et.al. 2508.07930 null
2025-08-11 Score Augmentation for Diffusion Models Liang Hou et.al. 2508.07926 null
2025-08-11 Generative Video Matting Yongtao Ge et.al. 2508.07905 null
2025-08-11 Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models Johanna P. Müller et.al. 2508.07903 null
2025-08-12 Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation Bowen Xue et.al. 2508.07901 null
2025-08-11 NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction Tianle Zeng et.al. 2508.07897 null
2025-08-11 Deep Learning-Based Desikan-Killiany Parcellation of the Brain Using Diffusion MRI Yousef Sadegheih et.al. 2508.07815 null
2025-08-11 DiTVR: Zero-Shot Diffusion Transformer for Video Restoration Sicheng Gao et.al. 2508.07811 null
2025-08-11 MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks Yushen Xu et.al. 2508.07803 null
2025-08-11 Generative Inversion for Property-Targeted Materials Design: Application to Shape Memory Alloys Cheng Li et.al. 2508.07798 null
2025-08-11 Feynman-Kac formula gor general time dependent stochastic parabolic equation on a bounded domain and applications Yaozhong Hu et.al. 2508.07793 null
2025-08-13 AgentWorld: An Interactive Simulation Platform for Scene Construction and Mobile Robotic Manipulation Yizheng Zhang et.al. 2508.07770 null
2025-08-11 Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation Xiaoyan Liu et.al. 2508.07769 null
2025-08-11 Sea-Undistort: A Dataset for Through-Water Image Restoration in High Resolution Airborne Bathymetric Mapping Maximilian Kromer et.al. 2508.07760 null
2025-08-11 Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild Haoran Wang et.al. 2508.07759 null
2025-08-11 Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion Minseo Kim et.al. 2508.07755 null
2025-08-11 Grouped Speculative Decoding for Autoregressive Image Generation Junhyuk So et.al. 2508.07747 null
2025-08-11 Is GAN Necessary for Mel-Spectrogram-based Neural Vocoder? Hui-Peng Du et.al. 2508.07711 null
2025-08-11 Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing Weitao Wang et.al. 2508.07700 null
2025-08-11 DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework Wenzhuo Ma et.al. 2508.07682 null
2025-08-11 LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering Xiaohang Zhan et.al. 2508.07647 null
2025-08-11 X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning Jian Ma et.al. 2508.07607 null
2025-08-11 LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation Wenhui Song et.al. 2508.07603 null
2025-08-11 ShoulderShot: Generating Over-the-Shoulder Dialogue Videos Yuang Zhang et.al. 2508.07597 null
2025-08-11 Procedural Mixture Sets Hendrik Rommeswinkel et.al. 2508.07588 null
2025-08-12 From Platform Migration to Cultural Integration: the Ingress and Diffusion of #wlw from TikTok to RedNote in Queer Women Communities Ziqi Pan et.al. 2508.07579 null
2025-08-11 UniFlow: Unifying Speech Front-End Tasks via Continuous Generative Modeling Ziqian Wang et.al. 2508.07558 null
2025-08-11 Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation Minghao Yin et.al. 2508.07557 null
2025-08-11 Physics-informed Multiresolution Wavelet Neural Network Method for Solving Partial Differential Equations Feng Han et.al. 2508.07546 null
2025-08-11 Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing Joonghyuk Shin et.al. 2508.07519 null
2025-08-10 Forecasting solar power output in Ibadan: A machine learning approach leveraging weather data and system specifications Obarotu Peter Urhuerhi et.al. 2508.07462 null
2025-08-10 Unified Semiclassical Theory of Nonlinear Hall Effect:Bridging Ballistic and Diffusive Transport Regime Xinyu Liu et.al. 2508.07445 null
2025-08-10 Robust, fast, and adaptive splitting schemes for nonlinear doubly-degenerate diffusion equations Ayesha Javed et.al. 2508.07420 null
2025-08-10 CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization Youqi Wang et.al. 2508.07413 null
2025-08-10 Conditional splitting probabilities for hidden-state inference in drift-diffusive processes Emir Sezik et.al. 2508.07386 null
2025-08-10 Supercritical fluids as a distinct state of matter characterized by sub-short-range structural order Sha Jin et.al. 2508.07385 null
2025-08-10 SODiff: Semantic-Oriented Diffusion Model for JPEG Compression Artifacts Removal Tingyu Yang et.al. 2508.07346 null
2025-08-10 CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation Fangtai Wu et.al. 2508.07341 null
2025-08-10 Linear-Quadratic Mean Field Games with Common Noise: A Direct Approach Wenyu Cong et.al. 2508.07271 null
2025-08-10 Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers Xin Ma et.al. 2508.07246 null
2025-08-10 Causal Negative Sampling via Diffusion Model for Out-of-Distribution Recommendation Chu Zhao et.al. 2508.07243 null
2025-08-10 HaDM-ST: Histology-Assisted Differential Modeling for Spatial Transcriptomics Generation Xuepeng Liu et.al. 2508.07225 null
2025-08-10 Neural Bridge Processes Jian Xu et.al. 2508.07220 null
2025-08-10 Explainability-in-Action: Enabling Expressive Manipulation and Tacit Understanding by Bending Diffusion Models in ComfyUI Ahmed M. Abuzuraiq et.al. 2508.07183 null
2025-08-10 CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion Xiaotong Lin et.al. 2508.07162 null
2025-08-10 SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models Ruolin Yang et.al. 2508.07149 null
2025-08-10 Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction Yu Liu et.al. 2508.07146 null
2025-08-10 SketchConcept: Sketching-based Concept Recomposition for Product Design using Generative AI Runlin Duan et.al. 2508.07141 null
2025-08-10 Canvas3D: Empowering Precise Spatial Control for Image Generation with Constraints from a 3D Virtual Canvas Runlin Duan et.al. 2508.07135 null
2025-08-10 On the geometric Brownian motion with state-dependent variable exponent diffusion term Mustafa Avci et.al. 2508.07130 null
2025-08-10 Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays Gregory Schuit et.al. 2508.07128 null
2025-08-10 Modelling Human Skin Morphology and Simulating Transdermal Transport of 50 Chemicals Milana Tesfamarian et.al. 2508.07123 null
2025-08-09 DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit Aiden Swann et.al. 2508.07118 null
2025-08-09 Whisfusion: Parallel ASR Decoding via a Diffusion Transformer Taeyoun Kwon et.al. 2508.07048 null
2025-08-09 A Stage-Aware Mixture of Experts Framework for Neurodegenerative Disease Progression Modelling Tiantian He et.al. 2508.07032 null
2025-08-09 Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities Anindya Bijoy Das et.al. 2508.07031 null
2025-08-09 Vec2Summ: Text Summarization via Probabilistic Sentence Embeddings Mao Li et.al. 2508.07017 null
2025-08-12 HiMat: DiT-based Ultra-High Resolution SVBRDF Generation Zixiong Wang et.al. 2508.07011 null
2025-08-09 Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments Gian Mario Favero et.al. 2508.07006 null
2025-08-09 Mechanism of Anisotropic Crystallization and Phase Transitions under Van der Waals Squeezing Yuxiang Gao et.al. 2508.06992 null
2025-08-09 WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering Yixin Zhu et.al. 2508.06982 null
2025-08-09 Structure-Preserving Digital Twins via Conditional Neural Whitney Forms Brooks Kinch et.al. 2508.06981 null
2025-08-09 CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing Weiyan Xie et.al. 2508.06937 null
2025-08-09 Unveiling the Puzzle of Brittleness in Single Crystal Iridium Qing Cheng et.al. 2508.06929 null
2025-08-09 AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning Shihao Yuan et.al. 2508.06924 null
2025-08-09 Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing Shichao Ma et.al. 2508.06916 null
2025-08-09 MultiRef: Controllable Image Generation with Multiple Visual References Ruoxi Chen et.al. 2508.06905 null
2025-08-09 Text to Speech System for Meitei Mayek Script Gangular Singh Irengbam et.al. 2508.06870 null
2025-08-09 Speech Enhancement based on cascaded two flow Seonggyu Lee et.al. 2508.06842 null
2025-08-09 FlowSE: Flow Matching-based Speech Enhancement Seonggyu Lee et.al. 2508.06840 null
2025-08-09 Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models Shiqian Zhao et.al. 2508.06837 null
2025-08-09 A Score-based Diffusion Model Approach for Adaptive Learning of Stochastic Partial Differential Equation Solutions Toan Huynh et.al. 2508.06834 null
2025-08-09 Efficient data-driven regression for reduced-order modeling of spatial pattern formation Alessandro Alla et.al. 2508.06833 null
2025-08-09 Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation Xiao Huang et.al. 2508.06806 null
2025-08-09 D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning Shu-Ang Yu et.al. 2508.06804 null
2025-08-09 GaN/InN HEMT based UV photodetector on SiC with hexagonal boron nitride passivation Mustafa Kilin et.al. 2508.06782 null
2025-08-08 Topology Generation of UAV Covert Communication Networks: A Graph Diffusion Approach with Incentive Mechanism Xin Tang et.al. 2508.06746 null
2025-08-08 Design of high-mobility p-type GaN via the piezomobility tensor Jie-Cheng Chen et.al. 2508.06723 null
2025-08-08 Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video Jixuan He et.al. 2508.06715 null
2025-08-08 LightSwitch: Multi-view Relighting with Material-guided Diffusion Yehonathan Litman et.al. 2508.06494 null
2025-08-08 Weak approximation of stochastic differential equations with sticky boundary conditions Akash Sharma et.al. 2508.06487 null
2025-08-08 SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning Lingkun Long et.al. 2508.06447 null
2025-08-08 SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation Guido Manni et.al. 2508.06429 null
2025-08-08 4D operando X-ray nano-holo-tomography reveals multiscale chemomechanics in Silicon-Graphite anode Victor Vanpeene et.al. 2508.06413 null
2025-08-08 FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation Wenbin Teng et.al. 2508.06392 null
2025-08-08 Diffuse measures and nonlinear parabolic equations Francesco Petitta et.al. 2508.06384 null
2025-08-08 ActivityDiff: A diffusion model with Positive and Negative Activity Guidance for De Novo Drug Design Renyi Zhou et.al. 2508.06364 null
2025-08-08 Quantum Algorithm for Estimating Intrinsic Geometry Nhat A. Nghiem et.al. 2508.06355 null
2025-08-08 Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging? Xin Ci Wong et.al. 2508.06327 null
2025-08-08 OM2P: Offline Multi-Agent Mean-Flow Policy Zhuoran Li et.al. 2508.06269 null
2025-08-08 ADPro: a Test-time Adaptive Diffusion Policy for Robot Manipulation via Manifold and Initial Noise Constraints Zezeng Li et.al. 2508.06266 null
2025-08-08 Tanaka formula for SDEs driven by fractional Brownian motion Tommi Sottinen et.al. 2508.06261 null
2025-08-08 Low dimensional dynamics of a sparse balanced synaptic network of quadratic integrate-and-fire neurons Maria V. Ageeva et.al. 2508.06253 null
2025-08-08 Light-Addressable Smart Nanostructures via Resonant Nanoheating Victor Tabouillot et.al. 2508.06215 null
2025-08-08 Inverse Source Problems for the Time-Fractional Evolution Equation Rahmonov Askar Ahmadovich et.al. 2508.06209 null
2025-08-08 Clinically-guided Data Synthesis for Laryngeal Lesion Detection Chiara Baldini et.al. 2508.06182 null
2025-08-08 Synthetic Data-Driven Multi-Architecture Framework for Automated Polyp Segmentation Through Integrated Detection and Mask Generation Ojonugwa Oluwafemi Ejiga Peter et.al. 2508.06170 null
2025-08-08 Sharp non-existence threshold for a parabolic Hardy-H{é}non equation with quasilinear diffusion Razvan Gabriel Iagar et.al. 2508.06164 null
2025-08-08 Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment Zhenbang Du et.al. 2508.06160 null
2025-08-08 Revealing the Staging Structural Evolution and Li (De)Intercalation Kinetics in Graphite Anodes via Machine Learning Potential Liqi Wang et.al. 2508.06156 null
2025-08-08 VISTAR:A User-Centric and Role-Driven Benchmark for Text-to-Image Evaluation Kaiyuan Jiang et.al. 2508.06152 null
2025-08-08 Improving Diagnostic Accuracy for Oral Cancer with inpainting Synthesis Lesions Generated Using Diffusion Models Yong Oh Lee et.al. 2508.06151 null
2025-08-08 DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera Shaohua Pan et.al. 2508.06139 null
2025-08-08 GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving Jian Wang et.al. 2508.06113 null
2025-08-08 MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment Gui Zou et.al. 2508.06104 null
2025-08-08 UGD-IML: A Unified Generative Diffusion-based Framework for Constrained and Unconstrained Image Manipulation Localization Yachun Mi et.al. 2508.06101 null
2025-08-08 MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows Xiquan Li et.al. 2508.06098 null
2025-08-08 E-React: Towards Emotionally Controlled Synthesis of Human Reactions Chen Zhu et.al. 2508.06093 null
2025-08-08 SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment Yanxiao Sun et.al. 2508.06082 null
2025-08-08 DreamVE: Unified Instruction-based Image and Video Editing Bin Xia et.al. 2508.06080 null
2025-08-08 Towards MR-Based Trochleoplasty Planning Michael Wehrli et.al. 2508.06076 null
2025-08-08 Radio continuum and \HI 21-cm line observations of a nearby luminous infrared galaxy IRAS 17526+3253 Jianfeng Wu et.al. 2508.06075 null
2025-08-08 Real-time physics-informed reconstruction of transient fields using sensor guidance and higher-order time differentiation Hong-Kyun Noh et.al. 2508.06070 null
2025-08-08 ThematicPlane: Bridging Tacit User Intent and Latent Spaces for Image Generation Daniel Lee et.al. 2508.06065 null
2025-08-08 NEP: Autoregressive Image Editing via Next Editing Token Prediction Huimin Wu et.al. 2508.06044 null
2025-08-08 Bayesian Radio Map Estimation: Fundamentals and Implementation via Diffusion Models Tien Ngoc Ha et.al. 2508.06037 null
2025-08-08 InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow Yiming Gong et.al. 2508.06033 null
2025-08-08 Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts Kiran Chhatre et.al. 2508.06032 null
2025-08-08 Improved Sub-Visible Particle Classification in Flow Imaging Microscopy via Generative AI-Based Image Synthesis Utku Ozbulak et.al. 2508.06021 null
2025-08-08 Vacuum Dealloyed Brass as Li-Metal Battery Current Collector: Effect of Zinc and Porosity Eric V Woods et.al. 2508.06015 null
2025-08-08 ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors Minsu Kim et.al. 2508.06014 null
2025-08-08 KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training Kai Zhang et.al. 2508.06001 null
2025-08-08 Global solutions in $L^{p}{v}L^{\infty}{x}$ for the Boltzmann equation in bounded domains Dingqun Deng et.al. 2508.05985 null
2025-08-08 Revisiting $μ$ SR Studies of Ion Dynamics in the Light of Extended Kubo-Toyabe Model Takashi U. Ito et.al. 2508.05968 null
2025-08-08 Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents Han Lin et.al. 2508.05954 null
2025-08-08 A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image Yanxing Liang et.al. 2508.05950 null
2025-08-08 Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution Zhanyi Sun et.al. 2508.05941 null
2025-08-08 Reverse Diffusion Sequential Monte Carlo Samplers Luhuan Wu et.al. 2508.05926 null
2025-08-08 Fast, Convex and Conditioned Network for Multi-Fidelity Vectors and Stiff Univariate Differential Equations Siddharth Rout et.al. 2508.05921 null
2025-08-07 Measurement of All Flavor PeV Neutrino Flux using Combined Datasets from IceCube Emre Yildizci et.al. 2508.05886 null
2025-08-07 Emerging ultra-wide band gap semiconductors for future high-frequency electronics Emily M. Garrity et.al. 2508.05823 null
2025-08-07 FineDialFact: A benchmark for Fine-grained Dialogue Fact Verification Xiangyan Chen et.al. 2508.05782 null
2025-08-07 MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss Can Zhao et.al. 2508.05772 null
2025-08-07 UnGuide: Learning to Forget with LoRA-Guided Diffusion Models Agnieszka Polowczyk et.al. 2508.05755 null
2025-08-07 Quantum Reservoir GAN Hikaru Wakaura et.al. 2508.05716 null
2025-08-07 High multiplicity and global structure of coexistence states in a predator-prey model with saturation Kousuke Kuto et.al. 2508.05714 null
2025-08-07 Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation Yue Liao et.al. 2508.05635 null
2025-08-07 GAP: Gaussianize Any Point Clouds with Text Guidance Weiqi Zhang et.al. 2508.05631 null
2025-08-07 Latent Space Diffusion for Topology Optimization Aaron Lutheran et.al. 2508.05624 null
2025-08-07 Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision Luozheng Qin et.al. 2508.05606 null
2025-08-07 Unveiling the Lithium-Ion Transport Mechanism in Li2ZrCl6 Solid-State Electrolyte via Deep Learning-Accelerated Molecular Dynamics Simulations Hanzeng Guo et.al. 2508.05598 null
2025-08-07 Discrepancy-Aware Contrastive Adaptation in Medical Time Series Analysis Yifan Wang et.al. 2508.05572 null
2025-08-07 MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips Shibo Wang et.al. 2508.05506 null
2025-08-07 Heat and super-diffusive melting fronts in unsaturated porous media Eirik G. Flekkøy et.al. 2508.05451 null
2025-08-07 Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI Krzysztof Janowicz et.al. 2508.05432 null
2025-08-07 MolSnap: Snap-Fast Molecular Generation with Latent Variational Mean Flow Md Atik Ahamed et.al. 2508.05411 null
2025-08-07 UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation Wonjun Kang et.al. 2508.05399 null
2025-08-07 Real-Time Iteration Scheme for Diffusion Policy Yufei Duan et.al. 2508.05396 null
2025-08-09 Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms Jie Xiao et.al. 2508.05387 null
2025-08-07 Multi-Modal Multi-Behavior Sequential Recommendation with Conditional Diffusion-Based Feature Denoising Xiaoxi Cui et.al. 2508.05352 null
2025-08-07 Stranski-Krastanov Growth of Disordered ScNx Thin Films on MgO(100): Influence of Defect Densities on Electronic Structure and Transport Properties Susmita Chowdhury et.al. 2508.05330 null
2025-08-07 Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting Frank Ruis et.al. 2508.05323 null
2025-08-07 Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces Mathias Rose Bjare et.al. 2508.05306 null
2025-08-07 SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens Nikita Dragunov et.al. 2508.05305 null
2025-08-07 An Investigation into the Distribution of Ratios of Particle Solver-based Likelihoods Emil Løvbak et.al. 2508.05303 null
2025-08-07 Wavelet-Guided Dual-Frequency Encoding for Remote Sensing Change Detection Xiaoyang Zhang et.al. 2508.05271 null
2025-08-07 B4DL: A Benchmark for 4D LiDAR LLM in Spatio-Temporal Understanding Changho Choi et.al. 2508.05269 null
2025-08-07 SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion Xiaoyang Zhang et.al. 2508.05264 null
2025-08-07 ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models Yatong Lan et.al. 2508.05236 null
2025-08-07 Parabolic abstract evolution equations in cylindrical domains and uniformly local Sobolev spaces Joly Romain et.al. 2508.05220 null
2025-08-07 An asymptotic-preserving active flux scheme for the hyperbolic heat equation in the diffusive scaling Junming Duan et.al. 2508.05166 null
2025-08-07 RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer Fangyu Du et.al. 2508.05115 null
2025-08-07 PoseGen: In-Context LoRA Finetuning for Pose-Controllable Long Human Video Generation Jingxuan He et.al. 2508.05091 null
2025-08-07 MetaDiT: Enabling Fine-grained Constraints in High-degree-of Freedom Metasurface Design Hao Li et.al. 2508.05076 null
2025-08-07 Align-for-Fusion: Harmonizing Triple Preferences via Dual-oriented Diffusion for Cross-domain Sequential Recommendation Yongfu Zha et.al. 2508.05074 null
2025-08-07 FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer Jian Zhu et.al. 2508.05069 null
2025-08-07 DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion Yifeng Huang et.al. 2508.05060 null
2025-08-07 Observation of Super-ballistic Brownian Motion in Liquid Jason Boynewicz et.al. 2508.05031 null
2025-08-07 Coupled 1D Chemical Kinetic-Transport and 2D Hydrodynamic Modeling Supports a modest 1-1.5x Supersolar Oxygen Abundance in Jupiter’s Atmosphere Jeehyun Yang et.al. 2508.05007 null
2025-08-07 Switching Diffusion Systems with Past-Dependent Switching and Countable State Space: Successful Couplings and Strong Ergodicity Fubao Xi et.al. 2508.04997 null
2025-08-08 REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers Yuepeng Jiang et.al. 2508.04996 null
2025-08-07 Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression Zheng Chen et.al. 2508.04979 null
2025-08-06 Simulation of Non-Premixed, Supersonic Combustion using the Discontinuous Galerkin Method on Fully Unstructured Grids Cal J. Rising et.al. 2508.04930 null
2025-08-06 Taxonomy of Faults in Attention-Based Neural Networks Sigma Jahan et.al. 2508.04925 null
2025-08-08 Learning AI Auditing: A Case Study of Teenagers Auditing a Generative AI Model Luis Morales-Navarro et.al. 2508.04902 null
2025-08-06 The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models Leo Zhang et.al. 2508.04884 null
2025-08-06 Unified Flow Matching for Long Horizon Event Forecasting Xiao Shou et.al. 2508.04843 null
2025-08-06 Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off Seungyong Lee et.al. 2508.04825 null
2025-08-06 Delay-constrained re-entry governs large-scale brain seizures and other network pathologies Paul Triebkorn et.al. 2508.04824 null
2025-08-06 Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models Mehrdad Moradi et.al. 2508.04818 null
2025-08-06 Stochastic Optimal Control with Control-Dependent Diffusion and State Constraints: A Degenerate Elliptic Approach Anderson O. Calixto et.al. 2508.04809 null
2025-08-06 Electrodeless Magnetohydrodynamic Local Force Generator for Aerocapture Bernard Parent et.al. 2508.04806 null
2025-08-06 ACM Multimedia Grand Challenge on ENT Endoscopy Analysis Trong-Thuan Nguyen et.al. 2508.04801 null
2025-08-08 Quantum-impurity sensing of altermagnetic order V. A. S. V. Bittencourt et.al. 2508.04788 null
2025-08-06 Edge-Assisted Collaborative Fine-Tuning for Multi-User Personalized Artificial Intelligence Generated Content (AIGC) Nan Li et.al. 2508.04745 null
2025-08-06 A colossal dielectric response of HfxZr1-xO2 nanoparticles Oleksandr S. Pylypchuk et.al. 2508.04697 null
2025-08-06 Diffusion in a $d$ -dimensional rough potential Jacob Jeffries et.al. 2508.04674 null
2025-08-06 HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models Young D. Kwon et.al. 2508.04663 null
2025-08-06 Stochastic Calculus for Pathwise Observables of Markov-Jump Processes: Unification of Diffusion and Jump Dynamics Lars Torbjørn Stutzer et.al. 2508.04647 null
2025-08-06 A unified model for linear responses of physical networks José M. Ortiz-Tavárez et.al. 2508.04616 null
2025-08-06 Multitask Learning with Stochastic Interpolants Hugo Negrel et.al. 2508.04605 null
2025-08-07 A Comprehensive Framework for Uncertainty Quantification of Voxel-wise Supervised Models in IVIM MRI Nicola Casali et.al. 2508.04588 null
2025-08-06 Joint Communication and Indoor Positioning Based on Visible Light in the Presence of Dimming A. Tarik Leblebici et.al. 2508.04570 null
2025-08-06 DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling Yijie Li et.al. 2508.04568 null
2025-08-06 TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning Yunbi Liu et.al. 2508.04565 null
2025-08-06 Drone Detection with Event Cameras Gabriele Magrini et.al. 2508.04564 null
2025-08-06 One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose Jinxi Liu et.al. 2508.04559 null
2025-08-06 Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis Angang Zhang et.al. 2508.04551 null
2025-08-06 MSC: A Marine Wildlife Video Dataset with Grounded Segmentation and Clip-Level Captioning Quang-Trung Truong et.al. 2508.04549 null
2025-08-06 X-ray thermal diffuse scattering as a texture-robust temperature diagnostic for dynamically compressed solids P. G. Heighway et.al. 2508.04525 null
2025-08-06 $β$ -Irida-Graphene: A New 2D Carbon Allotrope for Sodium-Ion Battery Anodes José A. S. Laranjeira et.al. 2508.04506 null
2025-08-06 QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution Bowen Chai et.al. 2508.04485 null
2025-08-06 Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model Hongxu Chen et.al. 2508.04472 null
2025-08-06 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation Shuzhou Yang et.al. 2508.04467 null
2025-08-06 Case Studies of Generative Machine Learning Models for Dynamical Systems Nachiket U. Bapat et.al. 2508.04459 null
2025-08-06 Cognitive Effort in the Two-Step Task: An Active Inference Drift-Diffusion Model Approach Alvaro Garrido Perez et.al. 2508.04435 null
2025-08-06 Unmasking Interstitial Lung Diseases: Leveraging Masked Autoencoders for Diagnosis Ethan Dack et.al. 2508.04429 null
2025-08-06 Hydrodynamic Effects in Cryogenic Buffer Gas Cells: Design Insights from Hybrid Simulations Nick Vogeley et.al. 2508.04364 null
2025-08-06 Derivation and Numerical Simulation of a Thermodynamically Consistent Magneto Two-Phase Flow Model for Magnetic Drug Targeting Eberhard Bänsch et.al. 2508.04360 null
2025-08-06 From Split to Share: Private Inference with Distributed Feature Sharing Zihan Liu et.al. 2508.04346 null
2025-08-06 Performative Market Making Charalampos Kleitsikas et.al. 2508.04344 null
2025-08-06 TempFlow-GRPO: When Timing Matters for GRPO in Flow Models Xiaoxuan He et.al. 2508.04324 null
2025-08-06 Wave coupling in partially ionized plasmas with shear flows I. Fast-to-Alfvén transformation Miquel Cantallops et.al. 2508.04319 null
2025-08-06 Turbulent Injection assisted by Diffusion Models for Scale Resolving Simulations Margaux Boxho et.al. 2508.04318 null
2025-08-06 Parameter Estimation for Weakly Interacting Hypoelliptic Diffusions Yuga Iguchi et.al. 2508.04287 null
2025-08-06 S2M3: Split-and-Share Multi-Modal Models for Distributed Multi-Task Inference on the Edge JinYi Yoon et.al. 2508.04271 null
2025-08-06 Sparse Narrow-Band Topology Optimization for Large-Scale Thermal-Fluid Applications Vladislav Pimanov et.al. 2508.04261 null
2025-08-06 High-Dimensional Matrix-Variate Diffusion Index Models for Time Series Forecasting Zhiren Ma et.al. 2508.04259 null
2025-08-06 Suspensions of small ultra-soft colloids remain liquids in overcrowded conditions Nikolaos A. Burger et.al. 2508.04244 null
2025-08-06 PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction Muhua Zhu et.al. 2508.04236 null
2025-08-06 DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification Saifullah Saifullah et.al. 2508.04233 null
2025-08-06 Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction Yu Liu et.al. 2508.04229 null
2025-08-06 LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation Kangrui Cen et.al. 2508.04228 null
2025-08-06 DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models Saifullah Saifullah et.al. 2508.04208 null
2025-08-06 A background-free signal of jet-induced diffusion wake in quark-gluon plasma Zhong Yang et.al. 2508.04194 null
2025-08-06 Deeper Inside Deep ViT Sungrae Hong et.al. 2508.04181 null
2025-08-06 Quasi-Clique Discovery via Energy Diffusion Yu Zhang et.al. 2508.04174 null
2025-08-06 Non-Equilibrium Dynamics and First-Passage Properties of Stochastic Processes: From Brownian Motion to Active Particles Mathis Guéneau et.al. 2508.04154 null
2025-08-06 IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control Lijuan Liu et.al. 2508.04147 null
2025-08-06 Polynomial-time sampling despite disorder chaos Eric Ma et.al. 2508.04133 null
2025-08-06 Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation Maximilian Ulmer et.al. 2508.04122 null
2025-08-06 Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework Yi-Ting Chen et.al. 2508.04090 null
2025-08-06 Long time behavior and Yaglom limit for real trait-structured Birth and Death Processes Pierre Collet et.al. 2508.04089 null
2025-08-06 Convolutional autoencoders for the reconstruction of three-dimensional interfacial multiphase flows Murray Cutforth et.al. 2508.04084 null
2025-08-06 POD-based reduced order modeling of global-in-time iterative decoupled algorithms for Biot’s consolidation model Huipeng Gu et.al. 2508.04082 null
2025-08-06 Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion Fangmin Zhao et.al. 2508.04055 null
2025-08-06 Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation Jiayi He et.al. 2508.04049 null
2025-08-06 Nonlinear stability of two-dimensional periodic waves in parabolic systems with conservation laws L. Miguel Rodrigues et.al. 2508.04023 null
2025-08-07 S $^2$ Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation Weilun Feng et.al. 2508.04016 null
2025-08-06 Constructing Generalized Sample Transition Probabilities with Biased Simulations Yanbin Wang et.al. 2508.03977 null
2025-08-05 Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm Lin Zhang et.al. 2508.03955 null
2025-08-05 Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model Shen Zhu et.al. 2508.03925 null
2025-08-05 Coefficient Identification Problem with Integral Overdetermination Condition for Diffusion Equations R. R. Ashurov et.al. 2508.03859 null
2025-08-05 VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations Yifei Zong et.al. 2508.03839 null
2025-08-05 HPSv3: Towards Wide-Spectrum Human Preference Score Yuhang Ma et.al. 2508.03789 null
2025-08-05 LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Jianxiong Gao et.al. 2508.03694 null
2025-08-05 LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences Ao Liang et.al. 2508.03692 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-05 OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World Katherine Liu et.al. 2508.03669 null
2025-08-05 Rigidity for graph product von Neumann algebras Camille Horbez et.al. 2508.03662 null
2025-08-05 DiWA: Diffusion Policy Adaptation with World Models Akshay L Chandra et.al. 2508.03645 null
2025-08-05 Likelihood Matching for Diffusion Models Lei Qian et.al. 2508.03636 null
2025-08-05 Radiative Nonideal MHD Simulations of Inner Protoplanetary Disks: Temperature Structures, Asymmetric Winds, and Episodic Surface Accretion Shoji Mori et.al. 2508.03624 null
2025-08-05 Expanding the Standard Diffusion Process to Specified Non-Gaussian Marginal Distributions Robert Richardson et.al. 2508.03617 null
2025-08-05 CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models Ana Lawry Aguila et.al. 2508.03594 null
2025-08-05 Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection Long Qian et.al. 2508.03539 null
2025-08-05 X-ray Halos of Early-Type Galaxies with AGN Feedback and Accretion from a Circumgalactic Medium: models and observations Silvia Pellegrini et.al. 2508.03536 null
2025-08-05 CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation Kaishen Yuan et.al. 2508.03535 null
2025-08-05 LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Text-to-Image Generation Lianwei Yang et.al. 2508.03485 null
2025-08-05 When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models Dasol Choi Jihwan Lee et.al. 2508.03483 null
2025-08-05 Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models Hyungjin Kim et.al. 2508.03481 null
2025-08-05 VideoGuard: Protecting Video Content from Unauthorized Editing Junjie Cao et.al. 2508.03480 null
2025-08-05 Learning to Incentivize: LLM-Empowered Contract for AIGC Offloading in Teleoperation Zijun Zhan et.al. 2508.03464 null
2025-08-06 READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation Haotian Wang et.al. 2508.03457 null
2025-08-05 Error Estimates of Semi-Lagrangian Schemes for Diffusive Conservation Laws Haruki Takemura et.al. 2508.03455 null
2025-08-05 RAAG: Ratio Aware Adaptive Guidance Shangwen Zhu et.al. 2508.03442 null
2025-08-05 Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN Shivangi Nigam et.al. 2508.03415 null
2025-08-05 SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models Pingchuan Ma et.al. 2508.03402 null
2025-08-05 Delay-facilitated self-assembly in compartmentalized systems Severin Angerpointner et.al. 2508.03383 null
2025-08-05 Diffusion Once and Done: Degradation-Aware LoRA for Efficient All-in-One Image Restoration Ni Tang et.al. 2508.03373 null
2025-08-05 A Closed-Loop Multi-Agent Framework for Aerodynamics-Aware Automotive Styling Design Xinyu Jin et.al. 2508.03370 null
2025-08-05 GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images Yifei Sun et.al. 2508.03357 null
2025-08-05 Quenching time and probability estimates for a stochastic reaction-diffusion system with coupled inner singular absorption terms driven by mixed noises Nikos I. Kavallaris et.al. 2508.03354 null
2025-08-06 Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation Xunzhi Xiang et.al. 2508.03334 null
2025-08-05 Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation Peiyu Wang et.al. 2508.03320 null
2025-08-05 Thermal Metamaterials for Enhanced Non-Fourier Heat Transport Harry Mclean et.al. 2508.03316 null
2025-08-05 The non-isothermal Maxwell-Stefan asymptotics of the multi-species Boltzmann equations Xinqiu Chen et.al. 2508.03311 null
2025-08-05 Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation Jun Luo et.al. 2508.03300 null
2025-08-05 Investigation on deep learning-based galaxy image translation models Hengxin Ruan et.al. 2508.03291 null
2025-08-07 Well-Posedness of the Cauchy Problem for One-Dimensional Nonlinear Diffusion Equations with Dynamic and Fourth-Type Boundary Conditions in the Lp Lq Maximal Regularity Setting Ken Furukawa et.al. 2508.03288 null
2025-08-07 Global solvability for doubly degenerate nutrient taxis system with a wide range of bacterial responses in physical dimension Bao-Ngoc Tran et.al. 2508.03268 null
2025-08-05 Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation Gang Dai et.al. 2508.03256 null
2025-08-05 V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models Jisoo Kim et.al. 2508.03254 null
2025-08-05 Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion Wentao Qu et.al. 2508.03252 null
2025-08-06 FFHQ-Makeup: Paired Synthetic Makeup Dataset with Facial Consistency Across Multiple Styles Xingchao Yang et.al. 2508.03241 null
2025-08-05 BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models Yu Pan et.al. 2508.03221 null
2025-08-05 Timing is everything: How subtle timing changes in MRI echo planar imaging can significantly alter mechanical vibrations and sound level Amir Seginer et.al. 2508.03220 null
2025-08-05 Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance Eliot Beyler et.al. 2508.03210 null
2025-08-05 Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models Muhammed Saeed et.al. 2508.03199 null
2025-08-05 An Analytic Model to Determine the Interstitial-Solute Energetics and Underlying Mechanism in Refractory High-Entropy Alloys Qianxi Zhu et.al. 2508.03163 null
2025-08-05 SARD: Segmentation-Aware Anomaly Synthesis via Region-Constrained Diffusion with Discriminative Mask Guidance Yanshu Wang et.al. 2508.03143 null
2025-08-05 UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying Chengyu Bai et.al. 2508.03142 null
2025-08-05 Filtering and 1/3 Power Law for Optimal Time Discretisation in Numerical Integration of Stochastic Differential Equations Igor G. Vladimirov et.al. 2508.03135 null
2025-08-05 Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback Jingyi Chen et.al. 2508.03123 null
2025-08-05 Power System Voltage Stability Boundary: Computational Results and Applications Zhenyao Li et.al. 2508.03119 null
2025-08-05 T2UE: Generating Unlearnable Examples from Text Descriptions Xingjun Ma et.al. 2508.03091 null
2025-08-05 MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation Youran Zhou et.al. 2508.03083 null
2025-08-05 Multi-human Interactive Talking Dataset Zeyu Zhu et.al. 2508.03050 null
2025-08-05 Urban In-Context Learning: Bridging Pretraining and Inference through Masked Diffusion for Urban Profiling Ruixing Zhang et.al. 2508.03042 null
2025-08-05 Sparse Identification of Nonlinear Dynamics for Stochastic Delay Differential Equations Dimitri Breda et.al. 2508.03040 null
2025-08-05 MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention Qi Xie et.al. 2508.03034 null
2025-08-05 LiGen: GAN-Augmented Spectral Fingerprinting for Indoor Positioning Jie Lin et.al. 2508.03024 null
2025-08-05 Generating Light-based Fingerprints for Indoor Localization Hsun-Yu Lee et.al. 2508.03011 null
2025-08-05 Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models Fan Yang et.al. 2508.03006 null
2025-08-05 Diffusion Models with Adaptive Negative Sampling Without External Resources Alakh Desai et.al. 2508.02973 null
2025-08-05 Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver Jonathan Patsenker et.al. 2508.02964 null
2025-08-04 X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio Chenxu Zhang et.al. 2508.02944 null
2025-08-04 Documenting Patterns of Exoticism of Marginalized Populations within Text-to-Image Generators Sourojit Ghosh et.al. 2508.02937 null
2025-08-06 A nonstandard finite difference scheme for an SEIQR epidemiological PDE model Achraf Zinihi et.al. 2508.02928 null
2025-08-04 Goal-Oriented Adaptive Finite Element Multilevel Quasi-{M}onte {C}arlo Joakim Beck et.al. 2508.02925 null
2025-08-04 How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution Minh-Hai Nguyen et.al. 2508.02923 null
2025-08-04 RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation Mehrdad Moradi et.al. 2508.02903 null
2025-08-04 REFLECT: Rectified Flows for Efficient Brain Anomaly Correction Transport Farzad Beizaee et.al. 2508.02889 null
2025-08-04 Memoirs of mass accretion: probing the edges of intracluster light in simulated galaxy clusters Tara Dacunha et.al. 2508.02837 null
2025-08-04 DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework Tongchun Zuo et.al. 2508.02807 null
2025-08-04 NASIM: Revealing the low surface brightness Universe from legacy VISTA data Elham Saremi et.al. 2508.02780 null
2025-08-04 D2PPO: Diffusion Policy Policy Optimization with Dispersive Loss Guowei Zou et.al. 2508.02644 null
2025-08-04 CAK: Emergent Audio Effects from Minimal Deep Learning Austin Rockman et.al. 2508.02643 null
2025-08-04 Anticipating Decoherence: a Predictive Framework for Enhancing Coherence in Quantum Emitters Pranshu Maan et.al. 2508.02638 null
2025-08-04 ReMoMask: Retrieval-Augmented Masked Motion Generation Zhengdao Li et.al. 2508.02605 null
2025-08-04 Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction Yuerong Song et.al. 2508.02558 null
2025-08-04 From Pixels to Pathology: Restoration Diffusion for Diagnostic-Consistent Virtual IHC Jingsong Liu et.al. 2508.02528 null
2025-08-06 xDeepServe: Model-as-a-Service on Huawei CloudMatrix384 Ao Xiao et.al. 2508.02520 null
2025-08-04 QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots Sheng Wu et.al. 2508.02512 null
2025-08-04 Quantitative and Predictive Folding Models from Limited Single-Molecule Data Using Simulation-Based Inference Lars Dingeldein et.al. 2508.02509 null
2025-08-04 Toward Using Machine Learning as a Shape Quality Metric for Liver Point Cloud Generation Khoa Tuan Nguyen et.al. 2508.02482 null
2025-08-04 PoseGuard: Pose-Guided Generation with Safety Guardrails Kongxin Wang et.al. 2508.02476 null
2025-08-04 Efficient spin-pumping and spin-to-charge conversion in epitaxial Mn $_3$ Sn(0001) noncollinear antiferromagnetic films Surya N. Panda et.al. 2508.02415 null
2025-08-04 Hydra: Accurate Multi-Modal Leaf Wetness Sensing with mm-Wave and Camera Fusion Yimeng Liu et.al. 2508.02409 null
2025-08-04 Inference-time Scaling for Diffusion-based Audio Super-resolution Yizhu Jin et.al. 2508.02391 null
2025-08-04 Talking Surveys: How Photorealistic Embodied Conversational Agents Shape Response Quality, Engagement, and Satisfaction Matus Krajcovic et.al. 2508.02376 null
2025-08-04 Transport-Guided Rectified Flow Inversion: Improved Image Editing Using Optimal Transport Theory Marian Lupascu et.al. 2508.02363 null
2025-08-04 Qwen-Image Technical Report Chenfei Wu et.al. 2508.02324 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-05 LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training Sikui Zhang et.al. 2508.02308 null
2025-08-05 Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor Xiaoliu Guan et.al. 2508.02240 null
2025-08-04 Abstract Formulation of Mean-Field Models and Propagation of Chaos Tau Shean Lim et.al. 2508.02224 null
2025-08-04 A theory of strange metals Simone Fratini et.al. 2508.02221 null
2025-08-04 Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Yuxuan Song et.al. 2508.02193 null
2025-08-04 DreamPainter: Image Background Inpainting for E-commerce Scenarios Sijie Zhao et.al. 2508.02155 null
2025-08-04 AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models Die Chen et.al. 2508.02151 null
2025-08-04 VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling Yuru Xiao et.al. 2508.02129 null
2025-08-04 AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation Zhiwen Li et.al. 2508.02107 null
2025-08-04 Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis Kaiyang Ji et.al. 2508.02106 null
2025-08-04 “Stack It Up!”: 3D Stable Structure Generation from 2D Hand-drawn Sketch Yiqing Xu et.al. 2508.02093 null
2025-08-04 Unsupervised Multi-channel Speech Dereverberation via Diffusion Yulun Wu et.al. 2508.02071 null
2025-08-04 “Set It Up”: Functional Object Arrangement with Compositional Generative Models Yiqing Xu et.al. 2508.02068 null
2025-08-04 StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion Haoxin Yang et.al. 2508.02056 null
2025-08-04 Why Generate When You Can Transform? Unleashing Generative Attention for Dynamic Recommendation Yuli Liu et.al. 2508.02050 null
2025-08-04 Conditional Diffusion Model with Anatomical-Dose Dual Constraints for End-to-End Multi-Tumor Dose Prediction Hui Xie et.al. 2508.02043 null
2025-08-04 Frequency-Domain Denoising-Based in Vivo Fluorescence Imaging XuHao Yu et.al. 2508.02025 null
2025-08-04 Significant Mobility Enhancement in Coupled AlGaN/GaN Quantum Wells considering Inter-Well Distance and Asymmetric Widths Le Tri Dat et.al. 2508.02024 null
2025-08-05 Asymptotic analysis of the Allen-Cahn equation with dynamic boundary conditions of Cahn-Hilliard type Pierluigi Colli et.al. 2508.02021 null
2025-08-04 Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention Kyungmin Jo et.al. 2508.02004 null
2025-08-04 Generative Large-Scale Pre-trained Models for Automated Ad Bidding Optimization Yu Lei et.al. 2508.02002 null
2025-08-04 Path-Integral Formulation of Bosonic Markovian Open Quantum Dynamics with Monte Carlo stochastic trajectories using the Glauber-Sudarshan P, Wigner, and Husimi Q Functions and Hybrids Toma Yoneya et.al. 2508.01991 null
2025-08-04 Controllable and Stealthy Shilling Attacks via Dispersive Latent Diffusion Shutong Qiao et.al. 2508.01987 null
2025-08-04 Diffusion models for inverse problems Hyungjin Chung et.al. 2508.01975 null
2025-08-03 Distributed games with jumps: An $α$ -potential game approach Xin Guo et.al. 2508.01929 null
2025-08-03 On the Non-Markovian Navier-Stokes Framework for Turbulence Modeling – A Preliminary Analysis Siamak Kazemzadeh Hannani et.al. 2508.01890 null
2025-08-03 DiffusionFF: Face Forgery Detection via Diffusion-based Artifact Localization Siran Peng et.al. 2508.01873 null
2025-08-05 Moment Estimate and Variational Approach for Learning Generalized Diffusion with Non-gradient Structures Fanze Kong et.al. 2508.01854 null
2025-08-03 Diffusion-based 3D Hand Motion Recovery with Intuitive Physics Yufei Zhang et.al. 2508.01835 null
2025-08-03 Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder Runxuan Yang et.al. 2508.01796 null
2025-08-03 Exponential mixing for the stochastic Kuramoto-Sivashinsky equation on the 1D torus Peng Gao et.al. 2508.01794 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 Semantically-Guided Inference for Conditional Diffusion Models: Enhancing Covariate Consistency in Time Series Forecasting Rui Ding et.al. 2508.01761 null
2025-08-03 Dynamic Coupling of Infiltration-Soil Moisture Feedback:Emergent Vegetation Patterns in a Water-Vegetation Model Juan Yan et.al. 2508.01755 null
2025-08-03 Energy-Efficient Federated Learning for Edge Real-Time Vision via Joint Data, Computation, and Communication Design Xiangwang Hou et.al. 2508.01745 null
2025-08-05 Imbalance-Robust and Sampling-Efficient Continuous Conditional GANs via Adaptive Vicinity and Auxiliary Regularization Xin Ding et.al. 2508.01725 null
2025-08-03 ModFus-DM: Explore the Representation in Modulated Signal Diffusion Generated Models Haoyue Tan et.al. 2508.01719 null
2025-08-03 Versatile Transition Generation with Image-to-Video Diffusion Zuhao Yang et.al. 2508.01698 null
2025-08-03 DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing Yufeng Chi et.al. 2508.01684 null
2025-08-03 DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding Hanqing Wang et.al. 2508.01651 null
2025-08-03 StrandDesigner: Towards Practical Strand Generation with Sketch Guidance Na Zhang et.al. 2508.01650 null
2025-08-03 Hamiltonian simulation for nonlinear partial differential equation by Schrödingerization Shoya Sasaki et.al. 2508.01640 null
2025-08-03 VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation Xuanran Zhai et.al. 2508.01622 null
2025-08-03 LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding Xuanzhao Dong et.al. 2508.01617 null
2025-08-03 TCDiff: Triplex Cascaded Diffusion for High-fidelity Multimodal EHRs Generation with Incomplete Clinical Data Yandong Yan et.al. 2508.01615 null
2025-08-03 Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models Haoran Dai et.al. 2508.01605 null
2025-08-03 Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment Lubin Gan et.al. 2508.01602 null
2025-08-03 CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation Sung-Wook Lee et.al. 2508.01600 null
2025-08-03 Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching Juyan Zhang et.al. 2508.01597 null
2025-08-03 A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation Hua Yu et.al. 2508.01590 null
2025-08-03 Censored Sampling for Topology Design: Guiding Diffusion with Human Preferences Euihyun Kim et.al. 2508.01589 null
2025-08-03 Diffusion Models for Future Networks and Communications: A Comprehensive Survey Nguyen Cong Luong et.al. 2508.01586 null
2025-08-03 Tractography-Guided Dual-Label Collaborative Learning for Multi-Modal Cranial Nerves Parcellation Lei Xie et.al. 2508.01577 null
2025-08-03 Sub 10 nm Nanochannels Enable Directional Quasi Ballistic Exciton Transport over 5 μm at Room Temperature Xiao-Jie Wang et.al. 2508.01567 null
2025-08-03 MGCR-Net:Multimodal Graph-Conditioned Vision-Language Reconstruction Network for Remote Sensing Change Detection Chengming Wang et.al. 2508.01555 null
2025-08-02 A Reward-Directed Diffusion Framework for Generative Design Optimization Hadi Keramati et.al. 2508.01509 null
2025-08-02 Instruction-based Time Series Editing Jiaxing Qiu et.al. 2508.01504 null
2025-08-02 The role of zealots in the spread of linguistic traits Vivian Dornelas et.al. 2508.01500 null
2025-08-02 TreeDiff: AST-Guided Code Generation with Diffusion LLMs Yiming Zeng et.al. 2508.01473 null
2025-08-02 Regression Augmentation With Data-Driven Segmentation Shayan Alahyari et.al. 2508.01455 null
2025-08-02 Physically-based Lighting Augmentation for Robotic Manipulation Shutong Jin et.al. 2508.01442 null
2025-08-02 Viscosity Stabilized Plug-and-Play Reconstruction Arghya Sinha et.al. 2508.01441 null
2025-08-02 Parabolic-elliptic and indirect-direct simplifications in chemotaxis systems driven by indirect signalling Le Trong Thanh Bui et.al. 2508.01436 null
2025-08-02 Artificial Intelligence and Misinformation in Art: Can Vision Language Models Judge the Hand or the Machine Behind the Canvas? Tarian Fu et.al. 2508.01408 null
2025-08-02 StyleSentinel: Reliable Artistic Copyright Verification via Stylistic Fingerprints Lingxiao Chen et.al. 2508.01335 null
2025-08-05 Zero-shot Segmentation of Skin Conditions: Erythema with Edit-Friendly Inversion Konstantinos Moutselos et.al. 2508.01334 null
2025-08-02 LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points Xuemiao Zhang et.al. 2508.01317 null
2025-08-02 CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis Alec Sargood et.al. 2508.01292 null
2025-08-02 PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation Zonglei Jing et.al. 2508.01272 null
2025-08-02 Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling Lexiao Zou et.al. 2508.01264 null
2025-08-02 NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection Jiazhen Yan et.al. 2508.01248 null
2025-08-02 Effect of protection zone on the dynamics of a diffusion-advection population-toxicant model Jing Gao et.al. 2508.01246 null
2025-08-02 Sliding two-dimensional superconductivity and charge-density-wave state in a bulk crystal Xiangqi Liu et.al. 2508.01241 null
2025-08-02 SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches Cheng Tan et.al. 2508.01237 null
2025-08-02 Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system Jiyong Kim et.al. 2508.01230 null
2025-08-02 StyDeco: Unsupervised Style Transfer with Distilling Priors and Semantic Decoupling Yuanlin Yang et.al. 2508.01215 null
2025-08-02 Energy-dependent anisotropy of cosmic-ray muons: A twelve-year study with IceCube Neutrino Observatory Nabin Upadhya Dhakal et.al. 2508.01194 null
2025-08-02 DELTAv2: Accelerating Dense 3D Tracking Tuan Duc Ngo et.al. 2508.01170 null
2025-08-02 RoboLinker: A Diffusion-model-based Matching Clothing Generator Between Humans and Companion Robots Jing Tang et.al. 2508.01165 null
2025-08-02 LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation Xinyu Yan et.al. 2508.01152 null
2025-08-02 Personalized Safety Alignment for Text-to-Image Diffusion Models Yu Lei et.al. 2508.01151 null
2025-08-02 Dataset Condensation with Color Compensation Huyu Wu et.al. 2508.01139 null
2025-08-01 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models Jinsong Li et.al. 2508.00819 null
2025-08-01 Multibeam High Throughput Satellite: Hardware Foundation, Resource Allocation, and Precoding Rui Chen et.al. 2508.00800 null
2025-08-01 Video Generators are Robot Policies Junbang Liang et.al. 2508.00795 null
2025-08-01 SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation Kien T. Pham et.al. 2508.00782 null
2025-08-01 Diffusion-Scheduled Denoising Autoencoders for Anomaly Detection in Tabular Data Timur Sattarov et.al. 2508.00758 null
2025-08-01 LeakyCLIP: Extracting Training Data from CLIP Yunhao Chen et.al. 2508.00756 null
2025-08-01 SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation Prerana Ramkumar et.al. 2508.00750 null
2025-08-01 AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation Le Wang et.al. 2508.00733 null
2025-08-01 YOLO-Count: Differentiable Object Counting for Text-to-Image Generation Guanning Zeng et.al. 2508.00728 null
2025-08-01 Controllability of diffusive Lotka-Volterra strongly competitive systems under boundary constrained controls Elisa Affili et.al. 2508.00713 null
2025-08-01 D3: Training-Free AI-Generated Video Detection Using Second-Order Features Chende Zheng et.al. 2508.00701 null
2025-08-01 On-Device Diffusion Transformer Policy for Efficient Robot Manipulation Yiming Wu et.al. 2508.00697 null
2025-08-01 Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network Young-ho Cho et.al. 2508.00692 null
2025-08-01 Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators Albert Matveev et.al. 2508.00643 null
2025-08-01 Minimum Data, Maximum Impact: 20 annotated samples for explainable lung nodule classification Luisa Gallée et.al. 2508.00639 null
2025-08-01 DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior Junzhe Lu et.al. 2508.00599 null
2025-08-01 Wukong Framework for Not Safe For Work Detection in Text-to-Image systems Mingrui Liu et.al. 2508.00591 null
2025-08-01 Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints Jens U. Kreber et.al. 2508.00558 null
2025-08-01 DBLP: Noise Bridge Consistency Distillation For Efficient And Reliable Adversarial Purification Chihan Huang et.al. 2508.00552 null
2025-08-01 Video Color Grading via Look-Up Table Generation Seunghyun Shin et.al. 2508.00548 null
2025-08-01 HannesImitation: Grasping with the Hannes Prosthetic Hand via Imitation Learning Carlo Alessi et.al. 2508.00491 null
2025-08-01 LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer Yuzhuo Chen et.al. 2508.00477 null
2025-08-01 A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces Leonidas Akritidis et.al. 2508.00472 null
2025-08-01 Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution Yiwen Wang et.al. 2508.00471 null
2025-08-01 AutoDebias: Automated Framework for Debiasing Text-to-Image Models Hongyi Cai et.al. 2508.00445 null
2025-08-01 SDMatte: Grafting Diffusion Models for Interactive Matting Longfei Huang et.al. 2508.00443 null
2025-08-01 Diffusion-Based User-Guided Data Augmentation for Coronary Stenosis Detection Sumin Seo et.al. 2508.00438 null
2025-08-01 Accurate Latent Inversion for Generative Image Steganography via Rectified Flow Yuqi Qian et.al. 2508.00434 null
2025-08-01 Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation Nan Xiang et.al. 2508.00428 null
2025-08-01 Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting Seunggeun Chi et.al. 2508.00427 null
2025-08-01 Collimated QED Cascades with Curved Plasma Mirror Xuesong Geng et.al. 2508.00417 null
2025-08-01 DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space Junyu Chen et.al. 2508.00413 null
2025-08-01 Sortblock: Similarity-Aware Feature Reuse for Diffusion Model Hanqi Chen et.al. 2508.00412 null
2025-08-01 Predictive information criterion for jump diffusion processes Yuma Uehara et.al. 2508.00411 null
2025-08-01 Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency Xi Xue et.al. 2508.00397 null
2025-08-01 Sheaf Graph Neural Networks via PAC-Bayes Spectral Optimization Yoonhyuk Choi et.al. 2508.00357 null
2025-08-01 BOOD: Boundary-based Out-Of-Distribution Data Generation Qilin Liao et.al. 2508.00350 null
2025-08-01 Favorable modifications of Scrape-Off Layer (SOL) heat flux width through pulsed fuelling in ADITYA-U Tokamak SK Injamul Hoque et.al. 2508.00339 null
2025-08-01 Radially Locked Sun-Ray Patterns in Autocatalytic Reaction-Diffusion-Advection Systems Surya Narayan Maharana et.al. 2508.00329 null
2025-08-01 Steering Guidance for Personalized Text-to-Image Diffusion Models Sunghyun Park et.al. 2508.00319 null
2025-08-01 GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection Suhang Cai et.al. 2508.00312 null
2025-08-01 TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps Zehui Xu et.al. 2508.00303 null
2025-08-01 Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence Danzhen Fu et.al. 2508.00299 null
2025-08-01 AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer Jin Lyu et.al. 2508.00298 null
2025-08-01 TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models Christian Simon et.al. 2508.00289 null
2025-08-01 UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents Jianqiang Xiao et.al. 2508.00288 null
2025-08-01 Towards Robust Semantic Correspondence: A Benchmark and Insights Wenyue Chong et.al. 2508.00272 null
2025-08-01 Jet Image Generation in High Energy Physics Using Diffusion Models Victor D. Martinez et.al. 2508.00250 null
2025-07-31 Reliability of 1D radiative-convective photochemical-equilibrium retrievals on transit spectra of WASP-107b Thomas Konings et.al. 2508.00177 null
2025-07-31 DiSC-Med: Diffusion-based Semantic Communications for Robust Medical Image Transmission Fupei Guo et.al. 2508.00172 null
2025-07-31 World Consistency Score: A Unified Metric for Video Generation Quality Akshat Rakheja et.al. 2508.00144 null
2025-07-31 Entanglement spreading and emergent locality in Brownian SYK chains Onkar Parrikar et.al. 2508.00060 null
2025-07-31 Predicting Large-scale Urban Network Dynamics with Energy-informed Graph Neural Diffusion Tong Nie et.al. 2508.00037 null
2025-07-31 Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Bowen Zhang et.al. 2507.23785 null
2025-07-31 SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions Jessica Bader et.al. 2507.23784 null
2025-07-31 General diffusions on metric graphs as limits of time-space Markov Chains Alexis Anagnostakis et.al. 2507.23724 null
2025-07-31 DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching Emery Pierson et.al. 2507.23715 null
2025-07-31 CFDagent: A Language-Guided, Zero-Shot Multi-Agent System for Complex Flow Simulation Zhaoyue Xu et.al. 2507.23693 null
2025-07-31 UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration Zihan Cheng et.al. 2507.23685 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 Analysis of a Cross-Nonlinear Porous-Medium System Modeling Pressure-Driven Cell Population Dynamics Alexis Béjar-López et.al. 2507.23680 null
2025-07-31 DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data Rabeya Tus Sadia et.al. 2507.23676 null
2025-07-31 One-Step Flow Policy Mirror Descent Tianyi Chen et.al. 2507.23675 null
2025-07-31 Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image Synthesis Kunpeng Qiu et.al. 2507.23652 null
2025-07-31 A stochastic heat equation with non-locally Lipschitz coefficients Le Chen et.al. 2507.23637 null
2025-07-31 DivControl: Knowledge Diversion for Controllable Image Generation Yucheng Xie et.al. 2507.23620 null
2025-08-02 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 Theory of ultrafast conductance modulation in electrochemical protonic synapses by multiphase polarization Michael L. Li et.al. 2507.23576 null
2025-08-01 H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation Hongzhe Bi et.al. 2507.23523 null
2025-07-31 Conical diffraction of the synchrotron beam to probe the efficiency and morphology of blazed gratings K. V. Nikolaev et.al. 2507.23513 null
2025-07-31 Emergence of long-range non-equilibrium correlations in free liquid diffusion Marco Bussoletti et.al. 2507.23507 null
2025-07-31 Digital literacy interventions can boost humans in discerning deepfakes Dominique Geissler et.al. 2507.23492 null
2025-07-31 Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion Mutian Xu et.al. 2507.23483 null
2025-07-31 Adjoint-Based Aerodynamic Shape Optimization with a Manifold Constraint Learned by Diffusion Models Long Chen et.al. 2507.23443 null
2025-07-31 Out-of-Distribution Detection in Medical Imaging via Diffusion Trajectories Lemar Abdi et.al. 2507.23411 null
2025-07-31 An optimal preconditioner for high-order scheme arising from multi-dimensional Riesz space fractional diffusion equations with variable coefficients Yuan-Yuan Huang et.al. 2507.23408 null
2025-07-31 UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries Yijie Zhu et.al. 2507.23372 null
2025-07-31 IN45023 Neural Network Design Patterns in Computer Vision Seminar Report, Summer 2025 Radu-Andrei Bourceanu et.al. 2507.23357 null
2025-07-31 Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads Yingjie Zhou et.al. 2507.23343 null
2025-07-31 EMU and the DRAGNs I: A Catalogue of DRAGNs Ray P. Norris et.al. 2507.23337 null
2025-07-31 Classifying Compact Radio Emission in Nearby Galaxies: a 10GHz Study of Active Galactic Nuclei, Supernovae, Anomalous Microwave Emission and Star Forming Regions Kristen C. Dage et.al. 2507.23332 null
2025-07-31 The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models Alfio Ferrara et.al. 2507.23313 null
2025-07-31 PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving Xuewei Tang et.al. 2507.23309 null
2025-08-01 Training-free Geometric Image Editing on Diffusion Models Hanshen Zhu et.al. 2507.23300 null
2025-07-31 UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing Hao Tang et.al. 2507.23278 null
2025-07-31 PixNerd: Pixel Neural Field Diffusion Shuai Wang et.al. 2507.23268 null
2025-07-31 Automated Mapping the Pathways of Cranial Nerve II, III, V, and VII/VIII: A Multi-Parametric Multi-Stage Diffusion Tractography Atlas Lei Xie et.al. 2507.23245 null
2025-07-31 BS-1-to-N: Diffusion-Based Environment-Aware Cross-BS Channel Knowledge Map Generation for Cell-Free Networks Zhuoyin Dai et.al. 2507.23236 null
2025-07-31 Adversarial-Guided Diffusion for Multimodal LLM Attacks Chengwei Xia et.al. 2507.23202 null
2025-07-30 X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention Xiaochen Zhao et.al. 2507.23143 null
2025-07-30 Nonzero $\mathfrak{n}$ cohomology of Totally Degenerate Limit of Discrete Series representations Jin Kunwoo Lee et.al. 2507.23102 null
2025-07-30 Diffusion model for gradient preconditioning in hyperspectral imaging inverse problems Jonathan Monsalve et.al. 2507.23065 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-07-30 Search for Neutrinos from the Galactic 4FGL Sources with the Pion-bump Signature with IceCube Alejandra Granados et.al. 2507.23040 null
2025-07-30 Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction Giuseppe Cartella et.al. 2507.23021 null
2025-07-30 Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods Siwoo Park et.al. 2507.23010 null
2025-07-30 LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis Jamil Fayyad et.al. 2507.23001 null
2025-07-29 Neural Autoregressive Modeling of Brain Aging Ridvan Yesiloglu et.al. 2507.22954 null
2025-07-30 AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS Hai Ling et.al. 2507.22880 null
2025-07-30 Robust Contract with Career Concerns Tan Gan et.al. 2507.22852 null
2025-07-30 Morph: ChirpTransformer-based Encoder-decoder Co-design for Reliable LoRa Communication Yidong Ren et.al. 2507.22851 null
2025-07-30 DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion Qingcheng Zhao et.al. 2507.22825 null
2025-07-30 Design and Analysis of Plasmonic-Nanorod-Enhanced Lead-Free Inorganic Perovskite/Silicon Heterojunction Tandem Solar Cell Exceeding the Shockley-Queisser Limit Md. Sad Abdullah Sami et.al. 2507.22803 null
2025-07-31 G-Core: A Simple, Scalable and Balanced RLHF Trainer Junyu Wu et.al. 2507.22789 null
2025-07-30 DO-EM: Density Operator Expectation Maximization Adit Vishnu et.al. 2507.22786 null
2025-08-01 Next Tokens Denoising for Speech Synthesis Yanqing Liu et.al. 2507.22746 null
2025-07-30 Zero-Shot Image Anomaly Detection Using Generative Foundation Models Lemar Abdi et.al. 2507.22692 null
2025-07-30 LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing Federico Girella et.al. 2507.22627 null
2025-07-30 Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions Yiting Qu et.al. 2507.22617 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning Xiefan Guo et.al. 2507.22604 null
2025-07-30 Diffusion Models for Influence Maximization on Temporal Networks: A Guide to Make the Best Choice Aaqib Zahoor et.al. 2507.22589 null
2025-07-30 DACA-Net: A Degradation-Aware Conditional Diffusion Network for Underwater Image Enhancement Chang Huang et.al. 2507.22501 null
2025-07-30 LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning Xiang Li et.al. 2507.22499 null
2025-07-30 Visual Language Models as Zero-Shot Deepfake Detectors Viacheslav Pirogov et.al. 2507.22469 null
2025-07-30 TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation Jiuming Liu et.al. 2507.22454 null
2025-07-30 GVD: Guiding Video Diffusion Model for Scalable Video Distillation Kunyang Li et.al. 2507.22360 null
2025-07-29 Trade-offs in Image Generation: How Do Different Dimensions Interact? Sicheng Zhang et.al. 2507.22100 null
2025-07-29 X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again Zigang Geng et.al. 2507.22058 null
2025-07-30 See Different, Think Better: Visual Variations Mitigating Hallucinations in LVLMs Ziyun Dai et.al. 2507.22003 null
2025-07-29 Enhancing Generalization in Data-free Quantization via Mixup-class Prompting Jiwoong Park et.al. 2507.21947 null
2025-07-29 Anyone Can Jailbreak: Prompt-Based Attacks on LLMs and T2Is Ahmed B Mustafa et.al. 2507.21820 null
2025-07-29 Control Copy-Paste: Controllable Diffusion-Based Augmentation Method for Remote Sensing Few-Shot Object Detection Yanxing Liu et.al. 2507.21816 null
2025-07-29 MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE Junzhe Li et.al. 2507.21802 null
2025-07-29 APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing Sangmin Han et.al. 2507.21690 null
2025-07-29 GuidPaint: Class-Guided Image Inpainting with Diffusion Models Qimin Wang et.al. 2507.21627 null
2025-07-29 Locally Controlled Face Aging with Latent Diffusion Models Lais Isabelle Alves dos Santos et.al. 2507.21600 null
2025-07-29 Neural network enabled wide field-of-view imaging with hyperbolic metalenses Joel Yeo et.al. 2507.21562 null
2025-07-29 Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance Mengling Xu et.al. 2507.21529 null
2025-07-29 BANG: Dividing 3D Assets via Generative Exploded Dynamics Longwen Zhang et.al. 2507.21493 null
2025-07-29 Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training Sodtavilan Odonchimed et.al. 2507.21452 null
2025-07-30 Multimodal LLMs as Customized Reward Models for Text-to-Image Generation Shijie Zhou et.al. 2507.21391 null
2025-07-28 Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation I-Hsiang Chen et.al. 2507.21367 null
2025-07-28 A Contrastive Diffusion-based Network (CDNet) for Time Series Classification Yaoyu Zhang et.al. 2507.21357 null
2025-07-28 HDR Environment Map Estimation with Latent Diffusion Models Jack Hilliard et.al. 2507.21261 null
2025-07-28 Adaptive Multimodal Protein Plug-and-Play with Diffusion-Based Priors Amartya Banerjee et.al. 2507.21260 null
2025-07-28 Learning from Limited and Imperfect Data Harsh Rangwani et.al. 2507.21205 null
2025-08-01 Flow Matching Policy Gradients David McAllister et.al. 2507.21053 null
2025-07-29 JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1 Xinhan Di et.al. 2507.20987 null
2025-07-28 Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision Xiao Fang et.al. 2507.20976 null

Industry

Publish Date Title Authors PDF Code
2025-08-28 Leveraging Generative Models for Real-Time Query-Driven Text Summarization in Large-Scale Web Search Zeyu Xiong et.al. 2508.20559 null
2025-08-28 Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation Jiusi Li et.al. 2508.20471 null
2025-08-28 MedFoundationHub: A Lightweight and Secure Toolkit for Deploying Medical Vision Language Foundation Models Xiao Li et.al. 2508.20345 null
2025-08-26 APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration Shaobo Ma et.al. 2508.19087 null
2025-08-26 TaiBai: A fully programmable brain-inspired processor with topology-aware efficiency Qianpeng Li et.al. 2508.18961 null
2025-08-26 ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive Xinhao Luo et.al. 2508.18850 null
2025-08-26 Strata: Hierarchical Context Caching for Long Context Language Model Serving Zhiqiang Xie et.al. 2508.18572 null
2025-08-25 Practical GPU Choices for Earth Observation: ResNet-50 Training Throughput on Integrated, Laptop, and Cloud Accelerators Ritvik Chaturvedi et.al. 2508.18206 null
2025-08-24 A Synthetic Dataset for Manometry Recognition in Robotic Applications Pedro Antonio Rabelo Saraiva et.al. 2508.17468 null
2025-08-24 MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models Krishna Teja Chitty-Venkata et.al. 2508.17467 null
2025-08-23 DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method Qingwen Zhang et.al. 2508.17054 null
2025-08-23 A Novel Local Focusing Mechanism for Deepfake Detection Generalization Mingliang Li et.al. 2508.17029 null
2025-08-22 GPT-OSS-20B: A Comprehensive Deployment-Centric Analysis of OpenAI’s Open-Weight Mixture of Experts Model Deepak Kumar et.al. 2508.16700 null
2025-08-17 GPU Acceleration for Faster Evolutionary Spatial Cyclic Game Systems Louie Sinadjan et.al. 2508.16639 null
2025-08-22 GreenLLM: SLO-Aware Dynamic Frequency Scaling for Energy-Efficient LLM Serving Qunyou Liu et.al. 2508.16449 null
2025-08-22 Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars NVIDIA et.al. 2508.16401 null
2025-08-27 Hybrid Classical-Quantum Supercomputing: A demonstration of a multi-user, multi-QPU and multi-GPU environment Mateusz Slysz et.al. 2508.16297 null
2025-08-22 Bare-Metal RISC-V + NVDLA SoC for Efficient Deep Learning Inference Vineet Kumar et.al. 2508.16095 null
2025-08-22 A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection Qifeng Liu et.al. 2508.16069 null
2025-08-21 graph framework: A Domain Specific Compiler for Building Physics Applications M. Cianciosa et.al. 2508.15967 null
2025-08-17 Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base Stations Mauro Belgiovine et.al. 2508.15816 null
2025-08-25 DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians Cong Wang et.al. 2508.15376 null
2025-08-20 Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds Jia Lu et.al. 2508.14892 null
2025-08-20 Leveraging Hardware-Aware Computation in Mixed-Precision Matrix Multiply: A Tile-Centric Approach Qiao Zhang et.al. 2508.14848 null
2025-08-20 FakeHunter: Multimodal Step-by-Step Reasoning for Explainable Video Forensics Chen Chen et.al. 2508.14581 null
2025-08-25 NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model NVIDIA et.al. 2508.14444 null
2025-08-19 The 9th AI City Challenge Zheng Tang et.al. 2508.13564 null
2025-08-18 Optimizing Allreduce Operations for Heterogeneous Architectures with Multiple Processes per GPU Michael Adams et.al. 2508.13397 null
2025-08-18 X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms Yueming Yuan et.al. 2508.13337 null
2025-07-28 Sustainable AI Training via Hardware-Software Co-Design on NVIDIA, AMD, and Emerging GPU Architectures Yashasvi Makin et.al. 2508.13163 null
2025-08-18 CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction Zhiwei Ning et.al. 2508.12917 null
2025-08-17 CarelessWhisper: Turning Whisper into a Causal Streaming Model Tomer Krichli et.al. 2508.12301 null
2025-08-17 TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform Jun Liu et.al. 2508.12279 null
2025-08-17 ATLAS: AI-Native Receiver Test-and-Measurement by Leveraging AI-Guided Search Mauro Belgiovine et.al. 2508.12204 null
2025-08-16 Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization Kousuke Nakano et.al. 2508.12033 null
2025-08-18 Visual Perception Engine: Fast and Flexible Multi-Head Inference for Robotic Vision Tasks Jakub Łucki et.al. 2508.11584 null
2025-08-15 Efficient GPU-Centered Singular Value Decomposition Using the Divide-and-Conquer Method Shifang Liu et.al. 2508.11467 null
2025-08-15 Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking Haonan Zhang et.al. 2508.11323 null
2025-08-14 EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI Training Hasibul Jamil et.al. 2508.11035 null
2025-08-12 ViPE: Video Pose Engine for 3D Geometric Perception Jiahui Huang et.al. 2508.10934 null
2025-08-13 GPU accelerated MHD in the DISPATCH framework using directive-based programming Michael Haahr et.al. 2508.09568 null
2025-08-13 UWBa at SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval Ladislav Lenc et.al. 2508.09517 null
2025-08-13 Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving Guangxun Zhu et.al. 2508.09404 null
2025-08-07 Camel: Energy-Aware LLM Inference on Resource-Constrained Devices Hao Xu et.al. 2508.09173 null
2025-08-12 Profiling Large Language Model Inference on Apple Silicon: A Quantization Perspective Afsara Benazir et.al. 2508.08531 null
2025-08-11 Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson – Extended Abhinaba Chakraborty et.al. 2508.08430 null
2025-08-10 Weather-Driven Agricultural Decision-Making Using Digital Twins Under Imperfect Conditions Tamim Ahmed et.al. 2508.08326 null
2025-08-11 Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions Bangsheng Tang et.al. 2508.08192 null
2025-08-11 TLV-HGNN: Thinking Like a Vertex for Memory-efficient HGNN Inference Dengke Han et.al. 2508.07796 null
2025-08-10 An Experimental Exploration of In-Memory Computing for Multi-Layer Perceptrons Pedro Carrinho et.al. 2508.07317 null
2025-08-09 The Fused Kernel Library: A C++ API to Develop Highly-Efficient GPU Libraries Oscar Amoros et.al. 2508.07071 null
2025-08-27 From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving Antonio Guillen-Perez et.al. 2508.07029 null
2025-08-09 A Portable Multi-GPU Solver for Collisional Plasmas with Coulombic Interactions James Almgren-Bell et.al. 2508.06771 null
2025-08-02 PiKV: KV Cache Management System for Mixture of Experts Dong Liu et.al. 2508.06526 null
2025-08-08 MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows Xiquan Li et.al. 2508.06098 null
2025-08-07 CleanUpBench: Embodied Sweeping and Grasping Benchmark Wenbo Li et.al. 2508.05543 null
2025-08-07 MedMambaLite: Hardware-Aware Mamba for Medical Image Classification Romina Aalishah et.al. 2508.05049 null
2025-08-07 CSRAP: Enhanced Canvas Attention Scheduling for Real-Time Mission Critical Perception Md Iftekharul Islam Sakib et.al. 2508.04976 null
2025-08-07 Real-Time Doppler and Ionospheric Dispersion Correction Techniques for Arbitrary Waveforms Utilizing GPU Compute Daniel J. Vickers et.al. 2508.04951 null
2025-08-05 AIC CTU@FEVER 8: On-premise fact checking through long context RAG Herbert Ullrich et.al. 2508.04390 null
2025-08-06 A Foundation Model for DAS Signal Recognition and Visual Prompt Tuning of the Pre-trained Model for Downstream Tasks Kun Gui et.al. 2508.04316 null
2025-08-11 Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems Luai Abuelsamen et.al. 2508.04146 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Understanding the Landscape of Ampere GPU Memory Errors Zhu Zhu et.al. 2508.03513 null
2025-08-05 Full-History Graphs with Edge-Type Decoupled Networks for Temporal Reasoning Osama Mohammed et.al. 2508.03251 null
2025-08-04 MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models Wenyuan Liu et.al. 2508.02343 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis Yuzhuang Xu et.al. 2508.02322 null
2025-08-04 GPU in the Blind Spot: Overlooked Security Risks in Transportation Sefatun-Noor Puspa et.al. 2508.01995 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-02 A Parallel Algorithm for Finding Robust Spanners in Large Social Networks Arindam Khanda et.al. 2508.01485 null
2025-08-01 Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection Cheng-You Lu et.al. 2508.01014 null
2025-08-01 Optimal Scheduling Algorithms for LLM Inference: Theory and Practice Agrim Bari et.al. 2508.01002 null
2025-07-29 Forecasting LLM Inference Performance via Hardware-Agnostic Analytical Modeling Rajeev Patwari et.al. 2508.00904 null
2025-08-12 Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving Stefan Englmeier et.al. 2508.00589 null
2025-08-09 DGEMM without FP64 Arithmetic – Using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme Daichi Mukunoki et.al. 2508.00441 null
2025-08-01 On Learning Closed-Loop Probabilistic Multi-Agent Simulator Juanwu Lu et.al. 2508.00384 null
2025-08-01 Beamformed 360° Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization Belman Jahir Rodriguez et.al. 2508.00307 null
2025-07-31 FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction Donghyun Lee et.al. 2507.23480 null
2025-07-31 InterfO-RAN: Real-Time In-band Cellular Uplink Interference Detection with GPU-Accelerated dApps Neagin Neasamoni Santhi et.al. 2507.23177 null
2025-07-30 On the Sustainability of AI Inferences in the Edge Ghazal Sobhani et.al. 2507.23093 null
2025-07-30 Early Goal-Guided Multi-Scale Fusion for Real-Time Vision-Language Driving Santosh Patapati et.al. 2507.23042 null
2025-07-28 Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery Deepak Joshi et.al. 2507.20680 null
2025-07-27 SWIFT: A General Sensitive Weight Identification Framework for Fast Sensor-Transfer Pansharpening Zeyu Xia et.al. 2507.20311 null
2025-07-26 Racing to Idle: Energy Efficiency of Matrix Multiplication on Heterogeneous CPU and GPU Architectures Mufakir Qamar Ansari et.al. 2507.20063 null
2025-07-26 A Fast Parallel Median Filtering Algorithm Using Hierarchical Tiling Louis Sugy et.al. 2507.19926 null
2025-08-02 GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting Baijun Ye et.al. 2507.19451 null
2025-07-25 TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability Mohammad Aflah Khan et.al. 2507.19419 null
2025-07-25 LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences Yusuke Hirota et.al. 2507.19362 null
2025-07-25 SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding Large Language Models Zhen Wan et.al. 2507.19361 null
2025-07-25 High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins Lorenzo Cazzella et.al. 2507.19173 null
2025-07-24 SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time Yun Chen et.al. 2507.18713 null
2025-07-24 Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping Chong Cheng et.al. 2507.18541 null
2025-07-24 Building an Accelerated OpenFOAM Proof-of-Concept Application using Modern C++ Giulio Malenza et.al. 2507.18268 null
2025-07-26 MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation Zhongzhen Wen et.al. 2507.17773 null
2025-07-23 BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems Malsha Ashani Mahawatta Dona et.al. 2507.17722 null
2025-07-24 Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners Kostas Karakontis et.al. 2507.17519 null
2025-07-25 HuNavSim 2.0: An Enhanced Human Navigation Simulator for Human-Aware Robot Navigation Miguel Escudero-Jiménez et.al. 2507.17317 null
2025-07-23 GPU Benchmark through QPE Emulator with cuQuantum for Practical Quantum Applications Takaki Akiba et.al. 2507.17175 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 Model Compression Engine for Wearable Devices Skin Cancer Diagnosis Jacob M. Delgado-López et.al. 2507.17125 null
2025-07-23 Computer Vision for Real-Time Monkeypox Diagnosis on Embedded Systems Jacob M. Delgado-López et.al. 2507.17123 null
2025-07-22 Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems Imran Latif et.al. 2507.16781 null
2025-07-22 AcceleratedKernels.jl: Cross-Architecture Parallel Algorithms from a Unified, Transpiled Codebase Andrei-Leonard Nicusan et.al. 2507.16710 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-21 MSGM: A Multi-Scale Spatiotemporal Graph Mamba for EEG Emotion Recognition Hanwen Liu et.al. 2507.15914 null
2025-07-30 GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data Analysis Guoxi Liu et.al. 2507.15230 null
2025-07-19 Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall Shayan Rokhva et.al. 2507.14662 null
2025-07-16 GPU-Accelerated Interpretable Generalization for Rapid Cyberattack Detection and Forensics Shu-Ting Huang et.al. 2507.14222 null
2025-08-12 CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning Xiaoya Li et.al. 2507.14111 null
2025-07-23 Photonic Fabric Platform for AI Accelerators Jing Ding et.al. 2507.14000 null
2025-07-18 Leveraging Multi-Instance GPUs through moldable task scheduling Jorge Villarrubia et.al. 2507.13601 null
2025-07-17 Performance Portable Gradient Computations Using Source Transformation Kim Liegeois et.al. 2507.13204 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 HyDRA: A Hybrid Dual-Mode Network for Closed- and Open-Set RFFI with Optimized VMD Hanwen Liu et.al. 2507.12133 null
2025-07-16 PoTPTQ: A Two-step Power-of-Two Post-training for LLMs Xinyu Wang et.al. 2507.11959 null
2025-07-15 MIRAGE: KV Cache Optimization through Parameter Remapping for Multi-tenant LLM Serving Ruihao Li et.al. 2507.11507 null
2025-07-15 MMStencil: Optimizing High-order Stencils on Multicore CPU using Matrix Unit Yinuo Wang et.al. 2507.11067 null
2025-07-15 Standards-Compliant DM-RS Allocation via Temporal Channel Prediction for Massive MIMO Systems Sehyun Ryu et.al. 2507.11064 null
2025-07-15 Modernizing CNN-based Weather Forecast Model towards Higher Computational Efficiency Minjong Cheon et.al. 2507.10893 null
2025-07-21 Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks Aaron Jarmusch et.al. 2507.10789 null
2025-07-14 A Benchmarking Framework for AI models in Automotive Aerodynamics Kaustubh Tangsali et.al. 2507.10747 null
2025-07-14 Quantize-then-Rectify: Efficient VQ-VAE Training Borui Zhang et.al. 2507.10547 null
2025-07-30 Designing quantum chemistry algorithms with just-in-time compilation Xiaojie Wu et.al. 2507.09772 null
2025-07-13 GeoWarp: An automatically differentiable and GPU-accelerated implicit MPM framework for geomechanics based on NVIDIA Warp Yidong Zhao et.al. 2507.09435 null
2025-07-12 Factorization-free Orthogonal Projection onto the Positive Semidefinite Cone with Composite Polynomial Filtering Shucheng Kang et.al. 2507.09165 null
2025-07-10 Vidyut3d: a GPU accelerated fluid solver for non-equilibrium plasmas on adaptive grids Hariswaran Sitaraman et.al. 2507.08200 null
2025-07-10 GPUHammer: Rowhammer Attacks on GPU Memories are Practical Chris S. Lin et.al. 2507.08166 null
2025-07-03 Collective Communication Profiling of Modern-day Machine Learning Workloads Jit Gupta et.al. 2507.07117 null
2025-07-09 StixelNExT++: Lightweight Monocular Scene Segmentation and Representation for Collective Perception Marcel Vosshans et.al. 2507.06687 null
2025-07-09 EA: An Event Autoencoder for High-Speed Vision Sensing Riadul Islam et.al. 2507.06459 null
2025-07-08 CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation Kushal Gajjar et.al. 2507.06013 null
2025-07-07 Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model Mengyao Xu et.al. 2507.05513 null
2025-07-07 Robotic System with AI for Real Time Weed Detection, Canopy Aware Spraying, and Droplet Pattern Evaluation Inayat Rasool et.al. 2507.05432 null
2025-07-23 Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms Zhiyi Hu et.al. 2507.04786 null
2025-07-05 ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments Guile Wu et.al. 2507.03886 null
2025-07-24 Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps Chong Cheng et.al. 2507.03737 null
2025-07-03 NVIDIA GPU Confidential Computing Demystified Zhongshu Gu et.al. 2507.02770 null
2025-07-03 Flotilla: A scalable, modular and resilient federated learning framework for heterogeneous resources Roopkatha Banerjee et.al. 2507.02295 null
2025-07-02 SAKURAONE: Empowering Transparent and Open AI Platforms through Private-Sector HPC Investment in Japan Fumikazu Konishi et.al. 2507.02124 null
2025-07-02 Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization Giuseppe Ruggeri et.al. 2507.01676 null
2025-06-20 PyTorch-based Geometric Learning with Non-CUDA Processing Units: Experiences from Intel Gaudi-v2 HPUs Fanchen Bu et.al. 2507.01031 null
2025-07-01 Anatomy of High-Performance Column-Pivoted QR Decomposition Maksim Melnichenko et.al. 2507.00976 null
2025-07-01 Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms Zain Taufique et.al. 2507.00491 null
2025-07-01 Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs Mohammad Firas Sada et.al. 2507.00418 null
2025-07-01 Question Decomposition for Retrieval-Augmented Generation Paul J. L. Ammann et.al. 2507.00355 null
2025-06-24 AdaDeDup: Adaptive Hybrid Data Pruning for Efficient Large-Scale Object Detection Training Feiyang Kang et.al. 2507.00049 null
2025-06-30 Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model Mu-Chi Chen et.al. 2506.23635 null
2025-06-30 Risk-Based Filtering of Valuable Driving Situations in the Waymo Open Motion Dataset Tim Puphal et.al. 2506.23433 null
2025-06-29 CMOS+X: Stacking Persistent Embedded Memories based on Oxide Transistors upon GPGPU Platforms Faaiq Waqar et.al. 2506.23405 null
2025-06-28 FF-INT8: Efficient Forward-Forward DNN Training on Edge Devices with INT8 Precision Jingxiao Ma et.al. 2506.22771 null
2025-06-27 Quantum-Classical Auxiliary Field Quantum Monte Carlo with Matchgate Shadows on Trapped Ion Quantum Computers Luning Zhao et.al. 2506.22408 null
2025-06-27 MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism Zheng Zhang et.al. 2506.22175 null
2025-06-27 MCFuser: High-Performance and Rapid Fusion of Memory-Bound Compute-Intensive Operators Zheng Zhang et.al. 2506.22169 null
2025-07-08 BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting Zipei Ma et.al. 2506.22099 null
2025-06-27 SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model Shuhan Tan et.al. 2506.21976 null
2025-06-23 TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge Zhiyuan Zhang et.al. 2506.21618 null
2025-06-26 SAM4D: Segment Anything in Camera and LiDAR Streams Jianyun Xu et.al. 2506.21547 null
2025-06-26 Portable High-Performance Kernel Generation for a Computational Fluid Dynamics Code with DaCe Måns I. Andersson et.al. 2506.20994 null
2025-06-25 Characterization and Mitigation of Training Instabilities in Microscaling Formats Huangyuan Su et.al. 2506.20752 null
2025-06-24 MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models Hoa La et.al. 2506.20686 null
2025-06-25 SuperSONIC: Cloud-Native Infrastructure for ML Inferencing Dmitry Kondratyev et.al. 2506.20657 null
2025-06-25 Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking Ben Kang et.al. 2506.20381 null
2025-06-24 Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification Minghao Qin et.al. 2506.19225 null
2025-06-23 Let Your Video Listen to Your Music! Xinyu Zhang et.al. 2506.18881 null
2025-06-23 Cross-Architecture Knowledge Distillation (KD) for Retinal Fundus Image Anomaly Detection on NVIDIA Jetson Nano Berk Yilmaz et.al. 2506.18220 null
2025-06-22 AMD Versal Implementations of FAM and SSCA Estimators Carol Jingyi Li et.al. 2506.18003 null
2025-06-20 Code Generation for Near-Roofline Finite Element Actions on GPUs from Symbolic Variational Forms Kaushik Kulkarni et.al. 2506.17471 null
2025-06-19 VideoGAN-based Trajectory Proposal for Automated Vehicles Annajoyce Mariani et.al. 2506.16209 null
2025-06-19 Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs Xun Wang et.al. 2506.16196 null
2025-06-19 HetGPU: The pursuit of making binary compatibility towards GPUs Yiwei Yang et.al. 2506.15993 null
2025-06-18 Early Attentive Sparsification Accelerates Neural Speech Transcription Zifei Xu et.al. 2506.15912 null
2025-06-18 UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting Kai He et.al. 2506.15673 null
2025-06-18 Engineering Supercomputing Platforms for Biomolecular Applications Robert Welch et.al. 2506.15585 null
2025-07-30 Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention Syed Haider Ali et.al. 2506.15562 null
2025-06-17 Align Your Flow: Scaling Continuous-Time Flow Map Distillation Amirmojtaba Sabour et.al. 2506.14603 null
2025-06-18 Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models Xuanchi Ren et.al. 2506.09042 null
2025-06-10 Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions David Acuna et.al. 2506.08927 null
2025-07-18 Controllable Weather Synthesis and Removal with Video Diffusion Models Chih-Hao Lin et.al. 2505.00704 null
2025-04-21 LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception Yuan-Hong Liao et.al. 2504.15362 null
2025-04-15 PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond Minghua Liu et.al. 2504.11451 null
2025-04-17 VideoPanda: Video Panoramic Diffusion with Multi-view Attention Kevin Xie et.al. 2504.11389 null
2025-04-01 Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control NVIDIA et.al. 2503.14492 null
2025-03-05 GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Xuanchi Ren et.al. 2503.03751 null
2025-03-03 Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Jay Zhangjie Wu et.al. 2503.01774 null
2025-03-22 DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models Ruofan Liang et.al. 2501.18590 null
2025-07-09 Cosmos World Foundation Model Platform for Physical AI NVIDIA et.al. 2501.03575 null
2025-06-26 InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models Yifan Lu et.al. 2412.03934 null
2025-04-01 Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos Hanxue Liang et.al. 2412.03526 null
2024-11-14 LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Zhengyi Wang et.al. 2411.09595 null
2025-02-28 ReMatching Dynamic Reconstruction Flow Sara Oblak et.al. 2411.00705 null
2024-10-26 SCube: Instant Large-Scale Scene Reconstruction using VoxSplats Xuanchi Ren et.al. 2410.20030 null
2025-02-11 SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes Tianchang Shen et.al. 2409.20562 null
2024-09-28 G3R: Gradient Guided Generalizable Reconstruction Yun Chen et.al. 2409.19405 null
2024-09-27 UniCal: Unified Neural Sensor Calibration Ze Yang et.al. 2409.18953 null
2024-09-26 Learning to Drive via Asymmetric Self-Play Chris Zhang et.al. 2409.18218 null
2024-09-15 Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models Yuan-Hong Liao et.al. 2409.09788 null
2025-04-19 OmniRe: Omni Urban Scene Reconstruction Ziyu Chen et.al. 2408.16760 null
2024-08-19 Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering Ruofan Liang et.al. 2408.09702 null
2025-03-20 Wolf: Dense Video Captioning with a World Summarization Framework Boyi Li et.al. 2407.18908 null
2024-07-15 SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation Jordan Juravsky et.al. 2407.10481 null
2024-10-10 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes Nicolas Moenne-Loccoz et.al. 2407.07090 null
2024-07-01 fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence Francis Williams et.al. 2407.01781 null
2024-10-31 DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Letian Wang et.al. 2406.12095 null
2024-06-14 L4GM: Large 4D Gaussian Reconstruction Model Jiawei Ren et.al. 2406.10324 null
2024-06-12 UnO: Unsupervised Occupancy Fields for Perception and Forecasting Ben Agro et.al. 2406.08691 null
2024-06-12 Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata Dongsu Zhang et.al. 2406.08292 null
2024-06-13 DeTra: A Unified Model for Object Detection and Trajectory Forecasting Sergio Casas et.al. 2406.04426 null
2024-04-24 NeRF-XL: Scaling NeRFs with Multiple GPUs Ruilong Li et.al. 2404.16221 null
2024-04-22 Align Your Steps: Optimizing Sampling Schedules in Diffusion Models Amirmojtaba Sabour et.al. 2404.14507 null
2024-04-16 RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting Ashkan Mirzaei et.al. 2404.10765 null
2025-05-26 Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves? Yuan-Hong Liao et.al. 2404.06510 null
2024-04-01 QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving Sourav Biswas et.al. 2404.01486 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385 null
2024-03-22 Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks Aqeel Anwar et.al. 2403.15370 null
2024-01-22 EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models Koichi Namekata et.al. 2401.11739 null
2023-12-28 Compact Neural Graphics Primitives with Learned Hash Probing Towaki Takikawa et.al. 2312.17241 null
2024-01-03 Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models Huan Ling et.al. 2312.13763 null
2023-12-11 LightSim: Neural Lighting Simulation for Urban Scenes Ava Pun et.al. 2312.06654 null
2024-04-14 Trajeglish: Traffic Modeling as Next-Token Prediction Jonah Philion et.al. 2312.04535 null
2024-06-25 XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies Xuanchi Ren et.al. 2312.03806 null
2024-04-12 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570 null
2023-11-16 Adaptive Shells for Efficient Neural Radiance Field Rendering Zian Wang et.al. 2311.10091 null
2023-11-09 Real-Time Neural Rasterization for Large Scenes Jeffrey Yunfan Liu et.al. 2311.05607 null
2023-11-09 Reconstructing Objects in-the-wild for Realistic Sensor Simulation Ze Yang et.al. 2311.05602 null
2023-11-07 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Chenfeng Xu et.al. 2311.04391 null
2023-11-03 EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Jiawei Yang et.al. 2311.02077 null
2023-11-03 Towards Unsupervised Object Detection From LiDAR Point Clouds Lunjun Zhang et.al. 2311.02007 null
2023-11-02 MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory Enxu Li et.al. 2311.01556 null
2023-11-17 4D-Former: Multimodal 4D Panoptic Segmentation Ali Athar et.al. 2311.01520 null
2023-11-02 UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation Yuwen Xiong et.al. 2311.01448 null
2023-11-02 CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation Jingkang Wang et.al. 2311.01447 null
2023-11-02 Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation Jay Sarva et.al. 2311.01446 null
2023-11-02 LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds Anqi Joyce Yang et.al. 2311.01444 null
2023-11-02 Learning Realistic Traffic Agents in Closed-loop Chris Zhang et.al. 2311.01394 null
2024-04-01 Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion Lunjun Zhang et.al. 2311.01017 null
2024-01-26 ViR: Towards Efficient Vision Retention Backbones Ali Hatamizadeh et.al. 2310.19731 null
2023-10-20 TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models Tianshi Cao et.al. 2310.13772 null
2023-09-11 Towards Viewpoint Robustness in Bird’s Eye View Segmentation Tzofi Klinghoffer et.al. 2309.05192 null
2023-08-10 Flexible Isosurface Extraction for Gradient-Based Mesh Optimization Tianchang Shen et.al. 2308.05371 null
2023-08-03 UniSim: A Neural Closed-Loop Sensor Simulator Ze Yang et.al. 2308.01898 null
2023-08-02 Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving Ben Agro et.al. 2308.01471 null
2023-07-14 DreamTeacher: Pretraining Image Backbones with Deep Generative Models Daiqing Li et.al. 2307.07487 null
2023-06-27 Rethinking Closed-loop Training for Autonomous Driving Chris Zhang et.al. 2306.15713 null
2023-06-06 ATT3D: Amortized Text-to-3D Object Synthesis Jonathan Lorraine et.al. 2306.07349 null
2023-06-09 Neural Kernel Surface Reconstruction Jiahui Huang et.al. 2305.19590 null
2023-08-13 Neural LiDAR Fields for Novel View Synthesis Shengyu Huang et.al. 2305.01643 null
2023-04-19 NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models Seung Wook Kim et.al. 2304.09787 null
2023-12-28 Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models Andreas Blattmann et.al. 2304.08818 null
2023-04-06 Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes Zian Wang et.al. 2304.03266 null
2023-04-04 Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion Davis Rempe et.al. 2304.01893 null
2023-03-25 VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion Yiming Li et.al. 2302.12251 null
2023-02-09 Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting Viraj Prabhu et.al. 2302.04832 null
2023-02-02 Synthesizing Physical Character-Scene Interactions Mohamed Hassan et.al. 2302.00883 null
2023-01-31 PADL: Language-Directed Physics-Based Character Control Jordan Juravsky et.al. 2301.13868 null
2023-03-25 Magic3D: High-Resolution Text-to-3D Content Creation Chen-Hsuan Lin et.al. 2211.10440 null
2022-11-08 GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting Alexander Cui et.al. 2211.02545 null
2022-10-12 LION: Latent Point Diffusion Models for 3D Shape Generation Xiaohui Zeng et.al. 2210.06978 null
2022-10-06 XDGAN: Multi-Modal 3D Shape Generation in 2D Space Hassan Abu Alhaija et.al. 2210.03007 null
2022-10-03 Optimizing Data Collection for Machine Learning Rafid Mahmood et.al. 2210.01234 null
2022-09-26 EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations Ahmad Darkhalil et.al. 2209.13064 null
2022-09-22 GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images Jun Gao et.al. 2209.11163 null
2022-08-19 Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion Zian Wang et.al. 2208.09480 null
2022-08-18 MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation Gopal Sharma et.al. 2208.08580 null
2022-07-05 Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention Gary Leung et.al. 2207.02126 null
2022-07-13 How Much More Data Do I Need? Estimating Requirements for Downstream Tasks Rafid Mahmood et.al. 2207.01725 null
2022-06-19 Scalable Neural Data Server: A Data Recommender for Transfer Learning Tianshi Cao et.al. 2206.09386 null
2022-06-16 Virtual Correspondence: Humans as a Cue for Extreme-View Geometry Wei-Chiu Ma et.al. 2206.08365 null
2022-06-15 Variable Bitrate Neural Fields Towaki Takikawa et.al. 2206.07707 null
2022-06-06 Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps Seung Wook Kim et.al. 2206.02903 null
2022-05-05 ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters Xue Bin Peng et.al. 2205.01906 null
2022-04-19 M $^2$ BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation Enze Xie et.al. 2204.05088 null
2022-04-06 AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis Zhiqin Chen et.al. 2204.03105 null

Autonomous Driving

Publish Date Title Authors PDF Code
2025-08-28 DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes Yajiao Xiong et.al. 2508.20965 null
2025-08-28 Surfel-based 3D Registration with Equivariant SE(3) Features Xueyang Kang et.al. 2508.20789 null
2025-08-28 SKGE-SWIN: End-To-End Autonomous Vehicle Waypoint Prediction and Navigation Using Skip Stage Swin Transformer Fachri Najm Noer Kartiman et.al. 2508.20762 null
2025-08-28 UTA-Sign: Unsupervised Thermal Video Augmentation via Event-Assisted Traffic Signage Sketching Yuqi Han et.al. 2508.20594 null
2025-08-28 Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts Zixuan Hu et.al. 2508.20488 null
2025-08-28 Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation Jiusi Li et.al. 2508.20471 null
2025-08-27 Streamlining the Development of Active Learning Methods in Real-World Object Detection Moussa Kassem Sbeyti et.al. 2508.19906 null
2025-08-27 Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities Imad Ali Shah et.al. 2508.19905 null
2025-08-27 Generalizing Monocular 3D Object Detection Abhinav Kumar et.al. 2508.19593 null
2025-08-25 Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation Alexandros Gkillas et.al. 2508.19290 null
2025-08-26 Interpretable Decision-Making for End-to-End Autonomous Driving Mona Mirzaie et.al. 2508.18898 null
2025-08-26 EMind: A Foundation Model for Multi-task Electromagnetic Signals Understanding Luqing Luo et.al. 2508.18785 null
2025-08-20 GM-Skip: Metric-Guided Transformer Block Skipping for Efficient Vision-Language Models Lianming Huang et.al. 2508.18227 null
2025-08-25 EventTracer: Fast Path Tracing-based Event Stream Rendering Zhenyang Li et.al. 2508.18071 null
2025-08-25 Integration of Computer Vision with Adaptive Control for Autonomous Driving Using ADORE Abu Shad Ahammed et.al. 2508.17985 null
2025-08-25 Enhanced Drift-Aware Computer Vision Architecture for Autonomous Driving Md Shahi Amran Hossain et.al. 2508.17975 null
2025-08-25 Adaptive Output Steps: FlexiSteps Network for Dynamic Trajectory Prediction Yunxiang Liu et.al. 2508.17797 null
2025-08-23 A Rapid Iterative Trajectory Planning Method for Automated Parking through Differential Flatness Zhouheng Li et.al. 2508.17038 null
2025-08-23 A Survey of Deep Learning-based Point Cloud Denoising Jinxi Wang et.al. 2508.17011 null
2025-08-23 Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model Fan Ding et.al. 2508.16947 null
2025-08-22 Closer to Reality: Practical Semi-Supervised Federated Learning for Foundation Model Adaptation Guangyu Sun et.al. 2508.16568 null
2025-08-22 Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation Chun-Peng Chang et.al. 2508.16512 null
2025-08-22 SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather Edoardo Palladin et.al. 2508.16408 null
2025-08-22 MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction Ziyang Yan et.al. 2508.15653 null
2025-08-23 ExtraGS: Geometric-Aware Trajectory Extrapolation with Uncertainty-Guided Generative Priors Kaiyuan Tan et.al. 2508.15529 null
2025-08-21 RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features Olga Matykina et.al. 2508.15353 null
2025-08-21 RATopo: Improving Lane Topology Reasoning via Redundancy Assignment Han Li et.al. 2508.15272 null
2025-08-21 Adversarial Agent Behavior Learning in Autonomous Driving Using Deep Reinforcement Learning Arjun Srinivasan et.al. 2508.15207 null
2025-08-25 MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion Xuyang Chen et.al. 2508.15169 null
2025-08-28 Learning to Drive Ethically: Embedding Moral Reasoning into Autonomous Driving Dianzhao Li et.al. 2508.14926 null
2025-08-20 Multiscale Video Transformers for Class Agnostic Segmentation in Autonomous Driving Leila Cheshmi et.al. 2508.14729 null
2025-08-20 MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation Guile Wu et.al. 2508.14327 null
2025-08-19 ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving Xianda Guo et.al. 2508.13977 null
2025-08-19 Unleashing Semantic and Geometric Priors for 3D Scene Completion Shiyuan Chen et.al. 2508.13601 null
2025-08-25 Bridging Clear and Adverse Driving Conditions Yoel Shapiro et.al. 2508.13592 null
2025-08-19 Generative Model-Based Feature Attention Module for Video Action Analysis Guiqin Wang et.al. 2508.13565 null
2025-08-19 CORENet: Cross-Modal 4D Radar Denoising Network with LiDAR Supervision for Autonomous Driving Fuyang Liu et.al. 2508.13485 null
2025-08-19 Structured Prompting and Multi-Agent Knowledge Distillation for Traffic Video Interpretation and Risk Inference Yunxiang Yang et.al. 2508.13439 null
2025-08-18 Incremental Generalized Hybrid A* Sidharth Talia et.al. 2508.13392 null
2025-08-18 Prune2Drive: A Plug-and-Play Framework for Accelerating Vision-Language Models in Autonomous Driving Minhao Xiong et.al. 2508.13305 null
2025-08-18 SpotVLM: Cloud-edge Collaborative Real-time VLM based on Context Transfer Chen Qian et.al. 2508.12638 null
2025-08-18 ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving Can Cui et.al. 2508.12603 null
2025-08-17 An Initial Study of Bird’s-Eye View Generation for Autonomous Vehicles using Cross-View Transformers Felipe Carlos dos Santos et.al. 2508.12520 null
2025-08-17 LMAD: Integrated End-to-End Vision-Language Model for Explainable Autonomous Driving Nan Song et.al. 2508.12404 null
2025-08-17 DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection Yuval Haitman et.al. 2508.12330 null
2025-08-17 TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform Jun Liu et.al. 2508.12279 null
2025-08-16 InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes Hongyuan Liu et.al. 2508.12015 null
2025-08-16 Saliency-Based Attention Shifting: A Framework for Improving Driver Situational Awareness of Out-of-Label Hazards Yousra Shleibik et.al. 2508.11887 null
2025-08-16 Data Shift of Object Detection in Autonomous Driving Lida Xu et.al. 2508.11868 null
2025-08-15 Relative Position Matters: Trajectory Prediction and Planning with Polar Representation Bozhou Zhang et.al. 2508.11492 null
2025-08-15 Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous Driving Bozhou Zhang et.al. 2508.11488 null
2025-08-15 EvoPSF: Online Evolution of Autonomous Driving Models via Planning-State Feedback Jiayue Jin et.al. 2508.11453 null
2025-08-15 ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving Jingyu Li et.al. 2508.11428 null
2025-08-15 Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking Haonan Zhang et.al. 2508.11323 null
2025-08-15 A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving Jialin Li et.al. 2508.11218 null
2025-08-14 CSNR and JMIM Based Spectral Band Selection for Reducing Metamerism in Urban Driving Jiarong Li et.al. 2508.10962 null
2025-08-18 HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model Qi Liu et.al. 2508.10935 null
2025-08-14 Towards Powerful and Practical Patch Attacks for 2D Object Detection in Autonomous Driving Yuxin Cao et.al. 2508.10600 null
2025-08-14 SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving Philipp Wolters et.al. 2508.10567 null
2025-08-14 Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies Ayushman Sarkar et.al. 2508.10523 null
2025-08-14 STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes Keishi Ishihara et.al. 2508.10427 null
2025-08-14 From Pixel to Mask: A Survey of Out-of-Distribution Segmentation Wenjie Zhao et.al. 2508.10309 null
2025-08-13 BridgeTA: Bridging the Representation Gap in Knowledge Distillation via Teacher Assistant for Bird’s Eye View Map Segmentation Beomjun Kim et.al. 2508.09599 null
2025-08-13 Offline Auto Labeling: BAAS Stefan Haag et.al. 2508.09585 null
2025-08-13 Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving Guangxun Zhu et.al. 2508.09404 null
2025-08-12 VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception Fuhao Chang et.al. 2508.09061 null
2025-08-12 A Pseudo Global Fusion Paradigm-Based Cross-View Network for LiDAR-Based Place Recognition Jintao Cheng et.al. 2508.08917 null
2025-08-21 ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction Chaojun Ni et.al. 2508.08170 null
2025-08-18 TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation Huawei Sun et.al. 2508.08038 null
2025-08-11 CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving Qi Xiang et.al. 2508.07838 null
2025-08-11 Risk Map As Middleware: Towards Interpretable Cooperative End-to-end Autonomous Driving for Risk-Aware Planning Mingyue Lei et.al. 2508.07686 null
2025-08-11 Progressive Bird’s Eye View Perception for Safety-Critical Autonomous Driving: A Comprehensive Survey Yan Gong et.al. 2508.07560 null
2025-08-12 Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring Ludan Zhang et.al. 2508.07552 null
2025-08-10 Noise-Aware Generative Microscopic Traffic Simulation Vindula Jayawardana et.al. 2508.07453 null
2025-08-09 An Evolutionary Game-Theoretic Merging Decision-Making Considering Social Acceptance for Autonomous Driving Haolin Liu et.al. 2508.07080 null
2025-08-27 From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving Antonio Guillen-Perez et.al. 2508.07029 null
2025-08-09 WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering Yixin Zhu et.al. 2508.06982 null
2025-08-08 Robust-Sub-Gaussian Model Predictive Control for Safe Ultrasound-Image-Guided Robotic Spinal Surgery Yunke Ao et.al. 2508.06744 null
2025-08-15 IRL-VLA: Training an Vision-Language-Action Policy via Reward World Model Anqing Jiang et.al. 2508.06571 null
2025-08-20 MetAdv: A Unified and Interactive Adversarial Testing Platform for Autonomous Driving Aishan Liu et.al. 2508.06534 null
2025-08-02 RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving Jiayuan Wang et.al. 2508.06529 null
2025-08-12 GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving Jian Wang et.al. 2508.06113 null
2025-08-08 ME $^3$ -BEV: Mamba-Enhanced Deep Reinforcement Learning for End-to-End Autonomous Driving with BEV-Perception Siyi Lu et.al. 2508.06074 null
2025-08-07 VISTA: Vision-Language Imitation of Situational Thinking and Attention for Human-Like Driver Focus in Dynamic Environments Kaiser Hamid et.al. 2508.05852 null
2025-08-07 SMOL-MapSeg: Show Me One Label Yunshuang Yuan et.al. 2508.05501 null
2025-08-07 Physical Adversarial Camouflage through Gradient Calibration and Regularization Jiawei Liang et.al. 2508.05414 null
2025-08-07 DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model Rui Yu et.al. 2508.05402 null
2025-08-07 ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models Yatong Lan et.al. 2508.05236 null
2025-08-07 PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems Qi Guo et.al. 2508.05167 null
2025-08-07 AdvDINO: Domain-Adversarial Self-Supervised Representation Learning for Spatial Proteomics Stella Su et.al. 2508.04955 null
2025-08-06 Occupancy Learning with Spatiotemporal Memory Ziyang Leng et.al. 2508.04705 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case Baihui Xiao et.al. 2508.04642 null
2025-08-06 Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark Xiao Wang et.al. 2508.04260 null
2025-08-06 DRIVE: Dynamic Rule Inference and Verified Evaluation for Constraint-Aware Autonomous Driving Longling Geng et.al. 2508.04066 null
2025-08-05 LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences Ao Liang et.al. 2508.03692 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-13 MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention Qi Xie et.al. 2508.03034 null
2025-08-04 Context-aware Risk Assessment and Its Application in Autonomous Driving Boyang Tian et.al. 2508.02919 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera Byeonggyu Park et.al. 2508.02348 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 Test-Time Model Adaptation for Quantized Neural Networks Zeshuai Deng et.al. 2508.02180 null
2025-08-04 Beyond RGB and Events: Enhancing Object Detection under Adverse Lighting with Monocular Normal Maps Mingjie Liu et.al. 2508.02127 null
2025-08-04 Mapillary Vistas Validation for Fine-Grained Traffic Signs: A Benchmark Revealing Vision-Language Model Limitations Sparsh Garg et.al. 2508.02047 null
2025-08-20 Bench2ADVLM: A Closed-Loop Benchmark for Vision-language Models in Autonomous Driving Tianyuan Zhang et.al. 2508.02028 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-03 StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding Haolin Yang et.al. 2508.01875 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving Luqi Cheng et.al. 2508.01704 null
2025-08-03 Adverse Weather-Independent Framework Towards Autonomous Driving Perception through Temporal Correlation and Unfolded Regularization Wei-Bin Kou et.al. 2508.01583 null
2025-08-02 A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Zhan Shi et.al. 2508.01197 null
2025-08-01 CP-FREEZER: Latency Attacks against Vehicular Cooperative Perception Chenyi Wang et.al. 2508.01062 null
2025-08-12 Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance Fengze Yang et.al. 2508.01057 null
2025-07-31 Service Discovery-Based Hybrid Network Middleware for Efficient Communication in Distributed Robotic Systems Shiyao Sang et.al. 2508.00947 null
2025-08-01 Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR Adwait Chandorkar et.al. 2508.00744 null
2025-08-12 Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving Stefan Englmeier et.al. 2508.00589 null
2025-08-01 Uncertainty-Aware Likelihood Ratio Estimation for Pixel-Wise Out-of-Distribution Detection Marc Hölle et.al. 2508.00587 null
2025-08-01 Pro2Guard: Proactive Runtime Enforcement of LLM Agent Safety via Probabilistic Model Checking Haoyu Wang et.al. 2508.00500 null
2025-08-01 Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence Danzhen Fu et.al. 2508.00299 null
2025-07-21 AoI-Aware Resource Allocation with Deep Reinforcement Learning for HAPS-V2X Networks Ahmet Melih Ince et.al. 2508.00011 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation Yuchen Zhou et.al. 2507.23599 null
2025-08-09 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 A Unified Perception-Language-Action Framework for Adaptive Autonomous Driving Yi Zhang et.al. 2507.23540 null
2025-07-31 MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting Xingyue Peng et.al. 2507.23340 null
2025-07-31 Contrastive Learning-Driven Traffic Sign Perception: Multi-Modal Fusion of Text and Vision Qiang Lu et.al. 2507.23331 null
2025-07-31 FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models Yiming Yang et.al. 2507.23325 null
2025-08-02 FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning Jiajun Cao et.al. 2507.23318 null
2025-08-04 PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving Xuewei Tang et.al. 2507.23309 null
2025-07-30 Causal-Inspired Multi-Agent Decision-Making via Graph Reinforcement Learning Jing Wang et.al. 2507.23080 null
2025-08-05 Vision-Language Fusion for Real-Time Autonomous Driving: Goal-Centered Cross-Attention of Camera, HD-Map, & Waypoints Santosh Patapati et.al. 2507.23064 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-08-07 Bayesian Optimization applied for accelerated Virtual Validation of the Autonomous Driving Function Satyesh Shanker Awasthi et.al. 2507.22769 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation Jiuming Liu et.al. 2507.22454 null
2025-07-30 Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators Kaustav Chakraborty et.al. 2507.22389 null
2025-07-29 Hierarchical Game-Based Multi-Agent Decision-Making for Autonomous Vehicles Mushuang Liu et.al. 2507.21941 null
2025-07-31 MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors Shouyi Lu et.al. 2507.21872 null
2025-07-29 SAMITE: Position Prompted SAM2 with Calibrated Memory for Visual Object Tracking Qianxiong Xu et.al. 2507.21732 null
2025-08-16 Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition Ruiyang Hao et.al. 2507.21610 null
2025-07-29 SafeDriveRAG: Towards Safe Autonomous Driving with Knowledge Graph-based Retrieval-Augmented Generation Hao Ye et.al. 2507.21585 null
2025-07-30 No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering Linye Wei et.al. 2507.21572 null
2025-07-29 RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors Tianhui Cai et.al. 2507.21567 null
2025-07-29 SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity Xingyang Li et.al. 2507.21499 null
2025-07-29 MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving Thomas Monninger et.al. 2507.21423 null
2025-08-03 Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy Jicheng Yuan et.al. 2507.21358 null
2025-07-25 Seeing Beyond Frames: Zero-Shot Pedestrian Intention Prediction with Raw Temporal Video and Multimodal Cues Pallavi Zambare et.al. 2507.21161 null
2025-07-28 GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction Tianhao Li et.al. 2507.20963 null
2025-07-25 Event-Based De-Snowing for Autonomous Driving Manasi Muglikar et.al. 2507.20901 null
2025-07-28 DriveAgent-R1: Advancing VLM-based Autonomous Driving with Hybrid Thinking and Active Perception Weicheng Zheng et.al. 2507.20879 null
2025-07-27 Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars Mattia Piccinini et.al. 2507.20427 null
2025-07-27 VESPA: Towards un(Human)supervised Open-World Pointcloud Labeling for Autonomous Driving Levente Tempfli et.al. 2507.20397 null
2025-07-27 Solving Scene Understanding for Autonomous Navigation in Unstructured Environments Naveen Mathews Renji et.al. 2507.20389 null
2025-07-27 VLMPlanner: Integrating Visual Language Models with Motion Planning Zhipeng Tang et.al. 2507.20342 null
2025-07-27 MambaMap: Online Vectorized HD Map Construction using State Space Model Ruizi Yang et.al. 2507.20224 null
2025-07-27 LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks Fei Kong et.al. 2507.20174 null
2025-07-27 Meta Fusion: A Unified Framework For Multimodality Fusion with Mutual Learning Ziyi Liang et.al. 2507.20089 null
2025-07-26 Digital Twin Channel-Enabled Online Resource Allocation for 6G: Principle, Architecture and Application Tongjie Li et.al. 2507.19974 null
2025-08-12 DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes Rishav Kumar et.al. 2507.19912 null
2025-07-26 Bridging Simulation and Usability: A User-Friendly Framework for Scenario Generation in CARLA Ahmed Abouelazm et.al. 2507.19883 null
2025-07-26 FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving Tao Lian et.al. 2507.19881 null
2025-07-30 RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection Xiaokai Bai et.al. 2507.19856 null
2025-07-26 A 4D Radar Camera Extrinsic Calibration Tool Based on 3D Uncertainty Perspective N Points Chuan Cao et.al. 2507.19829 null
2025-07-25 PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction Haichuan Li et.al. 2507.19701 null
2025-07-25 Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing Haichuan Li et.al. 2507.19691 null
2025-08-02 GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting Baijun Ye et.al. 2507.19451 null
2025-07-25 An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles Matthias Weiß et.al. 2507.19446 null
2025-07-25 SDVDiag: A Modular Platform for the Diagnosis of Connected Vehicle Functions Matthias Weiß et.al. 2507.19403 null
2025-07-25 BEV-LLM: Leveraging Multimodal BEV Maps for Scene Captioning in Autonomous Driving Felix Brandstaetter et.al. 2507.19370 null
2025-07-25 LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences Yusuke Hirota et.al. 2507.19362 null
2025-07-25 SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence Viktar Dubovik et.al. 2507.19321 null
2025-07-25 CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception Jiaru Zhong et.al. 2507.19239 null
2025-07-25 VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions Haoang Lu et.al. 2507.19188 null
2025-07-25 Continual Learning-Based Unified Model for Unpaired Image Restoration Tasks Kotha Kartheek et.al. 2507.19184 null
2025-07-25 Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL Ahmed Abouelazm et.al. 2507.19146 null
2025-07-31 PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction Yanghong Liu et.al. 2507.19119 null
2025-07-25 Fine-Grained Traffic Inference from Road to Lane via Spatio-Temporal Graph Node Generation Shuhao Li et.al. 2507.19089 null
2025-07-25 HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback Elham Soltani Kazemi et.al. 2507.18921 null
2025-07-24 Diffusion-FS: Multimodal Free-Space Prediction via Diffusion for Autonomous Driving Keshav Gupta et.al. 2507.18763 null
2025-07-24 Linear Memory SE(2) Invariant Attention Ethan Pronovost et.al. 2507.18597 null
2025-07-24 GaussianFusionOcc: A Seamless Sensor Fusion Approach for 3D Occupancy Prediction Using 3D Gaussians Tomislav Pavković et.al. 2507.18522 null
2025-07-24 Delving into Mapping Uncertainty for Mapless Trajectory Prediction Zongzheng Zhang et.al. 2507.18498 null
2025-07-24 Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments Xiao Yang et.al. 2507.18484 null
2025-07-24 CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting Haoran Xu et.al. 2507.18473 null
2025-07-24 LONG3R: Long Sequence Streaming 3D Reconstruction Zhuoguang Chen et.al. 2507.18255 null
2025-07-24 GenAI for Automotive Software Development: From Requirements to Wheels Nenad Petrovic et.al. 2507.18223 null
2025-07-24 Goal-based Trajectory Prediction for improved Cross-Dataset Generalization Daniel Grimm et.al. 2507.18196 null
2025-07-24 Policy Disruption in Reinforcement Learning:Adversarial Attack with Large Language Models and Critical State Identification Junyong Jiang et.al. 2507.18113 null
2025-07-23 BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems Malsha Ashani Mahawatta Dona et.al. 2507.17722 null
2025-07-23 Reusing Attention for One-stage Lane Topology Understanding Yang Li et.al. 2507.17617 null
2025-07-23 InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling Xiaoxue Chen et.al. 2507.17613 null
2025-07-24 PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving Maciej K. Wozniak et.al. 2507.17596 null
2025-07-23 SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving Chuang Chen et.al. 2507.17479 null
2025-07-23 VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization Sania Waheed et.al. 2507.17455 null
2025-07-23 Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning Joobin Jin et.al. 2507.17418 null
2025-08-06 DeMo++: Motion Decoupling for Autonomous Driving Bozhou Zhang et.al. 2507.17342 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study Mandar Pitale et.al. 2507.17118 null
2025-07-22 SDGOCC: Semantic and Depth-Guided Bird’s-Eye View Transformation for 3D Multimodal Occupancy Prediction Zaipeng Duan et.al. 2507.17083 null
2025-07-22 Few-Shot Learning in Video and 3D Object Detection: A Survey Md Meftahul Ferdaus et.al. 2507.17079 null
2025-07-22 Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach Adithya Mohan et.al. 2507.17070 null
2025-07-22 Denoising-While-Completing Network (DWCNet): Robust Point Cloud Completion Under Corruption Keneni W. Tesema et.al. 2507.16743 null
2025-07-22 Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control Zongzheng Zhang et.al. 2507.16645 null
2025-07-22 A Target-based Multi-LiDAR Multi-Camera Extrinsic Calibration System Lorenzo Gentilini et.al. 2507.16621 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-22 A Sparsity-Aware Autonomous Path Planning Accelerator with HW/SW Co-Design and Multi-Level Dataflow Optimization Yifan Zhang et.al. 2507.16177 null
2025-07-21 Optimal Batch-Size Control for Low-Latency Federated Learning with Device Heterogeneity Huiling Yang et.al. 2507.15601 null
2025-07-21 Robots for Kiwifruit Harvesting and Pollination Jamie Bell et.al. 2507.15484 null
2025-07-21 VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving Haichao Liu et.al. 2507.15266 null
2025-07-20 CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning Pan Hu et.al. 2507.14903 null
2025-07-23 GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving Chi Wan et.al. 2507.14456 null
2025-07-18 Preference-based Multi-Objective Reinforcement Learning Ni Mu et.al. 2507.14066 null
2025-07-18 Architecting Human-AI Cocreation for Technical Services – Interaction Modes and Contingency Factors Jochen Wulf et.al. 2507.14034 null
2025-07-18 Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection Yujian Mo et.al. 2507.13899 null
2025-07-18 Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation Max van den Hoven et.al. 2507.13857 null
2025-07-18 One Step Closer: Creating the Future to Boost Monocular Semantic Scene Completion Haoang Lu et.al. 2507.13801 null
2025-07-18 AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework Yu Yao et.al. 2507.13729 null
2025-07-17 CaSTFormer: Causal Spatio-Temporal Transformer for Driving Intention Prediction Sirui Wang et.al. 2507.13425 null
2025-07-16 From Binary to Semantic: Utilizing Large-Scale Binary Occupancy Data for 3D Semantic Occupancy Prediction Chihiro Noguchi et.al. 2507.13387 null
2025-07-17 Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models Arian Mousakhan et.al. 2507.13162 null
2025-07-17 Channel-wise Motion Features for Efficient Motion Segmentation Riku Inoue et.al. 2507.13082 null
2025-07-23 LaViPlan : Language-Guided Visual Path Planning with RLVR Hayeon Oh et.al. 2507.12911 null
2025-07-17 World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving Yanchen Guan et.al. 2507.12762 null
2025-07-17 Domain-Enhanced Dual-Branch Model for Efficient and Interpretable Accident Anticipation Yanchen Guan et.al. 2507.12755 null
2025-07-16 ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving Yuhang Lu et.al. 2507.12499 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models Santosh Vasa et.al. 2507.12414 null
2025-07-21 AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving Jiawei Xu et.al. 2507.12137 null
2025-07-16 LidarPainter: One-Step Away From Any Lidar View To Novel Guidance Yuzhou Ji et.al. 2507.12114 null
2025-07-16 Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics Muleilan Pei et.al. 2507.12083 null
2025-07-16 IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving Kanghyun Ryu et.al. 2507.11940 null
2025-07-16 Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers Mohammed Hassanin et.al. 2507.11852 null
2025-07-15 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Zhen Xu et.al. 2507.11540 null
2025-07-15 A Survey on Interpretability in Visual Recognition Qiyang Wan et.al. 2507.11099 null
2025-07-14 RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding Benjamin Stoler et.al. 2507.10749 null
2025-07-14 Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance Kyungtae Han et.al. 2507.10500 null

Traffic Simulation

Publish Date Title Authors PDF Code
2025-08-28 HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning Zhi Su et.al. 2508.21043 null
2025-08-28 Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees Yaniv Hassidof et.al. 2508.21001 null
2025-08-28 Deep Fuzzy Optimization for Batch-Size and Nearest Neighbors in Optimal Robot Motion Planning Liding Zhang et.al. 2508.20884 null
2025-08-28 Uncertainty Aware-Predictive Control Barrier Functions: Safer Human Robot Interaction through Probabilistic Motion Forecasting Lorenzo Busellato et.al. 2508.20812 null
2025-08-28 CardioMorphNet: Cardiac Motion Prediction Using a Shape-Guided Bayesian Recurrent Deep Network Reza Akbari Movahed et.al. 2508.20734 null
2025-08-27 Regulation-Aware Game-Theoretic Motion Planning for Autonomous Racing Francesco Prignoli et.al. 2508.20203 null
2025-08-27 Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning Jinhao Liang et.al. 2508.20095 null
2025-08-27 APT*: Asymptotically Optimal Motion Planning via Adaptively Prolated Elliptical R-Nearest Neighbors Liding Zhang et.al. 2508.19790 null
2025-08-27 Tree-Based Grafting Approach for Bidirectional Motion Planning with Local Subsets Optimization Liding Zhang et.al. 2508.19776 null
2025-08-27 Elliptical K-Nearest Neighbors – Path Optimization via Coulomb’s Law and Invalid Vertices in C-space Obstacles Liding Zhang et.al. 2508.19771 null
2025-08-27 Autonomous Aerial Manipulation at Arbitrary Pose in SE(3) with Robust Control and Whole-body Planning Dongjae Lee et.al. 2508.19608 null
2025-08-25 Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning Antonio Guillen-Perez et.al. 2508.18397 null
2025-08-26 FlowVLA: Thinking in Motion with a Visual Chain of Thought Zhide Zhong et.al. 2508.18269 null
2025-08-25 Adaptive Output Steps: FlexiSteps Network for Dynamic Trajectory Prediction Yunxiang Liu et.al. 2508.17797 null
2025-08-23 LLM-based Human-like Traffic Simulation for Self-driving Tests Wendi Li et.al. 2508.16962 null
2025-08-23 Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model Fan Ding et.al. 2508.16947 null
2025-08-21 Lang2Lift: A Framework for Language-Guided Pallet Detection and Pose Estimation Integrated in Autonomous Outdoor Forklift Operation Huy Hoang Nguyen et.al. 2508.15427 null
2025-08-20 TRUST-Planner: Topology-guided Robust Trajectory Planner for AAVs with Uncertain Obstacle Spatial-temporal Avoidance Junzhi Li et.al. 2508.14610 null
2025-08-20 FiReFly: Fair Distributed Receding Horizon Planning for Multiple UAVs Nicole Fronda et.al. 2508.14381 null
2025-08-16 Task and Motion Planning for Humanoid Loco-manipulation Michal Ciebielski et.al. 2508.14099 null
2025-08-20 Accelerating Signal-Temporal-Logic-Based Task and Motion Planning of Bipedal Navigation using Benders Decomposition Jiming Ren et.al. 2508.13407 null
2025-08-18 BOW: Bayesian Optimization over Windows for Motion Planning in Complex Environments Sourav Raxit et.al. 2508.13052 null
2025-08-28 On the complexity of constrained reconfiguration and motion planning Nicolas Bousquet et.al. 2508.13032 null
2025-08-26 SocialTrack: Multi-Object Tracking in Complex Urban Traffic Scenes Inspired by Social Behavior Wenguang Tao et.al. 2508.12777 null
2025-08-17 Autonomous Oil Spill Response Through Liquid Neural Trajectory Modeling and Coordinated Marine Robotics Hadas C. Kuzmenko et.al. 2508.12456 null
2025-08-17 EgoLoc: A Generalizable Solution for Temporal Interaction Localization in Egocentric Videos Junyi Ma et.al. 2508.12349 null
2025-08-15 A Comparative Study of Floating-Base Space Parameterizations for Agile Whole-Body Motion Planning Evangelos Tsiatsianas et.al. 2508.11520 null
2025-08-15 Relative Position Matters: Trajectory Prediction and Planning with Polar Representation Bozhou Zhang et.al. 2508.11492 null
2025-08-15 EvoPSF: Online Evolution of Autonomous Driving Models via Planning-State Feedback Jiayue Jin et.al. 2508.11453 null
2025-08-15 ReachVox: Clutter-free Reachability Visualization for Robot Motion Planning in Virtual Reality Steffen Hauck et.al. 2508.11426 null
2025-08-15 Learning Differentiable Reachability Maps for Optimization-based Humanoid Motion Generation Masaki Murooka et.al. 2508.11275 null
2025-08-15 A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving Jialin Li et.al. 2508.11218 null
2025-08-20 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation Nikolaos Gkanatsios et.al. 2508.11002 null
2025-08-14 SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving Philipp Wolters et.al. 2508.10567 null
2025-08-14 STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes Keishi Ishihara et.al. 2508.10427 null
2025-08-12 CLF-RL: Control Lyapunov Function Guided Reinforcement Learning Kejun Li et.al. 2508.09354 null
2025-08-10 Whole-Body Coordination for Dynamic Object Grasping with Legged Manipulators Qiwei Liang et.al. 2508.08328 null
2025-08-11 Learning an Implicit Physics Model for Image-based Fluid Simulation Emily Yue-Ting Jia et.al. 2508.08254 null
2025-08-10 A Learning-Based Framework for Collision-Free Motion Planning Mateus Salomão et.al. 2508.07502 null
2025-08-10 Noise-Aware Generative Microscopic Traffic Simulation Vindula Jayawardana et.al. 2508.07453 null
2025-08-10 Bio-Inspired Topological Autonomous Navigation with Active Inference in Robotics Daria de Tinguy et.al. 2508.07267 null
2025-08-12 Understanding Dynamic Scenes in Ego Centric 4D Point Clouds Junsheng Huang et.al. 2508.07251 null
2025-08-10 CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion Xiaotong Lin et.al. 2508.07162 null
2025-08-10 Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction Yu Liu et.al. 2508.07146 null
2025-08-09 ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting Sandro Papais et.al. 2508.07089 null
2025-08-09 Model Predictive Control for Crowd Navigation via Learning-Based Trajectory Prediction Mohamed Parvez Aslam et.al. 2508.07079 null
2025-08-05 Historical Prediction Attention Mechanism based Trajectory Forecasting for Proactive Work Zone Safety in a Digital Twin Environment Minhaj Uddin Ahmad et.al. 2508.06544 null
2025-08-04 Symbolic Learning of Interpretable Reduced-Order Models for Jumping Quadruped Robots Gioele Buriani et.al. 2508.06538 null
2025-08-08 V*: An Efficient Motion Planning Algorithm for Autonomous Vehicles Abdullah Zareh Andaryan et.al. 2508.06404 null
2025-08-08 Incremental Language Understanding for Online Motion Planning of Robot Manipulators Mitchell Abrams et.al. 2508.06095 null
2025-08-08 Dynamical Trajectory Planning of Disturbance Consciousness for Air-Land Bimodal Unmanned Aerial Vehicles Shaoting Liu et.al. 2508.05972 null
2025-08-07 TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven Evolution Zhikai Zhao et.al. 2508.05616 null
2025-08-07 Benchmarking Shortcutting Techniques for Multi-Robot-Arm Motion Planning Philip Huang et.al. 2508.05027 null
2025-08-06 LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction Md Zahidul Hasan et.al. 2508.04847 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 Incorporating Stochastic Models of Controller Behavior into Kinodynamic Efficiently Adaptive State Lattices for Mobile Robot Motion Planning in Off-Road Environments Eric R. Damm et.al. 2508.04384 null
2025-08-06 Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction Yu Liu et.al. 2508.04229 null
2025-08-11 Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems Luai Abuelsamen et.al. 2508.04146 null
2025-08-05 Constraint-Preserving Data Generation for Visuomotor Policy Learning Kevin Lin et.al. 2508.03944 null
2025-08-05 Vision-based Perception System for Automated Delivery Robot-Pedestrians Interactions Ergi Tushe et.al. 2508.03541 null
2025-08-04 X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio Chenxu Zhang et.al. 2508.02944 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering Xu Wang et.al. 2508.02362 null
2025-08-19 Adaptive Lattice-based Motion Planning Abhishek Dhar et.al. 2508.02350 null
2025-08-04 Framework for Robust Motion Planning of Tethered Multi-Robot Systems in Marine Environments Markus Buchholz et.al. 2508.02287 null
2025-08-04 AID4AD: Aerial Image Data for Automated Driving Perception Daniel Lengerer et.al. 2508.02140 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction Hua Yu et.al. 2508.01585 null
2025-07-29 A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles Jiayuan Wang et.al. 2508.00917 null
2025-08-01 On Learning Closed-Loop Probabilistic Multi-Agent Simulator Juanwu Lu et.al. 2508.00384 null
2025-08-01 TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps Zehui Xu et.al. 2508.00303 null
2025-07-31 Data-Driven Motion Planning for Uncertain Nonlinear Systems Babak Esmaeili et.al. 2508.00154 null
2025-07-31 OmniTraj: Pre-Training on Heterogeneous Data for Adaptive and Zero-Shot Human Trajectory Prediction Yang Gao et.al. 2507.23657 null
2025-07-31 A Framework for Ethical Decision-Making in Automated Vehicles through Human Reasons-based Supervision Lucas Elbert Suryana et.al. 2507.23308 null
2025-07-31 Simulation-based planning of Motion Sequences for Automated Procedure Optimization in Multi-Robot Assembly Cells Loris Schneider et.al. 2507.23270 null
2025-08-01 Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future Guoping Xu et.al. 2507.22792 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators Kaustav Chakraborty et.al. 2507.22389 null
2025-07-27 Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars Mattia Piccinini et.al. 2507.20427 null
2025-07-27 VLMPlanner: Integrating Visual Language Models with Motion Planning Zhipeng Tang et.al. 2507.20342 null
2025-07-27 PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks Clinton Ansun Mo et.al. 2507.20170 null
2025-07-25 PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction Haichuan Li et.al. 2507.19701 null
2025-07-25 RAKOMO: Reachability-Aware K-Order Markov Path Optimization for Quadrupedal Loco-Manipulation Mattia Risiglione et.al. 2507.19652 null
2025-07-25 High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins Lorenzo Cazzella et.al. 2507.19173 null
2025-07-31 PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction Yanghong Liu et.al. 2507.19119 null
2025-07-24 Probabilistic Collision Risk Estimation through Gauss-Legendre Cubature and Non-Homogeneous Poisson Processes Trent Weiss et.al. 2507.18819 null
2025-07-24 Delving into Mapping Uncertainty for Mapless Trajectory Prediction Zongzheng Zhang et.al. 2507.18498 null
2025-07-24 Goal-based Trajectory Prediction for improved Cross-Dataset Generalization Daniel Grimm et.al. 2507.18196 null
2025-07-24 DanceGraph: A Complementary Architecture for Synchronous Dancing Online David Sinclair et.al. 2507.18052 null
2025-07-23 Safety Assurance for Quadrotor Kinodynamic Motion Planning Theodoros Tavoulareas et.al. 2507.17679 null
2025-07-23 IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception Haichuan Li et.al. 2507.17445 null
2025-08-06 DeMo++: Motion Decoupling for Autonomous Driving Bozhou Zhang et.al. 2507.17342 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 Falconry-like palm landing by a flapping-wing drone based on the human gesture interaction and distance-aware flight planning Kazuki Numazato et.al. 2507.17144 null
2025-07-22 RAPTAR: Radar Radiation Pattern Acquisition through Automated Collaborative Robotics Maaz Qureshi et.al. 2507.16988 null
2025-07-21 Resilience Evaluation of Kubernetes in Cloud-Edge Environments via Failure Injection Zihao Chen et.al. 2507.16109 null
2025-07-21 Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction Shiyang Li et.al. 2507.15832 null
2025-07-21 Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs Ruochu Yang et.al. 2507.15782 null
2025-07-21 Selective Densification for Rapid Motion Planning in High Dimensions with Narrow Passages Lu Huang et.al. 2507.15710 null
2025-07-21 A Universal Vehicle-Trailer Navigation System with Neural Kinematics and Online Residual Learning Yanbo Chen et.al. 2507.15607 null
2025-07-21 VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving Haichao Liu et.al. 2507.15266 null
2025-07-20 Search-Based Autonomous Vehicle Motion Planning Using Game Theory Pouya Panahandeh et.al. 2507.15088 null
2025-07-20 CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning Pan Hu et.al. 2507.14903 null
2025-07-18 Context-Aware Behavior Learning with Heuristic Motion Memory for Underwater Manipulation Markus Buchholz et.al. 2507.14099 null
2025-07-18 NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning Qingyi Chen et.al. 2507.13940 null
2025-07-18 Conformal Contraction for Robust Nonlinear Control with Distribution-Free Uncertainty Quantification Sihang Wei et.al. 2507.13613 null
2025-08-08 Trustworthy Pedestrian Trajectory Prediction via Pattern-Aware Interaction Modeling Kaiyuan Zhai et.al. 2507.13397 null
2025-07-25 Signal Temporal Logic Compliant Co-design of Planning and Control Manas Sashank Juvvi et.al. 2507.13225 null
2025-07-22 Predictability-Aware Motion Prediction for Edge XR via High-Order Error-State Kalman Filtering Ziyu Zhong et.al. 2507.13179 null
2025-07-17 Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning Giwon Lee et.al. 2507.12977 null
2025-07-17 FFI-VTR: Lightweight and Robust Visual Teach and Repeat Navigation based on Feature Flow Indicator and Probabilistic Motion Planning Jikai Wang et.al. 2507.12800 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios Van-Hoang-Anh Phan et.al. 2507.12449 null
2025-07-16 Regrasp Maps for Sequential Manipulation Planning Svetlana Levit et.al. 2507.12407 null
2025-07-16 Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics Muleilan Pei et.al. 2507.12083 null
2025-07-16 IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving Kanghyun Ryu et.al. 2507.11940 null
2025-07-16 A Fast Method for Planning All Optimal Homotopic Configurations for Tethered Robots and Its Extended Applications Jinyuan Liu et.al. 2507.11880 null
2025-07-15 MPC-based Coarse-to-Fine Motion Planning for Robotic Object Transportation in Cluttered Environments Chen Cai et.al. 2507.11211 null
2025-07-15 Enhancing Autonomous Manipulator Control with Human-in-loop for Uncertain Assembly Environments Ashutosh Mishra et.al. 2507.11006 null
2025-07-15 OffsetCrust: Variable-Radius Offset Approximation with Power Diagrams Zihan Zhao et.al. 2507.10924 null
2025-07-15 Mixed Discrete and Continuous Planning using Shortest Walks in Graphs of Convex Sets Savva Morozov et.al. 2507.10878 null
2025-07-14 A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex Environments Yuchen Wang et.al. 2507.10792 null
2025-07-23 Leveraging RAG-LLMs for Urban Mobility Simulation and Analysis Yue Ding et.al. 2507.10382 null
2025-07-16 TOP: Trajectory Optimization via Parallel Optimization towards Constant Time Complexity Jiajun Yu et.al. 2507.10290 null
2025-07-14 MP-RBFN: Learning-based Vehicle Motion Primitives using Radial Basis Function Networks Marc Kaufeld et.al. 2507.10047 null
2025-07-22 Active Probing with Multimodal Predictions for Motion Planning Darshan Gadginmath et.al. 2507.09822 null
2025-07-13 Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions Yuanhong Zheng et.al. 2507.09446 null
2025-07-12 Real-Time Adaptive Motion Planning via Point Cloud-Guided, Energy-Based Diffusion and Potential Fields Wondmgezahu Teshome et.al. 2507.09383 null
2025-07-19 Informed Hybrid Zonotope-based Motion Planning Algorithm Peng Xie et.al. 2507.09309 null
2025-07-12 Integrating Planning and Predictive Control Using the Path Feasibility Governor Shu Zhang et.al. 2507.09134 null
2025-07-09 Next-Generation Travel Demand Modeling with a Generative Framework for Household Activity Coordination Xishun Liao et.al. 2507.08871 null
2025-07-14 STRAP: Spatial-Temporal Risk-Attentive Vehicle Trajectory Prediction for Autonomous Driving Xinyi Ning et.al. 2507.08563 null
2025-07-11 Prediction of Lane Change Intentions of Human Drivers using an LSTM, a CNN and a Transformer Francesco De Cristofaro et.al. 2507.08365 null
2025-07-11 Neural Parameter-varying Data-enabled Predictive Control of Cold Atmospheric Pressure Plasma Jets Pegah GhafGhanbari et.al. 2507.08259 null
2025-07-10 GGMotion: Group Graph Dynamics-Kinematics Networks for Human Motion Prediction Shuaijin Wan et.al. 2507.07515 null
2025-07-10 Towards Safe Autonomous Driving: A Real-Time Safeguarding Concept for Motion Planning Algorithms Korbinian Moller et.al. 2507.07444 null
2025-07-09 When Context Is Not Enough: Modeling Unexplained Variability in Car-Following Behavior Chengyuan Zhang et.al. 2507.07012 null
2025-07-09 Robust signal decompositions on the circle Aral Kose et.al. 2507.07007 null
2025-07-09 ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture Mingjin Zeng et.al. 2507.06531 null
2025-07-08 AURA-CVC: Autonomous Ultrasound-guided Robotic Assistance for Central Venous Catheterization Deepak Raina et.al. 2507.05979 null
2025-07-08 DRO-EDL-MPC: Evidential Deep Learning-Based Distributionally Robust Model Predictive Control for Safe Autonomous Driving Hyeongchan Ham et.al. 2507.05710 null
2025-07-07 From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving Fabian Konstantinidis et.al. 2507.05254 null
2025-07-07 Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance Tobias Demmler et.al. 2507.05098 null
2025-07-07 Unifying Robot Optimization: Monte Carlo Tree Search with Tensor Factorization Teng Xue et.al. 2507.04949 null
2025-07-25 Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning Giwon Lee et.al. 2507.04790 null
2025-07-07 LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction Yixin Yan et.al. 2507.04634 null
2025-07-06 Free-Space Optical Communication-Driven NMPC Framework for Multi-Rotor Aerial Vehicles in Structured Inspection Scenarios Giuseppe Silano et.al. 2507.04443 null
2025-07-05 Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic Jianwei Tang et.al. 2507.04062 null
2025-07-05 Temporal Continual Learning with Prior Compensation for Human Motion Prediction Jianwei Tang et.al. 2507.04060 null
2025-07-05 DK-RRT: Deep Koopman RRT for Collision-Aware Motion Planning of Space Manipulators in Dynamic Debris Environments Qi Chen et.al. 2507.03878 null
2025-07-05 Enhanced accuracy through ensembling of randomly initialized auto-regressive models for time-dependent PDEs Ishan Khurjekar et.al. 2507.03863 null
2025-07-04 Label-Free Long-Horizon 3D UAV Trajectory Prediction via Motion-Aligned RGB and Event Cues Hanfang Liang et.al. 2507.03365 null
2025-07-03 Trajectory Optimization for Differential Drive Mobile Manipulators via Topological Paths Search and Arc Length-Yaw Parameterization Long Xu et.al. 2507.02761 null
2025-07-03 Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization Caio Azevedo et.al. 2507.02406 null
2025-07-03 Path Planning using a One-shot-sampling Skeleton Map Gabriel O. Flores-Aquino et.al. 2507.02328 null
2025-07-02 GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters Wanjia Zhao et.al. 2507.02085 null
2025-07-09 Test-Time Scaling with Reflective Generative Model Zixiao Wang et.al. 2507.01951 null
2025-07-06 AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction Bin Rao et.al. 2507.01801 null
2025-07-02 Efficient Collision Detection for Long and Slender Robotic Links in Euclidean Distance Fields: Application to a Forestry Crane Marc-Philip Ecker et.al. 2507.01705 null
2025-07-02 LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction Muhammad Atta ur Rahman et.al. 2507.01308 null
2025-07-01 Search-Based Robot Motion Planning With Distance-Based Adaptive Motion Primitives Benjamin Kraljusic et.al. 2507.01198 null
2025-07-01 ARIG: Autoregressive Interactive Head Generation for Real-time Conversations Ying Guo et.al. 2507.00472 null
2025-06-30 Rethink 3D Object Detection from Physical World Satoshi Tanaka et.al. 2507.00190 null
2025-06-30 Epona: Autoregressive Diffusion World Model for Autonomous Driving Kaiwen Zhang et.al. 2506.24113 null
2025-06-30 STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems Mingfei Cheng et.al. 2506.23995 null
2025-06-29 InfGen: Scenario Generation as Next Token Group Prediction Zhenghao Peng et.al. 2506.23316 null
2025-06-29 Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models Maarten Hugenholtz et.al. 2506.23164 null
2025-06-28 Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example Bei Zhou et.al. 2506.22894 null
2025-06-27 Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD Ruthvik Bokkasam et.al. 2506.22111 null
2025-06-27 A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments Akshay Jaitly et.al. 2506.21982 null
2025-06-27 SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model Shuhan Tan et.al. 2506.21976 null
2025-07-14 Ark: An Open-source Python-based Framework for Robot Learning Magnus Dierking et.al. 2506.21628 null
2025-06-26 GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction Muleilan Pei et.al. 2506.21121 null
2025-06-25 Near Time-Optimal Hybrid Motion Planning for Timber Cranes Marc-Philip Ecker et.al. 2506.20314 null
2025-06-24 Trajectory Prediction in Dynamic Object Tracking: A Critical Study Zhongping Dong et.al. 2506.19341 null
2025-06-25 AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation Ziyan Zhao et.al. 2506.19269 null
2025-08-04 Faster Motion Planning via Restarts Nancy Amato et.al. 2506.19016 null
2025-06-23 SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives Yizhou Chen et.al. 2506.18825 null
2025-06-23 Design, fabrication and control of a cable-driven parallel robot Dhruv Sorathiya et.al. 2506.18526 null
2025-06-23 Integrating Maneuverable Planning and Adaptive Control for Robot Cart-Pushing under Disturbances Zhe Zhang et.al. 2506.18410 null
2025-06-23 Selective Social-Interaction via Individual Importance for Fast Human Trajectory Prediction Yota Urano et.al. 2506.18291 null
2025-06-23 Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning Yue Li et.al. 2506.18234 null
2025-06-20 Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation Xiuyu Yang et.al. 2506.17213 null
2025-06-20 Judo: A User-Friendly Open-Source Package for Sampling-Based Model Predictive Control Albert H. Li et.al. 2506.17184 null
2025-07-11 Experimental Setup and Software Pipeline to Evaluate Optimization based Autonomous Multi-Robot Search Algorithms Aditya Bhatt et.al. 2506.16710 null