Updated on 2025.08.08

This page is maintained by Leheng Li that contains papers he interested in. Source code of this web is at here.

3D

Publish Date Title Authors PDF Code
2025-08-07 GAP: Gaussianize Any Point Clouds with Text Guidance Weiqi Zhang et.al. 2508.05631 null
2025-08-07 Physically Controllable Relighting of Photographs Chris Careaga et.al. 2508.05626 null
2025-08-07 Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity Yuhan Zhang et.al. 2508.05609 null
2025-08-07 Robust adaptive fuzzy sliding mode control for trajectory tracking for of cylindrical manipulator Van Cuong Pham et.al. 2508.05584 null
2025-08-07 Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis Kunyu Feng et.al. 2508.05580 null
2025-08-07 Point cloud segmentation for 3D Clothed Human Layering Davide Garavaso et.al. 2508.05531 null
2025-08-07 Head Anchor Enhanced Detection and Association for Crowded Pedestrian Tracking Zewei Wu et.al. 2508.05514 null
2025-08-07 MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips Shibo Wang et.al. 2508.05506 null
2025-08-07 Symmetry Understanding of 3D Shapes via Chirality Disentanglement Weikang Wang et.al. 2508.05505 null
2025-08-07 Computational Design and Fabrication of Modular Robots with Untethered Control Manas Bhargava et.al. 2508.05410 null
2025-08-07 CT-GRAPH: Hierarchical Graph Attention Network for Anatomy-Guided CT Report Generation Hamza Kalisch et.al. 2508.05375 null
2025-08-07 3DGabSplat: 3D Gabor Splatting for Frequency-adaptive Radiance Field Rendering Junyu Zhou et.al. 2508.05343 null
2025-08-07 CF3: Compact and Fast 3D Feature Fields Hyunjoon Lee et.al. 2508.05254 null
2025-08-07 Coarse-to-Fine Joint Registration of MR and Ultrasound Images via Imaging Style Transfer Junyi Wang et.al. 2508.05240 null
2025-08-07 EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery Bingyu Yang et.al. 2508.05205 null
2025-08-07 Refining Gaussian Splatting: A Volumetric Densification Approach Mohamed Abdul Gafoor et.al. 2508.05187 null
2025-08-07 Learning to See and Act: Task-Aware View Planning for Robotic Manipulation Yongjie Bai et.al. 2508.05186 null
2025-08-07 FCBV-Net: Category-Level Robotic Garment Smoothing via Feature-Conditioned Bimanual Value Prediction Mohammed Daba et.al. 2508.05153 null
2025-08-07 FedGIN: Federated Learning with Dynamic Global Intensity Non-linear Augmentation for Organ Segmentation using Multi-modal Images Sachin Dudda Nagaraju et.al. 2508.05137 null
2025-08-07 A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding Mahmoud Chick Zaouali et.al. 2508.05064 null
2025-08-07 DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion Yifeng Huang et.al. 2508.05060 null
2025-08-07 MAG-Nav: Language-Driven Object Navigation Leveraging Memory-Reserved Active Grounding Weifan Zhang et.al. 2508.05021 null
2025-08-07 Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion Shenglun Chen et.al. 2508.04984 null
2025-08-07 UGOD: Uncertainty-Guided Differentiable Opacity and Soft Dropout for Enhanced Sparse-View 3DGS Zhihao Guo et.al. 2508.04968 null
2025-08-07 Laplacian Analysis Meets Dynamics Modelling: Gaussian Splatting for 4D Reconstruction Yifan Zhou et.al. 2508.04966 null
2025-08-07 Perceive-Sample-Compress: Towards Real-Time 3D Gaussian Splatting Zijian Wang et.al. 2508.04965 null
2025-08-06 CryoGS: Gaussian Splatting for Cryo-EM Homogeneous Reconstruction Suyi Chen et.al. 2508.04929 null
2025-08-06 LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction Md Zahidul Hasan et.al. 2508.04847 null
2025-08-06 Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models Mehrdad Moradi et.al. 2508.04818 null
2025-08-06 Occupancy Learning with Spatiotemporal Memory Ziyang Leng et.al. 2508.04705 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 MienCap: Realtime Performance-Based Facial Animation with Live Mood Dynamics Ye Pan et.al. 2508.04687 null
2025-08-06 PixCuboid: Room Layout Estimation from Multi-view Featuremetric Alignment Gustav Hanning et.al. 2508.04659 null
2025-08-06 OmniDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment Tongfan Guan et.al. 2508.04611 null
2025-08-06 $NavA^3$ : Understanding Any Instruction, Navigating Anywhere, Finding Anything Lingfeng Zhang et.al. 2508.04598 null
2025-08-06 Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline Linqing Zhao et.al. 2508.04597 null
2025-08-06 LA-CaRe-CNN: Cascading Refinement CNN for Left Atrial Scar Segmentation Franz Thaler et.al. 2508.04553 null
2025-08-06 Surf3R: Rapid Surface Reconstruction from Sparse RGB Views in Seconds Haodong Zhu et.al. 2508.04508 null
2025-08-06 MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular Videos Daisheng Jin et.al. 2508.04505 null
2025-08-06 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation Shuzhou Yang et.al. 2508.04467 null
2025-08-06 Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models Yinan Yu et.al. 2508.04406 null
2025-08-06 RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization Yanyan Li et.al. 2508.04335 null
2025-08-07 Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research Ke Li et.al. 2508.04326 null
2025-08-06 MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction Yaopeng Lou et.al. 2508.04297 null
2025-08-06 PKSS-Align: Robust Point Cloud Registration on Pre-Kendall Shape Space Chenlei Lv et.al. 2508.04286 null
2025-08-06 PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction Muhua Zhu et.al. 2508.04236 null
2025-08-06 SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition Jiahui Li et.al. 2508.04224 null
2025-08-06 Small Lesions-aware Bidirectional Multimodal Multiscale Fusion Network for Lung Disease Classification Jianxun Yu et.al. 2508.04205 null
2025-08-06 IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control Lijuan Liu et.al. 2508.04147 null
2025-08-06 DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting Zexu Huang et.al. 2508.04099 null
2025-08-06 Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework Yi-Ting Chen et.al. 2508.04090 null
2025-08-06 RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting Zhan Li et.al. 2508.04078 null
2025-08-06 Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation Jiayi He et.al. 2508.04049 null
2025-08-06 JanusNet: Hierarchical Slice-Block Shuffle and Displacement for Semi-Supervised 3D Multi-Organ Segmentation Zheng Zhang et.al. 2508.03997 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-05 Inland-LOAM: Voxel-Based Structural Semantic Mapping for Inland Waterways Zhongbi Luo et.al. 2508.03672 null
2025-08-05 OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World Katherine Liu et.al. 2508.03669 null
2025-08-06 Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images Xiangyu Sun et.al. 2508.03643 null
2025-08-05 FPG-NAS: FLOPs-Aware Gated Differentiable Neural Architecture Search for Efficient 6DoF Pose Estimation Nassim Ali Ousalah et.al. 2508.03618 null
2025-08-05 CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models Ana Lawry Aguila et.al. 2508.03594 null
2025-08-05 Spatial Imputation Drives Cross-Domain Alignment for EEG Classification Hongjun Liu et.al. 2508.03437 null
2025-08-05 WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval Junlong Ren et.al. 2508.03343 null
2025-08-05 Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion Wentao Qu et.al. 2508.03252 null
2025-08-05 Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing Hongyu Shen et.al. 2508.03227 null
2025-08-05 Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling Heng Wu et.al. 2508.03186 null
2025-08-05 Duplex-GS: Proxy-Guided Weighted Blending for Real-Time Order-Independent Gaussian Splatting Weihang Liu et.al. 2508.03180 null
2025-08-05 H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction Heng Jia et.al. 2508.03118 null
2025-08-05 Point2Act: Efficient 3D Distillation of Multimodal LLMs for Zero-Shot Context-Aware Grasping Sang Min Kim et.al. 2508.03099 null
2025-08-05 RobustGS: Unified Boosting of Feedforward 3D Gaussian Splatting under Low-Quality Conditions Anran Wu et.al. 2508.03077 null
2025-08-05 SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation Bo Zhang et.al. 2508.03069 null
2025-08-05 A Survey of Medical Point Cloud Shape Learning: Registration, Reconstruction and Variation Tongxu Zhang et.al. 2508.03057 null
2025-08-05 SA-3DGS: A Self-Adaptive Compression Method for 3D Gaussian Splatting Liheng Zhang et.al. 2508.03017 null
2025-08-05 ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion Meng Zhou et.al. 2508.03008 null
2025-08-05 GACL: Grounded Adaptive Curriculum Learning with Active Task and Performance Monitoring Linji Wang et.al. 2508.02988 null
2025-08-04 Evaluation of 3D Counterfactual Brain MRI Generation Pengwei Sun et.al. 2508.02880 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing Mikołaj Zieliński et.al. 2508.02831 null
2025-08-04 PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation Zongyou Yang et.al. 2508.02806 null
2025-08-04 PMGS: Reconstruction of Projectile Motion across Large Spatiotemporal Spans via 3D Gaussian Splatting Yijun Xu et.al. 2508.02660 null
2025-08-04 RL-U $^2$ Net: A Dual-Branch UNet with Reinforcement Learning-Assisted Multimodal Feature Fusion for Accurate 3D Whole-Heart Segmentation Jierui Qu et.al. 2508.02557 null
2025-08-04 Uncertainty-Aware Perception-Based Control for Autonomous Racing Jelena Trisovic et.al. 2508.02494 null
2025-08-05 Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting Jianchao Wang et.al. 2508.02493 null
2025-08-06 GR-Gaussian: Graph-Based Radiative Gaussian Splatting for Sparse-View CT Reconstruction Yikuang Yuluo et.al. 2508.02408 null
2025-08-04 Correspondence-Free Fast and Robust Spherical Point Pattern Registration Anik Sarker et.al. 2508.02339 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 ASDR: Exploiting Adaptive Sampling and Data Reuse for CIM-based Instant Neural Rendering Fangxin Liu et.al. 2508.02304 null
2025-08-04 Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection Jae-Young Kang et.al. 2508.02288 null
2025-08-04 SplatSSC: Decoupled Depth-Guided Gaussian Splatting for Semantic Scene Completion Rui Qian et.al. 2508.02261 null
2025-08-04 GaussianCross: Cross-modal Self-supervised 3D Representation Learning via Gaussian Splatting Lei Yao et.al. 2508.02172 null
2025-08-04 Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes Tom Fischer et.al. 2508.02157 null
2025-08-04 ScrewSplat: An End-to-End Method for Articulated Object Recognition Seungyeon Kim et.al. 2508.02146 null
2025-08-04 VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling Yuru Xiao et.al. 2508.02129 null
2025-08-04 REACT-KD: Region-Aware Cross-modal Topological Knowledge Distillation for Interpretable Medical Image Classification Hongzhao Chen et.al. 2508.02104 null
2025-08-04 StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion Haoxin Yang et.al. 2508.02056 null
2025-08-04 Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure Ziling Wang et.al. 2508.02034 null
2025-08-04 On-the-Fly Object-aware Representative Point Selection in Point Cloud Xiaoyu Zhang et.al. 2508.01980 null
2025-08-04 From Photons to Physics: Autonomous Indoor Drones and the Future of Objective Property Assessment Petteri Teikari et.al. 2508.01965 null
2025-08-03 Less is More: AMBER-AFNO – a New Benchmark for Lightweight 3D Medical Image Segmentation Andrea Dosi et.al. 2508.01941 null
2025-08-03 MUTE-DSS: A Digital-Twin-Based Decision Support System for Minimizing Underwater Radiated Noise in Ship Voyage Planning Akash Venkateshwaran et.al. 2508.01907 null
2025-08-03 Beyond Vulnerabilities: A Survey of Adversarial Attacks as Both Threats and Defenses in Computer Vision Systems Zhongliang Guo et.al. 2508.01845 null
2025-08-03 OmniEvent: Unified Event Representation Learning Weiqi Yan et.al. 2508.01842 null
2025-08-03 Diffusion-based 3D Hand Motion Recovery with Intuitive Physics Yufei Zhang et.al. 2508.01835 null
2025-08-03 Skip priors and add graph-based anatomical information, for point-based Couinaud segmentation Xiaotong Zhang et.al. 2508.01785 null
2025-08-05 VPN: Visual Prompt Navigation Shuo Feng et.al. 2508.01766 null
2025-08-03 AG $^2$ aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing Zhaonan Wang et.al. 2508.01740 null
2025-08-03 OpenMap: Instruction Grounding via Open-Vocabulary Visual-Language Mapping Danyang Li et.al. 2508.01723 null
2025-08-03 LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving Luqi Cheng et.al. 2508.01704 null
2025-08-03 Register Anything: Estimating “Corresponding Prompts” for Segment Anything Model Shiqi Huang et.al. 2508.01697 null
2025-08-03 DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing Yufeng Chi et.al. 2508.01684 null
2025-08-03 DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding Hanqing Wang et.al. 2508.01651 null
2025-08-03 StrandDesigner: Towards Practical Strand Generation with Sketch Guidance Na Zhang et.al. 2508.01650 null
2025-08-03 Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection Hanxi Li et.al. 2508.01591 null
2025-08-03 A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction Hua Yu et.al. 2508.01585 null
2025-08-03 Deeply Supervised Multi-Task Autoencoder for Biological Brain Age estimation using three dimensional T $_1$ -weighted magnetic resonance imaging Mehreen Kanwal et.al. 2508.01565 null
2025-08-03 Adaptive LiDAR Scanning: Harnessing Temporal Cues for Efficient 3D Object Detection via Multi-Modal Fusion Sara Shoouri et.al. 2508.01562 null
2025-08-02 Decentralized Aerial Manipulation of a Cable-Suspended Load using Multi-Agent Reinforcement Learning Jack Zeng et.al. 2508.01522 null
2025-08-02 EfficientGFormer: Multimodal Brain Tumor Segmentation via Pruned Graph-Augmented Transformer Fatemeh Ziaeetabar et.al. 2508.01465 null
2025-08-02 Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians Quankai Gao et.al. 2508.01464 null
2025-08-02 Uncertainty-Aware Segmentation Quality Prediction via Deep Learning Bayesian Modeling: Comprehensive Evaluation and Interpretation on Skin Cancer and Liver Segmentation Sikha O K et.al. 2508.01460 null
2025-08-05 3DRot: 3D Rotation Augmentation for RGB-Based 3D Tasks Shitian Yang et.al. 2508.01423 null
2025-08-02 ReMu: Reconstructing Multi-layer 3D Clothed Human from Image Layers Onat Vuran et.al. 2508.01381 null
2025-08-02 P3P Made Easy Seong Hun Lee et.al. 2508.01312 null
2025-08-02 C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor Haoquan Lu et.al. 2508.01311 null
2025-08-02 CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis Alec Sargood et.al. 2508.01292 null
2025-08-02 Integrating Disparity Confidence Estimation into Relative Depth Prior-Guided Unsupervised Stereo Matching Chuang-Wei Liu et.al. 2508.01275 null
2025-08-05 MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh Shuangkang Fang et.al. 2508.01242 null
2025-08-02 OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS Han Ling et.al. 2508.01239 null
2025-08-02 Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system Jiyong Kim et.al. 2508.01230 null
2025-08-02 MoGaFace: Momentum-Guided and Texture-Aware Gaussian Avatars for Consistent Facial Geometry Yujian Liu et.al. 2508.01218 null
2025-08-02 Perspective from a Broader Context: Can Room Style Knowledge Help Visual Floorplan Localization? Bolei Chen et.al. 2508.01216 null
2025-08-02 A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Zhan Shi et.al. 2508.01197 null
2025-08-02 Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning Xinhang Wan et.al. 2508.01184 null
2025-08-02 No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views Ranran Huang et.al. 2508.01171 null
2025-08-02 DELTAv2: Accelerating Dense 3D Tracking Tuan Duc Ngo et.al. 2508.01170 null
2025-08-02 OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding Dianyi Yang et.al. 2508.01150 null
2025-08-02 Design of Q8bot: A Miniature, Low-Cost, Dynamic Quadruped Built with Zero Wires Yufeng Wu et.al. 2508.01149 null
2025-08-02 UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation Chaitanya Patel et.al. 2508.01126 null
2025-08-01 DreamSat-2.0: Towards a General Single-View Asteroid 3D Reconstruction Santiago Diaz et.al. 2508.01079 null
2025-08-01 Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation Fenghe Tang et.al. 2508.01064 null
2025-08-01 Structured Spectral Graph Learning for Anomaly Classification in 3D Chest CT Scans Theo Di Piazza et.al. 2508.01045 null
2025-08-01 3D Reconstruction via Incremental Structure From Motion Muhammad Zeeshan et.al. 2508.01019 null
2025-08-01 Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection Cheng-You Lu et.al. 2508.01014 null
2025-08-01 Cooperative Perception: A Resource-Efficient Framework for Multi-Drone 3D Scene Reconstruction Using Federated Diffusion and NeRF Massoud Pourmandi et.al. 2508.00967 null
2025-07-31 Investigating Crossing Perception in 3D Graph Visualisation Ying Zhang et.al. 2508.00950 null
2025-08-01 IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation Wenxuan Guo et.al. 2508.00823 null
2025-08-01 Cross-Dataset Semantic Segmentation Performance Analysis: Unifying NIST Point Cloud City Datasets for 3D Deep Learning Alexander Nikitas Dimopoulos et.al. 2508.00822 null
2025-08-01 GECO: Geometrically Consistent Embedding with Lightspeed Inference Regine Hartwig et.al. 2508.00746 null
2025-08-01 Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR Adwait Chandorkar et.al. 2508.00744 null
2025-08-04 DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior Junzhe Lu et.al. 2508.00599 null
2025-08-01 OmniUnet: A Multimodal Network for Unstructured Terrain Segmentation on Planetary Rovers Using RGB, Depth, and Thermal Imagery Raul Castilla-Arquillo et.al. 2508.00580 null
2025-08-04 LesiOnTime – Joint Temporal and Clinical Modeling for Small Breast Lesion Segmentation in Longitudinal DCE-MRI Mohammed Kamran et.al. 2508.00496 null
2025-08-01 HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection Jiaping Cao et.al. 2508.00473 null
2025-08-01 Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation Nan Xiang et.al. 2508.00428 null
2025-08-01 Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting Seunggeun Chi et.al. 2508.00427 null
2025-08-01 Sari Sandbox: A Virtual Retail Store Environment for Embodied AI Agents Janika Deborah Gajo et.al. 2508.00400 null
2025-08-01 Occlusion-robust Stylization for Drawing-based 3D Animation Sunjae Yoon et.al. 2508.00398 null
2025-08-01 SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies Liang Han et.al. 2508.00366 null
2025-08-01 Stable at Any Speed: Speed-Driven Multi-Object Tracking with Learnable Kalman Filtering Yan Gong et.al. 2508.00358 null
2025-08-01 Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging Tianshuang Qiu et.al. 2508.00354 null
2025-08-01 AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer Jin Lyu et.al. 2508.00298 null
2025-08-01 Towards Robust Semantic Correspondence: A Benchmark and Insights Wenyue Chong et.al. 2508.00272 null
2025-08-05 Multimodal Referring Segmentation: A Survey Henghui Ding et.al. 2508.00265 null
2025-08-01 PointGauss: Point Cloud-Guided Multi-Object Segmentation for Gaussian Splatting Wentao Sun et.al. 2508.00259 null
2025-08-01 Weakly Supervised Intracranial Aneurysm Detection and Segmentation in MR angiography via Multi-task UNet with Vesselness Prior Erin Rainville et.al. 2508.00235 null
2025-07-31 Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs Bhavya Goyal et.al. 2508.00169 null
2025-07-31 GEPAR3D: Geometry Prior-Assisted Learning for 3D Tooth Segmentation Tomasz Szczepański et.al. 2508.00155 null
2025-07-31 Stress-Aware Resilient Neural Training Ashkan Shakarami et.al. 2508.00098 null
2025-07-31 Punching Bag vs. Punching Person: Motion Transferability in Videos Raiyaan Abdullah et.al. 2508.00085 null
2025-07-31 Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Bowen Zhang et.al. 2507.23785 null
2025-07-31 Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions Li Siyao et.al. 2507.23778 null
2025-07-31 SeqAffordSplat: Scene-level Sequential Affordance Reasoning on 3D Gaussian Splatting Di Li et.al. 2507.23772 null
2025-08-05 Topology Optimization in Medical Image Segmentation with Fast Euler Characteristic Liu Li et.al. 2507.23763 null
2025-07-31 Enhanced Velocity Field Modeling for Gaussian Video Reconstruction Zhenyang Li et.al. 2507.23704 null
2025-07-31 Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents Shaofei Cai et.al. 2507.23698 null
2025-07-31 High-resolution eikonal imaging and uncertainty quantification of the Kilauea caldera Angela F. Gao et.al. 2507.23692 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes Xiaohan Li et.al. 2507.23677 null
2025-07-31 DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation Yuchen Zhou et.al. 2507.23599 null
2025-08-02 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization Maxime Pietrantoni et.al. 2507.23569 null
2025-07-31 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection Yung-Hsu Yang et.al. 2507.23567 null
2025-08-01 H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation Hongzhe Bi et.al. 2507.23523 null
2025-07-31 Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion Mutian Xu et.al. 2507.23483 null
2025-07-31 FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction Donghyun Lee et.al. 2507.23480 null
2025-07-31 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding Ting Huang et.al. 2507.23478 null
2025-07-31 NeRF Is a Valuable Assistant for 3D Gaussian Splatting Shuangkang Fang et.al. 2507.23374 null
2025-07-31 MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting Xingyue Peng et.al. 2507.23340 null
2025-08-01 Training-free Geometric Image Editing on Diffusion Models Hanshen Zhu et.al. 2507.23300 null
2025-07-31 iLRM: An Iterative Large 3D Reconstruction Model Gyeongjin Kang et.al. 2507.23277 null
2025-07-31 GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting Jaeseok Park et.al. 2507.23273 null
2025-07-31 Towards Affordable Tumor Segmentation and Visualization for 3D Breast MRI Using SAM2 Solha Kang et.al. 2507.23272 null
2025-07-30 Details Matter for Indoor Open-vocabulary 3D Instance Segmentation Sanghun Jung et.al. 2507.23134 null
2025-07-30 Rethink Domain Generalization in Heterogeneous Sequence MRI Segmentation Zheyuan Zhang et.al. 2507.23110 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-07-30 Adaptive Time-step Training for Enhancing Spike-Based Neural Radiance Fields Ranxi Lin et.al. 2507.23033 null
2025-07-30 Learning to Prune Branches in Modern Tree-Fruit Orchards Abhinav Jain et.al. 2507.23015 null
2025-07-30 Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction Zhensheng Yuan et.al. 2507.23006 null
2025-07-30 Viser: Imperative, Web-based 3D Visualization in Python Brent Yi et.al. 2507.22885 null
2025-07-30 DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion Qingcheng Zhao et.al. 2507.22825 null
2025-07-30 Wall Shear Stress Estimation in Abdominal Aortic Aneurysms: Towards Generalisable Neural Surrogate Models Patryk Rygiel et.al. 2507.22817 null
2025-07-30 Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques Weide Liu et.al. 2507.22791 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks Hang Su et.al. 2507.22733 null
2025-07-30 Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints Thuy Tran et.al. 2507.22699 null
2025-07-30 Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation Hongbin Lin et.al. 2507.22668 null
2025-07-30 trAIce3D: A Prompt-Driven Transformer Based U-Net for Semantic Segmentation of Microglial Cells from Large-Scale 3D Microscopy Images MohammadAmin Alamalhoda et.al. 2507.22635 null
2025-07-30 Estimating 2D Camera Motion with Hybrid Motion Basis Haipeng Li et.al. 2507.22480 null
2025-07-30 UAVScenes: A Multi-Modal Dataset for UAVs Sijie Wang et.al. 2507.22412 null
2025-07-30 UFV-Splatter: Pose-Free Feed-Forward 3D Gaussian Splatting Adapted to Unfavorable Views Yuki Fujimura et.al. 2507.22342 null
2025-07-30 A Segmentation Framework for Accurate Diagnosis of Amyloid Positivity without Structural Images Penghan Zhu et.al. 2507.22336 null
2025-07-29 Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception Christian Ellis et.al. 2507.22194 null
2025-07-29 Enhancing efficiency in paediatric brain tumour segmentation using a pathologically diverse single-center clinical dataset A. Piffer et.al. 2507.22152 null
2025-07-29 Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos Ziren Gong et.al. 2507.22052 null
2025-07-29 ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports Mohammed Baharoon et.al. 2507.22030 null
2025-07-29 Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images Yutao Hu et.al. 2507.22024 null
2025-07-29 XAI for Point Cloud Data using Perturbations based on Meaningful Segmentation Raju Ningappa Mulawade et.al. 2507.22020 null
2025-07-29 DISCOVERSE: Efficient Robot Simulation in Complex High-Fidelity Environments Yufei Jia et.al. 2507.21981 null
2025-07-29 PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction Jiahui Ren et.al. 2507.21960 null
2025-07-31 MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors Shouyi Lu et.al. 2507.21872 null
2025-07-29 VidFuncta: Towards Generalizable Neural Representations for Ultrasound Videos Julia Wolleb et.al. 2507.21863 null
2025-07-29 HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels HunyuanWorld Team et.al. 2507.21809 null
2025-07-29 AU-LLM: Micro-Expression Action Unit Detection via Enhanced LLM-Based Feature Fusion Zhishu Liu et.al. 2507.21778 null
2025-07-29 Multi-UAV Deployment in Obstacle-Cluttered Environments with LOS Connectivity Yuda Chen et.al. 2507.21772 null
2025-07-30 No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering Linye Wei et.al. 2507.21572 null
2025-07-29 Multi-View Reconstruction with Global Context for 3D Anomaly Detection Yihan Sun et.al. 2507.21555 null
2025-07-29 LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments Junhao Chen et.al. 2507.21517 null
2025-07-29 ST-DAI: Single-shot 2.5D Spatial Transcriptomics with Intra-Sample Domain Adaptive Imputation for Cost-efficient 3D Reconstruction Jiahe Qian et.al. 2507.21516 null
2025-07-29 BANG: Dividing 3D Assets via Generative Exploded Dynamics Longwen Zhang et.al. 2507.21493 null
2025-07-29 Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval Zhichuan Wang et.al. 2507.21489 null
2025-07-28 Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View Zitong Zhang et.al. 2507.21371 null
2025-08-03 Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy Jicheng Yuan et.al. 2507.21358 null
2025-07-28 DEM-NeRF: A Neuro-Symbolic Method for Scientific Discovery through Physics-Informed Simulation Wenkai Tan et.al. 2507.21350 null
2025-07-28 GLCP: Global-to-Local Connectivity Preservation for Tubular Structure Segmentation Feixiang Zhou et.al. 2507.21328 null
2025-07-28 VoluMe – Authentic 3D Video Calls from Live Gaussian Splat Prediction Martin de La Gorce et.al. 2507.21311 null
2025-07-28 Fluidically Innervated Lattices Make Versatile and Durable Tactile Sensors Annan Zhang et.al. 2507.21225 null
2025-08-03 Reconstructing 4D Spatial Intelligence: A Survey Yukang Cao et.al. 2507.21045 null
2025-07-28 GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction Tianhao Li et.al. 2507.20963 null
2025-07-28 $S^3$ LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping Ruoyu Fan et.al. 2507.20854 null
2025-07-28 An Efficient Machine Learning Framework for Forest Height Estimation from Multi-Polarimetric Multi-Baseline SAR data Francesca Razzano et.al. 2507.20798 null
2025-07-28 KASportsFormer: Kinematic Anatomy Enhanced Transformer for 3D Human Pose Estimation on Short Sports Scene Video Zhuoer Yin et.al. 2507.20763 null
2025-07-28 Methods for the Segmentation of Reticular Structures Using 3D LiDAR Data: A Comparative Evaluation Francisco J. Soler Mora et.al. 2507.20589 null
2025-07-28 M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast Jiacheng Lu et.al. 2507.20582 null
2025-07-28 Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation Hyung Kyu Kim et.al. 2507.20568 null
2025-07-28 MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization Hyung Kyu Kim et.al. 2507.20562 null
2025-07-28 Uni-Mapper: Unified Mapping Framework for Multi-modal LiDARs in Complex and Dynamic Environments Gilhwan Kang et.al. 2507.20538 null
2025-07-28 Enhancing Spatial Reasoning through Visual and Textual Thinking Xun Liang et.al. 2507.20529 null
2025-07-28 GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections Haiyang Bai et.al. 2507.20512 null
2025-07-28 Automated 3D-GS Registration and Fusion via Skeleton Alignment and Gaussian-Adaptive Features Shiyang Liu et.al. 2507.20480 null
2025-07-29 From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos Chenjian Gao et.al. 2507.20331 null
2025-07-27 Decomposing Densification in Gaussian Splatting for Faster 3D Scene Reconstruction Binxiao Huang et.al. 2507.20239 null
2025-07-27 NeuroVoxel-LM: Language-Aligned 3D Perception via Dynamic Voxelization and Meta-Embedding Shiyu Liu et.al. 2507.20110 null
2025-07-26 High-Speed Event Vision-Based Tactile Roller Sensor for Large Surface Measurements Akram Khairi et.al. 2507.19914 null
2025-07-30 RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection Xiaokai Bai et.al. 2507.19856 null
2025-07-26 Taking Language Embedded 3D Gaussian Splatting into the Wild Yuze Wang et.al. 2507.19830 null
2025-07-25 GSCache: Real-Time Radiance Caching for Volume Path Tracing using 3D Gaussian Splatting David Bauer et.al. 2507.19718 null
2025-07-25 DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations Ziren Gong et.al. 2507.19474 null
2025-07-25 Fast Learning of Non-Cooperative Spacecraft 3D Models through Primitive Initialization Pol Francesch Huc et.al. 2507.19459 null
2025-07-25 NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography Kirsten W. H. Maas et.al. 2507.19328 null
2025-07-25 3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering Wei-Hsing Huang et.al. 2507.19133 null
2025-07-25 Gaussian Set Surface Reconstruction through Per-Gaussian Optimization Zhentao Huang et.al. 2507.18923 null
2025-07-24 SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time Yun Chen et.al. 2507.18713 null
2025-07-24 Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping Chong Cheng et.al. 2507.18541 null
2025-07-24 G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM Gyuhyeon Pak et.al. 2507.18344 null
2025-07-24 LONG3R: Long Sequence Streaming 3D Reconstruction Zhuoguang Chen et.al. 2507.18255 null
2025-07-24 PS-GS: Gaussian Splatting for Multi-View Photometric Stereo Yixiao Chen et.al. 2507.18231 null
2025-07-24 High-fidelity 3D Gaussian Inpainting: preserving multi-view consistency and photorealistic details Jun Zhou et.al. 2507.18023 null
2025-07-24 Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners Kostas Karakontis et.al. 2507.17519 null
2025-07-23 Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field Yuzhe Zhu et.al. 2507.17351 null
2025-07-23 Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting Hyeongmin Lee et.al. 2507.17336 null
2025-07-24 PolarAnything: Diffusion-based Polarimetric Image Synthesis Kailong Zhang et.al. 2507.17268 null
2025-07-22 StreamME: Simplify 3D Gaussian Avatar within Live Stream Luchuan Song et.al. 2507.17029 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-22 Sparse-View 3D Reconstruction: Recent Advances and Open Challenges Tanveer Younis et.al. 2507.16406 null
2025-07-22 Dens3R: A Foundation Model for 3D Geometry Prediction Xianze Fang et.al. 2507.16290 null
2025-07-22 LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images Guichen Huang et.al. 2507.16144 null
2025-07-21 Appearance Harmonization via Bilateral Grid Prediction with Transformers for 3DGS Jisu Shin et.al. 2507.15748 null
2025-07-21 DWTGS: Rethinking Frequency Regularization for Sparse-view 3D Gaussian Splatting Hung Nguyen et.al. 2507.15690 null
2025-07-21 Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing Boni Hu et.al. 2507.15683 null
2025-07-21 Gaussian Splatting with Discretized SDF for Relightable Assets Zuo-Liang Zhu et.al. 2507.15629 null
2025-07-28 SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting Zihui Gao et.al. 2507.15602 null
2025-07-21 ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting Ruijie Zhu et.al. 2507.15454 null
2025-07-25 GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing Minnan Pei et.al. 2507.15300 null
2025-07-20 3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline Kaishva Chintan Shah et.al. 2507.14924 null
2025-07-20 Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction Xiufeng Huang et.al. 2507.14921 null
2025-07-20 An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks Xinyi Wu et.al. 2507.14798 null
2025-07-30 Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey Jiahui Zhang et.al. 2507.14501 null
2025-07-19 Adaptive 3D Gaussian Splatting Video Streaming: Visual Saliency-Aware Tiling and Meta-Learning-Based Bitrate Adaptation Han Gong et.al. 2507.14454 null
2025-07-19 Adaptive 3D Gaussian Splatting Video Streaming Han Gong et.al. 2507.14432 null
2025-08-01 C-DOG: Multi-View Multi-instance Feature Association Using Connected δ-Overlap Graphs Yung-Hong Sun et.al. 2507.14095 null
2025-07-18 TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views Hsiang-Hui Hung et.al. 2507.13929 null
2025-07-18 Neural-GASh: A CGA-based neural radiance prediction pipeline for real-time shading Efstratios Geronikolakis et.al. 2507.13917 null
2025-07-21 PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations Yu Wei et.al. 2507.13891 null
2025-07-18 EPSilon: Efficient Point Sampling for Lightening of Hybrid-based 3D Avatar Generation Seungjun Moon et.al. 2507.13648 null
2025-07-18 Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation Masahiro Ogawa et.al. 2507.13628 null
2025-07-19 AutoPartGen: Autogressive 3D Part Generation and Discovery Minghao Chen et.al. 2507.13346 null
2025-07-16 VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians Siyuan Yao et.al. 2507.12667 null
2025-07-16 NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting Kuangshi Ai et.al. 2507.12621 null
2025-07-21 Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition Beizhen Zhao et.al. 2507.12498 null
2025-07-19 SpatialTrackerV2: 3D Point Tracking Made Easy Yuxi Xiao et.al. 2507.12462 null
2025-07-16 Revealing the Ancient Beauty: Digital Reconstruction of Temple Tiles using Computer Vision Arkaprabha Basu et.al. 2507.12195 null
2025-07-16 DoRF: Doppler Radiance Fields for Robust Human Activity Recognition Using Wi-Fi Navid Hasanzadeh et.al. 2507.12132 null
2025-07-16 BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images Davide Di Nucci et.al. 2507.12095 null
2025-07-16 SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation Beining Xu et.al. 2507.12027 null
2025-07-16 HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing Tielong Wang et.al. 2507.11971 null
2025-07-16 Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark Jingqian Wu et.al. 2507.11931 null
2025-07-16 CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning Peiwen Xia et.al. 2507.11834 null
2025-07-15 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Zhen Xu et.al. 2507.11540 null
2025-07-21 Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling Hayeon Kim et.al. 2507.11061 null
2025-07-14 ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions Shivangi Aneja et.al. 2507.10542 null
2025-07-14 Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry Geyou Zhang et.al. 2507.10009 null
2025-07-19 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving Yixun Zhang et.al. 2507.09993 null
2025-07-14 VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling Zihang Zeng et.al. 2507.09987 null
2025-07-11 From images to properties: a NeRF-driven framework for granular material parameter inversion Cheng-Hsi Hsiao et.al. 2507.09005 null
2025-07-11 An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan Mengyuan Liu et.al. 2507.08690 null
2025-07-11 Adaptive Framework for Ambient Intelligence in Rehabilitation Assistance Gábor Baranyi et.al. 2507.08624 null
2025-07-11 Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT Wei Zhang et.al. 2507.08448 null
2025-07-11 RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting Ji Hyun Seo et.al. 2507.08434 null
2025-07-11 CL3R: 3D Reconstruction and Contrastive Learning for Enhanced Robotic Manipulation Representations Wenbo Cui et.al. 2507.08262 null
2025-07-10 Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction Hyungjun Doh et.al. 2507.08137 null
2025-07-18 RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration Chong Cheng et.al. 2507.08136 null
2025-07-10 Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions Longfei Li et.al. 2507.07978 null
2025-07-10 RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection Yongyang Zhou et.al. 2507.07733 null

Diffusion

Publish Date Title Authors PDF Code
2025-08-07 Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation Yue Liao et.al. 2508.05635 null
2025-08-07 GAP: Gaussianize Any Point Clouds with Text Guidance Weiqi Zhang et.al. 2508.05631 null
2025-08-07 Latent Space Diffusion for Topology Optimization Aaron Lutheran et.al. 2508.05624 null
2025-08-07 Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision Luozheng Qin et.al. 2508.05606 null
2025-08-07 Unveiling the Lithium-Ion Transport Mechanism in Li2ZrCl6 Solid-State Electrolyte via Deep Learning-Accelerated Molecular Dynamics Simulations Hanzeng Guo et.al. 2508.05598 null
2025-08-07 Discrepancy-Aware Contrastive Adaptation in Medical Time Series Analysis Yifan Wang et.al. 2508.05572 null
2025-08-07 MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips Shibo Wang et.al. 2508.05506 null
2025-08-07 Heat and super-diffusive melting fronts in unsaturated porous media Eirik G. Flekkøy et.al. 2508.05451 null
2025-08-07 Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI Krzysztof Janowicz et.al. 2508.05432 null
2025-08-07 MolSnap: Snap-Fast Molecular Generation with Latent Variational Mean Flow Md Atik Ahamed et.al. 2508.05411 null
2025-08-07 UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation Wonjun Kang et.al. 2508.05399 null
2025-08-07 Real-Time Iteration Scheme for Diffusion Policy Yufei Duan et.al. 2508.05396 null
2025-08-07 Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms Jie Xiao et.al. 2508.05387 null
2025-08-07 Multi-Modal Multi-Behavior Sequential Recommendation with Conditional Diffusion-Based Feature Denoising Xiaoxi Cui et.al. 2508.05352 null
2025-08-07 Stranski-Krastanov Growth of Disordered ScNx Thin Films on MgO(100): Influence of Defect Densities on Electronic Structure and Transport Properties Susmita Chowdhury et.al. 2508.05330 null
2025-08-07 Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting Frank Ruis et.al. 2508.05323 null
2025-08-07 Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces Mathias Rose Bjare et.al. 2508.05306 null
2025-08-07 SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens Nikita Dragunov et.al. 2508.05305 null
2025-08-07 An Investigation into the Distribution of Ratios of Particle Solver-based Likelihoods Emil Løvbak et.al. 2508.05303 null
2025-08-07 Wavelet-Guided Dual-Frequency Encoding for Remote Sensing Change Detection Xiaoyang Zhang et.al. 2508.05271 null
2025-08-07 B4DL: A Benchmark for 4D LiDAR LLM in Spatio-Temporal Understanding Changho Choi et.al. 2508.05269 null
2025-08-07 SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion Xiaoyang Zhang et.al. 2508.05264 null
2025-08-07 ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models Yatong Lan et.al. 2508.05236 null
2025-08-07 Parabolic abstract evolution equations in cylindrical domains and uniformly local Sobolev spaces Joly Romain et.al. 2508.05220 null
2025-08-07 An asymptotic-preserving active flux scheme for the hyperbolic heat equation in the diffusive scaling Junming Duan et.al. 2508.05166 null
2025-08-07 RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer Fangyu Du et.al. 2508.05115 null
2025-08-07 PoseGen: In-Context LoRA Finetuning for Pose-Controllable Long Human Video Generation Jingxuan He et.al. 2508.05091 null
2025-08-07 MetaDiT: Enabling Fine-grained Constraints in High-degree-of Freedom Metasurface Design Hao Li et.al. 2508.05076 null
2025-08-07 Align-for-Fusion: Harmonizing Triple Preferences via Dual-oriented Diffusion for Cross-domain Sequential Recommendation Yongfu Zha et.al. 2508.05074 null
2025-08-07 FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer Jian Zhu et.al. 2508.05069 null
2025-08-07 DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion Yifeng Huang et.al. 2508.05060 null
2025-08-07 Observation of Super-ballistic Brownian Motion in Liquid Jason Boynewicz et.al. 2508.05031 null
2025-08-07 Coupled 1D Chemical Kinetic-Transport and 2D Hydrodynamic Modeling Supports a modest 1-1.5x Supersolar Oxygen Abundance in Jupiter’s Atmosphere Jeehyun Yang et.al. 2508.05007 null
2025-08-07 Switching Diffusion Systems with Past-Dependent Switching and Countable State Space: Successful Couplings and Strong Ergodicity Fubao Xi et.al. 2508.04997 null
2025-08-07 REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers Yuepeng Jiang et.al. 2508.04996 null
2025-08-07 Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression Zheng Chen et.al. 2508.04979 null
2025-08-06 Simulation of Non-Premixed, Supersonic Combustion using the Discontinuous Galerkin Method on Fully Unstructured Grids Cal J. Rising et.al. 2508.04930 null
2025-08-06 Taxonomy of Faults in Attention-Based Neural Networks Sigma Jahan et.al. 2508.04925 null
2025-08-06 Learning AI Auditing: A Case Study of Teenagers Auditing a Generative AI Model Luis Morales-Navarro et.al. 2508.04902 null
2025-08-06 The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models Leo Zhang et.al. 2508.04884 null
2025-08-06 Unified Flow Matching for Long Horizon Event Forecasting Xiao Shou et.al. 2508.04843 null
2025-08-06 Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off Seungyong Lee et.al. 2508.04825 null
2025-08-06 Delay-constrained re-entry governs large-scale brain seizures and other network pathologies Paul Triebkorn et.al. 2508.04824 null
2025-08-06 Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models Mehrdad Moradi et.al. 2508.04818 null
2025-08-06 Stochastic Optimal Control with Control-Dependent Diffusion and State Constraints: A Degenerate Elliptic Approach Anderson O. Calixto et.al. 2508.04809 null
2025-08-06 Electrodeless Magnetohydrodynamic Local Force Generator for Aerocapture Bernard Parent et.al. 2508.04806 null
2025-08-06 ACM Multimedia Grand Challenge on ENT Endoscopy Analysis Trong-Thuan Nguyen et.al. 2508.04801 null
2025-08-06 Quantum-impurity sensing of altermagnetic order V. A. S. V. Bittencourt et.al. 2508.04788 null
2025-08-06 Edge-Assisted Collaborative Fine-Tuning for Multi-User Personalized Artificial Intelligence Generated Content (AIGC) Nan Li et.al. 2508.04745 null
2025-08-06 A colossal dielectric response of HfxZr1-xO2 nanoparticles Oleksandr S. Pylypchuk et.al. 2508.04697 null
2025-08-06 Diffusion in a $d$ -dimensional rough potential Jacob Jeffries et.al. 2508.04674 null
2025-08-06 HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models Young D. Kwon et.al. 2508.04663 null
2025-08-06 Stochastic Calculus for Pathwise Observables of Markov-Jump Processes: Unification of Diffusion and Jump Dynamics Lars Torbjørn Stutzer et.al. 2508.04647 null
2025-08-06 A unified model for linear responses of physical networks José M. Ortiz-Tavárez et.al. 2508.04616 null
2025-08-06 Multitask Learning with Stochastic Interpolants Hugo Negrel et.al. 2508.04605 null
2025-08-07 A Comprehensive Framework for Uncertainty Quantification of Voxel-wise Supervised Models in IVIM MRI Nicola Casali et.al. 2508.04588 null
2025-08-06 Joint Communication and Indoor Positioning Based on Visible Light in the Presence of Dimming A. Tarik Leblebici et.al. 2508.04570 null
2025-08-06 DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling Yijie Li et.al. 2508.04568 null
2025-08-06 TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning Yunbi Liu et.al. 2508.04565 null
2025-08-06 Drone Detection with Event Cameras Gabriele Magrini et.al. 2508.04564 null
2025-08-06 One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose Jinxi Liu et.al. 2508.04559 null
2025-08-06 Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis Angang Zhang et.al. 2508.04551 null
2025-08-06 MSC: A Marine Wildlife Video Dataset with Grounded Segmentation and Clip-Level Captioning Quang-Trung Truong et.al. 2508.04549 null
2025-08-06 X-ray thermal diffuse scattering as a texture-robust temperature diagnostic for dynamically compressed solids P. G. Heighway et.al. 2508.04525 null
2025-08-06 $β$ -Irida-Graphene: A New 2D Carbon Allotrope for Sodium-Ion Battery Anodes José A. S. Laranjeira et.al. 2508.04506 null
2025-08-06 QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution Bowen Chai et.al. 2508.04485 null
2025-08-06 Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model Hongxu Chen et.al. 2508.04472 null
2025-08-06 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation Shuzhou Yang et.al. 2508.04467 null
2025-08-06 Case Studies of Generative Machine Learning Models for Dynamical Systems Nachiket U. Bapat et.al. 2508.04459 null
2025-08-06 Cognitive Effort in the Two-Step Task: An Active Inference Drift-Diffusion Model Approach Alvaro Garrido Perez et.al. 2508.04435 null
2025-08-06 Unmasking Interstitial Lung Diseases: Leveraging Masked Autoencoders for Diagnosis Ethan Dack et.al. 2508.04429 null
2025-08-06 Hydrodynamic Effects in Cryogenic Buffer Gas Cells: Design Insights from Hybrid Simulations Nick Vogeley et.al. 2508.04364 null
2025-08-06 Derivation and Numerical Simulation of a Thermodynamically Consistent Magneto Two-Phase Flow Model for Magnetic Drug Targeting Eberhard Bänsch et.al. 2508.04360 null
2025-08-06 From Split to Share: Private Inference with Distributed Feature Sharing Zihan Liu et.al. 2508.04346 null
2025-08-06 Performative Market Making Charalampos Kleitsikas et.al. 2508.04344 null
2025-08-06 TempFlow-GRPO: When Timing Matters for GRPO in Flow Models Xiaoxuan He et.al. 2508.04324 null
2025-08-06 Wave coupling in partially ionized plasmas with shear flows I. Fast-to-Alfvén transformation Miquel Cantallops et.al. 2508.04319 null
2025-08-06 Turbulent Injection assisted by Diffusion Models for Scale Resolving Simulations Margaux Boxho et.al. 2508.04318 null
2025-08-06 Parameter Estimation for Weakly Interacting Hypoelliptic Diffusions Yuga Iguchi et.al. 2508.04287 null
2025-08-06 S2M3: Split-and-Share Multi-Modal Models for Distributed Multi-Task Inference on the Edge JinYi Yoon et.al. 2508.04271 null
2025-08-06 Sparse Narrow-Band Topology Optimization for Large-Scale Thermal-Fluid Applications Vladislav Pimanov et.al. 2508.04261 null
2025-08-06 High-Dimensional Matrix-Variate Diffusion Index Models for Time Series Forecasting Zhiren Ma et.al. 2508.04259 null
2025-08-06 Suspensions of small ultra-soft colloids remain liquids in overcrowded conditions Nikolaos A. Burger et.al. 2508.04244 null
2025-08-06 PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction Muhua Zhu et.al. 2508.04236 null
2025-08-06 DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification Saifullah Saifullah et.al. 2508.04233 null
2025-08-06 Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction Yu Liu et.al. 2508.04229 null
2025-08-06 LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation Kangrui Cen et.al. 2508.04228 null
2025-08-06 DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models Saifullah Saifullah et.al. 2508.04208 null
2025-08-06 A background-free signal of jet-induced diffusion wake in quark-gluon plasma Zhong Yang et.al. 2508.04194 null
2025-08-06 Deeper Inside Deep ViT Sungrae Hong et.al. 2508.04181 null
2025-08-06 Quasi-Clique Discovery via Energy Diffusion Yu Zhang et.al. 2508.04174 null
2025-08-06 Non-Equilibrium Dynamics and First-Passage Properties of Stochastic Processes: From Brownian Motion to Active Particles Mathis Guéneau et.al. 2508.04154 null
2025-08-06 IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control Lijuan Liu et.al. 2508.04147 null
2025-08-06 Polynomial-time sampling despite disorder chaos Eric Ma et.al. 2508.04133 null
2025-08-06 Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation Maximilian Ulmer et.al. 2508.04122 null
2025-08-06 Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework Yi-Ting Chen et.al. 2508.04090 null
2025-08-06 Long time behavior and Yaglom limit for real trait-structured Birth and Death Processes Pierre Collet et.al. 2508.04089 null
2025-08-06 Convolutional autoencoders for the reconstruction of three-dimensional interfacial multiphase flows Murray Cutforth et.al. 2508.04084 null
2025-08-06 POD-based reduced order modeling of global-in-time iterative decoupled algorithms for Biot’s consolidation model Huipeng Gu et.al. 2508.04082 null
2025-08-06 Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion Fangmin Zhao et.al. 2508.04055 null
2025-08-06 Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation Jiayi He et.al. 2508.04049 null
2025-08-06 Nonlinear stability of two-dimensional periodic waves in parabolic systems with conservation laws L. Miguel Rodrigues et.al. 2508.04023 null
2025-08-07 S $^2$ Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation Weilun Feng et.al. 2508.04016 null
2025-08-06 Constructing Generalized Sample Transition Probabilities with Biased Simulations Yanbin Wang et.al. 2508.03977 null
2025-08-05 Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm Lin Zhang et.al. 2508.03955 null
2025-08-05 Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model Shen Zhu et.al. 2508.03925 null
2025-08-05 Coefficient Identification Problem with Integral Overdetermination Condition for Diffusion Equations R. R. Ashurov et.al. 2508.03859 null
2025-08-05 VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations Yifei Zong et.al. 2508.03839 null
2025-08-05 HPSv3: Towards Wide-Spectrum Human Preference Score Yuhang Ma et.al. 2508.03789 null
2025-08-05 LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Jianxiong Gao et.al. 2508.03694 null
2025-08-05 LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences Ao Liang et.al. 2508.03692 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-05 OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World Katherine Liu et.al. 2508.03669 null
2025-08-05 Rigidity for graph product von Neumann algebras Camille Horbez et.al. 2508.03662 null
2025-08-05 DiWA: Diffusion Policy Adaptation with World Models Akshay L Chandra et.al. 2508.03645 null
2025-08-05 Likelihood Matching for Diffusion Models Lei Qian et.al. 2508.03636 null
2025-08-05 Radiative Nonideal MHD Simulations of Inner Protoplanetary Disks: Temperature Structures, Asymmetric Winds, and Episodic Surface Accretion Shoji Mori et.al. 2508.03624 null
2025-08-05 Expanding the Standard Diffusion Process to Specified Non-Gaussian Marginal Distributions Robert Richardson et.al. 2508.03617 null
2025-08-05 CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models Ana Lawry Aguila et.al. 2508.03594 null
2025-08-05 Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection Long Qian et.al. 2508.03539 null
2025-08-05 X-ray Halos of Early-Type Galaxies with AGN Feedback and Accretion from a Circumgalactic Medium: models and observations Silvia Pellegrini et.al. 2508.03536 null
2025-08-05 CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation Kaishen Yuan et.al. 2508.03535 null
2025-08-05 LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Text-to-Image Generation Lianwei Yang et.al. 2508.03485 null
2025-08-05 When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models Dasol Choi Jihwan Lee et.al. 2508.03483 null
2025-08-05 Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models Hyungjin Kim et.al. 2508.03481 null
2025-08-05 VideoGuard: Protecting Video Content from Unauthorized Editing Junjie Cao et.al. 2508.03480 null
2025-08-05 Learning to Incentivize: LLM-Empowered Contract for AIGC Offloading in Teleoperation Zijun Zhan et.al. 2508.03464 null
2025-08-06 READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation Haotian Wang et.al. 2508.03457 null
2025-08-05 Error Estimates of Semi-Lagrangian Schemes for Diffusive Conservation Laws Haruki Takemura et.al. 2508.03455 null
2025-08-05 RAAG: Ratio Aware Adaptive Guidance Shangwen Zhu et.al. 2508.03442 null
2025-08-05 Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN Shivangi Nigam et.al. 2508.03415 null
2025-08-05 SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models Pingchuan Ma et.al. 2508.03402 null
2025-08-05 Delay-facilitated self-assembly in compartmentalized systems Severin Angerpointner et.al. 2508.03383 null
2025-08-05 Diffusion Once and Done: Degradation-Aware LoRA for Efficient All-in-One Image Restoration Ni Tang et.al. 2508.03373 null
2025-08-05 A Closed-Loop Multi-Agent Framework for Aerodynamics-Aware Automotive Styling Design Xinyu Jin et.al. 2508.03370 null
2025-08-05 GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images Yifei Sun et.al. 2508.03357 null
2025-08-05 Quenching time and probability estimates for a stochastic reaction-diffusion system with coupled inner singular absorption terms driven by mixed noises Nikos I. Kavallaris et.al. 2508.03354 null
2025-08-06 Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation Xunzhi Xiang et.al. 2508.03334 null
2025-08-05 Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation Peiyu Wang et.al. 2508.03320 null
2025-08-05 Thermal Metamaterials for Enhanced Non-Fourier Heat Transport Harry Mclean et.al. 2508.03316 null
2025-08-05 The non-isothermal Maxwell-Stefan asymptotics of the multi-species Boltzmann equations Xinqiu Chen et.al. 2508.03311 null
2025-08-05 Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation Jun Luo et.al. 2508.03300 null
2025-08-05 Investigation on deep learning-based galaxy image translation models Hengxin Ruan et.al. 2508.03291 null
2025-08-07 Well-Posedness of the Cauchy Problem for One-Dimensional Nonlinear Diffusion Equations with Dynamic and Fourth-Type Boundary Conditions in the Lp Lq Maximal Regularity Setting Ken Furukawa et.al. 2508.03288 null
2025-08-07 Global solvability for doubly degenerate nutrient taxis system with a wide range of bacterial responses in physical dimension Bao-Ngoc Tran et.al. 2508.03268 null
2025-08-05 Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation Gang Dai et.al. 2508.03256 null
2025-08-05 V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models Jisoo Kim et.al. 2508.03254 null
2025-08-05 Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion Wentao Qu et.al. 2508.03252 null
2025-08-06 FFHQ-Makeup: Paired Synthetic Makeup Dataset with Facial Consistency Across Multiple Styles Xingchao Yang et.al. 2508.03241 null
2025-08-05 BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models Yu Pan et.al. 2508.03221 null
2025-08-05 Timing is everything: How subtle timing changes in MRI echo planar imaging can significantly alter mechanical vibrations and sound level Amir Seginer et.al. 2508.03220 null
2025-08-05 Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance Eliot Beyler et.al. 2508.03210 null
2025-08-05 Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models Muhammed Saeed et.al. 2508.03199 null
2025-08-05 An Analytic Model to Determine the Interstitial-Solute Energetics and Underlying Mechanism in Refractory High-Entropy Alloys Qianxi Zhu et.al. 2508.03163 null
2025-08-05 SARD: Segmentation-Aware Anomaly Synthesis via Region-Constrained Diffusion with Discriminative Mask Guidance Yanshu Wang et.al. 2508.03143 null
2025-08-05 UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying Chengyu Bai et.al. 2508.03142 null
2025-08-05 Filtering and 1/3 Power Law for Optimal Time Discretisation in Numerical Integration of Stochastic Differential Equations Igor G. Vladimirov et.al. 2508.03135 null
2025-08-05 Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback Jingyi Chen et.al. 2508.03123 null
2025-08-05 Power System Voltage Stability Boundary: Computational Results and Applications Zhenyao Li et.al. 2508.03119 null
2025-08-05 T2UE: Generating Unlearnable Examples from Text Descriptions Xingjun Ma et.al. 2508.03091 null
2025-08-05 MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation Youran Zhou et.al. 2508.03083 null
2025-08-05 Multi-human Interactive Talking Dataset Zeyu Zhu et.al. 2508.03050 null
2025-08-05 Urban In-Context Learning: Bridging Pretraining and Inference through Masked Diffusion for Urban Profiling Ruixing Zhang et.al. 2508.03042 null
2025-08-05 Sparse Identification of Nonlinear Dynamics for Stochastic Delay Differential Equations Dimitri Breda et.al. 2508.03040 null
2025-08-05 MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention Qi Xie et.al. 2508.03034 null
2025-08-05 LiGen: GAN-Augmented Spectral Fingerprinting for Indoor Positioning Jie Lin et.al. 2508.03024 null
2025-08-05 Generating Light-based Fingerprints for Indoor Localization Hsun-Yu Lee et.al. 2508.03011 null
2025-08-05 Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models Fan Yang et.al. 2508.03006 null
2025-08-05 Diffusion Models with Adaptive Negative Sampling Without External Resources Alakh Desai et.al. 2508.02973 null
2025-08-05 Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver Jonathan Patsenker et.al. 2508.02964 null
2025-08-04 X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio Chenxu Zhang et.al. 2508.02944 null
2025-08-04 Documenting Patterns of Exoticism of Marginalized Populations within Text-to-Image Generators Sourojit Ghosh et.al. 2508.02937 null
2025-08-06 A nonstandard finite difference scheme for an SEIQR epidemiological PDE model Achraf Zinihi et.al. 2508.02928 null
2025-08-04 Goal-Oriented Adaptive Finite Element Multilevel Quasi-{M}onte {C}arlo Joakim Beck et.al. 2508.02925 null
2025-08-04 How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution Minh-Hai Nguyen et.al. 2508.02923 null
2025-08-04 RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation Mehrdad Moradi et.al. 2508.02903 null
2025-08-04 REFLECT: Rectified Flows for Efficient Brain Anomaly Correction Transport Farzad Beizaee et.al. 2508.02889 null
2025-08-04 Memoirs of mass accretion: probing the edges of intracluster light in simulated galaxy clusters Tara Dacunha et.al. 2508.02837 null
2025-08-04 DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework Tongchun Zuo et.al. 2508.02807 null
2025-08-04 NASIM: Revealing the low surface brightness Universe from legacy VISTA data Elham Saremi et.al. 2508.02780 null
2025-08-04 D2PPO: Diffusion Policy Policy Optimization with Dispersive Loss Guowei Zou et.al. 2508.02644 null
2025-08-04 CAK: Emergent Audio Effects from Minimal Deep Learning Austin Rockman et.al. 2508.02643 null
2025-08-04 Anticipating Decoherence: a Predictive Framework for Enhancing Coherence in Quantum Emitters Pranshu Maan et.al. 2508.02638 null
2025-08-04 ReMoMask: Retrieval-Augmented Masked Motion Generation Zhengdao Li et.al. 2508.02605 null
2025-08-04 Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction Yuerong Song et.al. 2508.02558 null
2025-08-04 From Pixels to Pathology: Restoration Diffusion for Diagnostic-Consistent Virtual IHC Jingsong Liu et.al. 2508.02528 null
2025-08-06 xDeepServe: Model-as-a-Service on Huawei CloudMatrix384 Ao Xiao et.al. 2508.02520 null
2025-08-04 QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots Sheng Wu et.al. 2508.02512 null
2025-08-04 Quantitative and Predictive Folding Models from Limited Single-Molecule Data Using Simulation-Based Inference Lars Dingeldein et.al. 2508.02509 null
2025-08-04 Toward Using Machine Learning as a Shape Quality Metric for Liver Point Cloud Generation Khoa Tuan Nguyen et.al. 2508.02482 null
2025-08-04 PoseGuard: Pose-Guided Generation with Safety Guardrails Kongxin Wang et.al. 2508.02476 null
2025-08-04 Efficient spin-pumping and spin-to-charge conversion in epitaxial Mn $_3$ Sn(0001) noncollinear antiferromagnetic films Surya N. Panda et.al. 2508.02415 null
2025-08-04 Hydra: Accurate Multi-Modal Leaf Wetness Sensing with mm-Wave and Camera Fusion Yimeng Liu et.al. 2508.02409 null
2025-08-04 Inference-time Scaling for Diffusion-based Audio Super-resolution Yizhu Jin et.al. 2508.02391 null
2025-08-04 Talking Surveys: How Photorealistic Embodied Conversational Agents Shape Response Quality, Engagement, and Satisfaction Matus Krajcovic et.al. 2508.02376 null
2025-08-04 Transport-Guided Rectified Flow Inversion: Improved Image Editing Using Optimal Transport Theory Marian Lupascu et.al. 2508.02363 null
2025-08-04 Qwen-Image Technical Report Chenfei Wu et.al. 2508.02324 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-05 LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training Sikui Zhang et.al. 2508.02308 null
2025-08-05 Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor Xiaoliu Guan et.al. 2508.02240 null
2025-08-04 Abstract Formulation of Mean-Field Models and Propagation of Chaos Tau Shean Lim et.al. 2508.02224 null
2025-08-04 A theory of strange metals Simone Fratini et.al. 2508.02221 null
2025-08-04 Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Yuxuan Song et.al. 2508.02193 null
2025-08-04 DreamPainter: Image Background Inpainting for E-commerce Scenarios Sijie Zhao et.al. 2508.02155 null
2025-08-04 AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models Die Chen et.al. 2508.02151 null
2025-08-04 VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling Yuru Xiao et.al. 2508.02129 null
2025-08-04 AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation Zhiwen Li et.al. 2508.02107 null
2025-08-04 Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis Kaiyang Ji et.al. 2508.02106 null
2025-08-04 “Stack It Up!”: 3D Stable Structure Generation from 2D Hand-drawn Sketch Yiqing Xu et.al. 2508.02093 null
2025-08-04 Unsupervised Multi-channel Speech Dereverberation via Diffusion Yulun Wu et.al. 2508.02071 null
2025-08-04 “Set It Up”: Functional Object Arrangement with Compositional Generative Models Yiqing Xu et.al. 2508.02068 null
2025-08-04 StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion Haoxin Yang et.al. 2508.02056 null
2025-08-04 Why Generate When You Can Transform? Unleashing Generative Attention for Dynamic Recommendation Yuli Liu et.al. 2508.02050 null
2025-08-04 Conditional Diffusion Model with Anatomical-Dose Dual Constraints for End-to-End Multi-Tumor Dose Prediction Hui Xie et.al. 2508.02043 null
2025-08-04 Frequency-Domain Denoising-Based in Vivo Fluorescence Imaging XuHao Yu et.al. 2508.02025 null
2025-08-04 Significant Mobility Enhancement in Coupled AlGaN/GaN Quantum Wells considering Inter-Well Distance and Asymmetric Widths Le Tri Dat et.al. 2508.02024 null
2025-08-05 Asymptotic analysis of the Allen-Cahn equation with dynamic boundary conditions of Cahn-Hilliard type Pierluigi Colli et.al. 2508.02021 null
2025-08-04 Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention Kyungmin Jo et.al. 2508.02004 null
2025-08-04 Generative Large-Scale Pre-trained Models for Automated Ad Bidding Optimization Yu Lei et.al. 2508.02002 null
2025-08-04 Path-Integral Formulation of Bosonic Markovian Open Quantum Dynamics with Monte Carlo stochastic trajectories using the Glauber-Sudarshan P, Wigner, and Husimi Q Functions and Hybrids Toma Yoneya et.al. 2508.01991 null
2025-08-04 Controllable and Stealthy Shilling Attacks via Dispersive Latent Diffusion Shutong Qiao et.al. 2508.01987 null
2025-08-04 Diffusion models for inverse problems Hyungjin Chung et.al. 2508.01975 null
2025-08-03 Distributed games with jumps: An $α$ -potential game approach Xin Guo et.al. 2508.01929 null
2025-08-03 On the Non-Markovian Navier-Stokes Framework for Turbulence Modeling – A Preliminary Analysis Siamak Kazemzadeh Hannani et.al. 2508.01890 null
2025-08-03 DiffusionFF: Face Forgery Detection via Diffusion-based Artifact Localization Siran Peng et.al. 2508.01873 null
2025-08-05 Moment Estimate and Variational Approach for Learning Generalized Diffusion with Non-gradient Structures Fanze Kong et.al. 2508.01854 null
2025-08-03 Diffusion-based 3D Hand Motion Recovery with Intuitive Physics Yufei Zhang et.al. 2508.01835 null
2025-08-03 Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder Runxuan Yang et.al. 2508.01796 null
2025-08-03 Exponential mixing for the stochastic Kuramoto-Sivashinsky equation on the 1D torus Peng Gao et.al. 2508.01794 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 Semantically-Guided Inference for Conditional Diffusion Models: Enhancing Covariate Consistency in Time Series Forecasting Rui Ding et.al. 2508.01761 null
2025-08-03 Dynamic Coupling of Infiltration-Soil Moisture Feedback:Emergent Vegetation Patterns in a Water-Vegetation Model Juan Yan et.al. 2508.01755 null
2025-08-03 Energy-Efficient Federated Learning for Edge Real-Time Vision via Joint Data, Computation, and Communication Design Xiangwang Hou et.al. 2508.01745 null
2025-08-05 Imbalance-Robust and Sampling-Efficient Continuous Conditional GANs via Adaptive Vicinity and Auxiliary Regularization Xin Ding et.al. 2508.01725 null
2025-08-03 ModFus-DM: Explore the Representation in Modulated Signal Diffusion Generated Models Haoyue Tan et.al. 2508.01719 null
2025-08-03 Versatile Transition Generation with Image-to-Video Diffusion Zuhao Yang et.al. 2508.01698 null
2025-08-03 DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing Yufeng Chi et.al. 2508.01684 null
2025-08-03 DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding Hanqing Wang et.al. 2508.01651 null
2025-08-03 StrandDesigner: Towards Practical Strand Generation with Sketch Guidance Na Zhang et.al. 2508.01650 null
2025-08-03 Hamiltonian simulation for nonlinear partial differential equation by Schrödingerization Shoya Sasaki et.al. 2508.01640 null
2025-08-03 VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation Xuanran Zhai et.al. 2508.01622 null
2025-08-03 LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding Xuanzhao Dong et.al. 2508.01617 null
2025-08-03 TCDiff: Triplex Cascaded Diffusion for High-fidelity Multimodal EHRs Generation with Incomplete Clinical Data Yandong Yan et.al. 2508.01615 null
2025-08-03 Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models Haoran Dai et.al. 2508.01605 null
2025-08-03 Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment Lubin Gan et.al. 2508.01602 null
2025-08-03 CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation Sung-Wook Lee et.al. 2508.01600 null
2025-08-03 Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching Juyan Zhang et.al. 2508.01597 null
2025-08-03 A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation Hua Yu et.al. 2508.01590 null
2025-08-03 Censored Sampling for Topology Design: Guiding Diffusion with Human Preferences Euihyun Kim et.al. 2508.01589 null
2025-08-03 Diffusion Models for Future Networks and Communications: A Comprehensive Survey Nguyen Cong Luong et.al. 2508.01586 null
2025-08-03 Tractography-Guided Dual-Label Collaborative Learning for Multi-Modal Cranial Nerves Parcellation Lei Xie et.al. 2508.01577 null
2025-08-03 Sub 10 nm Nanochannels Enable Directional Quasi Ballistic Exciton Transport over 5 μm at Room Temperature Xiao-Jie Wang et.al. 2508.01567 null
2025-08-03 MGCR-Net:Multimodal Graph-Conditioned Vision-Language Reconstruction Network for Remote Sensing Change Detection Chengming Wang et.al. 2508.01555 null
2025-08-02 A Reward-Directed Diffusion Framework for Generative Design Optimization Hadi Keramati et.al. 2508.01509 null
2025-08-02 Instruction-based Time Series Editing Jiaxing Qiu et.al. 2508.01504 null
2025-08-02 The role of zealots in the spread of linguistic traits Vivian Dornelas et.al. 2508.01500 null
2025-08-02 TreeDiff: AST-Guided Code Generation with Diffusion LLMs Yiming Zeng et.al. 2508.01473 null
2025-08-02 Regression Augmentation With Data-Driven Segmentation Shayan Alahyari et.al. 2508.01455 null
2025-08-02 Physically-based Lighting Augmentation for Robotic Manipulation Shutong Jin et.al. 2508.01442 null
2025-08-02 Viscosity Stabilized Plug-and-Play Reconstruction Arghya Sinha et.al. 2508.01441 null
2025-08-02 Parabolic-elliptic and indirect-direct simplifications in chemotaxis systems driven by indirect signalling Le Trong Thanh Bui et.al. 2508.01436 null
2025-08-02 Artificial Intelligence and Misinformation in Art: Can Vision Language Models Judge the Hand or the Machine Behind the Canvas? Tarian Fu et.al. 2508.01408 null
2025-08-02 StyleSentinel: Reliable Artistic Copyright Verification via Stylistic Fingerprints Lingxiao Chen et.al. 2508.01335 null
2025-08-05 Zero-shot Segmentation of Skin Conditions: Erythema with Edit-Friendly Inversion Konstantinos Moutselos et.al. 2508.01334 null
2025-08-02 LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points Xuemiao Zhang et.al. 2508.01317 null
2025-08-02 CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis Alec Sargood et.al. 2508.01292 null
2025-08-02 PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation Zonglei Jing et.al. 2508.01272 null
2025-08-02 Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling Lexiao Zou et.al. 2508.01264 null
2025-08-02 NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection Jiazhen Yan et.al. 2508.01248 null
2025-08-02 Effect of protection zone on the dynamics of a diffusion-advection population-toxicant model Jing Gao et.al. 2508.01246 null
2025-08-02 Sliding two-dimensional superconductivity and charge-density-wave state in a bulk crystal Xiangqi Liu et.al. 2508.01241 null
2025-08-02 SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches Cheng Tan et.al. 2508.01237 null
2025-08-02 Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system Jiyong Kim et.al. 2508.01230 null
2025-08-02 StyDeco: Unsupervised Style Transfer with Distilling Priors and Semantic Decoupling Yuanlin Yang et.al. 2508.01215 null
2025-08-02 Energy-dependent anisotropy of cosmic-ray muons: A twelve-year study with IceCube Neutrino Observatory Nabin Upadhya Dhakal et.al. 2508.01194 null
2025-08-02 DELTAv2: Accelerating Dense 3D Tracking Tuan Duc Ngo et.al. 2508.01170 null
2025-08-02 RoboLinker: A Diffusion-model-based Matching Clothing Generator Between Humans and Companion Robots Jing Tang et.al. 2508.01165 null
2025-08-02 LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation Xinyu Yan et.al. 2508.01152 null
2025-08-02 Personalized Safety Alignment for Text-to-Image Diffusion Models Yu Lei et.al. 2508.01151 null
2025-08-02 Dataset Condensation with Color Compensation Huyu Wu et.al. 2508.01139 null
2025-08-01 Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models Jinsong Li et.al. 2508.00819 null
2025-08-01 Multibeam High Throughput Satellite: Hardware Foundation, Resource Allocation, and Precoding Rui Chen et.al. 2508.00800 null
2025-08-01 Video Generators are Robot Policies Junbang Liang et.al. 2508.00795 null
2025-08-01 SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation Kien T. Pham et.al. 2508.00782 null
2025-08-01 Diffusion-Scheduled Denoising Autoencoders for Anomaly Detection in Tabular Data Timur Sattarov et.al. 2508.00758 null
2025-08-01 LeakyCLIP: Extracting Training Data from CLIP Yunhao Chen et.al. 2508.00756 null
2025-08-01 SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation Prerana Ramkumar et.al. 2508.00750 null
2025-08-01 AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation Le Wang et.al. 2508.00733 null
2025-08-01 YOLO-Count: Differentiable Object Counting for Text-to-Image Generation Guanning Zeng et.al. 2508.00728 null
2025-08-01 Controllability of diffusive Lotka-Volterra strongly competitive systems under boundary constrained controls Elisa Affili et.al. 2508.00713 null
2025-08-01 D3: Training-Free AI-Generated Video Detection Using Second-Order Features Chende Zheng et.al. 2508.00701 null
2025-08-01 On-Device Diffusion Transformer Policy for Efficient Robot Manipulation Yiming Wu et.al. 2508.00697 null
2025-08-01 Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network Young-ho Cho et.al. 2508.00692 null
2025-08-01 Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators Albert Matveev et.al. 2508.00643 null
2025-08-01 Minimum Data, Maximum Impact: 20 annotated samples for explainable lung nodule classification Luisa Gallée et.al. 2508.00639 null
2025-08-01 DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior Junzhe Lu et.al. 2508.00599 null
2025-08-01 Wukong Framework for Not Safe For Work Detection in Text-to-Image systems Mingrui Liu et.al. 2508.00591 null
2025-08-01 Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints Jens U. Kreber et.al. 2508.00558 null
2025-08-01 DBLP: Noise Bridge Consistency Distillation For Efficient And Reliable Adversarial Purification Chihan Huang et.al. 2508.00552 null
2025-08-01 Video Color Grading via Look-Up Table Generation Seunghyun Shin et.al. 2508.00548 null
2025-08-01 HannesImitation: Grasping with the Hannes Prosthetic Hand via Imitation Learning Carlo Alessi et.al. 2508.00491 null
2025-08-01 LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer Yuzhuo Chen et.al. 2508.00477 null
2025-08-01 A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces Leonidas Akritidis et.al. 2508.00472 null
2025-08-01 Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution Yiwen Wang et.al. 2508.00471 null
2025-08-01 AutoDebias: Automated Framework for Debiasing Text-to-Image Models Hongyi Cai et.al. 2508.00445 null
2025-08-01 SDMatte: Grafting Diffusion Models for Interactive Matting Longfei Huang et.al. 2508.00443 null
2025-08-01 Diffusion-Based User-Guided Data Augmentation for Coronary Stenosis Detection Sumin Seo et.al. 2508.00438 null
2025-08-01 Accurate Latent Inversion for Generative Image Steganography via Rectified Flow Yuqi Qian et.al. 2508.00434 null
2025-08-01 Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation Nan Xiang et.al. 2508.00428 null
2025-08-01 Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting Seunggeun Chi et.al. 2508.00427 null
2025-08-01 Collimated QED Cascades with Curved Plasma Mirror Xuesong Geng et.al. 2508.00417 null
2025-08-01 DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space Junyu Chen et.al. 2508.00413 null
2025-08-01 Sortblock: Similarity-Aware Feature Reuse for Diffusion Model Hanqi Chen et.al. 2508.00412 null
2025-08-01 Predictive information criterion for jump diffusion processes Yuma Uehara et.al. 2508.00411 null
2025-08-01 Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency Xi Xue et.al. 2508.00397 null
2025-08-01 Sheaf Graph Neural Networks via PAC-Bayes Spectral Optimization Yoonhyuk Choi et.al. 2508.00357 null
2025-08-01 BOOD: Boundary-based Out-Of-Distribution Data Generation Qilin Liao et.al. 2508.00350 null
2025-08-01 Favorable modifications of Scrape-Off Layer (SOL) heat flux width through pulsed fuelling in ADITYA-U Tokamak SK Injamul Hoque et.al. 2508.00339 null
2025-08-01 Radially Locked Sun-Ray Patterns in Autocatalytic Reaction-Diffusion-Advection Systems Surya Narayan Maharana et.al. 2508.00329 null
2025-08-01 Steering Guidance for Personalized Text-to-Image Diffusion Models Sunghyun Park et.al. 2508.00319 null
2025-08-01 GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection Suhang Cai et.al. 2508.00312 null
2025-08-01 TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps Zehui Xu et.al. 2508.00303 null
2025-08-01 Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence Danzhen Fu et.al. 2508.00299 null
2025-08-01 AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer Jin Lyu et.al. 2508.00298 null
2025-08-01 TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models Christian Simon et.al. 2508.00289 null
2025-08-01 UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents Jianqiang Xiao et.al. 2508.00288 null
2025-08-01 Towards Robust Semantic Correspondence: A Benchmark and Insights Wenyue Chong et.al. 2508.00272 null
2025-08-01 Jet Image Generation in High Energy Physics Using Diffusion Models Victor D. Martinez et.al. 2508.00250 null
2025-07-31 Reliability of 1D radiative-convective photochemical-equilibrium retrievals on transit spectra of WASP-107b Thomas Konings et.al. 2508.00177 null
2025-07-31 DiSC-Med: Diffusion-based Semantic Communications for Robust Medical Image Transmission Fupei Guo et.al. 2508.00172 null
2025-07-31 World Consistency Score: A Unified Metric for Video Generation Quality Akshat Rakheja et.al. 2508.00144 null
2025-07-31 Entanglement spreading and emergent locality in Brownian SYK chains Onkar Parrikar et.al. 2508.00060 null
2025-07-31 Predicting Large-scale Urban Network Dynamics with Energy-informed Graph Neural Diffusion Tong Nie et.al. 2508.00037 null
2025-07-31 Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis Bowen Zhang et.al. 2507.23785 null
2025-07-31 SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions Jessica Bader et.al. 2507.23784 null
2025-07-31 General diffusions on metric graphs as limits of time-space Markov Chains Alexis Anagnostakis et.al. 2507.23724 null
2025-07-31 DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching Emery Pierson et.al. 2507.23715 null
2025-07-31 CFDagent: A Language-Guided, Zero-Shot Multi-Agent System for Complex Flow Simulation Zhaoyue Xu et.al. 2507.23693 null
2025-07-31 UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration Zihan Cheng et.al. 2507.23685 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 Analysis of a Cross-Nonlinear Porous-Medium System Modeling Pressure-Driven Cell Population Dynamics Alexis Béjar-López et.al. 2507.23680 null
2025-07-31 DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data Rabeya Tus Sadia et.al. 2507.23676 null
2025-07-31 One-Step Flow Policy Mirror Descent Tianyi Chen et.al. 2507.23675 null
2025-07-31 Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image Synthesis Kunpeng Qiu et.al. 2507.23652 null
2025-07-31 A stochastic heat equation with non-locally Lipschitz coefficients Le Chen et.al. 2507.23637 null
2025-07-31 DivControl: Knowledge Diversion for Controllable Image Generation Yucheng Xie et.al. 2507.23620 null
2025-08-02 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 Theory of ultrafast conductance modulation in electrochemical protonic synapses by multiphase polarization Michael L. Li et.al. 2507.23576 null
2025-08-01 H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation Hongzhe Bi et.al. 2507.23523 null
2025-07-31 Conical diffraction of the synchrotron beam to probe the efficiency and morphology of blazed gratings K. V. Nikolaev et.al. 2507.23513 null
2025-07-31 Emergence of long-range non-equilibrium correlations in free liquid diffusion Marco Bussoletti et.al. 2507.23507 null
2025-07-31 Digital literacy interventions can boost humans in discerning deepfakes Dominique Geissler et.al. 2507.23492 null
2025-07-31 Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion Mutian Xu et.al. 2507.23483 null
2025-07-31 Adjoint-Based Aerodynamic Shape Optimization with a Manifold Constraint Learned by Diffusion Models Long Chen et.al. 2507.23443 null
2025-07-31 Out-of-Distribution Detection in Medical Imaging via Diffusion Trajectories Lemar Abdi et.al. 2507.23411 null
2025-07-31 An optimal preconditioner for high-order scheme arising from multi-dimensional Riesz space fractional diffusion equations with variable coefficients Yuan-Yuan Huang et.al. 2507.23408 null
2025-07-31 UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries Yijie Zhu et.al. 2507.23372 null
2025-07-31 IN45023 Neural Network Design Patterns in Computer Vision Seminar Report, Summer 2025 Radu-Andrei Bourceanu et.al. 2507.23357 null
2025-07-31 Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads Yingjie Zhou et.al. 2507.23343 null
2025-07-31 EMU and the DRAGNs I: A Catalogue of DRAGNs Ray P. Norris et.al. 2507.23337 null
2025-07-31 Classifying Compact Radio Emission in Nearby Galaxies: a 10GHz Study of Active Galactic Nuclei, Supernovae, Anomalous Microwave Emission and Star Forming Regions Kristen C. Dage et.al. 2507.23332 null
2025-07-31 The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models Alfio Ferrara et.al. 2507.23313 null
2025-07-31 PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving Xuewei Tang et.al. 2507.23309 null
2025-08-01 Training-free Geometric Image Editing on Diffusion Models Hanshen Zhu et.al. 2507.23300 null
2025-07-31 UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing Hao Tang et.al. 2507.23278 null
2025-07-31 PixNerd: Pixel Neural Field Diffusion Shuai Wang et.al. 2507.23268 null
2025-07-31 Automated Mapping the Pathways of Cranial Nerve II, III, V, and VII/VIII: A Multi-Parametric Multi-Stage Diffusion Tractography Atlas Lei Xie et.al. 2507.23245 null
2025-07-31 BS-1-to-N: Diffusion-Based Environment-Aware Cross-BS Channel Knowledge Map Generation for Cell-Free Networks Zhuoyin Dai et.al. 2507.23236 null
2025-07-31 Adversarial-Guided Diffusion for Multimodal LLM Attacks Chengwei Xia et.al. 2507.23202 null
2025-07-30 X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention Xiaochen Zhao et.al. 2507.23143 null
2025-07-30 Nonzero $\mathfrak{n}$ cohomology of Totally Degenerate Limit of Discrete Series representations Jin Kunwoo Lee et.al. 2507.23102 null
2025-07-30 Diffusion model for gradient preconditioning in hyperspectral imaging inverse problems Jonathan Monsalve et.al. 2507.23065 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-07-30 Search for Neutrinos from the Galactic 4FGL Sources with the Pion-bump Signature with IceCube Alejandra Granados et.al. 2507.23040 null
2025-07-30 Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction Giuseppe Cartella et.al. 2507.23021 null
2025-07-30 Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods Siwoo Park et.al. 2507.23010 null
2025-07-30 LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis Jamil Fayyad et.al. 2507.23001 null
2025-07-29 Neural Autoregressive Modeling of Brain Aging Ridvan Yesiloglu et.al. 2507.22954 null
2025-07-30 AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS Hai Ling et.al. 2507.22880 null
2025-07-30 Robust Contract with Career Concerns Tan Gan et.al. 2507.22852 null
2025-07-30 Morph: ChirpTransformer-based Encoder-decoder Co-design for Reliable LoRa Communication Yidong Ren et.al. 2507.22851 null
2025-07-30 DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion Qingcheng Zhao et.al. 2507.22825 null
2025-07-30 Design and Analysis of Plasmonic-Nanorod-Enhanced Lead-Free Inorganic Perovskite/Silicon Heterojunction Tandem Solar Cell Exceeding the Shockley-Queisser Limit Md. Sad Abdullah Sami et.al. 2507.22803 null
2025-07-31 G-Core: A Simple, Scalable and Balanced RLHF Trainer Junyu Wu et.al. 2507.22789 null
2025-07-30 DO-EM: Density Operator Expectation Maximization Adit Vishnu et.al. 2507.22786 null
2025-08-01 Next Tokens Denoising for Speech Synthesis Yanqing Liu et.al. 2507.22746 null
2025-07-30 Zero-Shot Image Anomaly Detection Using Generative Foundation Models Lemar Abdi et.al. 2507.22692 null
2025-07-30 LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing Federico Girella et.al. 2507.22627 null
2025-07-30 Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions Yiting Qu et.al. 2507.22617 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning Xiefan Guo et.al. 2507.22604 null
2025-07-30 Diffusion Models for Influence Maximization on Temporal Networks: A Guide to Make the Best Choice Aaqib Zahoor et.al. 2507.22589 null
2025-07-30 DACA-Net: A Degradation-Aware Conditional Diffusion Network for Underwater Image Enhancement Chang Huang et.al. 2507.22501 null
2025-07-30 LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning Xiang Li et.al. 2507.22499 null
2025-07-30 Visual Language Models as Zero-Shot Deepfake Detectors Viacheslav Pirogov et.al. 2507.22469 null
2025-07-30 TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation Jiuming Liu et.al. 2507.22454 null
2025-07-30 GVD: Guiding Video Diffusion Model for Scalable Video Distillation Kunyang Li et.al. 2507.22360 null
2025-07-29 Trade-offs in Image Generation: How Do Different Dimensions Interact? Sicheng Zhang et.al. 2507.22100 null
2025-07-29 X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again Zigang Geng et.al. 2507.22058 null
2025-07-30 See Different, Think Better: Visual Variations Mitigating Hallucinations in LVLMs Ziyun Dai et.al. 2507.22003 null
2025-07-29 Enhancing Generalization in Data-free Quantization via Mixup-class Prompting Jiwoong Park et.al. 2507.21947 null
2025-07-29 Anyone Can Jailbreak: Prompt-Based Attacks on LLMs and T2Is Ahmed B Mustafa et.al. 2507.21820 null
2025-07-29 Control Copy-Paste: Controllable Diffusion-Based Augmentation Method for Remote Sensing Few-Shot Object Detection Yanxing Liu et.al. 2507.21816 null
2025-07-29 MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE Junzhe Li et.al. 2507.21802 null
2025-07-29 APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing Sangmin Han et.al. 2507.21690 null
2025-07-29 GuidPaint: Class-Guided Image Inpainting with Diffusion Models Qimin Wang et.al. 2507.21627 null
2025-07-29 Locally Controlled Face Aging with Latent Diffusion Models Lais Isabelle Alves dos Santos et.al. 2507.21600 null
2025-07-29 Neural network enabled wide field-of-view imaging with hyperbolic metalenses Joel Yeo et.al. 2507.21562 null
2025-07-29 Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance Mengling Xu et.al. 2507.21529 null
2025-07-29 BANG: Dividing 3D Assets via Generative Exploded Dynamics Longwen Zhang et.al. 2507.21493 null
2025-07-29 Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training Sodtavilan Odonchimed et.al. 2507.21452 null
2025-07-30 Multimodal LLMs as Customized Reward Models for Text-to-Image Generation Shijie Zhou et.al. 2507.21391 null
2025-07-28 Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation I-Hsiang Chen et.al. 2507.21367 null
2025-07-28 A Contrastive Diffusion-based Network (CDNet) for Time Series Classification Yaoyu Zhang et.al. 2507.21357 null
2025-07-28 HDR Environment Map Estimation with Latent Diffusion Models Jack Hilliard et.al. 2507.21261 null
2025-07-28 Adaptive Multimodal Protein Plug-and-Play with Diffusion-Based Priors Amartya Banerjee et.al. 2507.21260 null
2025-07-28 Learning from Limited and Imperfect Data Harsh Rangwani et.al. 2507.21205 null
2025-08-01 Flow Matching Policy Gradients David McAllister et.al. 2507.21053 null
2025-07-29 JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1 Xinhan Di et.al. 2507.20987 null
2025-07-28 Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision Xiao Fang et.al. 2507.20976 null

Industry

Publish Date Title Authors PDF Code
2025-08-07 CleanUpBench: Embodied Sweeping and Grasping Benchmark Wenbo Li et.al. 2508.05543 null
2025-08-07 MedMambaLite: Hardware-Aware Mamba for Medical Image Classification Romina Aalishah et.al. 2508.05049 null
2025-08-07 CSRAP: Enhanced Canvas Attention Scheduling for Real-Time Mission Critical Perception Md Iftekharul Islam Sakib et.al. 2508.04976 null
2025-08-07 Real-Time Doppler and Ionospheric Dispersion Correction Techniques for Arbitrary Waveforms Utilizing GPU Compute Daniel J. Vickers et.al. 2508.04951 null
2025-08-05 AIC CTU@FEVER 8: On-premise fact checking through long context RAG Herbert Ullrich et.al. 2508.04390 null
2025-08-06 A Foundation Model for DAS Signal Recognition and Visual Prompt Tuning of the Pre-trained Model for Downstream Tasks Kun Gui et.al. 2508.04316 null
2025-08-06 Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems Luai Abuelsamen et.al. 2508.04146 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Understanding the Landscape of Ampere GPU Memory Errors Zhu Zhu et.al. 2508.03513 null
2025-08-05 Full-History Graphs with Edge-Type Decoupled Networks for Temporal Reasoning Osama Mohammed et.al. 2508.03251 null
2025-08-04 MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models Wenyuan Liu et.al. 2508.02343 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis Yuzhuang Xu et.al. 2508.02322 null
2025-08-04 GPU in the Blind Spot: Overlooked Security Risks in Transportation Sefatun-Noor Puspa et.al. 2508.01995 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-02 A Parallel Algorithm for Finding Robust Spanners in Large Social Networks Arindam Khanda et.al. 2508.01485 null
2025-08-01 Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection Cheng-You Lu et.al. 2508.01014 null
2025-08-01 Optimal Scheduling Algorithms for LLM Inference: Theory and Practice Agrim Bari et.al. 2508.01002 null
2025-07-29 Forecasting LLM Inference Performance via Hardware-Agnostic Analytical Modeling Rajeev Patwari et.al. 2508.00904 null
2025-08-01 Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving Stefan Englmeier et.al. 2508.00589 null
2025-08-01 On Learning Closed-Loop Probabilistic Multi-Agent Simulator Juanwu Lu et.al. 2508.00384 null
2025-08-01 Beamformed 360° Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization Belman Jahir Rodriguez et.al. 2508.00307 null
2025-07-31 FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction Donghyun Lee et.al. 2507.23480 null
2025-07-31 InterfO-RAN: Real-Time In-band Cellular Uplink Interference Detection with GPU-Accelerated dApps Neagin Neasamoni Santhi et.al. 2507.23177 null
2025-07-30 On the Sustainability of AI Inferences in the Edge Ghazal Sobhani et.al. 2507.23093 null
2025-07-30 Early Goal-Guided Multi-Scale Fusion for Real-Time Vision-Language Driving Santosh Patapati et.al. 2507.23042 null
2025-07-28 Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery Deepak Joshi et.al. 2507.20680 null
2025-07-27 SWIFT: A General Sensitive Weight Identification Framework for Fast Sensor-Transfer Pansharpening Zeyu Xia et.al. 2507.20311 null
2025-07-26 Racing to Idle: Energy Efficiency of Matrix Multiplication on Heterogeneous CPU and GPU Architectures Mufakir Qamar Ansari et.al. 2507.20063 null
2025-07-26 A Fast Parallel Median Filtering Algorithm Using Hierarchical Tiling Louis Sugy et.al. 2507.19926 null
2025-08-02 GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting Baijun Ye et.al. 2507.19451 null
2025-07-25 TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability Mohammad Aflah Khan et.al. 2507.19419 null
2025-07-25 LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences Yusuke Hirota et.al. 2507.19362 null
2025-07-25 SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding Large Language Models Zhen Wan et.al. 2507.19361 null
2025-07-25 High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins Lorenzo Cazzella et.al. 2507.19173 null
2025-07-24 SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time Yun Chen et.al. 2507.18713 null
2025-07-24 Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping Chong Cheng et.al. 2507.18541 null
2025-07-24 Building an Accelerated OpenFOAM Proof-of-Concept Application using Modern C++ Giulio Malenza et.al. 2507.18268 null
2025-07-26 MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation Zhongzhen Wen et.al. 2507.17773 null
2025-07-23 BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems Malsha Ashani Mahawatta Dona et.al. 2507.17722 null
2025-07-24 Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners Kostas Karakontis et.al. 2507.17519 null
2025-07-25 HuNavSim 2.0: An Enhanced Human Navigation Simulator for Human-Aware Robot Navigation Miguel Escudero-Jiménez et.al. 2507.17317 null
2025-07-23 GPU Benchmark through QPE Emulator with cuQuantum for Practical Quantum Applications Takaki Akiba et.al. 2507.17175 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 Model Compression Engine for Wearable Devices Skin Cancer Diagnosis Jacob M. Delgado-López et.al. 2507.17125 null
2025-07-23 Computer Vision for Real-Time Monkeypox Diagnosis on Embedded Systems Jacob M. Delgado-López et.al. 2507.17123 null
2025-07-22 Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems Imran Latif et.al. 2507.16781 null
2025-07-22 AcceleratedKernels.jl: Cross-Architecture Parallel Algorithms from a Unified, Transpiled Codebase Andrei-Leonard Nicusan et.al. 2507.16710 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-21 MSGM: A Multi-Scale Spatiotemporal Graph Mamba for EEG Emotion Recognition Hanwen Liu et.al. 2507.15914 null
2025-07-30 GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data Analysis Guoxi Liu et.al. 2507.15230 null
2025-07-19 Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall Shayan Rokhva et.al. 2507.14662 null
2025-07-16 GPU-Accelerated Interpretable Generalization for Rapid Cyberattack Detection and Forensics Shu-Ting Huang et.al. 2507.14222 null
2025-08-02 CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning Xiaoya Li et.al. 2507.14111 null
2025-07-23 Photonic Fabric Platform for AI Accelerators Jing Ding et.al. 2507.14000 null
2025-07-18 Leveraging Multi-Instance GPUs through moldable task scheduling Jorge Villarrubia et.al. 2507.13601 null
2025-07-17 Performance Portable Gradient Computations Using Source Transformation Kim Liegeois et.al. 2507.13204 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 HyDRA: A Hybrid Dual-Mode Network for Closed- and Open-Set RFFI with Optimized VMD Hanwen Liu et.al. 2507.12133 null
2025-07-16 PoTPTQ: A Two-step Power-of-Two Post-training for LLMs Xinyu Wang et.al. 2507.11959 null
2025-07-15 MIRAGE: KV Cache Optimization through Parameter Remapping for Multi-tenant LLM Serving Ruihao Li et.al. 2507.11507 null
2025-07-15 MMStencil: Optimizing High-order Stencils on Multicore CPU using Matrix Unit Yinuo Wang et.al. 2507.11067 null
2025-07-15 Standards-Compliant DM-RS Allocation via Temporal Channel Prediction for Massive MIMO Systems Sehyun Ryu et.al. 2507.11064 null
2025-07-15 Modernizing CNN-based Weather Forecast Model towards Higher Computational Efficiency Minjong Cheon et.al. 2507.10893 null
2025-07-21 Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks Aaron Jarmusch et.al. 2507.10789 null
2025-07-14 A Benchmarking Framework for AI models in Automotive Aerodynamics Kaustubh Tangsali et.al. 2507.10747 null
2025-07-14 Quantize-then-Rectify: Efficient VQ-VAE Training Borui Zhang et.al. 2507.10547 null
2025-07-30 Designing quantum chemistry algorithms with just-in-time compilation Xiaojie Wu et.al. 2507.09772 null
2025-07-13 GeoWarp: An automatically differentiable and GPU-accelerated implicit MPM framework for geomechanics based on NVIDIA Warp Yidong Zhao et.al. 2507.09435 null
2025-07-12 Factorization-free Orthogonal Projection onto the Positive Semidefinite Cone with Composite Polynomial Filtering Shucheng Kang et.al. 2507.09165 null
2025-07-10 Vidyut3d: a GPU accelerated fluid solver for non-equilibrium plasmas on adaptive grids Hariswaran Sitaraman et.al. 2507.08200 null
2025-07-10 GPUHammer: Rowhammer Attacks on GPU Memories are Practical Chris S. Lin et.al. 2507.08166 null
2025-07-03 Collective Communication Profiling of Modern-day Machine Learning Workloads Jit Gupta et.al. 2507.07117 null
2025-07-09 StixelNExT++: Lightweight Monocular Scene Segmentation and Representation for Collective Perception Marcel Vosshans et.al. 2507.06687 null
2025-07-09 EA: An Event Autoencoder for High-Speed Vision Sensing Riadul Islam et.al. 2507.06459 null
2025-07-08 CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation Kushal Gajjar et.al. 2507.06013 null
2025-07-07 Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model Mengyao Xu et.al. 2507.05513 null
2025-07-07 Robotic System with AI for Real Time Weed Detection, Canopy Aware Spraying, and Droplet Pattern Evaluation Inayat Rasool et.al. 2507.05432 null
2025-07-23 Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms Zhiyi Hu et.al. 2507.04786 null
2025-07-05 ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments Guile Wu et.al. 2507.03886 null
2025-07-24 Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps Chong Cheng et.al. 2507.03737 null
2025-07-03 NVIDIA GPU Confidential Computing Demystified Zhongshu Gu et.al. 2507.02770 null
2025-07-03 Flotilla: A scalable, modular and resilient federated learning framework for heterogeneous resources Roopkatha Banerjee et.al. 2507.02295 null
2025-07-02 SAKURAONE: Empowering Transparent and Open AI Platforms through Private-Sector HPC Investment in Japan Fumikazu Konishi et.al. 2507.02124 null
2025-07-02 Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization Giuseppe Ruggeri et.al. 2507.01676 null
2025-06-20 PyTorch-based Geometric Learning with Non-CUDA Processing Units: Experiences from Intel Gaudi-v2 HPUs Fanchen Bu et.al. 2507.01031 null
2025-07-01 Anatomy of High-Performance Column-Pivoted QR Decomposition Maksim Melnichenko et.al. 2507.00976 null
2025-07-01 Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms Zain Taufique et.al. 2507.00491 null
2025-07-01 Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs Mohammad Firas Sada et.al. 2507.00418 null
2025-07-01 Question Decomposition for Retrieval-Augmented Generation Paul J. L. Ammann et.al. 2507.00355 null
2025-06-24 AdaDeDup: Adaptive Hybrid Data Pruning for Efficient Large-Scale Object Detection Training Feiyang Kang et.al. 2507.00049 null
2025-06-30 Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model Mu-Chi Chen et.al. 2506.23635 null
2025-06-30 Risk-Based Filtering of Valuable Driving Situations in the Waymo Open Motion Dataset Tim Puphal et.al. 2506.23433 null
2025-06-29 CMOS+X: Stacking Persistent Embedded Memories based on Oxide Transistors upon GPGPU Platforms Faaiq Waqar et.al. 2506.23405 null
2025-06-28 FF-INT8: Efficient Forward-Forward DNN Training on Edge Devices with INT8 Precision Jingxiao Ma et.al. 2506.22771 null
2025-06-27 Quantum-Classical Auxiliary Field Quantum Monte Carlo with Matchgate Shadows on Trapped Ion Quantum Computers Luning Zhao et.al. 2506.22408 null
2025-06-27 MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism Zheng Zhang et.al. 2506.22175 null
2025-06-27 MCFuser: High-Performance and Rapid Fusion of Memory-Bound Compute-Intensive Operators Zheng Zhang et.al. 2506.22169 null
2025-07-08 BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting Zipei Ma et.al. 2506.22099 null
2025-06-27 SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model Shuhan Tan et.al. 2506.21976 null
2025-06-23 TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge Zhiyuan Zhang et.al. 2506.21618 null
2025-06-26 SAM4D: Segment Anything in Camera and LiDAR Streams Jianyun Xu et.al. 2506.21547 null
2025-06-26 Portable High-Performance Kernel Generation for a Computational Fluid Dynamics Code with DaCe Måns I. Andersson et.al. 2506.20994 null
2025-06-25 Characterization and Mitigation of Training Instabilities in Microscaling Formats Huangyuan Su et.al. 2506.20752 null
2025-06-24 MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models Hoa La et.al. 2506.20686 null
2025-06-25 SuperSONIC: Cloud-Native Infrastructure for ML Inferencing Dmitry Kondratyev et.al. 2506.20657 null
2025-06-25 Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking Ben Kang et.al. 2506.20381 null
2025-06-24 Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification Minghao Qin et.al. 2506.19225 null
2025-06-23 Let Your Video Listen to Your Music! Xinyu Zhang et.al. 2506.18881 null
2025-06-23 Cross-Architecture Knowledge Distillation (KD) for Retinal Fundus Image Anomaly Detection on NVIDIA Jetson Nano Berk Yilmaz et.al. 2506.18220 null
2025-06-22 AMD Versal Implementations of FAM and SSCA Estimators Carol Jingyi Li et.al. 2506.18003 null
2025-06-20 Code Generation for Near-Roofline Finite Element Actions on GPUs from Symbolic Variational Forms Kaushik Kulkarni et.al. 2506.17471 null
2025-06-19 VideoGAN-based Trajectory Proposal for Automated Vehicles Annajoyce Mariani et.al. 2506.16209 null
2025-06-19 Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs Xun Wang et.al. 2506.16196 null
2025-06-19 HetGPU: The pursuit of making binary compatibility towards GPUs Yiwei Yang et.al. 2506.15993 null
2025-06-18 Early Attentive Sparsification Accelerates Neural Speech Transcription Zifei Xu et.al. 2506.15912 null
2025-06-18 UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting Kai He et.al. 2506.15673 null
2025-06-18 Engineering Supercomputing Platforms for Biomolecular Applications Robert Welch et.al. 2506.15585 null
2025-07-30 Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention Syed Haider Ali et.al. 2506.15562 null
2025-06-17 Align Your Flow: Scaling Continuous-Time Flow Map Distillation Amirmojtaba Sabour et.al. 2506.14603 null
2025-06-18 Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models Xuanchi Ren et.al. 2506.09042 null
2025-06-10 Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions David Acuna et.al. 2506.08927 null
2025-07-18 Controllable Weather Synthesis and Removal with Video Diffusion Models Chih-Hao Lin et.al. 2505.00704 null
2025-04-21 LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception Yuan-Hong Liao et.al. 2504.15362 null
2025-04-15 PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond Minghua Liu et.al. 2504.11451 null
2025-04-17 VideoPanda: Video Panoramic Diffusion with Multi-view Attention Kevin Xie et.al. 2504.11389 null
2025-04-01 Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control NVIDIA et.al. 2503.14492 null
2025-03-05 GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Xuanchi Ren et.al. 2503.03751 null
2025-03-03 Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Jay Zhangjie Wu et.al. 2503.01774 null
2025-03-22 DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models Ruofan Liang et.al. 2501.18590 null
2025-07-09 Cosmos World Foundation Model Platform for Physical AI NVIDIA et.al. 2501.03575 null
2025-06-26 InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models Yifan Lu et.al. 2412.03934 null
2025-04-01 Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos Hanxue Liang et.al. 2412.03526 null
2024-11-14 LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Zhengyi Wang et.al. 2411.09595 null
2025-02-28 ReMatching Dynamic Reconstruction Flow Sara Oblak et.al. 2411.00705 null
2024-10-26 SCube: Instant Large-Scale Scene Reconstruction using VoxSplats Xuanchi Ren et.al. 2410.20030 null
2025-02-11 SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes Tianchang Shen et.al. 2409.20562 null
2024-09-28 G3R: Gradient Guided Generalizable Reconstruction Yun Chen et.al. 2409.19405 null
2024-09-27 UniCal: Unified Neural Sensor Calibration Ze Yang et.al. 2409.18953 null
2024-09-26 Learning to Drive via Asymmetric Self-Play Chris Zhang et.al. 2409.18218 null
2024-09-15 Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models Yuan-Hong Liao et.al. 2409.09788 null
2025-04-19 OmniRe: Omni Urban Scene Reconstruction Ziyu Chen et.al. 2408.16760 null
2024-08-19 Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering Ruofan Liang et.al. 2408.09702 null
2025-03-20 Wolf: Dense Video Captioning with a World Summarization Framework Boyi Li et.al. 2407.18908 null
2024-07-15 SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation Jordan Juravsky et.al. 2407.10481 null
2024-10-10 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes Nicolas Moenne-Loccoz et.al. 2407.07090 null
2024-07-01 fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence Francis Williams et.al. 2407.01781 null
2024-10-31 DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features Letian Wang et.al. 2406.12095 null
2024-06-14 L4GM: Large 4D Gaussian Reconstruction Model Jiawei Ren et.al. 2406.10324 null
2024-06-12 UnO: Unsupervised Occupancy Fields for Perception and Forecasting Ben Agro et.al. 2406.08691 null
2024-06-12 Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata Dongsu Zhang et.al. 2406.08292 null
2024-06-13 DeTra: A Unified Model for Object Detection and Trajectory Forecasting Sergio Casas et.al. 2406.04426 null
2024-04-24 NeRF-XL: Scaling NeRFs with Multiple GPUs Ruilong Li et.al. 2404.16221 null
2024-04-22 Align Your Steps: Optimizing Sampling Schedules in Diffusion Models Amirmojtaba Sabour et.al. 2404.14507 null
2024-04-16 RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting Ashkan Mirzaei et.al. 2404.10765 null
2025-05-26 Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves? Yuan-Hong Liao et.al. 2404.06510 null
2024-04-01 QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving Sourav Biswas et.al. 2404.01486 null
2024-03-22 LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Kevin Xie et.al. 2403.15385 null
2024-03-22 Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks Aqeel Anwar et.al. 2403.15370 null
2024-01-22 EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models Koichi Namekata et.al. 2401.11739 null
2023-12-28 Compact Neural Graphics Primitives with Learned Hash Probing Towaki Takikawa et.al. 2312.17241 null
2024-01-03 Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models Huan Ling et.al. 2312.13763 null
2023-12-11 LightSim: Neural Lighting Simulation for Urban Scenes Ava Pun et.al. 2312.06654 null
2024-04-14 Trajeglish: Traffic Modeling as Next-Token Prediction Jonah Philion et.al. 2312.04535 null
2024-06-25 XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies Xuanchi Ren et.al. 2312.03806 null
2024-04-12 WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space Katja Schwarz et.al. 2311.13570 null
2023-11-16 Adaptive Shells for Efficient Neural Radiance Field Rendering Zian Wang et.al. 2311.10091 null
2023-11-09 Real-Time Neural Rasterization for Large Scenes Jeffrey Yunfan Liu et.al. 2311.05607 null
2023-11-09 Reconstructing Objects in-the-wild for Realistic Sensor Simulation Ze Yang et.al. 2311.05602 null
2023-11-07 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Chenfeng Xu et.al. 2311.04391 null
2023-11-03 EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision Jiawei Yang et.al. 2311.02077 null
2023-11-03 Towards Unsupervised Object Detection From LiDAR Point Clouds Lunjun Zhang et.al. 2311.02007 null
2023-11-02 MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory Enxu Li et.al. 2311.01556 null
2023-11-17 4D-Former: Multimodal 4D Panoptic Segmentation Ali Athar et.al. 2311.01520 null
2023-11-02 UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation Yuwen Xiong et.al. 2311.01448 null
2023-11-02 CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation Jingkang Wang et.al. 2311.01447 null
2023-11-02 Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation Jay Sarva et.al. 2311.01446 null
2023-11-02 LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds Anqi Joyce Yang et.al. 2311.01444 null
2023-11-02 Learning Realistic Traffic Agents in Closed-loop Chris Zhang et.al. 2311.01394 null
2024-04-01 Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion Lunjun Zhang et.al. 2311.01017 null
2024-01-26 ViR: Towards Efficient Vision Retention Backbones Ali Hatamizadeh et.al. 2310.19731 null
2023-10-20 TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models Tianshi Cao et.al. 2310.13772 null
2023-09-11 Towards Viewpoint Robustness in Bird’s Eye View Segmentation Tzofi Klinghoffer et.al. 2309.05192 null
2023-08-10 Flexible Isosurface Extraction for Gradient-Based Mesh Optimization Tianchang Shen et.al. 2308.05371 null
2023-08-03 UniSim: A Neural Closed-Loop Sensor Simulator Ze Yang et.al. 2308.01898 null
2023-08-02 Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving Ben Agro et.al. 2308.01471 null
2023-07-14 DreamTeacher: Pretraining Image Backbones with Deep Generative Models Daiqing Li et.al. 2307.07487 null
2023-06-27 Rethinking Closed-loop Training for Autonomous Driving Chris Zhang et.al. 2306.15713 null
2023-06-06 ATT3D: Amortized Text-to-3D Object Synthesis Jonathan Lorraine et.al. 2306.07349 null
2023-06-09 Neural Kernel Surface Reconstruction Jiahui Huang et.al. 2305.19590 null
2023-08-13 Neural LiDAR Fields for Novel View Synthesis Shengyu Huang et.al. 2305.01643 null
2023-04-19 NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models Seung Wook Kim et.al. 2304.09787 null
2023-12-28 Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models Andreas Blattmann et.al. 2304.08818 null
2023-04-06 Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes Zian Wang et.al. 2304.03266 null
2023-04-04 Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion Davis Rempe et.al. 2304.01893 null
2023-03-25 VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion Yiming Li et.al. 2302.12251 null
2023-02-09 Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting Viraj Prabhu et.al. 2302.04832 null
2023-02-02 Synthesizing Physical Character-Scene Interactions Mohamed Hassan et.al. 2302.00883 null
2023-01-31 PADL: Language-Directed Physics-Based Character Control Jordan Juravsky et.al. 2301.13868 null
2023-03-25 Magic3D: High-Resolution Text-to-3D Content Creation Chen-Hsuan Lin et.al. 2211.10440 null
2022-11-08 GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting Alexander Cui et.al. 2211.02545 null
2022-10-12 LION: Latent Point Diffusion Models for 3D Shape Generation Xiaohui Zeng et.al. 2210.06978 null
2022-10-06 XDGAN: Multi-Modal 3D Shape Generation in 2D Space Hassan Abu Alhaija et.al. 2210.03007 null
2022-10-03 Optimizing Data Collection for Machine Learning Rafid Mahmood et.al. 2210.01234 null
2022-09-26 EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations Ahmad Darkhalil et.al. 2209.13064 null
2022-09-22 GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images Jun Gao et.al. 2209.11163 null
2022-08-19 Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion Zian Wang et.al. 2208.09480 null
2022-08-18 MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation Gopal Sharma et.al. 2208.08580 null
2022-07-05 Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention Gary Leung et.al. 2207.02126 null
2022-07-13 How Much More Data Do I Need? Estimating Requirements for Downstream Tasks Rafid Mahmood et.al. 2207.01725 null
2022-06-19 Scalable Neural Data Server: A Data Recommender for Transfer Learning Tianshi Cao et.al. 2206.09386 null
2022-06-16 Virtual Correspondence: Humans as a Cue for Extreme-View Geometry Wei-Chiu Ma et.al. 2206.08365 null
2022-06-15 Variable Bitrate Neural Fields Towaki Takikawa et.al. 2206.07707 null
2022-06-06 Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps Seung Wook Kim et.al. 2206.02903 null
2022-05-05 ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters Xue Bin Peng et.al. 2205.01906 null
2022-04-19 M $^2$ BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation Enze Xie et.al. 2204.05088 null
2022-04-06 AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis Zhiqin Chen et.al. 2204.03105 null

Autonomous Driving

Publish Date Title Authors PDF Code
2025-08-07 SMOL-MapSeg: Show Me One Label Yunshuang Yuan et.al. 2508.05501 null
2025-08-07 Physical Adversarial Camouflage through Gradient Calibration and Regularization Jiawei Liang et.al. 2508.05414 null
2025-08-07 DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model Rui Yu et.al. 2508.05402 null
2025-08-07 ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models Yatong Lan et.al. 2508.05236 null
2025-08-07 PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems Qi Guo et.al. 2508.05167 null
2025-08-07 AdvDINO: Domain-Adversarial Self-Supervised Representation Learning for Spatial Proteomics Stella Su et.al. 2508.04955 null
2025-08-06 Occupancy Learning with Spatiotemporal Memory Ziyang Leng et.al. 2508.04705 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case Baihui Xiao et.al. 2508.04642 null
2025-08-06 Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark Xiao Wang et.al. 2508.04260 null
2025-08-06 DRIVE: Dynamic Rule Inference and Verified Evaluation for Constraint-Aware Autonomous Driving Longling Geng et.al. 2508.04066 null
2025-08-05 LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences Ao Liang et.al. 2508.03692 null
2025-08-05 La La LiDAR: Large-Scale Layout Generation from LiDAR Data Youquan Liu et.al. 2508.03691 null
2025-08-05 Veila: Panoramic LiDAR Generation from a Monocular RGB Image Youquan Liu et.al. 2508.03690 null
2025-08-05 MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention Qi Xie et.al. 2508.03034 null
2025-08-04 Context-aware Risk Assessment and Its Application in Autonomous Driving Boyang Tian et.al. 2508.02919 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera Byeonggyu Park et.al. 2508.02348 null
2025-08-04 Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images Philipp Wulff et.al. 2508.02323 null
2025-08-04 Test-Time Model Adaptation for Quantized Neural Networks Zeshuai Deng et.al. 2508.02180 null
2025-08-04 Beyond RGB and Events: Enhancing Object Detection under Adverse Lighting with Monocular Normal Maps Mingjie Liu et.al. 2508.02127 null
2025-08-04 Mapillary Vistas Validation for Fine-Grained Traffic Signs: A Benchmark Revealing Vision-Language Model Limitations Sparsh Garg et.al. 2508.02047 null
2025-08-04 Bench2ADVLM: A Closed-Loop Benchmark for Vision-language Models in Autonomous Driving Tianyuan Zhang et.al. 2508.02028 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-03 StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding Haolin Yang et.al. 2508.01875 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving Luqi Cheng et.al. 2508.01704 null
2025-08-03 Adverse Weather-Independent Framework Towards Autonomous Driving Perception through Temporal Correlation and Unfolded Regularization Wei-Bin Kou et.al. 2508.01583 null
2025-08-02 A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Zhan Shi et.al. 2508.01197 null
2025-08-01 CP-FREEZER: Latency Attacks against Vehicular Cooperative Perception Chenyi Wang et.al. 2508.01062 null
2025-08-01 REACT: A Real-Time Edge-AI Based V2X Framework for Accident Avoidance in Autonomous Driving System Fengze Yang et.al. 2508.01057 null
2025-07-31 Service Discovery-Based Hybrid Network Middleware for Efficient Communication in Distributed Robotic Systems Shiyao Sang et.al. 2508.00947 null
2025-08-01 Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR Adwait Chandorkar et.al. 2508.00744 null
2025-08-01 Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving Stefan Englmeier et.al. 2508.00589 null
2025-08-01 Uncertainty-Aware Likelihood Ratio Estimation for Pixel-Wise Out-of-Distribution Detection Marc Hölle et.al. 2508.00587 null
2025-08-01 Pro2Guard: Proactive Runtime Enforcement of LLM Agent Safety via Probabilistic Model Checking Haoyu Wang et.al. 2508.00500 null
2025-08-01 Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence Danzhen Fu et.al. 2508.00299 null
2025-07-21 AoI-Aware Resource Allocation with Deep Reinforcement Learning for HAPS-V2X Networks Ahmet Melih Ince et.al. 2508.00011 null
2025-07-31 I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation Jialei Chen et.al. 2507.23683 null
2025-07-31 DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation Yuchen Zhou et.al. 2507.23599 null
2025-08-02 MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction Zijian Dong et.al. 2507.23597 null
2025-07-31 A Unified Perception-Language-Action Framework for Adaptive Autonomous Driving Yi Zhang et.al. 2507.23540 null
2025-07-31 MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting Xingyue Peng et.al. 2507.23340 null
2025-07-31 Contrastive Learning-Driven Traffic Sign Perception: Multi-Modal Fusion of Text and Vision Qiang Lu et.al. 2507.23331 null
2025-07-31 FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models Yiming Yang et.al. 2507.23325 null
2025-08-02 FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning Jiajun Cao et.al. 2507.23318 null
2025-08-04 PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving Xuewei Tang et.al. 2507.23309 null
2025-07-30 Causal-Inspired Multi-Agent Decision-Making via Graph Reinforcement Learning Jing Wang et.al. 2507.23080 null
2025-08-05 Vision-Language Fusion for Real-Time Autonomous Driving: Goal-Centered Cross-Attention of Camera, HD-Map, & Waypoints Santosh Patapati et.al. 2507.23064 null
2025-07-30 Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation Alexandru Buburuzan et.al. 2507.23058 null
2025-08-07 Bayesian Optimization applied for accelerated Virtual Validation of the Autonomous Driving Function Satyesh Shanker Awasthi et.al. 2507.22769 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation Jiuming Liu et.al. 2507.22454 null
2025-07-30 Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators Kaustav Chakraborty et.al. 2507.22389 null
2025-07-29 Hierarchical Game-Based Multi-Agent Decision-Making for Autonomous Vehicles Mushuang Liu et.al. 2507.21941 null
2025-07-31 MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors Shouyi Lu et.al. 2507.21872 null
2025-07-29 SAMITE: Position Prompted SAM2 with Calibrated Memory for Visual Object Tracking Qianxiong Xu et.al. 2507.21732 null
2025-07-29 Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition Ruiyang Hao et.al. 2507.21610 null
2025-07-29 SafeDriveRAG: Towards Safe Autonomous Driving with Knowledge Graph-based Retrieval-Augmented Generation Hao Ye et.al. 2507.21585 null
2025-07-30 No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering Linye Wei et.al. 2507.21572 null
2025-07-29 RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors Tianhui Cai et.al. 2507.21567 null
2025-07-29 SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity Xingyang Li et.al. 2507.21499 null
2025-07-29 MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving Thomas Monninger et.al. 2507.21423 null
2025-08-03 Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy Jicheng Yuan et.al. 2507.21358 null
2025-07-25 Seeing Beyond Frames: Zero-Shot Pedestrian Intention Prediction with Raw Temporal Video and Multimodal Cues Pallavi Zambare et.al. 2507.21161 null
2025-07-28 GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction Tianhao Li et.al. 2507.20963 null
2025-07-25 Event-Based De-Snowing for Autonomous Driving Manasi Muglikar et.al. 2507.20901 null
2025-07-28 DriveAgent-R1: Advancing VLM-based Autonomous Driving with Hybrid Thinking and Active Perception Weicheng Zheng et.al. 2507.20879 null
2025-07-27 Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars Mattia Piccinini et.al. 2507.20427 null
2025-07-27 VESPA: Towards un(Human)supervised Open-World Pointcloud Labeling for Autonomous Driving Levente Tempfli et.al. 2507.20397 null
2025-07-27 Solving Scene Understanding for Autonomous Navigation in Unstructured Environments Naveen Mathews Renji et.al. 2507.20389 null
2025-07-27 VLMPlanner: Integrating Visual Language Models with Motion Planning Zhipeng Tang et.al. 2507.20342 null
2025-07-27 MambaMap: Online Vectorized HD Map Construction using State Space Model Ruizi Yang et.al. 2507.20224 null
2025-07-27 LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks Fei Kong et.al. 2507.20174 null
2025-07-27 Meta Fusion: A Unified Framework For Multimodality Fusion with Mutual Learning Ziyi Liang et.al. 2507.20089 null
2025-07-26 Digital Twin Channel-Enabled Online Resource Allocation for 6G: Principle, Architecture and Application Tongjie Li et.al. 2507.19974 null
2025-07-29 DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes Rishav Kumar et.al. 2507.19912 null
2025-07-26 Bridging Simulation and Usability: A User-Friendly Framework for Scenario Generation in CARLA Ahmed Abouelazm et.al. 2507.19883 null
2025-07-26 FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving Tao Lian et.al. 2507.19881 null
2025-07-30 RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection Xiaokai Bai et.al. 2507.19856 null
2025-07-26 A 4D Radar Camera Extrinsic Calibration Tool Based on 3D Uncertainty Perspective N Points Chuan Cao et.al. 2507.19829 null
2025-07-25 PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction Haichuan Li et.al. 2507.19701 null
2025-07-25 Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing Haichuan Li et.al. 2507.19691 null
2025-08-02 GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting Baijun Ye et.al. 2507.19451 null
2025-07-25 An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles Matthias Weiß et.al. 2507.19446 null
2025-07-25 SDVDiag: A Modular Platform for the Diagnosis of Connected Vehicle Functions Matthias Weiß et.al. 2507.19403 null
2025-07-25 BEV-LLM: Leveraging Multimodal BEV Maps for Scene Captioning in Autonomous Driving Felix Brandstaetter et.al. 2507.19370 null
2025-07-25 LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences Yusuke Hirota et.al. 2507.19362 null
2025-07-25 SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence Viktar Dubovik et.al. 2507.19321 null
2025-07-25 CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception Jiaru Zhong et.al. 2507.19239 null
2025-07-25 VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions Haoang Lu et.al. 2507.19188 null
2025-07-25 Continual Learning-Based Unified Model for Unpaired Image Restoration Tasks Kotha Kartheek et.al. 2507.19184 null
2025-07-25 Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL Ahmed Abouelazm et.al. 2507.19146 null
2025-07-31 PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction Yanghong Liu et.al. 2507.19119 null
2025-07-25 Fine-Grained Traffic Inference from Road to Lane via Spatio-Temporal Graph Node Generation Shuhao Li et.al. 2507.19089 null
2025-07-25 HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback Elham Soltani Kazemi et.al. 2507.18921 null
2025-07-24 Diffusion-FS: Multimodal Free-Space Prediction via Diffusion for Autonomous Driving Keshav Gupta et.al. 2507.18763 null
2025-07-24 Linear Memory SE(2) Invariant Attention Ethan Pronovost et.al. 2507.18597 null
2025-07-24 GaussianFusionOcc: A Seamless Sensor Fusion Approach for 3D Occupancy Prediction Using 3D Gaussians Tomislav Pavković et.al. 2507.18522 null
2025-07-24 Delving into Mapping Uncertainty for Mapless Trajectory Prediction Zongzheng Zhang et.al. 2507.18498 null
2025-07-24 Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments Xiao Yang et.al. 2507.18484 null
2025-07-24 CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting Haoran Xu et.al. 2507.18473 null
2025-07-24 LONG3R: Long Sequence Streaming 3D Reconstruction Zhuoguang Chen et.al. 2507.18255 null
2025-07-24 GenAI for Automotive Software Development: From Requirements to Wheels Nenad Petrovic et.al. 2507.18223 null
2025-07-24 Goal-based Trajectory Prediction for improved Cross-Dataset Generalization Daniel Grimm et.al. 2507.18196 null
2025-07-24 Policy Disruption in Reinforcement Learning:Adversarial Attack with Large Language Models and Critical State Identification Junyong Jiang et.al. 2507.18113 null
2025-07-23 BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems Malsha Ashani Mahawatta Dona et.al. 2507.17722 null
2025-07-23 Reusing Attention for One-stage Lane Topology Understanding Yang Li et.al. 2507.17617 null
2025-07-23 InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling Xiaoxue Chen et.al. 2507.17613 null
2025-07-24 PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving Maciej K. Wozniak et.al. 2507.17596 null
2025-07-23 SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving Chuang Chen et.al. 2507.17479 null
2025-07-23 VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization Sania Waheed et.al. 2507.17455 null
2025-07-23 Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning Joobin Jin et.al. 2507.17418 null
2025-08-06 DeMo++: Motion Decoupling for Autonomous Driving Bozhou Zhang et.al. 2507.17342 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study Mandar Pitale et.al. 2507.17118 null
2025-07-22 SDGOCC: Semantic and Depth-Guided Bird’s-Eye View Transformation for 3D Multimodal Occupancy Prediction Zaipeng Duan et.al. 2507.17083 null
2025-07-22 Few-Shot Learning in Video and 3D Object Detection: A Survey Md Meftahul Ferdaus et.al. 2507.17079 null
2025-07-22 Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach Adithya Mohan et.al. 2507.17070 null
2025-07-22 Denoising-While-Completing Network (DWCNet): Robust Point Cloud Completion Under Corruption Keneni W. Tesema et.al. 2507.16743 null
2025-07-22 Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control Zongzheng Zhang et.al. 2507.16645 null
2025-07-22 A Target-based Multi-LiDAR Multi-Camera Extrinsic Calibration System Lorenzo Gentilini et.al. 2507.16621 null
2025-07-22 VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences Kai Deng et.al. 2507.16443 null
2025-07-22 A Sparsity-Aware Autonomous Path Planning Accelerator with HW/SW Co-Design and Multi-Level Dataflow Optimization Yifan Zhang et.al. 2507.16177 null
2025-07-21 Optimal Batch-Size Control for Low-Latency Federated Learning with Device Heterogeneity Huiling Yang et.al. 2507.15601 null
2025-07-21 Robots for Kiwifruit Harvesting and Pollination Jamie Bell et.al. 2507.15484 null
2025-07-21 VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving Haichao Liu et.al. 2507.15266 null
2025-07-20 CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning Pan Hu et.al. 2507.14903 null
2025-07-23 GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving Chi Wan et.al. 2507.14456 null
2025-07-18 Preference-based Multi-Objective Reinforcement Learning Ni Mu et.al. 2507.14066 null
2025-07-18 Architecting Human-AI Cocreation for Technical Services – Interaction Modes and Contingency Factors Jochen Wulf et.al. 2507.14034 null
2025-07-18 Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection Yujian Mo et.al. 2507.13899 null
2025-07-18 Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation Max van den Hoven et.al. 2507.13857 null
2025-07-18 One Step Closer: Creating the Future to Boost Monocular Semantic Scene Completion Haoang Lu et.al. 2507.13801 null
2025-07-18 AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework Yu Yao et.al. 2507.13729 null
2025-07-17 CaSTFormer: Causal Spatio-Temporal Transformer for Driving Intention Prediction Sirui Wang et.al. 2507.13425 null
2025-07-16 From Binary to Semantic: Utilizing Large-Scale Binary Occupancy Data for 3D Semantic Occupancy Prediction Chihiro Noguchi et.al. 2507.13387 null
2025-07-17 Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models Arian Mousakhan et.al. 2507.13162 null
2025-07-17 Channel-wise Motion Features for Efficient Motion Segmentation Riku Inoue et.al. 2507.13082 null
2025-07-23 LaViPlan : Language-Guided Visual Path Planning with RLVR Hayeon Oh et.al. 2507.12911 null
2025-07-17 World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving Yanchen Guan et.al. 2507.12762 null
2025-07-17 Domain-Enhanced Dual-Branch Model for Efficient and Interpretable Accident Anticipation Yanchen Guan et.al. 2507.12755 null
2025-07-16 ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving Yuhang Lu et.al. 2507.12499 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models Santosh Vasa et.al. 2507.12414 null
2025-07-21 AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving Jiawei Xu et.al. 2507.12137 null
2025-07-16 LidarPainter: One-Step Away From Any Lidar View To Novel Guidance Yuzhou Ji et.al. 2507.12114 null
2025-07-16 Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics Muleilan Pei et.al. 2507.12083 null
2025-07-16 IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving Kanghyun Ryu et.al. 2507.11940 null
2025-07-16 Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers Mohammed Hassanin et.al. 2507.11852 null
2025-07-15 Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation Zhen Xu et.al. 2507.11540 null
2025-07-15 A Survey on Interpretability in Visual Recognition Qiyang Wan et.al. 2507.11099 null
2025-07-14 RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding Benjamin Stoler et.al. 2507.10749 null
2025-07-14 Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance Kyungtae Han et.al. 2507.10500 null

Traffic Simulation

Publish Date Title Authors PDF Code
2025-08-07 TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven Evolution Zhikai Zhao et.al. 2508.05616 null
2025-08-07 Benchmarking Shortcutting Techniques for Multi-Robot-Arm Motion Planning Philip Huang et.al. 2508.05027 null
2025-08-06 LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction Md Zahidul Hasan et.al. 2508.04847 null
2025-08-06 BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning Ziyang Leng et.al. 2508.04702 null
2025-08-06 Incorporating Stochastic Models of Controller Behavior into Kinodynamic Efficiently Adaptive State Lattices for Mobile Robot Motion Planning in Off-Road Environments Eric R. Damm et.al. 2508.04384 null
2025-08-06 Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction Yu Liu et.al. 2508.04229 null
2025-08-06 Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems Luai Abuelsamen et.al. 2508.04146 null
2025-08-05 Constraint-Preserving Data Generation for Visuomotor Policy Learning Kevin Lin et.al. 2508.03944 null
2025-08-05 Vision-based Perception System for Automated Delivery Robot-Pedestrians Interactions Ergi Tushe et.al. 2508.03541 null
2025-08-04 X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio Chenxu Zhang et.al. 2508.02944 null
2025-08-04 MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model Tianheng Zhu et.al. 2508.02858 null
2025-08-04 Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering Xu Wang et.al. 2508.02362 null
2025-08-04 Adaptive Lattice-based Motion Planning Abhishek Dhar et.al. 2508.02350 null
2025-08-04 Framework for Robust Motion Planning of Tethered Multi-Robot Systems in Marine Environments Markus Buchholz et.al. 2508.02287 null
2025-08-04 AID4AD: Aerial Image Data for Automated Driving Perception Daniel Lengerer et.al. 2508.02140 null
2025-08-03 Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving Hunter Schofield et.al. 2508.01922 null
2025-08-03 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion Zhigang Sun et.al. 2508.01778 null
2025-08-03 A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction Hua Yu et.al. 2508.01585 null
2025-07-29 A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles Jiayuan Wang et.al. 2508.00917 null
2025-08-01 On Learning Closed-Loop Probabilistic Multi-Agent Simulator Juanwu Lu et.al. 2508.00384 null
2025-08-01 TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps Zehui Xu et.al. 2508.00303 null
2025-07-31 Data-Driven Motion Planning for Uncertain Nonlinear Systems Babak Esmaeili et.al. 2508.00154 null
2025-07-31 OmniTraj: Pre-Training on Heterogeneous Data for Adaptive and Zero-Shot Human Trajectory Prediction Yang Gao et.al. 2507.23657 null
2025-07-31 A Framework for Ethical Decision-Making in Automated Vehicles through Human Reasons-based Supervision Lucas Elbert Suryana et.al. 2507.23308 null
2025-07-31 Simulation-based planning of Motion Sequences for Automated Procedure Optimization in Multi-Robot Assembly Cells Loris Schneider et.al. 2507.23270 null
2025-08-01 Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future Guoping Xu et.al. 2507.22792 null
2025-07-30 Social-Pose: Enhancing Trajectory Prediction with Human Body Pose Yang Gao et.al. 2507.22742 null
2025-07-30 Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model Daehee Park et.al. 2507.22615 null
2025-07-30 Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators Kaustav Chakraborty et.al. 2507.22389 null
2025-07-27 Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars Mattia Piccinini et.al. 2507.20427 null
2025-07-27 VLMPlanner: Integrating Visual Language Models with Motion Planning Zhipeng Tang et.al. 2507.20342 null
2025-07-27 PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks Clinton Ansun Mo et.al. 2507.20170 null
2025-07-25 PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction Haichuan Li et.al. 2507.19701 null
2025-07-25 RAKOMO: Reachability-Aware K-Order Markov Path Optimization for Quadrupedal Loco-Manipulation Mattia Risiglione et.al. 2507.19652 null
2025-07-25 High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins Lorenzo Cazzella et.al. 2507.19173 null
2025-07-31 PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction Yanghong Liu et.al. 2507.19119 null
2025-07-24 Probabilistic Collision Risk Estimation through Gauss-Legendre Cubature and Non-Homogeneous Poisson Processes Trent Weiss et.al. 2507.18819 null
2025-07-24 Delving into Mapping Uncertainty for Mapless Trajectory Prediction Zongzheng Zhang et.al. 2507.18498 null
2025-07-24 Goal-based Trajectory Prediction for improved Cross-Dataset Generalization Daniel Grimm et.al. 2507.18196 null
2025-07-24 DanceGraph: A Complementary Architecture for Synchronous Dancing Online David Sinclair et.al. 2507.18052 null
2025-07-23 Safety Assurance for Quadrotor Kinodynamic Motion Planning Theodoros Tavoulareas et.al. 2507.17679 null
2025-07-23 IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception Haichuan Li et.al. 2507.17445 null
2025-08-06 DeMo++: Motion Decoupling for Autonomous Driving Bozhou Zhang et.al. 2507.17342 null
2025-07-23 JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction Fangze Lin et.al. 2507.17152 null
2025-07-23 Falconry-like palm landing by a flapping-wing drone based on the human gesture interaction and distance-aware flight planning Kazuki Numazato et.al. 2507.17144 null
2025-07-22 RAPTAR: Radar Radiation Pattern Acquisition through Automated Collaborative Robotics Maaz Qureshi et.al. 2507.16988 null
2025-07-21 Resilience Evaluation of Kubernetes in Cloud-Edge Environments via Failure Injection Zihao Chen et.al. 2507.16109 null
2025-07-21 Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction Shiyang Li et.al. 2507.15832 null
2025-07-21 Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs Ruochu Yang et.al. 2507.15782 null
2025-07-21 Selective Densification for Rapid Motion Planning in High Dimensions with Narrow Passages Lu Huang et.al. 2507.15710 null
2025-07-21 A Universal Vehicle-Trailer Navigation System with Neural Kinematics and Online Residual Learning Yanbo Chen et.al. 2507.15607 null
2025-07-21 VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving Haichao Liu et.al. 2507.15266 null
2025-07-20 Search-Based Autonomous Vehicle Motion Planning Using Game Theory Pouya Panahandeh et.al. 2507.15088 null
2025-07-20 CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning Pan Hu et.al. 2507.14903 null
2025-07-18 Context-Aware Behavior Learning with Heuristic Motion Memory for Underwater Manipulation Markus Buchholz et.al. 2507.14099 null
2025-07-18 NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning Qingyi Chen et.al. 2507.13940 null
2025-07-18 Conformal Contraction for Robust Nonlinear Control with Distribution-Free Uncertainty Quantification Sihang Wei et.al. 2507.13613 null
2025-07-16 InSyn: Modeling Complex Interactions for Pedestrian Trajectory Prediction Kaiyuan Zhai et.al. 2507.13397 null
2025-07-25 Signal Temporal Logic Compliant Co-design of Planning and Control Manas Sashank Juvvi et.al. 2507.13225 null
2025-07-22 Predictability-Aware Motion Prediction for Edge XR via High-Order Error-State Kalman Filtering Ziyu Zhong et.al. 2507.13179 null
2025-07-17 Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning Giwon Lee et.al. 2507.12977 null
2025-07-17 FFI-VTR: Lightweight and Robust Visual Teach and Repeat Navigation based on Feature Flow Indicator and Probabilistic Motion Planning Jikai Wang et.al. 2507.12800 null
2025-07-16 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding Renjie Li et.al. 2507.12463 null
2025-07-16 Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios Van-Hoang-Anh Phan et.al. 2507.12449 null
2025-07-16 Regrasp Maps for Sequential Manipulation Planning Svetlana Levit et.al. 2507.12407 null
2025-07-16 Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics Muleilan Pei et.al. 2507.12083 null
2025-07-16 IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving Kanghyun Ryu et.al. 2507.11940 null
2025-07-16 A Fast Method for Planning All Optimal Homotopic Configurations for Tethered Robots and Its Extended Applications Jinyuan Liu et.al. 2507.11880 null
2025-07-15 MPC-based Coarse-to-Fine Motion Planning for Robotic Object Transportation in Cluttered Environments Chen Cai et.al. 2507.11211 null
2025-07-15 Enhancing Autonomous Manipulator Control with Human-in-loop for Uncertain Assembly Environments Ashutosh Mishra et.al. 2507.11006 null
2025-07-15 OffsetCrust: Variable-Radius Offset Approximation with Power Diagrams Zihan Zhao et.al. 2507.10924 null
2025-07-15 Mixed Discrete and Continuous Planning using Shortest Walks in Graphs of Convex Sets Savva Morozov et.al. 2507.10878 null
2025-07-14 A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex Environments Yuchen Wang et.al. 2507.10792 null
2025-07-23 Leveraging RAG-LLMs for Urban Mobility Simulation and Analysis Yue Ding et.al. 2507.10382 null
2025-07-16 TOP: Trajectory Optimization via Parallel Optimization towards Constant Time Complexity Jiajun Yu et.al. 2507.10290 null
2025-07-14 MP-RBFN: Learning-based Vehicle Motion Primitives using Radial Basis Function Networks Marc Kaufeld et.al. 2507.10047 null
2025-07-22 Active Probing with Multimodal Predictions for Motion Planning Darshan Gadginmath et.al. 2507.09822 null
2025-07-13 Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions Yuanhong Zheng et.al. 2507.09446 null
2025-07-12 Real-Time Adaptive Motion Planning via Point Cloud-Guided, Energy-Based Diffusion and Potential Fields Wondmgezahu Teshome et.al. 2507.09383 null
2025-07-19 Informed Hybrid Zonotope-based Motion Planning Algorithm Peng Xie et.al. 2507.09309 null
2025-07-12 Integrating Planning and Predictive Control Using the Path Feasibility Governor Shu Zhang et.al. 2507.09134 null
2025-07-09 Next-Generation Travel Demand Modeling with a Generative Framework for Household Activity Coordination Xishun Liao et.al. 2507.08871 null
2025-07-14 STRAP: Spatial-Temporal Risk-Attentive Vehicle Trajectory Prediction for Autonomous Driving Xinyi Ning et.al. 2507.08563 null
2025-07-11 Prediction of Lane Change Intentions of Human Drivers using an LSTM, a CNN and a Transformer Francesco De Cristofaro et.al. 2507.08365 null
2025-07-11 Neural Parameter-varying Data-enabled Predictive Control of Cold Atmospheric Pressure Plasma Jets Pegah GhafGhanbari et.al. 2507.08259 null
2025-07-10 GGMotion: Group Graph Dynamics-Kinematics Networks for Human Motion Prediction Shuaijin Wan et.al. 2507.07515 null
2025-07-10 Towards Safe Autonomous Driving: A Real-Time Safeguarding Concept for Motion Planning Algorithms Korbinian Moller et.al. 2507.07444 null
2025-07-09 When Context Is Not Enough: Modeling Unexplained Variability in Car-Following Behavior Chengyuan Zhang et.al. 2507.07012 null
2025-07-09 Robust signal decompositions on the circle Aral Kose et.al. 2507.07007 null
2025-07-09 ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture Mingjin Zeng et.al. 2507.06531 null
2025-07-08 AURA-CVC: Autonomous Ultrasound-guided Robotic Assistance for Central Venous Catheterization Deepak Raina et.al. 2507.05979 null
2025-07-08 DRO-EDL-MPC: Evidential Deep Learning-Based Distributionally Robust Model Predictive Control for Safe Autonomous Driving Hyeongchan Ham et.al. 2507.05710 null
2025-07-07 From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving Fabian Konstantinidis et.al. 2507.05254 null
2025-07-07 Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance Tobias Demmler et.al. 2507.05098 null
2025-07-07 Unifying Robot Optimization: Monte Carlo Tree Search with Tensor Factorization Teng Xue et.al. 2507.04949 null
2025-07-25 Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning Giwon Lee et.al. 2507.04790 null
2025-07-07 LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction Yixin Yan et.al. 2507.04634 null
2025-07-06 Free-Space Optical Communication-Driven NMPC Framework for Multi-Rotor Aerial Vehicles in Structured Inspection Scenarios Giuseppe Silano et.al. 2507.04443 null
2025-07-05 Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic Jianwei Tang et.al. 2507.04062 null
2025-07-05 Temporal Continual Learning with Prior Compensation for Human Motion Prediction Jianwei Tang et.al. 2507.04060 null
2025-07-05 DK-RRT: Deep Koopman RRT for Collision-Aware Motion Planning of Space Manipulators in Dynamic Debris Environments Qi Chen et.al. 2507.03878 null
2025-07-05 Enhanced accuracy through ensembling of randomly initialized auto-regressive models for time-dependent PDEs Ishan Khurjekar et.al. 2507.03863 null
2025-07-04 Label-Free Long-Horizon 3D UAV Trajectory Prediction via Motion-Aligned RGB and Event Cues Hanfang Liang et.al. 2507.03365 null
2025-07-03 Trajectory Optimization for Differential Drive Mobile Manipulators via Topological Paths Search and Arc Length-Yaw Parameterization Long Xu et.al. 2507.02761 null
2025-07-03 Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization Caio Azevedo et.al. 2507.02406 null
2025-07-03 Path Planning using a One-shot-sampling Skeleton Map Gabriel O. Flores-Aquino et.al. 2507.02328 null
2025-07-02 GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters Wanjia Zhao et.al. 2507.02085 null
2025-07-09 Test-Time Scaling with Reflective Generative Model Zixiao Wang et.al. 2507.01951 null
2025-07-06 AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction Bin Rao et.al. 2507.01801 null
2025-07-02 Efficient Collision Detection for Long and Slender Robotic Links in Euclidean Distance Fields: Application to a Forestry Crane Marc-Philip Ecker et.al. 2507.01705 null
2025-07-02 LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction Muhammad Atta ur Rahman et.al. 2507.01308 null
2025-07-01 Search-Based Robot Motion Planning With Distance-Based Adaptive Motion Primitives Benjamin Kraljusic et.al. 2507.01198 null
2025-07-01 ARIG: Autoregressive Interactive Head Generation for Real-time Conversations Ying Guo et.al. 2507.00472 null
2025-06-30 Rethink 3D Object Detection from Physical World Satoshi Tanaka et.al. 2507.00190 null
2025-06-30 Epona: Autoregressive Diffusion World Model for Autonomous Driving Kaiwen Zhang et.al. 2506.24113 null
2025-06-30 STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems Mingfei Cheng et.al. 2506.23995 null
2025-06-29 InfGen: Scenario Generation as Next Token Group Prediction Zhenghao Peng et.al. 2506.23316 null
2025-06-29 Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models Maarten Hugenholtz et.al. 2506.23164 null
2025-06-28 Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example Bei Zhou et.al. 2506.22894 null
2025-06-27 Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD Ruthvik Bokkasam et.al. 2506.22111 null
2025-06-27 A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments Akshay Jaitly et.al. 2506.21982 null
2025-06-27 SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model Shuhan Tan et.al. 2506.21976 null
2025-07-14 Ark: An Open-source Python-based Framework for Robot Learning Magnus Dierking et.al. 2506.21628 null
2025-06-26 GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction Muleilan Pei et.al. 2506.21121 null
2025-06-25 Near Time-Optimal Hybrid Motion Planning for Timber Cranes Marc-Philip Ecker et.al. 2506.20314 null
2025-06-24 Trajectory Prediction in Dynamic Object Tracking: A Critical Study Zhongping Dong et.al. 2506.19341 null
2025-06-25 AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation Ziyan Zhao et.al. 2506.19269 null
2025-08-04 Faster Motion Planning via Restarts Nancy Amato et.al. 2506.19016 null
2025-06-23 SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives Yizhou Chen et.al. 2506.18825 null
2025-06-23 Design, fabrication and control of a cable-driven parallel robot Dhruv Sorathiya et.al. 2506.18526 null
2025-06-23 Integrating Maneuverable Planning and Adaptive Control for Robot Cart-Pushing under Disturbances Zhe Zhang et.al. 2506.18410 null
2025-06-23 Selective Social-Interaction via Individual Importance for Fast Human Trajectory Prediction Yota Urano et.al. 2506.18291 null
2025-06-23 Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning Yue Li et.al. 2506.18234 null
2025-06-20 Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation Xiuyu Yang et.al. 2506.17213 null
2025-06-20 Judo: A User-Friendly Open-Source Package for Sampling-Based Model Predictive Control Albert H. Li et.al. 2506.17184 null
2025-07-11 Experimental Setup and Software Pipeline to Evaluate Optimization based Autonomous Multi-Robot Search Algorithms Aditya Bhatt et.al. 2506.16710 null