Updated on 2025.08.08
This page is maintained by Leheng Li that contains papers he interested in. Source code of this web is at here.
3D
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-07 | GAP: Gaussianize Any Point Clouds with Text Guidance | Weiqi Zhang et.al. | 2508.05631 | null |
2025-08-07 | Physically Controllable Relighting of Photographs | Chris Careaga et.al. | 2508.05626 | null |
2025-08-07 | Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity | Yuhan Zhang et.al. | 2508.05609 | null |
2025-08-07 | Robust adaptive fuzzy sliding mode control for trajectory tracking for of cylindrical manipulator | Van Cuong Pham et.al. | 2508.05584 | null |
2025-08-07 | Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis | Kunyu Feng et.al. | 2508.05580 | null |
2025-08-07 | Point cloud segmentation for 3D Clothed Human Layering | Davide Garavaso et.al. | 2508.05531 | null |
2025-08-07 | Head Anchor Enhanced Detection and Association for Crowded Pedestrian Tracking | Zewei Wu et.al. | 2508.05514 | null |
2025-08-07 | MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips | Shibo Wang et.al. | 2508.05506 | null |
2025-08-07 | Symmetry Understanding of 3D Shapes via Chirality Disentanglement | Weikang Wang et.al. | 2508.05505 | null |
2025-08-07 | Computational Design and Fabrication of Modular Robots with Untethered Control | Manas Bhargava et.al. | 2508.05410 | null |
2025-08-07 | CT-GRAPH: Hierarchical Graph Attention Network for Anatomy-Guided CT Report Generation | Hamza Kalisch et.al. | 2508.05375 | null |
2025-08-07 | 3DGabSplat: 3D Gabor Splatting for Frequency-adaptive Radiance Field Rendering | Junyu Zhou et.al. | 2508.05343 | null |
2025-08-07 | CF3: Compact and Fast 3D Feature Fields | Hyunjoon Lee et.al. | 2508.05254 | null |
2025-08-07 | Coarse-to-Fine Joint Registration of MR and Ultrasound Images via Imaging Style Transfer | Junyi Wang et.al. | 2508.05240 | null |
2025-08-07 | EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery | Bingyu Yang et.al. | 2508.05205 | null |
2025-08-07 | Refining Gaussian Splatting: A Volumetric Densification Approach | Mohamed Abdul Gafoor et.al. | 2508.05187 | null |
2025-08-07 | Learning to See and Act: Task-Aware View Planning for Robotic Manipulation | Yongjie Bai et.al. | 2508.05186 | null |
2025-08-07 | FCBV-Net: Category-Level Robotic Garment Smoothing via Feature-Conditioned Bimanual Value Prediction | Mohammed Daba et.al. | 2508.05153 | null |
2025-08-07 | FedGIN: Federated Learning with Dynamic Global Intensity Non-linear Augmentation for Organ Segmentation using Multi-modal Images | Sachin Dudda Nagaraju et.al. | 2508.05137 | null |
2025-08-07 | A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding | Mahmoud Chick Zaouali et.al. | 2508.05064 | null |
2025-08-07 | DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion | Yifeng Huang et.al. | 2508.05060 | null |
2025-08-07 | MAG-Nav: Language-Driven Object Navigation Leveraging Memory-Reserved Active Grounding | Weifan Zhang et.al. | 2508.05021 | null |
2025-08-07 | Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion | Shenglun Chen et.al. | 2508.04984 | null |
2025-08-07 | UGOD: Uncertainty-Guided Differentiable Opacity and Soft Dropout for Enhanced Sparse-View 3DGS | Zhihao Guo et.al. | 2508.04968 | null |
2025-08-07 | Laplacian Analysis Meets Dynamics Modelling: Gaussian Splatting for 4D Reconstruction | Yifan Zhou et.al. | 2508.04966 | null |
2025-08-07 | Perceive-Sample-Compress: Towards Real-Time 3D Gaussian Splatting | Zijian Wang et.al. | 2508.04965 | null |
2025-08-06 | CryoGS: Gaussian Splatting for Cryo-EM Homogeneous Reconstruction | Suyi Chen et.al. | 2508.04929 | null |
2025-08-06 | LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction | Md Zahidul Hasan et.al. | 2508.04847 | null |
2025-08-06 | Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models | Mehrdad Moradi et.al. | 2508.04818 | null |
2025-08-06 | Occupancy Learning with Spatiotemporal Memory | Ziyang Leng et.al. | 2508.04705 | null |
2025-08-06 | BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning | Ziyang Leng et.al. | 2508.04702 | null |
2025-08-06 | MienCap: Realtime Performance-Based Facial Animation with Live Mood Dynamics | Ye Pan et.al. | 2508.04687 | null |
2025-08-06 | PixCuboid: Room Layout Estimation from Multi-view Featuremetric Alignment | Gustav Hanning et.al. | 2508.04659 | null |
2025-08-06 | OmniDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment | Tongfan Guan et.al. | 2508.04611 | null |
2025-08-06 | $NavA^3$ : Understanding Any Instruction, Navigating Anywhere, Finding Anything | Lingfeng Zhang et.al. | 2508.04598 | null |
2025-08-06 | Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline | Linqing Zhao et.al. | 2508.04597 | null |
2025-08-06 | LA-CaRe-CNN: Cascading Refinement CNN for Left Atrial Scar Segmentation | Franz Thaler et.al. | 2508.04553 | null |
2025-08-06 | Surf3R: Rapid Surface Reconstruction from Sparse RGB Views in Seconds | Haodong Zhu et.al. | 2508.04508 | null |
2025-08-06 | MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular Videos | Daisheng Jin et.al. | 2508.04505 | null |
2025-08-06 | 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation | Shuzhou Yang et.al. | 2508.04467 | null |
2025-08-06 | Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models | Yinan Yu et.al. | 2508.04406 | null |
2025-08-06 | RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization | Yanyan Li et.al. | 2508.04335 | null |
2025-08-07 | Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research | Ke Li et.al. | 2508.04326 | null |
2025-08-06 | MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction | Yaopeng Lou et.al. | 2508.04297 | null |
2025-08-06 | PKSS-Align: Robust Point Cloud Registration on Pre-Kendall Shape Space | Chenlei Lv et.al. | 2508.04286 | null |
2025-08-06 | PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction | Muhua Zhu et.al. | 2508.04236 | null |
2025-08-06 | SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition | Jiahui Li et.al. | 2508.04224 | null |
2025-08-06 | Small Lesions-aware Bidirectional Multimodal Multiscale Fusion Network for Lung Disease Classification | Jianxun Yu et.al. | 2508.04205 | null |
2025-08-06 | IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control | Lijuan Liu et.al. | 2508.04147 | null |
2025-08-06 | DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting | Zexu Huang et.al. | 2508.04099 | null |
2025-08-06 | Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework | Yi-Ting Chen et.al. | 2508.04090 | null |
2025-08-06 | RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting | Zhan Li et.al. | 2508.04078 | null |
2025-08-06 | Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation | Jiayi He et.al. | 2508.04049 | null |
2025-08-06 | JanusNet: Hierarchical Slice-Block Shuffle and Displacement for Semi-Supervised 3D Multi-Organ Segmentation | Zheng Zhang et.al. | 2508.03997 | null |
2025-08-05 | La La LiDAR: Large-Scale Layout Generation from LiDAR Data | Youquan Liu et.al. | 2508.03691 | null |
2025-08-05 | Veila: Panoramic LiDAR Generation from a Monocular RGB Image | Youquan Liu et.al. | 2508.03690 | null |
2025-08-05 | Inland-LOAM: Voxel-Based Structural Semantic Mapping for Inland Waterways | Zhongbi Luo et.al. | 2508.03672 | null |
2025-08-05 | OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World | Katherine Liu et.al. | 2508.03669 | null |
2025-08-06 | Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images | Xiangyu Sun et.al. | 2508.03643 | null |
2025-08-05 | FPG-NAS: FLOPs-Aware Gated Differentiable Neural Architecture Search for Efficient 6DoF Pose Estimation | Nassim Ali Ousalah et.al. | 2508.03618 | null |
2025-08-05 | CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models | Ana Lawry Aguila et.al. | 2508.03594 | null |
2025-08-05 | Spatial Imputation Drives Cross-Domain Alignment for EEG Classification | Hongjun Liu et.al. | 2508.03437 | null |
2025-08-05 | WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval | Junlong Ren et.al. | 2508.03343 | null |
2025-08-05 | Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion | Wentao Qu et.al. | 2508.03252 | null |
2025-08-05 | Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing | Hongyu Shen et.al. | 2508.03227 | null |
2025-08-05 | Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling | Heng Wu et.al. | 2508.03186 | null |
2025-08-05 | Duplex-GS: Proxy-Guided Weighted Blending for Real-Time Order-Independent Gaussian Splatting | Weihang Liu et.al. | 2508.03180 | null |
2025-08-05 | H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction | Heng Jia et.al. | 2508.03118 | null |
2025-08-05 | Point2Act: Efficient 3D Distillation of Multimodal LLMs for Zero-Shot Context-Aware Grasping | Sang Min Kim et.al. | 2508.03099 | null |
2025-08-05 | RobustGS: Unified Boosting of Feedforward 3D Gaussian Splatting under Low-Quality Conditions | Anran Wu et.al. | 2508.03077 | null |
2025-08-05 | SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation | Bo Zhang et.al. | 2508.03069 | null |
2025-08-05 | A Survey of Medical Point Cloud Shape Learning: Registration, Reconstruction and Variation | Tongxu Zhang et.al. | 2508.03057 | null |
2025-08-05 | SA-3DGS: A Self-Adaptive Compression Method for 3D Gaussian Splatting | Liheng Zhang et.al. | 2508.03017 | null |
2025-08-05 | ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion | Meng Zhou et.al. | 2508.03008 | null |
2025-08-05 | GACL: Grounded Adaptive Curriculum Learning with Active Task and Performance Monitoring | Linji Wang et.al. | 2508.02988 | null |
2025-08-04 | Evaluation of 3D Counterfactual Brain MRI Generation | Pengwei Sun et.al. | 2508.02880 | null |
2025-08-04 | MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model | Tianheng Zhu et.al. | 2508.02858 | null |
2025-08-04 | GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing | Mikołaj Zieliński et.al. | 2508.02831 | null |
2025-08-04 | PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation | Zongyou Yang et.al. | 2508.02806 | null |
2025-08-04 | PMGS: Reconstruction of Projectile Motion across Large Spatiotemporal Spans via 3D Gaussian Splatting | Yijun Xu et.al. | 2508.02660 | null |
2025-08-04 | RL-U $^2$ Net: A Dual-Branch UNet with Reinforcement Learning-Assisted Multimodal Feature Fusion for Accurate 3D Whole-Heart Segmentation | Jierui Qu et.al. | 2508.02557 | null |
2025-08-04 | Uncertainty-Aware Perception-Based Control for Autonomous Racing | Jelena Trisovic et.al. | 2508.02494 | null |
2025-08-05 | Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting | Jianchao Wang et.al. | 2508.02493 | null |
2025-08-06 | GR-Gaussian: Graph-Based Radiative Gaussian Splatting for Sparse-View CT Reconstruction | Yikuang Yuluo et.al. | 2508.02408 | null |
2025-08-04 | Correspondence-Free Fast and Robust Spherical Point Pattern Registration | Anik Sarker et.al. | 2508.02339 | null |
2025-08-04 | Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images | Philipp Wulff et.al. | 2508.02323 | null |
2025-08-04 | ASDR: Exploiting Adaptive Sampling and Data Reuse for CIM-based Instant Neural Rendering | Fangxin Liu et.al. | 2508.02304 | null |
2025-08-04 | Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection | Jae-Young Kang et.al. | 2508.02288 | null |
2025-08-04 | SplatSSC: Decoupled Depth-Guided Gaussian Splatting for Semantic Scene Completion | Rui Qian et.al. | 2508.02261 | null |
2025-08-04 | GaussianCross: Cross-modal Self-supervised 3D Representation Learning via Gaussian Splatting | Lei Yao et.al. | 2508.02172 | null |
2025-08-04 | Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes | Tom Fischer et.al. | 2508.02157 | null |
2025-08-04 | ScrewSplat: An End-to-End Method for Articulated Object Recognition | Seungyeon Kim et.al. | 2508.02146 | null |
2025-08-04 | VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling | Yuru Xiao et.al. | 2508.02129 | null |
2025-08-04 | REACT-KD: Region-Aware Cross-modal Topological Knowledge Distillation for Interpretable Medical Image Classification | Hongzhao Chen et.al. | 2508.02104 | null |
2025-08-04 | StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion | Haoxin Yang et.al. | 2508.02056 | null |
2025-08-04 | Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure | Ziling Wang et.al. | 2508.02034 | null |
2025-08-04 | On-the-Fly Object-aware Representative Point Selection in Point Cloud | Xiaoyu Zhang et.al. | 2508.01980 | null |
2025-08-04 | From Photons to Physics: Autonomous Indoor Drones and the Future of Objective Property Assessment | Petteri Teikari et.al. | 2508.01965 | null |
2025-08-03 | Less is More: AMBER-AFNO – a New Benchmark for Lightweight 3D Medical Image Segmentation | Andrea Dosi et.al. | 2508.01941 | null |
2025-08-03 | MUTE-DSS: A Digital-Twin-Based Decision Support System for Minimizing Underwater Radiated Noise in Ship Voyage Planning | Akash Venkateshwaran et.al. | 2508.01907 | null |
2025-08-03 | Beyond Vulnerabilities: A Survey of Adversarial Attacks as Both Threats and Defenses in Computer Vision Systems | Zhongliang Guo et.al. | 2508.01845 | null |
2025-08-03 | OmniEvent: Unified Event Representation Learning | Weiqi Yan et.al. | 2508.01842 | null |
2025-08-03 | Diffusion-based 3D Hand Motion Recovery with Intuitive Physics | Yufei Zhang et.al. | 2508.01835 | null |
2025-08-03 | Skip priors and add graph-based anatomical information, for point-based Couinaud segmentation | Xiaotong Zhang et.al. | 2508.01785 | null |
2025-08-05 | VPN: Visual Prompt Navigation | Shuo Feng et.al. | 2508.01766 | null |
2025-08-03 | AG $^2$ aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing | Zhaonan Wang et.al. | 2508.01740 | null |
2025-08-03 | OpenMap: Instruction Grounding via Open-Vocabulary Visual-Language Mapping | Danyang Li et.al. | 2508.01723 | null |
2025-08-03 | LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving | Luqi Cheng et.al. | 2508.01704 | null |
2025-08-03 | Register Anything: Estimating “Corresponding Prompts” for Segment Anything Model | Shiqi Huang et.al. | 2508.01697 | null |
2025-08-03 | DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing | Yufeng Chi et.al. | 2508.01684 | null |
2025-08-03 | DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding | Hanqing Wang et.al. | 2508.01651 | null |
2025-08-03 | StrandDesigner: Towards Practical Strand Generation with Sketch Guidance | Na Zhang et.al. | 2508.01650 | null |
2025-08-03 | Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection | Hanxi Li et.al. | 2508.01591 | null |
2025-08-03 | A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction | Hua Yu et.al. | 2508.01585 | null |
2025-08-03 | Deeply Supervised Multi-Task Autoencoder for Biological Brain Age estimation using three dimensional T $_1$ -weighted magnetic resonance imaging | Mehreen Kanwal et.al. | 2508.01565 | null |
2025-08-03 | Adaptive LiDAR Scanning: Harnessing Temporal Cues for Efficient 3D Object Detection via Multi-Modal Fusion | Sara Shoouri et.al. | 2508.01562 | null |
2025-08-02 | Decentralized Aerial Manipulation of a Cable-Suspended Load using Multi-Agent Reinforcement Learning | Jack Zeng et.al. | 2508.01522 | null |
2025-08-02 | EfficientGFormer: Multimodal Brain Tumor Segmentation via Pruned Graph-Augmented Transformer | Fatemeh Ziaeetabar et.al. | 2508.01465 | null |
2025-08-02 | Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians | Quankai Gao et.al. | 2508.01464 | null |
2025-08-02 | Uncertainty-Aware Segmentation Quality Prediction via Deep Learning Bayesian Modeling: Comprehensive Evaluation and Interpretation on Skin Cancer and Liver Segmentation | Sikha O K et.al. | 2508.01460 | null |
2025-08-05 | 3DRot: 3D Rotation Augmentation for RGB-Based 3D Tasks | Shitian Yang et.al. | 2508.01423 | null |
2025-08-02 | ReMu: Reconstructing Multi-layer 3D Clothed Human from Image Layers | Onat Vuran et.al. | 2508.01381 | null |
2025-08-02 | P3P Made Easy | Seong Hun Lee et.al. | 2508.01312 | null |
2025-08-02 | C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor | Haoquan Lu et.al. | 2508.01311 | null |
2025-08-02 | CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis | Alec Sargood et.al. | 2508.01292 | null |
2025-08-02 | Integrating Disparity Confidence Estimation into Relative Depth Prior-Guided Unsupervised Stereo Matching | Chuang-Wei Liu et.al. | 2508.01275 | null |
2025-08-05 | MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh | Shuangkang Fang et.al. | 2508.01242 | null |
2025-08-02 | OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS | Han Ling et.al. | 2508.01239 | null |
2025-08-02 | Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system | Jiyong Kim et.al. | 2508.01230 | null |
2025-08-02 | MoGaFace: Momentum-Guided and Texture-Aware Gaussian Avatars for Consistent Facial Geometry | Yujian Liu et.al. | 2508.01218 | null |
2025-08-02 | Perspective from a Broader Context: Can Room Style Knowledge Help Visual Floorplan Localization? | Bolei Chen et.al. | 2508.01216 | null |
2025-08-02 | A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding | Zhan Shi et.al. | 2508.01197 | null |
2025-08-02 | Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning | Xinhang Wan et.al. | 2508.01184 | null |
2025-08-02 | No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views | Ranran Huang et.al. | 2508.01171 | null |
2025-08-02 | DELTAv2: Accelerating Dense 3D Tracking | Tuan Duc Ngo et.al. | 2508.01170 | null |
2025-08-02 | OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding | Dianyi Yang et.al. | 2508.01150 | null |
2025-08-02 | Design of Q8bot: A Miniature, Low-Cost, Dynamic Quadruped Built with Zero Wires | Yufeng Wu et.al. | 2508.01149 | null |
2025-08-02 | UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation | Chaitanya Patel et.al. | 2508.01126 | null |
2025-08-01 | DreamSat-2.0: Towards a General Single-View Asteroid 3D Reconstruction | Santiago Diaz et.al. | 2508.01079 | null |
2025-08-01 | Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation | Fenghe Tang et.al. | 2508.01064 | null |
2025-08-01 | Structured Spectral Graph Learning for Anomaly Classification in 3D Chest CT Scans | Theo Di Piazza et.al. | 2508.01045 | null |
2025-08-01 | 3D Reconstruction via Incremental Structure From Motion | Muhammad Zeeshan et.al. | 2508.01019 | null |
2025-08-01 | Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection | Cheng-You Lu et.al. | 2508.01014 | null |
2025-08-01 | Cooperative Perception: A Resource-Efficient Framework for Multi-Drone 3D Scene Reconstruction Using Federated Diffusion and NeRF | Massoud Pourmandi et.al. | 2508.00967 | null |
2025-07-31 | Investigating Crossing Perception in 3D Graph Visualisation | Ying Zhang et.al. | 2508.00950 | null |
2025-08-01 | IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation | Wenxuan Guo et.al. | 2508.00823 | null |
2025-08-01 | Cross-Dataset Semantic Segmentation Performance Analysis: Unifying NIST Point Cloud City Datasets for 3D Deep Learning | Alexander Nikitas Dimopoulos et.al. | 2508.00822 | null |
2025-08-01 | GECO: Geometrically Consistent Embedding with Lightspeed Inference | Regine Hartwig et.al. | 2508.00746 | null |
2025-08-01 | Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR | Adwait Chandorkar et.al. | 2508.00744 | null |
2025-08-04 | DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior | Junzhe Lu et.al. | 2508.00599 | null |
2025-08-01 | OmniUnet: A Multimodal Network for Unstructured Terrain Segmentation on Planetary Rovers Using RGB, Depth, and Thermal Imagery | Raul Castilla-Arquillo et.al. | 2508.00580 | null |
2025-08-04 | LesiOnTime – Joint Temporal and Clinical Modeling for Small Breast Lesion Segmentation in Longitudinal DCE-MRI | Mohammed Kamran et.al. | 2508.00496 | null |
2025-08-01 | HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection | Jiaping Cao et.al. | 2508.00473 | null |
2025-08-01 | Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation | Nan Xiang et.al. | 2508.00428 | null |
2025-08-01 | Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting | Seunggeun Chi et.al. | 2508.00427 | null |
2025-08-01 | Sari Sandbox: A Virtual Retail Store Environment for Embodied AI Agents | Janika Deborah Gajo et.al. | 2508.00400 | null |
2025-08-01 | Occlusion-robust Stylization for Drawing-based 3D Animation | Sunjae Yoon et.al. | 2508.00398 | null |
2025-08-01 | SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies | Liang Han et.al. | 2508.00366 | null |
2025-08-01 | Stable at Any Speed: Speed-Driven Multi-Object Tracking with Learnable Kalman Filtering | Yan Gong et.al. | 2508.00358 | null |
2025-08-01 | Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging | Tianshuang Qiu et.al. | 2508.00354 | null |
2025-08-01 | AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer | Jin Lyu et.al. | 2508.00298 | null |
2025-08-01 | Towards Robust Semantic Correspondence: A Benchmark and Insights | Wenyue Chong et.al. | 2508.00272 | null |
2025-08-05 | Multimodal Referring Segmentation: A Survey | Henghui Ding et.al. | 2508.00265 | null |
2025-08-01 | PointGauss: Point Cloud-Guided Multi-Object Segmentation for Gaussian Splatting | Wentao Sun et.al. | 2508.00259 | null |
2025-08-01 | Weakly Supervised Intracranial Aneurysm Detection and Segmentation in MR angiography via Multi-task UNet with Vesselness Prior | Erin Rainville et.al. | 2508.00235 | null |
2025-07-31 | Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs | Bhavya Goyal et.al. | 2508.00169 | null |
2025-07-31 | GEPAR3D: Geometry Prior-Assisted Learning for 3D Tooth Segmentation | Tomasz Szczepański et.al. | 2508.00155 | null |
2025-07-31 | Stress-Aware Resilient Neural Training | Ashkan Shakarami et.al. | 2508.00098 | null |
2025-07-31 | Punching Bag vs. Punching Person: Motion Transferability in Videos | Raiyaan Abdullah et.al. | 2508.00085 | null |
2025-07-31 | Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis | Bowen Zhang et.al. | 2507.23785 | null |
2025-07-31 | Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions | Li Siyao et.al. | 2507.23778 | null |
2025-07-31 | SeqAffordSplat: Scene-level Sequential Affordance Reasoning on 3D Gaussian Splatting | Di Li et.al. | 2507.23772 | null |
2025-08-05 | Topology Optimization in Medical Image Segmentation with Fast Euler Characteristic | Liu Li et.al. | 2507.23763 | null |
2025-07-31 | Enhanced Velocity Field Modeling for Gaussian Video Reconstruction | Zhenyang Li et.al. | 2507.23704 | null |
2025-07-31 | Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents | Shaofei Cai et.al. | 2507.23698 | null |
2025-07-31 | High-resolution eikonal imaging and uncertainty quantification of the Kilauea caldera | Angela F. Gao et.al. | 2507.23692 | null |
2025-07-31 | I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation | Jialei Chen et.al. | 2507.23683 | null |
2025-07-31 | Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes | Xiaohan Li et.al. | 2507.23677 | null |
2025-07-31 | DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation | Yuchen Zhou et.al. | 2507.23599 | null |
2025-08-02 | MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction | Zijian Dong et.al. | 2507.23597 | null |
2025-07-31 | Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization | Maxime Pietrantoni et.al. | 2507.23569 | null |
2025-07-31 | 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection | Yung-Hsu Yang et.al. | 2507.23567 | null |
2025-08-01 | H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation | Hongzhe Bi et.al. | 2507.23523 | null |
2025-07-31 | Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion | Mutian Xu et.al. | 2507.23483 | null |
2025-07-31 | FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction | Donghyun Lee et.al. | 2507.23480 | null |
2025-07-31 | 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding | Ting Huang et.al. | 2507.23478 | null |
2025-07-31 | NeRF Is a Valuable Assistant for 3D Gaussian Splatting | Shuangkang Fang et.al. | 2507.23374 | null |
2025-07-31 | MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting | Xingyue Peng et.al. | 2507.23340 | null |
2025-08-01 | Training-free Geometric Image Editing on Diffusion Models | Hanshen Zhu et.al. | 2507.23300 | null |
2025-07-31 | iLRM: An Iterative Large 3D Reconstruction Model | Gyeongjin Kang et.al. | 2507.23277 | null |
2025-07-31 | GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting | Jaeseok Park et.al. | 2507.23273 | null |
2025-07-31 | Towards Affordable Tumor Segmentation and Visualization for 3D Breast MRI Using SAM2 | Solha Kang et.al. | 2507.23272 | null |
2025-07-30 | Details Matter for Indoor Open-vocabulary 3D Instance Segmentation | Sanghun Jung et.al. | 2507.23134 | null |
2025-07-30 | Rethink Domain Generalization in Heterogeneous Sequence MRI Segmentation | Zheyuan Zhang et.al. | 2507.23110 | null |
2025-07-30 | Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation | Alexandru Buburuzan et.al. | 2507.23058 | null |
2025-07-30 | Adaptive Time-step Training for Enhancing Spike-Based Neural Radiance Fields | Ranxi Lin et.al. | 2507.23033 | null |
2025-07-30 | Learning to Prune Branches in Modern Tree-Fruit Orchards | Abhinav Jain et.al. | 2507.23015 | null |
2025-07-30 | Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction | Zhensheng Yuan et.al. | 2507.23006 | null |
2025-07-30 | Viser: Imperative, Web-based 3D Visualization in Python | Brent Yi et.al. | 2507.22885 | null |
2025-07-30 | DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion | Qingcheng Zhao et.al. | 2507.22825 | null |
2025-07-30 | Wall Shear Stress Estimation in Abdominal Aortic Aneurysms: Towards Generalisable Neural Surrogate Models | Patryk Rygiel et.al. | 2507.22817 | null |
2025-07-30 | Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques | Weide Liu et.al. | 2507.22791 | null |
2025-07-30 | Social-Pose: Enhancing Trajectory Prediction with Human Body Pose | Yang Gao et.al. | 2507.22742 | null |
2025-07-30 | A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks | Hang Su et.al. | 2507.22733 | null |
2025-07-30 | Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints | Thuy Tran et.al. | 2507.22699 | null |
2025-07-30 | Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation | Hongbin Lin et.al. | 2507.22668 | null |
2025-07-30 | trAIce3D: A Prompt-Driven Transformer Based U-Net for Semantic Segmentation of Microglial Cells from Large-Scale 3D Microscopy Images | MohammadAmin Alamalhoda et.al. | 2507.22635 | null |
2025-07-30 | Estimating 2D Camera Motion with Hybrid Motion Basis | Haipeng Li et.al. | 2507.22480 | null |
2025-07-30 | UAVScenes: A Multi-Modal Dataset for UAVs | Sijie Wang et.al. | 2507.22412 | null |
2025-07-30 | UFV-Splatter: Pose-Free Feed-Forward 3D Gaussian Splatting Adapted to Unfavorable Views | Yuki Fujimura et.al. | 2507.22342 | null |
2025-07-30 | A Segmentation Framework for Accurate Diagnosis of Amyloid Positivity without Structural Images | Penghan Zhu et.al. | 2507.22336 | null |
2025-07-29 | Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception | Christian Ellis et.al. | 2507.22194 | null |
2025-07-29 | Enhancing efficiency in paediatric brain tumour segmentation using a pathologically diverse single-center clinical dataset | A. Piffer et.al. | 2507.22152 | null |
2025-07-29 | Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos | Ziren Gong et.al. | 2507.22052 | null |
2025-07-29 | ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports | Mohammed Baharoon et.al. | 2507.22030 | null |
2025-07-29 | Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images | Yutao Hu et.al. | 2507.22024 | null |
2025-07-29 | XAI for Point Cloud Data using Perturbations based on Meaningful Segmentation | Raju Ningappa Mulawade et.al. | 2507.22020 | null |
2025-07-29 | DISCOVERSE: Efficient Robot Simulation in Complex High-Fidelity Environments | Yufei Jia et.al. | 2507.21981 | null |
2025-07-29 | PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction | Jiahui Ren et.al. | 2507.21960 | null |
2025-07-31 | MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors | Shouyi Lu et.al. | 2507.21872 | null |
2025-07-29 | VidFuncta: Towards Generalizable Neural Representations for Ultrasound Videos | Julia Wolleb et.al. | 2507.21863 | null |
2025-07-29 | HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels | HunyuanWorld Team et.al. | 2507.21809 | null |
2025-07-29 | AU-LLM: Micro-Expression Action Unit Detection via Enhanced LLM-Based Feature Fusion | Zhishu Liu et.al. | 2507.21778 | null |
2025-07-29 | Multi-UAV Deployment in Obstacle-Cluttered Environments with LOS Connectivity | Yuda Chen et.al. | 2507.21772 | null |
2025-07-30 | No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering | Linye Wei et.al. | 2507.21572 | null |
2025-07-29 | Multi-View Reconstruction with Global Context for 3D Anomaly Detection | Yihan Sun et.al. | 2507.21555 | null |
2025-07-29 | LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments | Junhao Chen et.al. | 2507.21517 | null |
2025-07-29 | ST-DAI: Single-shot 2.5D Spatial Transcriptomics with Intra-Sample Domain Adaptive Imputation for Cost-efficient 3D Reconstruction | Jiahe Qian et.al. | 2507.21516 | null |
2025-07-29 | BANG: Dividing 3D Assets via Generative Exploded Dynamics | Longwen Zhang et.al. | 2507.21493 | null |
2025-07-29 | Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval | Zhichuan Wang et.al. | 2507.21489 | null |
2025-07-28 | Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View | Zitong Zhang et.al. | 2507.21371 | null |
2025-08-03 | Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy | Jicheng Yuan et.al. | 2507.21358 | null |
2025-07-28 | DEM-NeRF: A Neuro-Symbolic Method for Scientific Discovery through Physics-Informed Simulation | Wenkai Tan et.al. | 2507.21350 | null |
2025-07-28 | GLCP: Global-to-Local Connectivity Preservation for Tubular Structure Segmentation | Feixiang Zhou et.al. | 2507.21328 | null |
2025-07-28 | VoluMe – Authentic 3D Video Calls from Live Gaussian Splat Prediction | Martin de La Gorce et.al. | 2507.21311 | null |
2025-07-28 | Fluidically Innervated Lattices Make Versatile and Durable Tactile Sensors | Annan Zhang et.al. | 2507.21225 | null |
2025-08-03 | Reconstructing 4D Spatial Intelligence: A Survey | Yukang Cao et.al. | 2507.21045 | null |
2025-07-28 | GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction | Tianhao Li et.al. | 2507.20963 | null |
2025-07-28 | $S^3$ LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping | Ruoyu Fan et.al. | 2507.20854 | null |
2025-07-28 | An Efficient Machine Learning Framework for Forest Height Estimation from Multi-Polarimetric Multi-Baseline SAR data | Francesca Razzano et.al. | 2507.20798 | null |
2025-07-28 | KASportsFormer: Kinematic Anatomy Enhanced Transformer for 3D Human Pose Estimation on Short Sports Scene Video | Zhuoer Yin et.al. | 2507.20763 | null |
2025-07-28 | Methods for the Segmentation of Reticular Structures Using 3D LiDAR Data: A Comparative Evaluation | Francisco J. Soler Mora et.al. | 2507.20589 | null |
2025-07-28 | M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast | Jiacheng Lu et.al. | 2507.20582 | null |
2025-07-28 | Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation | Hyung Kyu Kim et.al. | 2507.20568 | null |
2025-07-28 | MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization | Hyung Kyu Kim et.al. | 2507.20562 | null |
2025-07-28 | Uni-Mapper: Unified Mapping Framework for Multi-modal LiDARs in Complex and Dynamic Environments | Gilhwan Kang et.al. | 2507.20538 | null |
2025-07-28 | Enhancing Spatial Reasoning through Visual and Textual Thinking | Xun Liang et.al. | 2507.20529 | null |
2025-07-28 | GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections | Haiyang Bai et.al. | 2507.20512 | null |
2025-07-28 | Automated 3D-GS Registration and Fusion via Skeleton Alignment and Gaussian-Adaptive Features | Shiyang Liu et.al. | 2507.20480 | null |
2025-07-29 | From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos | Chenjian Gao et.al. | 2507.20331 | null |
2025-07-27 | Decomposing Densification in Gaussian Splatting for Faster 3D Scene Reconstruction | Binxiao Huang et.al. | 2507.20239 | null |
2025-07-27 | NeuroVoxel-LM: Language-Aligned 3D Perception via Dynamic Voxelization and Meta-Embedding | Shiyu Liu et.al. | 2507.20110 | null |
2025-07-26 | High-Speed Event Vision-Based Tactile Roller Sensor for Large Surface Measurements | Akram Khairi et.al. | 2507.19914 | null |
2025-07-30 | RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection | Xiaokai Bai et.al. | 2507.19856 | null |
2025-07-26 | Taking Language Embedded 3D Gaussian Splatting into the Wild | Yuze Wang et.al. | 2507.19830 | null |
2025-07-25 | GSCache: Real-Time Radiance Caching for Volume Path Tracing using 3D Gaussian Splatting | David Bauer et.al. | 2507.19718 | null |
2025-07-25 | DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations | Ziren Gong et.al. | 2507.19474 | null |
2025-07-25 | Fast Learning of Non-Cooperative Spacecraft 3D Models through Primitive Initialization | Pol Francesch Huc et.al. | 2507.19459 | null |
2025-07-25 | NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography | Kirsten W. H. Maas et.al. | 2507.19328 | null |
2025-07-25 | 3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering | Wei-Hsing Huang et.al. | 2507.19133 | null |
2025-07-25 | Gaussian Set Surface Reconstruction through Per-Gaussian Optimization | Zhentao Huang et.al. | 2507.18923 | null |
2025-07-24 | SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time | Yun Chen et.al. | 2507.18713 | null |
2025-07-24 | Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping | Chong Cheng et.al. | 2507.18541 | null |
2025-07-24 | G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM | Gyuhyeon Pak et.al. | 2507.18344 | null |
2025-07-24 | LONG3R: Long Sequence Streaming 3D Reconstruction | Zhuoguang Chen et.al. | 2507.18255 | null |
2025-07-24 | PS-GS: Gaussian Splatting for Multi-View Photometric Stereo | Yixiao Chen et.al. | 2507.18231 | null |
2025-07-24 | High-fidelity 3D Gaussian Inpainting: preserving multi-view consistency and photorealistic details | Jun Zhou et.al. | 2507.18023 | null |
2025-07-24 | Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners | Kostas Karakontis et.al. | 2507.17519 | null |
2025-07-23 | Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field | Yuzhe Zhu et.al. | 2507.17351 | null |
2025-07-23 | Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting | Hyeongmin Lee et.al. | 2507.17336 | null |
2025-07-24 | PolarAnything: Diffusion-based Polarimetric Image Synthesis | Kailong Zhang et.al. | 2507.17268 | null |
2025-07-22 | StreamME: Simplify 3D Gaussian Avatar within Live Stream | Luchuan Song et.al. | 2507.17029 | null |
2025-07-22 | VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences | Kai Deng et.al. | 2507.16443 | null |
2025-07-22 | Sparse-View 3D Reconstruction: Recent Advances and Open Challenges | Tanveer Younis et.al. | 2507.16406 | null |
2025-07-22 | Dens3R: A Foundation Model for 3D Geometry Prediction | Xianze Fang et.al. | 2507.16290 | null |
2025-07-22 | LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images | Guichen Huang et.al. | 2507.16144 | null |
2025-07-21 | Appearance Harmonization via Bilateral Grid Prediction with Transformers for 3DGS | Jisu Shin et.al. | 2507.15748 | null |
2025-07-21 | DWTGS: Rethinking Frequency Regularization for Sparse-view 3D Gaussian Splatting | Hung Nguyen et.al. | 2507.15690 | null |
2025-07-21 | Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing | Boni Hu et.al. | 2507.15683 | null |
2025-07-21 | Gaussian Splatting with Discretized SDF for Relightable Assets | Zuo-Liang Zhu et.al. | 2507.15629 | null |
2025-07-28 | SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting | Zihui Gao et.al. | 2507.15602 | null |
2025-07-21 | ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting | Ruijie Zhu et.al. | 2507.15454 | null |
2025-07-25 | GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing | Minnan Pei et.al. | 2507.15300 | null |
2025-07-20 | 3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline | Kaishva Chintan Shah et.al. | 2507.14924 | null |
2025-07-20 | Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction | Xiufeng Huang et.al. | 2507.14921 | null |
2025-07-20 | An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks | Xinyi Wu et.al. | 2507.14798 | null |
2025-07-30 | Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey | Jiahui Zhang et.al. | 2507.14501 | null |
2025-07-19 | Adaptive 3D Gaussian Splatting Video Streaming: Visual Saliency-Aware Tiling and Meta-Learning-Based Bitrate Adaptation | Han Gong et.al. | 2507.14454 | null |
2025-07-19 | Adaptive 3D Gaussian Splatting Video Streaming | Han Gong et.al. | 2507.14432 | null |
2025-08-01 | C-DOG: Multi-View Multi-instance Feature Association Using Connected δ-Overlap Graphs | Yung-Hong Sun et.al. | 2507.14095 | null |
2025-07-18 | TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views | Hsiang-Hui Hung et.al. | 2507.13929 | null |
2025-07-18 | Neural-GASh: A CGA-based neural radiance prediction pipeline for real-time shading | Efstratios Geronikolakis et.al. | 2507.13917 | null |
2025-07-21 | PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations | Yu Wei et.al. | 2507.13891 | null |
2025-07-18 | EPSilon: Efficient Point Sampling for Lightening of Hybrid-based 3D Avatar Generation | Seungjun Moon et.al. | 2507.13648 | null |
2025-07-18 | Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation | Masahiro Ogawa et.al. | 2507.13628 | null |
2025-07-19 | AutoPartGen: Autogressive 3D Part Generation and Discovery | Minghao Chen et.al. | 2507.13346 | null |
2025-07-16 | VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians | Siyuan Yao et.al. | 2507.12667 | null |
2025-07-16 | NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting | Kuangshi Ai et.al. | 2507.12621 | null |
2025-07-21 | Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition | Beizhen Zhao et.al. | 2507.12498 | null |
2025-07-19 | SpatialTrackerV2: 3D Point Tracking Made Easy | Yuxi Xiao et.al. | 2507.12462 | null |
2025-07-16 | Revealing the Ancient Beauty: Digital Reconstruction of Temple Tiles using Computer Vision | Arkaprabha Basu et.al. | 2507.12195 | null |
2025-07-16 | DoRF: Doppler Radiance Fields for Robust Human Activity Recognition Using Wi-Fi | Navid Hasanzadeh et.al. | 2507.12132 | null |
2025-07-16 | BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images | Davide Di Nucci et.al. | 2507.12095 | null |
2025-07-16 | SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation | Beining Xu et.al. | 2507.12027 | null |
2025-07-16 | HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing | Tielong Wang et.al. | 2507.11971 | null |
2025-07-16 | Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark | Jingqian Wu et.al. | 2507.11931 | null |
2025-07-16 | CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning | Peiwen Xia et.al. | 2507.11834 | null |
2025-07-15 | Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation | Zhen Xu et.al. | 2507.11540 | null |
2025-07-21 | Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling | Hayeon Kim et.al. | 2507.11061 | null |
2025-07-14 | ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions | Shivangi Aneja et.al. | 2507.10542 | null |
2025-07-14 | Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry | Geyou Zhang et.al. | 2507.10009 | null |
2025-07-19 | 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving | Yixun Zhang et.al. | 2507.09993 | null |
2025-07-14 | VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling | Zihang Zeng et.al. | 2507.09987 | null |
2025-07-11 | From images to properties: a NeRF-driven framework for granular material parameter inversion | Cheng-Hsi Hsiao et.al. | 2507.09005 | null |
2025-07-11 | An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan | Mengyuan Liu et.al. | 2507.08690 | null |
2025-07-11 | Adaptive Framework for Ambient Intelligence in Rehabilitation Assistance | Gábor Baranyi et.al. | 2507.08624 | null |
2025-07-11 | Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT | Wei Zhang et.al. | 2507.08448 | null |
2025-07-11 | RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting | Ji Hyun Seo et.al. | 2507.08434 | null |
2025-07-11 | CL3R: 3D Reconstruction and Contrastive Learning for Enhanced Robotic Manipulation Representations | Wenbo Cui et.al. | 2507.08262 | null |
2025-07-10 | Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction | Hyungjun Doh et.al. | 2507.08137 | null |
2025-07-18 | RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration | Chong Cheng et.al. | 2507.08136 | null |
2025-07-10 | Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions | Longfei Li et.al. | 2507.07978 | null |
2025-07-10 | RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection | Yongyang Zhou et.al. | 2507.07733 | null |
Diffusion
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-07 | Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation | Yue Liao et.al. | 2508.05635 | null |
2025-08-07 | GAP: Gaussianize Any Point Clouds with Text Guidance | Weiqi Zhang et.al. | 2508.05631 | null |
2025-08-07 | Latent Space Diffusion for Topology Optimization | Aaron Lutheran et.al. | 2508.05624 | null |
2025-08-07 | Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision | Luozheng Qin et.al. | 2508.05606 | null |
2025-08-07 | Unveiling the Lithium-Ion Transport Mechanism in Li2ZrCl6 Solid-State Electrolyte via Deep Learning-Accelerated Molecular Dynamics Simulations | Hanzeng Guo et.al. | 2508.05598 | null |
2025-08-07 | Discrepancy-Aware Contrastive Adaptation in Medical Time Series Analysis | Yifan Wang et.al. | 2508.05572 | null |
2025-08-07 | MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips | Shibo Wang et.al. | 2508.05506 | null |
2025-08-07 | Heat and super-diffusive melting fronts in unsaturated porous media | Eirik G. Flekkøy et.al. | 2508.05451 | null |
2025-08-07 | Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI | Krzysztof Janowicz et.al. | 2508.05432 | null |
2025-08-07 | MolSnap: Snap-Fast Molecular Generation with Latent Variational Mean Flow | Md Atik Ahamed et.al. | 2508.05411 | null |
2025-08-07 | UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation | Wonjun Kang et.al. | 2508.05399 | null |
2025-08-07 | Real-Time Iteration Scheme for Diffusion Policy | Yufei Duan et.al. | 2508.05396 | null |
2025-08-07 | Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms | Jie Xiao et.al. | 2508.05387 | null |
2025-08-07 | Multi-Modal Multi-Behavior Sequential Recommendation with Conditional Diffusion-Based Feature Denoising | Xiaoxi Cui et.al. | 2508.05352 | null |
2025-08-07 | Stranski-Krastanov Growth of Disordered ScNx Thin Films on MgO(100): Influence of Defect Densities on Electronic Structure and Transport Properties | Susmita Chowdhury et.al. | 2508.05330 | null |
2025-08-07 | Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting | Frank Ruis et.al. | 2508.05323 | null |
2025-08-07 | Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces | Mathias Rose Bjare et.al. | 2508.05306 | null |
2025-08-07 | SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens | Nikita Dragunov et.al. | 2508.05305 | null |
2025-08-07 | An Investigation into the Distribution of Ratios of Particle Solver-based Likelihoods | Emil Løvbak et.al. | 2508.05303 | null |
2025-08-07 | Wavelet-Guided Dual-Frequency Encoding for Remote Sensing Change Detection | Xiaoyang Zhang et.al. | 2508.05271 | null |
2025-08-07 | B4DL: A Benchmark for 4D LiDAR LLM in Spatio-Temporal Understanding | Changho Choi et.al. | 2508.05269 | null |
2025-08-07 | SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion | Xiaoyang Zhang et.al. | 2508.05264 | null |
2025-08-07 | ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models | Yatong Lan et.al. | 2508.05236 | null |
2025-08-07 | Parabolic abstract evolution equations in cylindrical domains and uniformly local Sobolev spaces | Joly Romain et.al. | 2508.05220 | null |
2025-08-07 | An asymptotic-preserving active flux scheme for the hyperbolic heat equation in the diffusive scaling | Junming Duan et.al. | 2508.05166 | null |
2025-08-07 | RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer | Fangyu Du et.al. | 2508.05115 | null |
2025-08-07 | PoseGen: In-Context LoRA Finetuning for Pose-Controllable Long Human Video Generation | Jingxuan He et.al. | 2508.05091 | null |
2025-08-07 | MetaDiT: Enabling Fine-grained Constraints in High-degree-of Freedom Metasurface Design | Hao Li et.al. | 2508.05076 | null |
2025-08-07 | Align-for-Fusion: Harmonizing Triple Preferences via Dual-oriented Diffusion for Cross-domain Sequential Recommendation | Yongfu Zha et.al. | 2508.05074 | null |
2025-08-07 | FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer | Jian Zhu et.al. | 2508.05069 | null |
2025-08-07 | DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion | Yifeng Huang et.al. | 2508.05060 | null |
2025-08-07 | Observation of Super-ballistic Brownian Motion in Liquid | Jason Boynewicz et.al. | 2508.05031 | null |
2025-08-07 | Coupled 1D Chemical Kinetic-Transport and 2D Hydrodynamic Modeling Supports a modest 1-1.5x Supersolar Oxygen Abundance in Jupiter’s Atmosphere | Jeehyun Yang et.al. | 2508.05007 | null |
2025-08-07 | Switching Diffusion Systems with Past-Dependent Switching and Countable State Space: Successful Couplings and Strong Ergodicity | Fubao Xi et.al. | 2508.04997 | null |
2025-08-07 | REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers | Yuepeng Jiang et.al. | 2508.04996 | null |
2025-08-07 | Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression | Zheng Chen et.al. | 2508.04979 | null |
2025-08-06 | Simulation of Non-Premixed, Supersonic Combustion using the Discontinuous Galerkin Method on Fully Unstructured Grids | Cal J. Rising et.al. | 2508.04930 | null |
2025-08-06 | Taxonomy of Faults in Attention-Based Neural Networks | Sigma Jahan et.al. | 2508.04925 | null |
2025-08-06 | Learning AI Auditing: A Case Study of Teenagers Auditing a Generative AI Model | Luis Morales-Navarro et.al. | 2508.04902 | null |
2025-08-06 | The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models | Leo Zhang et.al. | 2508.04884 | null |
2025-08-06 | Unified Flow Matching for Long Horizon Event Forecasting | Xiao Shou et.al. | 2508.04843 | null |
2025-08-06 | Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off | Seungyong Lee et.al. | 2508.04825 | null |
2025-08-06 | Delay-constrained re-entry governs large-scale brain seizures and other network pathologies | Paul Triebkorn et.al. | 2508.04824 | null |
2025-08-06 | Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models | Mehrdad Moradi et.al. | 2508.04818 | null |
2025-08-06 | Stochastic Optimal Control with Control-Dependent Diffusion and State Constraints: A Degenerate Elliptic Approach | Anderson O. Calixto et.al. | 2508.04809 | null |
2025-08-06 | Electrodeless Magnetohydrodynamic Local Force Generator for Aerocapture | Bernard Parent et.al. | 2508.04806 | null |
2025-08-06 | ACM Multimedia Grand Challenge on ENT Endoscopy Analysis | Trong-Thuan Nguyen et.al. | 2508.04801 | null |
2025-08-06 | Quantum-impurity sensing of altermagnetic order | V. A. S. V. Bittencourt et.al. | 2508.04788 | null |
2025-08-06 | Edge-Assisted Collaborative Fine-Tuning for Multi-User Personalized Artificial Intelligence Generated Content (AIGC) | Nan Li et.al. | 2508.04745 | null |
2025-08-06 | A colossal dielectric response of HfxZr1-xO2 nanoparticles | Oleksandr S. Pylypchuk et.al. | 2508.04697 | null |
2025-08-06 | Diffusion in a $d$ -dimensional rough potential | Jacob Jeffries et.al. | 2508.04674 | null |
2025-08-06 | HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models | Young D. Kwon et.al. | 2508.04663 | null |
2025-08-06 | Stochastic Calculus for Pathwise Observables of Markov-Jump Processes: Unification of Diffusion and Jump Dynamics | Lars Torbjørn Stutzer et.al. | 2508.04647 | null |
2025-08-06 | A unified model for linear responses of physical networks | José M. Ortiz-Tavárez et.al. | 2508.04616 | null |
2025-08-06 | Multitask Learning with Stochastic Interpolants | Hugo Negrel et.al. | 2508.04605 | null |
2025-08-07 | A Comprehensive Framework for Uncertainty Quantification of Voxel-wise Supervised Models in IVIM MRI | Nicola Casali et.al. | 2508.04588 | null |
2025-08-06 | Joint Communication and Indoor Positioning Based on Visible Light in the Presence of Dimming | A. Tarik Leblebici et.al. | 2508.04570 | null |
2025-08-06 | DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling | Yijie Li et.al. | 2508.04568 | null |
2025-08-06 | TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning | Yunbi Liu et.al. | 2508.04565 | null |
2025-08-06 | Drone Detection with Event Cameras | Gabriele Magrini et.al. | 2508.04564 | null |
2025-08-06 | One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose | Jinxi Liu et.al. | 2508.04559 | null |
2025-08-06 | Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis | Angang Zhang et.al. | 2508.04551 | null |
2025-08-06 | MSC: A Marine Wildlife Video Dataset with Grounded Segmentation and Clip-Level Captioning | Quang-Trung Truong et.al. | 2508.04549 | null |
2025-08-06 | X-ray thermal diffuse scattering as a texture-robust temperature diagnostic for dynamically compressed solids | P. G. Heighway et.al. | 2508.04525 | null |
2025-08-06 | $β$ -Irida-Graphene: A New 2D Carbon Allotrope for Sodium-Ion Battery Anodes | José A. S. Laranjeira et.al. | 2508.04506 | null |
2025-08-06 | QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution | Bowen Chai et.al. | 2508.04485 | null |
2025-08-06 | Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model | Hongxu Chen et.al. | 2508.04472 | null |
2025-08-06 | 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation | Shuzhou Yang et.al. | 2508.04467 | null |
2025-08-06 | Case Studies of Generative Machine Learning Models for Dynamical Systems | Nachiket U. Bapat et.al. | 2508.04459 | null |
2025-08-06 | Cognitive Effort in the Two-Step Task: An Active Inference Drift-Diffusion Model Approach | Alvaro Garrido Perez et.al. | 2508.04435 | null |
2025-08-06 | Unmasking Interstitial Lung Diseases: Leveraging Masked Autoencoders for Diagnosis | Ethan Dack et.al. | 2508.04429 | null |
2025-08-06 | Hydrodynamic Effects in Cryogenic Buffer Gas Cells: Design Insights from Hybrid Simulations | Nick Vogeley et.al. | 2508.04364 | null |
2025-08-06 | Derivation and Numerical Simulation of a Thermodynamically Consistent Magneto Two-Phase Flow Model for Magnetic Drug Targeting | Eberhard Bänsch et.al. | 2508.04360 | null |
2025-08-06 | From Split to Share: Private Inference with Distributed Feature Sharing | Zihan Liu et.al. | 2508.04346 | null |
2025-08-06 | Performative Market Making | Charalampos Kleitsikas et.al. | 2508.04344 | null |
2025-08-06 | TempFlow-GRPO: When Timing Matters for GRPO in Flow Models | Xiaoxuan He et.al. | 2508.04324 | null |
2025-08-06 | Wave coupling in partially ionized plasmas with shear flows I. Fast-to-Alfvén transformation | Miquel Cantallops et.al. | 2508.04319 | null |
2025-08-06 | Turbulent Injection assisted by Diffusion Models for Scale Resolving Simulations | Margaux Boxho et.al. | 2508.04318 | null |
2025-08-06 | Parameter Estimation for Weakly Interacting Hypoelliptic Diffusions | Yuga Iguchi et.al. | 2508.04287 | null |
2025-08-06 | S2M3: Split-and-Share Multi-Modal Models for Distributed Multi-Task Inference on the Edge | JinYi Yoon et.al. | 2508.04271 | null |
2025-08-06 | Sparse Narrow-Band Topology Optimization for Large-Scale Thermal-Fluid Applications | Vladislav Pimanov et.al. | 2508.04261 | null |
2025-08-06 | High-Dimensional Matrix-Variate Diffusion Index Models for Time Series Forecasting | Zhiren Ma et.al. | 2508.04259 | null |
2025-08-06 | Suspensions of small ultra-soft colloids remain liquids in overcrowded conditions | Nikolaos A. Burger et.al. | 2508.04244 | null |
2025-08-06 | PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction | Muhua Zhu et.al. | 2508.04236 | null |
2025-08-06 | DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification | Saifullah Saifullah et.al. | 2508.04233 | null |
2025-08-06 | Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction | Yu Liu et.al. | 2508.04229 | null |
2025-08-06 | LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation | Kangrui Cen et.al. | 2508.04228 | null |
2025-08-06 | DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models | Saifullah Saifullah et.al. | 2508.04208 | null |
2025-08-06 | A background-free signal of jet-induced diffusion wake in quark-gluon plasma | Zhong Yang et.al. | 2508.04194 | null |
2025-08-06 | Deeper Inside Deep ViT | Sungrae Hong et.al. | 2508.04181 | null |
2025-08-06 | Quasi-Clique Discovery via Energy Diffusion | Yu Zhang et.al. | 2508.04174 | null |
2025-08-06 | Non-Equilibrium Dynamics and First-Passage Properties of Stochastic Processes: From Brownian Motion to Active Particles | Mathis Guéneau et.al. | 2508.04154 | null |
2025-08-06 | IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control | Lijuan Liu et.al. | 2508.04147 | null |
2025-08-06 | Polynomial-time sampling despite disorder chaos | Eric Ma et.al. | 2508.04133 | null |
2025-08-06 | Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation | Maximilian Ulmer et.al. | 2508.04122 | null |
2025-08-06 | Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework | Yi-Ting Chen et.al. | 2508.04090 | null |
2025-08-06 | Long time behavior and Yaglom limit for real trait-structured Birth and Death Processes | Pierre Collet et.al. | 2508.04089 | null |
2025-08-06 | Convolutional autoencoders for the reconstruction of three-dimensional interfacial multiphase flows | Murray Cutforth et.al. | 2508.04084 | null |
2025-08-06 | POD-based reduced order modeling of global-in-time iterative decoupled algorithms for Biot’s consolidation model | Huipeng Gu et.al. | 2508.04082 | null |
2025-08-06 | Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion | Fangmin Zhao et.al. | 2508.04055 | null |
2025-08-06 | Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation | Jiayi He et.al. | 2508.04049 | null |
2025-08-06 | Nonlinear stability of two-dimensional periodic waves in parabolic systems with conservation laws | L. Miguel Rodrigues et.al. | 2508.04023 | null |
2025-08-07 | S $^2$ Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation | Weilun Feng et.al. | 2508.04016 | null |
2025-08-06 | Constructing Generalized Sample Transition Probabilities with Biased Simulations | Yanbin Wang et.al. | 2508.03977 | null |
2025-08-05 | Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm | Lin Zhang et.al. | 2508.03955 | null |
2025-08-05 | Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model | Shen Zhu et.al. | 2508.03925 | null |
2025-08-05 | Coefficient Identification Problem with Integral Overdetermination Condition for Diffusion Equations | R. R. Ashurov et.al. | 2508.03859 | null |
2025-08-05 | VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations | Yifei Zong et.al. | 2508.03839 | null |
2025-08-05 | HPSv3: Towards Wide-Spectrum Human Preference Score | Yuhang Ma et.al. | 2508.03789 | null |
2025-08-05 | LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation | Jianxiong Gao et.al. | 2508.03694 | null |
2025-08-05 | LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences | Ao Liang et.al. | 2508.03692 | null |
2025-08-05 | La La LiDAR: Large-Scale Layout Generation from LiDAR Data | Youquan Liu et.al. | 2508.03691 | null |
2025-08-05 | Veila: Panoramic LiDAR Generation from a Monocular RGB Image | Youquan Liu et.al. | 2508.03690 | null |
2025-08-05 | OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World | Katherine Liu et.al. | 2508.03669 | null |
2025-08-05 | Rigidity for graph product von Neumann algebras | Camille Horbez et.al. | 2508.03662 | null |
2025-08-05 | DiWA: Diffusion Policy Adaptation with World Models | Akshay L Chandra et.al. | 2508.03645 | null |
2025-08-05 | Likelihood Matching for Diffusion Models | Lei Qian et.al. | 2508.03636 | null |
2025-08-05 | Radiative Nonideal MHD Simulations of Inner Protoplanetary Disks: Temperature Structures, Asymmetric Winds, and Episodic Surface Accretion | Shoji Mori et.al. | 2508.03624 | null |
2025-08-05 | Expanding the Standard Diffusion Process to Specified Non-Gaussian Marginal Distributions | Robert Richardson et.al. | 2508.03617 | null |
2025-08-05 | CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models | Ana Lawry Aguila et.al. | 2508.03594 | null |
2025-08-05 | Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection | Long Qian et.al. | 2508.03539 | null |
2025-08-05 | X-ray Halos of Early-Type Galaxies with AGN Feedback and Accretion from a Circumgalactic Medium: models and observations | Silvia Pellegrini et.al. | 2508.03536 | null |
2025-08-05 | CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation | Kaishen Yuan et.al. | 2508.03535 | null |
2025-08-05 | LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Text-to-Image Generation | Lianwei Yang et.al. | 2508.03485 | null |
2025-08-05 | When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models | Dasol Choi Jihwan Lee et.al. | 2508.03483 | null |
2025-08-05 | Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models | Hyungjin Kim et.al. | 2508.03481 | null |
2025-08-05 | VideoGuard: Protecting Video Content from Unauthorized Editing | Junjie Cao et.al. | 2508.03480 | null |
2025-08-05 | Learning to Incentivize: LLM-Empowered Contract for AIGC Offloading in Teleoperation | Zijun Zhan et.al. | 2508.03464 | null |
2025-08-06 | READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation | Haotian Wang et.al. | 2508.03457 | null |
2025-08-05 | Error Estimates of Semi-Lagrangian Schemes for Diffusive Conservation Laws | Haruki Takemura et.al. | 2508.03455 | null |
2025-08-05 | RAAG: Ratio Aware Adaptive Guidance | Shangwen Zhu et.al. | 2508.03442 | null |
2025-08-05 | Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN | Shivangi Nigam et.al. | 2508.03415 | null |
2025-08-05 | SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models | Pingchuan Ma et.al. | 2508.03402 | null |
2025-08-05 | Delay-facilitated self-assembly in compartmentalized systems | Severin Angerpointner et.al. | 2508.03383 | null |
2025-08-05 | Diffusion Once and Done: Degradation-Aware LoRA for Efficient All-in-One Image Restoration | Ni Tang et.al. | 2508.03373 | null |
2025-08-05 | A Closed-Loop Multi-Agent Framework for Aerodynamics-Aware Automotive Styling Design | Xinyu Jin et.al. | 2508.03370 | null |
2025-08-05 | GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images | Yifei Sun et.al. | 2508.03357 | null |
2025-08-05 | Quenching time and probability estimates for a stochastic reaction-diffusion system with coupled inner singular absorption terms driven by mixed noises | Nikos I. Kavallaris et.al. | 2508.03354 | null |
2025-08-06 | Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation | Xunzhi Xiang et.al. | 2508.03334 | null |
2025-08-05 | Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation | Peiyu Wang et.al. | 2508.03320 | null |
2025-08-05 | Thermal Metamaterials for Enhanced Non-Fourier Heat Transport | Harry Mclean et.al. | 2508.03316 | null |
2025-08-05 | The non-isothermal Maxwell-Stefan asymptotics of the multi-species Boltzmann equations | Xinqiu Chen et.al. | 2508.03311 | null |
2025-08-05 | Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation | Jun Luo et.al. | 2508.03300 | null |
2025-08-05 | Investigation on deep learning-based galaxy image translation models | Hengxin Ruan et.al. | 2508.03291 | null |
2025-08-07 | Well-Posedness of the Cauchy Problem for One-Dimensional Nonlinear Diffusion Equations with Dynamic and Fourth-Type Boundary Conditions in the Lp Lq Maximal Regularity Setting | Ken Furukawa et.al. | 2508.03288 | null |
2025-08-07 | Global solvability for doubly degenerate nutrient taxis system with a wide range of bacterial responses in physical dimension | Bao-Ngoc Tran et.al. | 2508.03268 | null |
2025-08-05 | Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation | Gang Dai et.al. | 2508.03256 | null |
2025-08-05 | V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models | Jisoo Kim et.al. | 2508.03254 | null |
2025-08-05 | Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion | Wentao Qu et.al. | 2508.03252 | null |
2025-08-06 | FFHQ-Makeup: Paired Synthetic Makeup Dataset with Facial Consistency Across Multiple Styles | Xingchao Yang et.al. | 2508.03241 | null |
2025-08-05 | BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models | Yu Pan et.al. | 2508.03221 | null |
2025-08-05 | Timing is everything: How subtle timing changes in MRI echo planar imaging can significantly alter mechanical vibrations and sound level | Amir Seginer et.al. | 2508.03220 | null |
2025-08-05 | Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance | Eliot Beyler et.al. | 2508.03210 | null |
2025-08-05 | Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models | Muhammed Saeed et.al. | 2508.03199 | null |
2025-08-05 | An Analytic Model to Determine the Interstitial-Solute Energetics and Underlying Mechanism in Refractory High-Entropy Alloys | Qianxi Zhu et.al. | 2508.03163 | null |
2025-08-05 | SARD: Segmentation-Aware Anomaly Synthesis via Region-Constrained Diffusion with Discriminative Mask Guidance | Yanshu Wang et.al. | 2508.03143 | null |
2025-08-05 | UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying | Chengyu Bai et.al. | 2508.03142 | null |
2025-08-05 | Filtering and 1/3 Power Law for Optimal Time Discretisation in Numerical Integration of Stochastic Differential Equations | Igor G. Vladimirov et.al. | 2508.03135 | null |
2025-08-05 | Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback | Jingyi Chen et.al. | 2508.03123 | null |
2025-08-05 | Power System Voltage Stability Boundary: Computational Results and Applications | Zhenyao Li et.al. | 2508.03119 | null |
2025-08-05 | T2UE: Generating Unlearnable Examples from Text Descriptions | Xingjun Ma et.al. | 2508.03091 | null |
2025-08-05 | MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation | Youran Zhou et.al. | 2508.03083 | null |
2025-08-05 | Multi-human Interactive Talking Dataset | Zeyu Zhu et.al. | 2508.03050 | null |
2025-08-05 | Urban In-Context Learning: Bridging Pretraining and Inference through Masked Diffusion for Urban Profiling | Ruixing Zhang et.al. | 2508.03042 | null |
2025-08-05 | Sparse Identification of Nonlinear Dynamics for Stochastic Delay Differential Equations | Dimitri Breda et.al. | 2508.03040 | null |
2025-08-05 | MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention | Qi Xie et.al. | 2508.03034 | null |
2025-08-05 | LiGen: GAN-Augmented Spectral Fingerprinting for Indoor Positioning | Jie Lin et.al. | 2508.03024 | null |
2025-08-05 | Generating Light-based Fingerprints for Indoor Localization | Hsun-Yu Lee et.al. | 2508.03011 | null |
2025-08-05 | Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models | Fan Yang et.al. | 2508.03006 | null |
2025-08-05 | Diffusion Models with Adaptive Negative Sampling Without External Resources | Alakh Desai et.al. | 2508.02973 | null |
2025-08-05 | Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver | Jonathan Patsenker et.al. | 2508.02964 | null |
2025-08-04 | X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio | Chenxu Zhang et.al. | 2508.02944 | null |
2025-08-04 | Documenting Patterns of Exoticism of Marginalized Populations within Text-to-Image Generators | Sourojit Ghosh et.al. | 2508.02937 | null |
2025-08-06 | A nonstandard finite difference scheme for an SEIQR epidemiological PDE model | Achraf Zinihi et.al. | 2508.02928 | null |
2025-08-04 | Goal-Oriented Adaptive Finite Element Multilevel Quasi-{M}onte {C}arlo | Joakim Beck et.al. | 2508.02925 | null |
2025-08-04 | How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution | Minh-Hai Nguyen et.al. | 2508.02923 | null |
2025-08-04 | RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation | Mehrdad Moradi et.al. | 2508.02903 | null |
2025-08-04 | REFLECT: Rectified Flows for Efficient Brain Anomaly Correction Transport | Farzad Beizaee et.al. | 2508.02889 | null |
2025-08-04 | Memoirs of mass accretion: probing the edges of intracluster light in simulated galaxy clusters | Tara Dacunha et.al. | 2508.02837 | null |
2025-08-04 | DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework | Tongchun Zuo et.al. | 2508.02807 | null |
2025-08-04 | NASIM: Revealing the low surface brightness Universe from legacy VISTA data | Elham Saremi et.al. | 2508.02780 | null |
2025-08-04 | D2PPO: Diffusion Policy Policy Optimization with Dispersive Loss | Guowei Zou et.al. | 2508.02644 | null |
2025-08-04 | CAK: Emergent Audio Effects from Minimal Deep Learning | Austin Rockman et.al. | 2508.02643 | null |
2025-08-04 | Anticipating Decoherence: a Predictive Framework for Enhancing Coherence in Quantum Emitters | Pranshu Maan et.al. | 2508.02638 | null |
2025-08-04 | ReMoMask: Retrieval-Augmented Masked Motion Generation | Zhengdao Li et.al. | 2508.02605 | null |
2025-08-04 | Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction | Yuerong Song et.al. | 2508.02558 | null |
2025-08-04 | From Pixels to Pathology: Restoration Diffusion for Diagnostic-Consistent Virtual IHC | Jingsong Liu et.al. | 2508.02528 | null |
2025-08-06 | xDeepServe: Model-as-a-Service on Huawei CloudMatrix384 | Ao Xiao et.al. | 2508.02520 | null |
2025-08-04 | QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots | Sheng Wu et.al. | 2508.02512 | null |
2025-08-04 | Quantitative and Predictive Folding Models from Limited Single-Molecule Data Using Simulation-Based Inference | Lars Dingeldein et.al. | 2508.02509 | null |
2025-08-04 | Toward Using Machine Learning as a Shape Quality Metric for Liver Point Cloud Generation | Khoa Tuan Nguyen et.al. | 2508.02482 | null |
2025-08-04 | PoseGuard: Pose-Guided Generation with Safety Guardrails | Kongxin Wang et.al. | 2508.02476 | null |
2025-08-04 | Efficient spin-pumping and spin-to-charge conversion in epitaxial Mn $_3$ Sn(0001) noncollinear antiferromagnetic films | Surya N. Panda et.al. | 2508.02415 | null |
2025-08-04 | Hydra: Accurate Multi-Modal Leaf Wetness Sensing with mm-Wave and Camera Fusion | Yimeng Liu et.al. | 2508.02409 | null |
2025-08-04 | Inference-time Scaling for Diffusion-based Audio Super-resolution | Yizhu Jin et.al. | 2508.02391 | null |
2025-08-04 | Talking Surveys: How Photorealistic Embodied Conversational Agents Shape Response Quality, Engagement, and Satisfaction | Matus Krajcovic et.al. | 2508.02376 | null |
2025-08-04 | Transport-Guided Rectified Flow Inversion: Improved Image Editing Using Optimal Transport Theory | Marian Lupascu et.al. | 2508.02363 | null |
2025-08-04 | Qwen-Image Technical Report | Chenfei Wu et.al. | 2508.02324 | null |
2025-08-04 | Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images | Philipp Wulff et.al. | 2508.02323 | null |
2025-08-05 | LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training | Sikui Zhang et.al. | 2508.02308 | null |
2025-08-05 | Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor | Xiaoliu Guan et.al. | 2508.02240 | null |
2025-08-04 | Abstract Formulation of Mean-Field Models and Propagation of Chaos | Tau Shean Lim et.al. | 2508.02224 | null |
2025-08-04 | A theory of strange metals | Simone Fratini et.al. | 2508.02221 | null |
2025-08-04 | Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference | Yuxuan Song et.al. | 2508.02193 | null |
2025-08-04 | DreamPainter: Image Background Inpainting for E-commerce Scenarios | Sijie Zhao et.al. | 2508.02155 | null |
2025-08-04 | AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models | Die Chen et.al. | 2508.02151 | null |
2025-08-04 | VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling | Yuru Xiao et.al. | 2508.02129 | null |
2025-08-04 | AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation | Zhiwen Li et.al. | 2508.02107 | null |
2025-08-04 | Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis | Kaiyang Ji et.al. | 2508.02106 | null |
2025-08-04 | “Stack It Up!”: 3D Stable Structure Generation from 2D Hand-drawn Sketch | Yiqing Xu et.al. | 2508.02093 | null |
2025-08-04 | Unsupervised Multi-channel Speech Dereverberation via Diffusion | Yulun Wu et.al. | 2508.02071 | null |
2025-08-04 | “Set It Up”: Functional Object Arrangement with Compositional Generative Models | Yiqing Xu et.al. | 2508.02068 | null |
2025-08-04 | StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion | Haoxin Yang et.al. | 2508.02056 | null |
2025-08-04 | Why Generate When You Can Transform? Unleashing Generative Attention for Dynamic Recommendation | Yuli Liu et.al. | 2508.02050 | null |
2025-08-04 | Conditional Diffusion Model with Anatomical-Dose Dual Constraints for End-to-End Multi-Tumor Dose Prediction | Hui Xie et.al. | 2508.02043 | null |
2025-08-04 | Frequency-Domain Denoising-Based in Vivo Fluorescence Imaging | XuHao Yu et.al. | 2508.02025 | null |
2025-08-04 | Significant Mobility Enhancement in Coupled AlGaN/GaN Quantum Wells considering Inter-Well Distance and Asymmetric Widths | Le Tri Dat et.al. | 2508.02024 | null |
2025-08-05 | Asymptotic analysis of the Allen-Cahn equation with dynamic boundary conditions of Cahn-Hilliard type | Pierluigi Colli et.al. | 2508.02021 | null |
2025-08-04 | Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention | Kyungmin Jo et.al. | 2508.02004 | null |
2025-08-04 | Generative Large-Scale Pre-trained Models for Automated Ad Bidding Optimization | Yu Lei et.al. | 2508.02002 | null |
2025-08-04 | Path-Integral Formulation of Bosonic Markovian Open Quantum Dynamics with Monte Carlo stochastic trajectories using the Glauber-Sudarshan P, Wigner, and Husimi Q Functions and Hybrids | Toma Yoneya et.al. | 2508.01991 | null |
2025-08-04 | Controllable and Stealthy Shilling Attacks via Dispersive Latent Diffusion | Shutong Qiao et.al. | 2508.01987 | null |
2025-08-04 | Diffusion models for inverse problems | Hyungjin Chung et.al. | 2508.01975 | null |
2025-08-03 | Distributed games with jumps: An $α$ -potential game approach | Xin Guo et.al. | 2508.01929 | null |
2025-08-03 | On the Non-Markovian Navier-Stokes Framework for Turbulence Modeling – A Preliminary Analysis | Siamak Kazemzadeh Hannani et.al. | 2508.01890 | null |
2025-08-03 | DiffusionFF: Face Forgery Detection via Diffusion-based Artifact Localization | Siran Peng et.al. | 2508.01873 | null |
2025-08-05 | Moment Estimate and Variational Approach for Learning Generalized Diffusion with Non-gradient Structures | Fanze Kong et.al. | 2508.01854 | null |
2025-08-03 | Diffusion-based 3D Hand Motion Recovery with Intuitive Physics | Yufei Zhang et.al. | 2508.01835 | null |
2025-08-03 | Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder | Runxuan Yang et.al. | 2508.01796 | null |
2025-08-03 | Exponential mixing for the stochastic Kuramoto-Sivashinsky equation on the 1D torus | Peng Gao et.al. | 2508.01794 | null |
2025-08-03 | DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion | Zhigang Sun et.al. | 2508.01778 | null |
2025-08-03 | Semantically-Guided Inference for Conditional Diffusion Models: Enhancing Covariate Consistency in Time Series Forecasting | Rui Ding et.al. | 2508.01761 | null |
2025-08-03 | Dynamic Coupling of Infiltration-Soil Moisture Feedback:Emergent Vegetation Patterns in a Water-Vegetation Model | Juan Yan et.al. | 2508.01755 | null |
2025-08-03 | Energy-Efficient Federated Learning for Edge Real-Time Vision via Joint Data, Computation, and Communication Design | Xiangwang Hou et.al. | 2508.01745 | null |
2025-08-05 | Imbalance-Robust and Sampling-Efficient Continuous Conditional GANs via Adaptive Vicinity and Auxiliary Regularization | Xin Ding et.al. | 2508.01725 | null |
2025-08-03 | ModFus-DM: Explore the Representation in Modulated Signal Diffusion Generated Models | Haoyue Tan et.al. | 2508.01719 | null |
2025-08-03 | Versatile Transition Generation with Image-to-Video Diffusion | Zuhao Yang et.al. | 2508.01698 | null |
2025-08-03 | DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing | Yufeng Chi et.al. | 2508.01684 | null |
2025-08-03 | DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding | Hanqing Wang et.al. | 2508.01651 | null |
2025-08-03 | StrandDesigner: Towards Practical Strand Generation with Sketch Guidance | Na Zhang et.al. | 2508.01650 | null |
2025-08-03 | Hamiltonian simulation for nonlinear partial differential equation by Schrödingerization | Shoya Sasaki et.al. | 2508.01640 | null |
2025-08-03 | VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation | Xuanran Zhai et.al. | 2508.01622 | null |
2025-08-03 | LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding | Xuanzhao Dong et.al. | 2508.01617 | null |
2025-08-03 | TCDiff: Triplex Cascaded Diffusion for High-fidelity Multimodal EHRs Generation with Incomplete Clinical Data | Yandong Yan et.al. | 2508.01615 | null |
2025-08-03 | Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models | Haoran Dai et.al. | 2508.01605 | null |
2025-08-03 | Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment | Lubin Gan et.al. | 2508.01602 | null |
2025-08-03 | CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation | Sung-Wook Lee et.al. | 2508.01600 | null |
2025-08-03 | Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching | Juyan Zhang et.al. | 2508.01597 | null |
2025-08-03 | A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation | Hua Yu et.al. | 2508.01590 | null |
2025-08-03 | Censored Sampling for Topology Design: Guiding Diffusion with Human Preferences | Euihyun Kim et.al. | 2508.01589 | null |
2025-08-03 | Diffusion Models for Future Networks and Communications: A Comprehensive Survey | Nguyen Cong Luong et.al. | 2508.01586 | null |
2025-08-03 | Tractography-Guided Dual-Label Collaborative Learning for Multi-Modal Cranial Nerves Parcellation | Lei Xie et.al. | 2508.01577 | null |
2025-08-03 | Sub 10 nm Nanochannels Enable Directional Quasi Ballistic Exciton Transport over 5 μm at Room Temperature | Xiao-Jie Wang et.al. | 2508.01567 | null |
2025-08-03 | MGCR-Net:Multimodal Graph-Conditioned Vision-Language Reconstruction Network for Remote Sensing Change Detection | Chengming Wang et.al. | 2508.01555 | null |
2025-08-02 | A Reward-Directed Diffusion Framework for Generative Design Optimization | Hadi Keramati et.al. | 2508.01509 | null |
2025-08-02 | Instruction-based Time Series Editing | Jiaxing Qiu et.al. | 2508.01504 | null |
2025-08-02 | The role of zealots in the spread of linguistic traits | Vivian Dornelas et.al. | 2508.01500 | null |
2025-08-02 | TreeDiff: AST-Guided Code Generation with Diffusion LLMs | Yiming Zeng et.al. | 2508.01473 | null |
2025-08-02 | Regression Augmentation With Data-Driven Segmentation | Shayan Alahyari et.al. | 2508.01455 | null |
2025-08-02 | Physically-based Lighting Augmentation for Robotic Manipulation | Shutong Jin et.al. | 2508.01442 | null |
2025-08-02 | Viscosity Stabilized Plug-and-Play Reconstruction | Arghya Sinha et.al. | 2508.01441 | null |
2025-08-02 | Parabolic-elliptic and indirect-direct simplifications in chemotaxis systems driven by indirect signalling | Le Trong Thanh Bui et.al. | 2508.01436 | null |
2025-08-02 | Artificial Intelligence and Misinformation in Art: Can Vision Language Models Judge the Hand or the Machine Behind the Canvas? | Tarian Fu et.al. | 2508.01408 | null |
2025-08-02 | StyleSentinel: Reliable Artistic Copyright Verification via Stylistic Fingerprints | Lingxiao Chen et.al. | 2508.01335 | null |
2025-08-05 | Zero-shot Segmentation of Skin Conditions: Erythema with Edit-Friendly Inversion | Konstantinos Moutselos et.al. | 2508.01334 | null |
2025-08-02 | LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points | Xuemiao Zhang et.al. | 2508.01317 | null |
2025-08-02 | CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis | Alec Sargood et.al. | 2508.01292 | null |
2025-08-02 | PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation | Zonglei Jing et.al. | 2508.01272 | null |
2025-08-02 | Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling | Lexiao Zou et.al. | 2508.01264 | null |
2025-08-02 | NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection | Jiazhen Yan et.al. | 2508.01248 | null |
2025-08-02 | Effect of protection zone on the dynamics of a diffusion-advection population-toxicant model | Jing Gao et.al. | 2508.01246 | null |
2025-08-02 | Sliding two-dimensional superconductivity and charge-density-wave state in a bulk crystal | Xiangqi Liu et.al. | 2508.01241 | null |
2025-08-02 | SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches | Cheng Tan et.al. | 2508.01237 | null |
2025-08-02 | Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system | Jiyong Kim et.al. | 2508.01230 | null |
2025-08-02 | StyDeco: Unsupervised Style Transfer with Distilling Priors and Semantic Decoupling | Yuanlin Yang et.al. | 2508.01215 | null |
2025-08-02 | Energy-dependent anisotropy of cosmic-ray muons: A twelve-year study with IceCube Neutrino Observatory | Nabin Upadhya Dhakal et.al. | 2508.01194 | null |
2025-08-02 | DELTAv2: Accelerating Dense 3D Tracking | Tuan Duc Ngo et.al. | 2508.01170 | null |
2025-08-02 | RoboLinker: A Diffusion-model-based Matching Clothing Generator Between Humans and Companion Robots | Jing Tang et.al. | 2508.01165 | null |
2025-08-02 | LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation | Xinyu Yan et.al. | 2508.01152 | null |
2025-08-02 | Personalized Safety Alignment for Text-to-Image Diffusion Models | Yu Lei et.al. | 2508.01151 | null |
2025-08-02 | Dataset Condensation with Color Compensation | Huyu Wu et.al. | 2508.01139 | null |
2025-08-01 | Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models | Jinsong Li et.al. | 2508.00819 | null |
2025-08-01 | Multibeam High Throughput Satellite: Hardware Foundation, Resource Allocation, and Precoding | Rui Chen et.al. | 2508.00800 | null |
2025-08-01 | Video Generators are Robot Policies | Junbang Liang et.al. | 2508.00795 | null |
2025-08-01 | SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation | Kien T. Pham et.al. | 2508.00782 | null |
2025-08-01 | Diffusion-Scheduled Denoising Autoencoders for Anomaly Detection in Tabular Data | Timur Sattarov et.al. | 2508.00758 | null |
2025-08-01 | LeakyCLIP: Extracting Training Data from CLIP | Yunhao Chen et.al. | 2508.00756 | null |
2025-08-01 | SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation | Prerana Ramkumar et.al. | 2508.00750 | null |
2025-08-01 | AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation | Le Wang et.al. | 2508.00733 | null |
2025-08-01 | YOLO-Count: Differentiable Object Counting for Text-to-Image Generation | Guanning Zeng et.al. | 2508.00728 | null |
2025-08-01 | Controllability of diffusive Lotka-Volterra strongly competitive systems under boundary constrained controls | Elisa Affili et.al. | 2508.00713 | null |
2025-08-01 | D3: Training-Free AI-Generated Video Detection Using Second-Order Features | Chende Zheng et.al. | 2508.00701 | null |
2025-08-01 | On-Device Diffusion Transformer Policy for Efficient Robot Manipulation | Yiming Wu et.al. | 2508.00697 | null |
2025-08-01 | Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network | Young-ho Cho et.al. | 2508.00692 | null |
2025-08-01 | Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators | Albert Matveev et.al. | 2508.00643 | null |
2025-08-01 | Minimum Data, Maximum Impact: 20 annotated samples for explainable lung nodule classification | Luisa Gallée et.al. | 2508.00639 | null |
2025-08-01 | DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior | Junzhe Lu et.al. | 2508.00599 | null |
2025-08-01 | Wukong Framework for Not Safe For Work Detection in Text-to-Image systems | Mingrui Liu et.al. | 2508.00591 | null |
2025-08-01 | Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints | Jens U. Kreber et.al. | 2508.00558 | null |
2025-08-01 | DBLP: Noise Bridge Consistency Distillation For Efficient And Reliable Adversarial Purification | Chihan Huang et.al. | 2508.00552 | null |
2025-08-01 | Video Color Grading via Look-Up Table Generation | Seunghyun Shin et.al. | 2508.00548 | null |
2025-08-01 | HannesImitation: Grasping with the Hannes Prosthetic Hand via Imitation Learning | Carlo Alessi et.al. | 2508.00491 | null |
2025-08-01 | LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer | Yuzhuo Chen et.al. | 2508.00477 | null |
2025-08-01 | A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces | Leonidas Akritidis et.al. | 2508.00472 | null |
2025-08-01 | Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution | Yiwen Wang et.al. | 2508.00471 | null |
2025-08-01 | AutoDebias: Automated Framework for Debiasing Text-to-Image Models | Hongyi Cai et.al. | 2508.00445 | null |
2025-08-01 | SDMatte: Grafting Diffusion Models for Interactive Matting | Longfei Huang et.al. | 2508.00443 | null |
2025-08-01 | Diffusion-Based User-Guided Data Augmentation for Coronary Stenosis Detection | Sumin Seo et.al. | 2508.00438 | null |
2025-08-01 | Accurate Latent Inversion for Generative Image Steganography via Rectified Flow | Yuqi Qian et.al. | 2508.00434 | null |
2025-08-01 | Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation | Nan Xiang et.al. | 2508.00428 | null |
2025-08-01 | Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting | Seunggeun Chi et.al. | 2508.00427 | null |
2025-08-01 | Collimated QED Cascades with Curved Plasma Mirror | Xuesong Geng et.al. | 2508.00417 | null |
2025-08-01 | DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space | Junyu Chen et.al. | 2508.00413 | null |
2025-08-01 | Sortblock: Similarity-Aware Feature Reuse for Diffusion Model | Hanqi Chen et.al. | 2508.00412 | null |
2025-08-01 | Predictive information criterion for jump diffusion processes | Yuma Uehara et.al. | 2508.00411 | null |
2025-08-01 | Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency | Xi Xue et.al. | 2508.00397 | null |
2025-08-01 | Sheaf Graph Neural Networks via PAC-Bayes Spectral Optimization | Yoonhyuk Choi et.al. | 2508.00357 | null |
2025-08-01 | BOOD: Boundary-based Out-Of-Distribution Data Generation | Qilin Liao et.al. | 2508.00350 | null |
2025-08-01 | Favorable modifications of Scrape-Off Layer (SOL) heat flux width through pulsed fuelling in ADITYA-U Tokamak | SK Injamul Hoque et.al. | 2508.00339 | null |
2025-08-01 | Radially Locked Sun-Ray Patterns in Autocatalytic Reaction-Diffusion-Advection Systems | Surya Narayan Maharana et.al. | 2508.00329 | null |
2025-08-01 | Steering Guidance for Personalized Text-to-Image Diffusion Models | Sunghyun Park et.al. | 2508.00319 | null |
2025-08-01 | GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection | Suhang Cai et.al. | 2508.00312 | null |
2025-08-01 | TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps | Zehui Xu et.al. | 2508.00303 | null |
2025-08-01 | Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence | Danzhen Fu et.al. | 2508.00299 | null |
2025-08-01 | AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer | Jin Lyu et.al. | 2508.00298 | null |
2025-08-01 | TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models | Christian Simon et.al. | 2508.00289 | null |
2025-08-01 | UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents | Jianqiang Xiao et.al. | 2508.00288 | null |
2025-08-01 | Towards Robust Semantic Correspondence: A Benchmark and Insights | Wenyue Chong et.al. | 2508.00272 | null |
2025-08-01 | Jet Image Generation in High Energy Physics Using Diffusion Models | Victor D. Martinez et.al. | 2508.00250 | null |
2025-07-31 | Reliability of 1D radiative-convective photochemical-equilibrium retrievals on transit spectra of WASP-107b | Thomas Konings et.al. | 2508.00177 | null |
2025-07-31 | DiSC-Med: Diffusion-based Semantic Communications for Robust Medical Image Transmission | Fupei Guo et.al. | 2508.00172 | null |
2025-07-31 | World Consistency Score: A Unified Metric for Video Generation Quality | Akshat Rakheja et.al. | 2508.00144 | null |
2025-07-31 | Entanglement spreading and emergent locality in Brownian SYK chains | Onkar Parrikar et.al. | 2508.00060 | null |
2025-07-31 | Predicting Large-scale Urban Network Dynamics with Energy-informed Graph Neural Diffusion | Tong Nie et.al. | 2508.00037 | null |
2025-07-31 | Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis | Bowen Zhang et.al. | 2507.23785 | null |
2025-07-31 | SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions | Jessica Bader et.al. | 2507.23784 | null |
2025-07-31 | General diffusions on metric graphs as limits of time-space Markov Chains | Alexis Anagnostakis et.al. | 2507.23724 | null |
2025-07-31 | DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching | Emery Pierson et.al. | 2507.23715 | null |
2025-07-31 | CFDagent: A Language-Guided, Zero-Shot Multi-Agent System for Complex Flow Simulation | Zhaoyue Xu et.al. | 2507.23693 | null |
2025-07-31 | UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration | Zihan Cheng et.al. | 2507.23685 | null |
2025-07-31 | I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation | Jialei Chen et.al. | 2507.23683 | null |
2025-07-31 | Analysis of a Cross-Nonlinear Porous-Medium System Modeling Pressure-Driven Cell Population Dynamics | Alexis Béjar-López et.al. | 2507.23680 | null |
2025-07-31 | DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data | Rabeya Tus Sadia et.al. | 2507.23676 | null |
2025-07-31 | One-Step Flow Policy Mirror Descent | Tianyi Chen et.al. | 2507.23675 | null |
2025-07-31 | Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image Synthesis | Kunpeng Qiu et.al. | 2507.23652 | null |
2025-07-31 | A stochastic heat equation with non-locally Lipschitz coefficients | Le Chen et.al. | 2507.23637 | null |
2025-07-31 | DivControl: Knowledge Diversion for Controllable Image Generation | Yucheng Xie et.al. | 2507.23620 | null |
2025-08-02 | MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction | Zijian Dong et.al. | 2507.23597 | null |
2025-07-31 | Theory of ultrafast conductance modulation in electrochemical protonic synapses by multiphase polarization | Michael L. Li et.al. | 2507.23576 | null |
2025-08-01 | H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation | Hongzhe Bi et.al. | 2507.23523 | null |
2025-07-31 | Conical diffraction of the synchrotron beam to probe the efficiency and morphology of blazed gratings | K. V. Nikolaev et.al. | 2507.23513 | null |
2025-07-31 | Emergence of long-range non-equilibrium correlations in free liquid diffusion | Marco Bussoletti et.al. | 2507.23507 | null |
2025-07-31 | Digital literacy interventions can boost humans in discerning deepfakes | Dominique Geissler et.al. | 2507.23492 | null |
2025-07-31 | Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion | Mutian Xu et.al. | 2507.23483 | null |
2025-07-31 | Adjoint-Based Aerodynamic Shape Optimization with a Manifold Constraint Learned by Diffusion Models | Long Chen et.al. | 2507.23443 | null |
2025-07-31 | Out-of-Distribution Detection in Medical Imaging via Diffusion Trajectories | Lemar Abdi et.al. | 2507.23411 | null |
2025-07-31 | An optimal preconditioner for high-order scheme arising from multi-dimensional Riesz space fractional diffusion equations with variable coefficients | Yuan-Yuan Huang et.al. | 2507.23408 | null |
2025-07-31 | UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries | Yijie Zhu et.al. | 2507.23372 | null |
2025-07-31 | IN45023 Neural Network Design Patterns in Computer Vision Seminar Report, Summer 2025 | Radu-Andrei Bourceanu et.al. | 2507.23357 | null |
2025-07-31 | Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads | Yingjie Zhou et.al. | 2507.23343 | null |
2025-07-31 | EMU and the DRAGNs I: A Catalogue of DRAGNs | Ray P. Norris et.al. | 2507.23337 | null |
2025-07-31 | Classifying Compact Radio Emission in Nearby Galaxies: a 10GHz Study of Active Galactic Nuclei, Supernovae, Anomalous Microwave Emission and Star Forming Regions | Kristen C. Dage et.al. | 2507.23332 | null |
2025-07-31 | The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models | Alfio Ferrara et.al. | 2507.23313 | null |
2025-07-31 | PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving | Xuewei Tang et.al. | 2507.23309 | null |
2025-08-01 | Training-free Geometric Image Editing on Diffusion Models | Hanshen Zhu et.al. | 2507.23300 | null |
2025-07-31 | UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing | Hao Tang et.al. | 2507.23278 | null |
2025-07-31 | PixNerd: Pixel Neural Field Diffusion | Shuai Wang et.al. | 2507.23268 | null |
2025-07-31 | Automated Mapping the Pathways of Cranial Nerve II, III, V, and VII/VIII: A Multi-Parametric Multi-Stage Diffusion Tractography Atlas | Lei Xie et.al. | 2507.23245 | null |
2025-07-31 | BS-1-to-N: Diffusion-Based Environment-Aware Cross-BS Channel Knowledge Map Generation for Cell-Free Networks | Zhuoyin Dai et.al. | 2507.23236 | null |
2025-07-31 | Adversarial-Guided Diffusion for Multimodal LLM Attacks | Chengwei Xia et.al. | 2507.23202 | null |
2025-07-30 | X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention | Xiaochen Zhao et.al. | 2507.23143 | null |
2025-07-30 | Nonzero $\mathfrak{n}$ cohomology of Totally Degenerate Limit of Discrete Series representations | Jin Kunwoo Lee et.al. | 2507.23102 | null |
2025-07-30 | Diffusion model for gradient preconditioning in hyperspectral imaging inverse problems | Jonathan Monsalve et.al. | 2507.23065 | null |
2025-07-30 | Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation | Alexandru Buburuzan et.al. | 2507.23058 | null |
2025-07-30 | Search for Neutrinos from the Galactic 4FGL Sources with the Pion-bump Signature with IceCube | Alejandra Granados et.al. | 2507.23040 | null |
2025-07-30 | Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction | Giuseppe Cartella et.al. | 2507.23021 | null |
2025-07-30 | Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods | Siwoo Park et.al. | 2507.23010 | null |
2025-07-30 | LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis | Jamil Fayyad et.al. | 2507.23001 | null |
2025-07-29 | Neural Autoregressive Modeling of Brain Aging | Ridvan Yesiloglu et.al. | 2507.22954 | null |
2025-07-30 | AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS | Hai Ling et.al. | 2507.22880 | null |
2025-07-30 | Robust Contract with Career Concerns | Tan Gan et.al. | 2507.22852 | null |
2025-07-30 | Morph: ChirpTransformer-based Encoder-decoder Co-design for Reliable LoRa Communication | Yidong Ren et.al. | 2507.22851 | null |
2025-07-30 | DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion | Qingcheng Zhao et.al. | 2507.22825 | null |
2025-07-30 | Design and Analysis of Plasmonic-Nanorod-Enhanced Lead-Free Inorganic Perovskite/Silicon Heterojunction Tandem Solar Cell Exceeding the Shockley-Queisser Limit | Md. Sad Abdullah Sami et.al. | 2507.22803 | null |
2025-07-31 | G-Core: A Simple, Scalable and Balanced RLHF Trainer | Junyu Wu et.al. | 2507.22789 | null |
2025-07-30 | DO-EM: Density Operator Expectation Maximization | Adit Vishnu et.al. | 2507.22786 | null |
2025-08-01 | Next Tokens Denoising for Speech Synthesis | Yanqing Liu et.al. | 2507.22746 | null |
2025-07-30 | Zero-Shot Image Anomaly Detection Using Generative Foundation Models | Lemar Abdi et.al. | 2507.22692 | null |
2025-07-30 | LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing | Federico Girella et.al. | 2507.22627 | null |
2025-07-30 | Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions | Yiting Qu et.al. | 2507.22617 | null |
2025-07-30 | Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model | Daehee Park et.al. | 2507.22615 | null |
2025-07-30 | ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning | Xiefan Guo et.al. | 2507.22604 | null |
2025-07-30 | Diffusion Models for Influence Maximization on Temporal Networks: A Guide to Make the Best Choice | Aaqib Zahoor et.al. | 2507.22589 | null |
2025-07-30 | DACA-Net: A Degradation-Aware Conditional Diffusion Network for Underwater Image Enhancement | Chang Huang et.al. | 2507.22501 | null |
2025-07-30 | LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning | Xiang Li et.al. | 2507.22499 | null |
2025-07-30 | Visual Language Models as Zero-Shot Deepfake Detectors | Viacheslav Pirogov et.al. | 2507.22469 | null |
2025-07-30 | TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation | Jiuming Liu et.al. | 2507.22454 | null |
2025-07-30 | GVD: Guiding Video Diffusion Model for Scalable Video Distillation | Kunyang Li et.al. | 2507.22360 | null |
2025-07-29 | Trade-offs in Image Generation: How Do Different Dimensions Interact? | Sicheng Zhang et.al. | 2507.22100 | null |
2025-07-29 | X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again | Zigang Geng et.al. | 2507.22058 | null |
2025-07-30 | See Different, Think Better: Visual Variations Mitigating Hallucinations in LVLMs | Ziyun Dai et.al. | 2507.22003 | null |
2025-07-29 | Enhancing Generalization in Data-free Quantization via Mixup-class Prompting | Jiwoong Park et.al. | 2507.21947 | null |
2025-07-29 | Anyone Can Jailbreak: Prompt-Based Attacks on LLMs and T2Is | Ahmed B Mustafa et.al. | 2507.21820 | null |
2025-07-29 | Control Copy-Paste: Controllable Diffusion-Based Augmentation Method for Remote Sensing Few-Shot Object Detection | Yanxing Liu et.al. | 2507.21816 | null |
2025-07-29 | MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE | Junzhe Li et.al. | 2507.21802 | null |
2025-07-29 | APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing | Sangmin Han et.al. | 2507.21690 | null |
2025-07-29 | GuidPaint: Class-Guided Image Inpainting with Diffusion Models | Qimin Wang et.al. | 2507.21627 | null |
2025-07-29 | Locally Controlled Face Aging with Latent Diffusion Models | Lais Isabelle Alves dos Santos et.al. | 2507.21600 | null |
2025-07-29 | Neural network enabled wide field-of-view imaging with hyperbolic metalenses | Joel Yeo et.al. | 2507.21562 | null |
2025-07-29 | Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance | Mengling Xu et.al. | 2507.21529 | null |
2025-07-29 | BANG: Dividing 3D Assets via Generative Exploded Dynamics | Longwen Zhang et.al. | 2507.21493 | null |
2025-07-29 | Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training | Sodtavilan Odonchimed et.al. | 2507.21452 | null |
2025-07-30 | Multimodal LLMs as Customized Reward Models for Text-to-Image Generation | Shijie Zhou et.al. | 2507.21391 | null |
2025-07-28 | Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation | I-Hsiang Chen et.al. | 2507.21367 | null |
2025-07-28 | A Contrastive Diffusion-based Network (CDNet) for Time Series Classification | Yaoyu Zhang et.al. | 2507.21357 | null |
2025-07-28 | HDR Environment Map Estimation with Latent Diffusion Models | Jack Hilliard et.al. | 2507.21261 | null |
2025-07-28 | Adaptive Multimodal Protein Plug-and-Play with Diffusion-Based Priors | Amartya Banerjee et.al. | 2507.21260 | null |
2025-07-28 | Learning from Limited and Imperfect Data | Harsh Rangwani et.al. | 2507.21205 | null |
2025-08-01 | Flow Matching Policy Gradients | David McAllister et.al. | 2507.21053 | null |
2025-07-29 | JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1 | Xinhan Di et.al. | 2507.20987 | null |
2025-07-28 | Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision | Xiao Fang et.al. | 2507.20976 | null |
Industry
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-07 | CleanUpBench: Embodied Sweeping and Grasping Benchmark | Wenbo Li et.al. | 2508.05543 | null |
2025-08-07 | MedMambaLite: Hardware-Aware Mamba for Medical Image Classification | Romina Aalishah et.al. | 2508.05049 | null |
2025-08-07 | CSRAP: Enhanced Canvas Attention Scheduling for Real-Time Mission Critical Perception | Md Iftekharul Islam Sakib et.al. | 2508.04976 | null |
2025-08-07 | Real-Time Doppler and Ionospheric Dispersion Correction Techniques for Arbitrary Waveforms Utilizing GPU Compute | Daniel J. Vickers et.al. | 2508.04951 | null |
2025-08-05 | AIC CTU@FEVER 8: On-premise fact checking through long context RAG | Herbert Ullrich et.al. | 2508.04390 | null |
2025-08-06 | A Foundation Model for DAS Signal Recognition and Visual Prompt Tuning of the Pre-trained Model for Downstream Tasks | Kun Gui et.al. | 2508.04316 | null |
2025-08-06 | Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems | Luai Abuelsamen et.al. | 2508.04146 | null |
2025-08-05 | La La LiDAR: Large-Scale Layout Generation from LiDAR Data | Youquan Liu et.al. | 2508.03691 | null |
2025-08-05 | Understanding the Landscape of Ampere GPU Memory Errors | Zhu Zhu et.al. | 2508.03513 | null |
2025-08-05 | Full-History Graphs with Edge-Type Decoupled Networks for Temporal Reasoning | Osama Mohammed et.al. | 2508.03251 | null |
2025-08-04 | MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models | Wenyuan Liu et.al. | 2508.02343 | null |
2025-08-04 | Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images | Philipp Wulff et.al. | 2508.02323 | null |
2025-08-04 | CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis | Yuzhuang Xu et.al. | 2508.02322 | null |
2025-08-04 | GPU in the Blind Spot: Overlooked Security Risks in Transportation | Sefatun-Noor Puspa et.al. | 2508.01995 | null |
2025-08-03 | Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving | Hunter Schofield et.al. | 2508.01922 | null |
2025-08-02 | A Parallel Algorithm for Finding Robust Spanners in Large Social Networks | Arindam Khanda et.al. | 2508.01485 | null |
2025-08-01 | Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection | Cheng-You Lu et.al. | 2508.01014 | null |
2025-08-01 | Optimal Scheduling Algorithms for LLM Inference: Theory and Practice | Agrim Bari et.al. | 2508.01002 | null |
2025-07-29 | Forecasting LLM Inference Performance via Hardware-Agnostic Analytical Modeling | Rajeev Patwari et.al. | 2508.00904 | null |
2025-08-01 | Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving | Stefan Englmeier et.al. | 2508.00589 | null |
2025-08-01 | On Learning Closed-Loop Probabilistic Multi-Agent Simulator | Juanwu Lu et.al. | 2508.00384 | null |
2025-08-01 | Beamformed 360° Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization | Belman Jahir Rodriguez et.al. | 2508.00307 | null |
2025-07-31 | FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction | Donghyun Lee et.al. | 2507.23480 | null |
2025-07-31 | InterfO-RAN: Real-Time In-band Cellular Uplink Interference Detection with GPU-Accelerated dApps | Neagin Neasamoni Santhi et.al. | 2507.23177 | null |
2025-07-30 | On the Sustainability of AI Inferences in the Edge | Ghazal Sobhani et.al. | 2507.23093 | null |
2025-07-30 | Early Goal-Guided Multi-Scale Fusion for Real-Time Vision-Language Driving | Santosh Patapati et.al. | 2507.23042 | null |
2025-07-28 | Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery | Deepak Joshi et.al. | 2507.20680 | null |
2025-07-27 | SWIFT: A General Sensitive Weight Identification Framework for Fast Sensor-Transfer Pansharpening | Zeyu Xia et.al. | 2507.20311 | null |
2025-07-26 | Racing to Idle: Energy Efficiency of Matrix Multiplication on Heterogeneous CPU and GPU Architectures | Mufakir Qamar Ansari et.al. | 2507.20063 | null |
2025-07-26 | A Fast Parallel Median Filtering Algorithm Using Hierarchical Tiling | Louis Sugy et.al. | 2507.19926 | null |
2025-08-02 | GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting | Baijun Ye et.al. | 2507.19451 | null |
2025-07-25 | TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability | Mohammad Aflah Khan et.al. | 2507.19419 | null |
2025-07-25 | LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences | Yusuke Hirota et.al. | 2507.19362 | null |
2025-07-25 | SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding Large Language Models | Zhen Wan et.al. | 2507.19361 | null |
2025-07-25 | High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins | Lorenzo Cazzella et.al. | 2507.19173 | null |
2025-07-24 | SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time | Yun Chen et.al. | 2507.18713 | null |
2025-07-24 | Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping | Chong Cheng et.al. | 2507.18541 | null |
2025-07-24 | Building an Accelerated OpenFOAM Proof-of-Concept Application using Modern C++ | Giulio Malenza et.al. | 2507.18268 | null |
2025-07-26 | MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation | Zhongzhen Wen et.al. | 2507.17773 | null |
2025-07-23 | BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems | Malsha Ashani Mahawatta Dona et.al. | 2507.17722 | null |
2025-07-24 | Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners | Kostas Karakontis et.al. | 2507.17519 | null |
2025-07-25 | HuNavSim 2.0: An Enhanced Human Navigation Simulator for Human-Aware Robot Navigation | Miguel Escudero-Jiménez et.al. | 2507.17317 | null |
2025-07-23 | GPU Benchmark through QPE Emulator with cuQuantum for Practical Quantum Applications | Takaki Akiba et.al. | 2507.17175 | null |
2025-07-23 | JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction | Fangze Lin et.al. | 2507.17152 | null |
2025-07-23 | Model Compression Engine for Wearable Devices Skin Cancer Diagnosis | Jacob M. Delgado-López et.al. | 2507.17125 | null |
2025-07-23 | Computer Vision for Real-Time Monkeypox Diagnosis on Embedded Systems | Jacob M. Delgado-López et.al. | 2507.17123 | null |
2025-07-22 | Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems | Imran Latif et.al. | 2507.16781 | null |
2025-07-22 | AcceleratedKernels.jl: Cross-Architecture Parallel Algorithms from a Unified, Transpiled Codebase | Andrei-Leonard Nicusan et.al. | 2507.16710 | null |
2025-07-22 | VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences | Kai Deng et.al. | 2507.16443 | null |
2025-07-21 | MSGM: A Multi-Scale Spatiotemporal Graph Mamba for EEG Emotion Recognition | Hanwen Liu et.al. | 2507.15914 | null |
2025-07-30 | GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data Analysis | Guoxi Liu et.al. | 2507.15230 | null |
2025-07-19 | Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall | Shayan Rokhva et.al. | 2507.14662 | null |
2025-07-16 | GPU-Accelerated Interpretable Generalization for Rapid Cyberattack Detection and Forensics | Shu-Ting Huang et.al. | 2507.14222 | null |
2025-08-02 | CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning | Xiaoya Li et.al. | 2507.14111 | null |
2025-07-23 | Photonic Fabric Platform for AI Accelerators | Jing Ding et.al. | 2507.14000 | null |
2025-07-18 | Leveraging Multi-Instance GPUs through moldable task scheduling | Jorge Villarrubia et.al. | 2507.13601 | null |
2025-07-17 | Performance Portable Gradient Computations Using Source Transformation | Kim Liegeois et.al. | 2507.13204 | null |
2025-07-16 | MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding | Renjie Li et.al. | 2507.12463 | null |
2025-07-16 | HyDRA: A Hybrid Dual-Mode Network for Closed- and Open-Set RFFI with Optimized VMD | Hanwen Liu et.al. | 2507.12133 | null |
2025-07-16 | PoTPTQ: A Two-step Power-of-Two Post-training for LLMs | Xinyu Wang et.al. | 2507.11959 | null |
2025-07-15 | MIRAGE: KV Cache Optimization through Parameter Remapping for Multi-tenant LLM Serving | Ruihao Li et.al. | 2507.11507 | null |
2025-07-15 | MMStencil: Optimizing High-order Stencils on Multicore CPU using Matrix Unit | Yinuo Wang et.al. | 2507.11067 | null |
2025-07-15 | Standards-Compliant DM-RS Allocation via Temporal Channel Prediction for Massive MIMO Systems | Sehyun Ryu et.al. | 2507.11064 | null |
2025-07-15 | Modernizing CNN-based Weather Forecast Model towards Higher Computational Efficiency | Minjong Cheon et.al. | 2507.10893 | null |
2025-07-21 | Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks | Aaron Jarmusch et.al. | 2507.10789 | null |
2025-07-14 | A Benchmarking Framework for AI models in Automotive Aerodynamics | Kaustubh Tangsali et.al. | 2507.10747 | null |
2025-07-14 | Quantize-then-Rectify: Efficient VQ-VAE Training | Borui Zhang et.al. | 2507.10547 | null |
2025-07-30 | Designing quantum chemistry algorithms with just-in-time compilation | Xiaojie Wu et.al. | 2507.09772 | null |
2025-07-13 | GeoWarp: An automatically differentiable and GPU-accelerated implicit MPM framework for geomechanics based on NVIDIA Warp | Yidong Zhao et.al. | 2507.09435 | null |
2025-07-12 | Factorization-free Orthogonal Projection onto the Positive Semidefinite Cone with Composite Polynomial Filtering | Shucheng Kang et.al. | 2507.09165 | null |
2025-07-10 | Vidyut3d: a GPU accelerated fluid solver for non-equilibrium plasmas on adaptive grids | Hariswaran Sitaraman et.al. | 2507.08200 | null |
2025-07-10 | GPUHammer: Rowhammer Attacks on GPU Memories are Practical | Chris S. Lin et.al. | 2507.08166 | null |
2025-07-03 | Collective Communication Profiling of Modern-day Machine Learning Workloads | Jit Gupta et.al. | 2507.07117 | null |
2025-07-09 | StixelNExT++: Lightweight Monocular Scene Segmentation and Representation for Collective Perception | Marcel Vosshans et.al. | 2507.06687 | null |
2025-07-09 | EA: An Event Autoencoder for High-Speed Vision Sensing | Riadul Islam et.al. | 2507.06459 | null |
2025-07-08 | CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation | Kushal Gajjar et.al. | 2507.06013 | null |
2025-07-07 | Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model | Mengyao Xu et.al. | 2507.05513 | null |
2025-07-07 | Robotic System with AI for Real Time Weed Detection, Canopy Aware Spraying, and Droplet Pattern Evaluation | Inayat Rasool et.al. | 2507.05432 | null |
2025-07-23 | Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms | Zhiyi Hu et.al. | 2507.04786 | null |
2025-07-05 | ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments | Guile Wu et.al. | 2507.03886 | null |
2025-07-24 | Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps | Chong Cheng et.al. | 2507.03737 | null |
2025-07-03 | NVIDIA GPU Confidential Computing Demystified | Zhongshu Gu et.al. | 2507.02770 | null |
2025-07-03 | Flotilla: A scalable, modular and resilient federated learning framework for heterogeneous resources | Roopkatha Banerjee et.al. | 2507.02295 | null |
2025-07-02 | SAKURAONE: Empowering Transparent and Open AI Platforms through Private-Sector HPC Investment in Japan | Fumikazu Konishi et.al. | 2507.02124 | null |
2025-07-02 | Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization | Giuseppe Ruggeri et.al. | 2507.01676 | null |
2025-06-20 | PyTorch-based Geometric Learning with Non-CUDA Processing Units: Experiences from Intel Gaudi-v2 HPUs | Fanchen Bu et.al. | 2507.01031 | null |
2025-07-01 | Anatomy of High-Performance Column-Pivoted QR Decomposition | Maksim Melnichenko et.al. | 2507.00976 | null |
2025-07-01 | Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms | Zain Taufique et.al. | 2507.00491 | null |
2025-07-01 | Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs | Mohammad Firas Sada et.al. | 2507.00418 | null |
2025-07-01 | Question Decomposition for Retrieval-Augmented Generation | Paul J. L. Ammann et.al. | 2507.00355 | null |
2025-06-24 | AdaDeDup: Adaptive Hybrid Data Pruning for Efficient Large-Scale Object Detection Training | Feiyang Kang et.al. | 2507.00049 | null |
2025-06-30 | Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model | Mu-Chi Chen et.al. | 2506.23635 | null |
2025-06-30 | Risk-Based Filtering of Valuable Driving Situations in the Waymo Open Motion Dataset | Tim Puphal et.al. | 2506.23433 | null |
2025-06-29 | CMOS+X: Stacking Persistent Embedded Memories based on Oxide Transistors upon GPGPU Platforms | Faaiq Waqar et.al. | 2506.23405 | null |
2025-06-28 | FF-INT8: Efficient Forward-Forward DNN Training on Edge Devices with INT8 Precision | Jingxiao Ma et.al. | 2506.22771 | null |
2025-06-27 | Quantum-Classical Auxiliary Field Quantum Monte Carlo with Matchgate Shadows on Trapped Ion Quantum Computers | Luning Zhao et.al. | 2506.22408 | null |
2025-06-27 | MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism | Zheng Zhang et.al. | 2506.22175 | null |
2025-06-27 | MCFuser: High-Performance and Rapid Fusion of Memory-Bound Compute-Intensive Operators | Zheng Zhang et.al. | 2506.22169 | null |
2025-07-08 | BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting | Zipei Ma et.al. | 2506.22099 | null |
2025-06-27 | SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model | Shuhan Tan et.al. | 2506.21976 | null |
2025-06-23 | TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge | Zhiyuan Zhang et.al. | 2506.21618 | null |
2025-06-26 | SAM4D: Segment Anything in Camera and LiDAR Streams | Jianyun Xu et.al. | 2506.21547 | null |
2025-06-26 | Portable High-Performance Kernel Generation for a Computational Fluid Dynamics Code with DaCe | Måns I. Andersson et.al. | 2506.20994 | null |
2025-06-25 | Characterization and Mitigation of Training Instabilities in Microscaling Formats | Huangyuan Su et.al. | 2506.20752 | null |
2025-06-24 | MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models | Hoa La et.al. | 2506.20686 | null |
2025-06-25 | SuperSONIC: Cloud-Native Infrastructure for ML Inferencing | Dmitry Kondratyev et.al. | 2506.20657 | null |
2025-06-25 | Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking | Ben Kang et.al. | 2506.20381 | null |
2025-06-24 | Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification | Minghao Qin et.al. | 2506.19225 | null |
2025-06-23 | Let Your Video Listen to Your Music! | Xinyu Zhang et.al. | 2506.18881 | null |
2025-06-23 | Cross-Architecture Knowledge Distillation (KD) for Retinal Fundus Image Anomaly Detection on NVIDIA Jetson Nano | Berk Yilmaz et.al. | 2506.18220 | null |
2025-06-22 | AMD Versal Implementations of FAM and SSCA Estimators | Carol Jingyi Li et.al. | 2506.18003 | null |
2025-06-20 | Code Generation for Near-Roofline Finite Element Actions on GPUs from Symbolic Variational Forms | Kaushik Kulkarni et.al. | 2506.17471 | null |
2025-06-19 | VideoGAN-based Trajectory Proposal for Automated Vehicles | Annajoyce Mariani et.al. | 2506.16209 | null |
2025-06-19 | Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs | Xun Wang et.al. | 2506.16196 | null |
2025-06-19 | HetGPU: The pursuit of making binary compatibility towards GPUs | Yiwei Yang et.al. | 2506.15993 | null |
2025-06-18 | Early Attentive Sparsification Accelerates Neural Speech Transcription | Zifei Xu et.al. | 2506.15912 | null |
2025-06-18 | UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting | Kai He et.al. | 2506.15673 | null |
2025-06-18 | Engineering Supercomputing Platforms for Biomolecular Applications | Robert Welch et.al. | 2506.15585 | null |
2025-07-30 | Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention | Syed Haider Ali et.al. | 2506.15562 | null |
2025-06-17 | Align Your Flow: Scaling Continuous-Time Flow Map Distillation | Amirmojtaba Sabour et.al. | 2506.14603 | null |
2025-06-18 | Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models | Xuanchi Ren et.al. | 2506.09042 | null |
2025-06-10 | Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions | David Acuna et.al. | 2506.08927 | null |
2025-07-18 | Controllable Weather Synthesis and Removal with Video Diffusion Models | Chih-Hao Lin et.al. | 2505.00704 | null |
2025-04-21 | LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception | Yuan-Hong Liao et.al. | 2504.15362 | null |
2025-04-15 | PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond | Minghua Liu et.al. | 2504.11451 | null |
2025-04-17 | VideoPanda: Video Panoramic Diffusion with Multi-view Attention | Kevin Xie et.al. | 2504.11389 | null |
2025-04-01 | Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control | NVIDIA et.al. | 2503.14492 | null |
2025-03-05 | GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control | Xuanchi Ren et.al. | 2503.03751 | null |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-03-22 | DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models | Ruofan Liang et.al. | 2501.18590 | null |
2025-07-09 | Cosmos World Foundation Model Platform for Physical AI | NVIDIA et.al. | 2501.03575 | null |
2025-06-26 | InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models | Yifan Lu et.al. | 2412.03934 | null |
2025-04-01 | Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos | Hanxue Liang et.al. | 2412.03526 | null |
2024-11-14 | LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models | Zhengyi Wang et.al. | 2411.09595 | null |
2025-02-28 | ReMatching Dynamic Reconstruction Flow | Sara Oblak et.al. | 2411.00705 | null |
2024-10-26 | SCube: Instant Large-Scale Scene Reconstruction using VoxSplats | Xuanchi Ren et.al. | 2410.20030 | null |
2025-02-11 | SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes | Tianchang Shen et.al. | 2409.20562 | null |
2024-09-28 | G3R: Gradient Guided Generalizable Reconstruction | Yun Chen et.al. | 2409.19405 | null |
2024-09-27 | UniCal: Unified Neural Sensor Calibration | Ze Yang et.al. | 2409.18953 | null |
2024-09-26 | Learning to Drive via Asymmetric Self-Play | Chris Zhang et.al. | 2409.18218 | null |
2024-09-15 | Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Yuan-Hong Liao et.al. | 2409.09788 | null |
2025-04-19 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760 | null |
2024-08-19 | Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering | Ruofan Liang et.al. | 2408.09702 | null |
2025-03-20 | Wolf: Dense Video Captioning with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-15 | SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation | Jordan Juravsky et.al. | 2407.10481 | null |
2024-10-10 | 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes | Nicolas Moenne-Loccoz et.al. | 2407.07090 | null |
2024-07-01 | fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence | Francis Williams et.al. | 2407.01781 | null |
2024-10-31 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-14 | L4GM: Large 4D Gaussian Reconstruction Model | Jiawei Ren et.al. | 2406.10324 | null |
2024-06-12 | UnO: Unsupervised Occupancy Fields for Perception and Forecasting | Ben Agro et.al. | 2406.08691 | null |
2024-06-12 | Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata | Dongsu Zhang et.al. | 2406.08292 | null |
2024-06-13 | DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas et.al. | 2406.04426 | null |
2024-04-24 | NeRF-XL: Scaling NeRFs with Multiple GPUs | Ruilong Li et.al. | 2404.16221 | null |
2024-04-22 | Align Your Steps: Optimizing Sampling Schedules in Diffusion Models | Amirmojtaba Sabour et.al. | 2404.14507 | null |
2024-04-16 | RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting | Ashkan Mirzaei et.al. | 2404.10765 | null |
2025-05-26 | Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves? | Yuan-Hong Liao et.al. | 2404.06510 | null |
2024-04-01 | QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving | Sourav Biswas et.al. | 2404.01486 | null |
2024-03-22 | LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Kevin Xie et.al. | 2403.15385 | null |
2024-03-22 | Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks | Aqeel Anwar et.al. | 2403.15370 | null |
2024-01-22 | EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models | Koichi Namekata et.al. | 2401.11739 | null |
2023-12-28 | Compact Neural Graphics Primitives with Learned Hash Probing | Towaki Takikawa et.al. | 2312.17241 | null |
2024-01-03 | Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Huan Ling et.al. | 2312.13763 | null |
2023-12-11 | LightSim: Neural Lighting Simulation for Urban Scenes | Ava Pun et.al. | 2312.06654 | null |
2024-04-14 | Trajeglish: Traffic Modeling as Next-Token Prediction | Jonah Philion et.al. | 2312.04535 | null |
2024-06-25 | XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies | Xuanchi Ren et.al. | 2312.03806 | null |
2024-04-12 | WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space | Katja Schwarz et.al. | 2311.13570 | null |
2023-11-16 | Adaptive Shells for Efficient Neural Radiance Field Rendering | Zian Wang et.al. | 2311.10091 | null |
2023-11-09 | Real-Time Neural Rasterization for Large Scenes | Jeffrey Yunfan Liu et.al. | 2311.05607 | null |
2023-11-09 | Reconstructing Objects in-the-wild for Realistic Sensor Simulation | Ze Yang et.al. | 2311.05602 | null |
2023-11-07 | 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features | Chenfeng Xu et.al. | 2311.04391 | null |
2023-11-03 | EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision | Jiawei Yang et.al. | 2311.02077 | null |
2023-11-03 | Towards Unsupervised Object Detection From LiDAR Point Clouds | Lunjun Zhang et.al. | 2311.02007 | null |
2023-11-02 | MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory | Enxu Li et.al. | 2311.01556 | null |
2023-11-17 | 4D-Former: Multimodal 4D Panoptic Segmentation | Ali Athar et.al. | 2311.01520 | null |
2023-11-02 | UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation | Yuwen Xiong et.al. | 2311.01448 | null |
2023-11-02 | CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation | Jingkang Wang et.al. | 2311.01447 | null |
2023-11-02 | Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation | Jay Sarva et.al. | 2311.01446 | null |
2023-11-02 | LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds | Anqi Joyce Yang et.al. | 2311.01444 | null |
2023-11-02 | Learning Realistic Traffic Agents in Closed-loop | Chris Zhang et.al. | 2311.01394 | null |
2024-04-01 | Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion | Lunjun Zhang et.al. | 2311.01017 | null |
2024-01-26 | ViR: Towards Efficient Vision Retention Backbones | Ali Hatamizadeh et.al. | 2310.19731 | null |
2023-10-20 | TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models | Tianshi Cao et.al. | 2310.13772 | null |
2023-09-11 | Towards Viewpoint Robustness in Bird’s Eye View Segmentation | Tzofi Klinghoffer et.al. | 2309.05192 | null |
2023-08-10 | Flexible Isosurface Extraction for Gradient-Based Mesh Optimization | Tianchang Shen et.al. | 2308.05371 | null |
2023-08-03 | UniSim: A Neural Closed-Loop Sensor Simulator | Ze Yang et.al. | 2308.01898 | null |
2023-08-02 | Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving | Ben Agro et.al. | 2308.01471 | null |
2023-07-14 | DreamTeacher: Pretraining Image Backbones with Deep Generative Models | Daiqing Li et.al. | 2307.07487 | null |
2023-06-27 | Rethinking Closed-loop Training for Autonomous Driving | Chris Zhang et.al. | 2306.15713 | null |
2023-06-06 | ATT3D: Amortized Text-to-3D Object Synthesis | Jonathan Lorraine et.al. | 2306.07349 | null |
2023-06-09 | Neural Kernel Surface Reconstruction | Jiahui Huang et.al. | 2305.19590 | null |
2023-08-13 | Neural LiDAR Fields for Novel View Synthesis | Shengyu Huang et.al. | 2305.01643 | null |
2023-04-19 | NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models | Seung Wook Kim et.al. | 2304.09787 | null |
2023-12-28 | Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models | Andreas Blattmann et.al. | 2304.08818 | null |
2023-04-06 | Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes | Zian Wang et.al. | 2304.03266 | null |
2023-04-04 | Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion | Davis Rempe et.al. | 2304.01893 | null |
2023-03-25 | VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion | Yiming Li et.al. | 2302.12251 | null |
2023-02-09 | Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting | Viraj Prabhu et.al. | 2302.04832 | null |
2023-02-02 | Synthesizing Physical Character-Scene Interactions | Mohamed Hassan et.al. | 2302.00883 | null |
2023-01-31 | PADL: Language-Directed Physics-Based Character Control | Jordan Juravsky et.al. | 2301.13868 | null |
2023-03-25 | Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin et.al. | 2211.10440 | null |
2022-11-08 | GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting | Alexander Cui et.al. | 2211.02545 | null |
2022-10-12 | LION: Latent Point Diffusion Models for 3D Shape Generation | Xiaohui Zeng et.al. | 2210.06978 | null |
2022-10-06 | XDGAN: Multi-Modal 3D Shape Generation in 2D Space | Hassan Abu Alhaija et.al. | 2210.03007 | null |
2022-10-03 | Optimizing Data Collection for Machine Learning | Rafid Mahmood et.al. | 2210.01234 | null |
2022-09-26 | EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Ahmad Darkhalil et.al. | 2209.13064 | null |
2022-09-22 | GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images | Jun Gao et.al. | 2209.11163 | null |
2022-08-19 | Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion | Zian Wang et.al. | 2208.09480 | null |
2022-08-18 | MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation | Gopal Sharma et.al. | 2208.08580 | null |
2022-07-05 | Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention | Gary Leung et.al. | 2207.02126 | null |
2022-07-13 | How Much More Data Do I Need? Estimating Requirements for Downstream Tasks | Rafid Mahmood et.al. | 2207.01725 | null |
2022-06-19 | Scalable Neural Data Server: A Data Recommender for Transfer Learning | Tianshi Cao et.al. | 2206.09386 | null |
2022-06-16 | Virtual Correspondence: Humans as a Cue for Extreme-View Geometry | Wei-Chiu Ma et.al. | 2206.08365 | null |
2022-06-15 | Variable Bitrate Neural Fields | Towaki Takikawa et.al. | 2206.07707 | null |
2022-06-06 | Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps | Seung Wook Kim et.al. | 2206.02903 | null |
2022-05-05 | ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters | Xue Bin Peng et.al. | 2205.01906 | null |
2022-04-19 | M $^2$ BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation | Enze Xie et.al. | 2204.05088 | null |
2022-04-06 | AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis | Zhiqin Chen et.al. | 2204.03105 | null |
Autonomous Driving
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-07 | SMOL-MapSeg: Show Me One Label | Yunshuang Yuan et.al. | 2508.05501 | null |
2025-08-07 | Physical Adversarial Camouflage through Gradient Calibration and Regularization | Jiawei Liang et.al. | 2508.05414 | null |
2025-08-07 | DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model | Rui Yu et.al. | 2508.05402 | null |
2025-08-07 | ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models | Yatong Lan et.al. | 2508.05236 | null |
2025-08-07 | PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems | Qi Guo et.al. | 2508.05167 | null |
2025-08-07 | AdvDINO: Domain-Adversarial Self-Supervised Representation Learning for Spatial Proteomics | Stella Su et.al. | 2508.04955 | null |
2025-08-06 | Occupancy Learning with Spatiotemporal Memory | Ziyang Leng et.al. | 2508.04705 | null |
2025-08-06 | BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning | Ziyang Leng et.al. | 2508.04702 | null |
2025-08-06 | RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case | Baihui Xiao et.al. | 2508.04642 | null |
2025-08-06 | Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark | Xiao Wang et.al. | 2508.04260 | null |
2025-08-06 | DRIVE: Dynamic Rule Inference and Verified Evaluation for Constraint-Aware Autonomous Driving | Longling Geng et.al. | 2508.04066 | null |
2025-08-05 | LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences | Ao Liang et.al. | 2508.03692 | null |
2025-08-05 | La La LiDAR: Large-Scale Layout Generation from LiDAR Data | Youquan Liu et.al. | 2508.03691 | null |
2025-08-05 | Veila: Panoramic LiDAR Generation from a Monocular RGB Image | Youquan Liu et.al. | 2508.03690 | null |
2025-08-05 | MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention | Qi Xie et.al. | 2508.03034 | null |
2025-08-04 | Context-aware Risk Assessment and Its Application in Autonomous Driving | Boyang Tian et.al. | 2508.02919 | null |
2025-08-04 | MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model | Tianheng Zhu et.al. | 2508.02858 | null |
2025-08-04 | mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera | Byeonggyu Park et.al. | 2508.02348 | null |
2025-08-04 | Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images | Philipp Wulff et.al. | 2508.02323 | null |
2025-08-04 | Test-Time Model Adaptation for Quantized Neural Networks | Zeshuai Deng et.al. | 2508.02180 | null |
2025-08-04 | Beyond RGB and Events: Enhancing Object Detection under Adverse Lighting with Monocular Normal Maps | Mingjie Liu et.al. | 2508.02127 | null |
2025-08-04 | Mapillary Vistas Validation for Fine-Grained Traffic Signs: A Benchmark Revealing Vision-Language Model Limitations | Sparsh Garg et.al. | 2508.02047 | null |
2025-08-04 | Bench2ADVLM: A Closed-Loop Benchmark for Vision-language Models in Autonomous Driving | Tianyuan Zhang et.al. | 2508.02028 | null |
2025-08-03 | Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving | Hunter Schofield et.al. | 2508.01922 | null |
2025-08-03 | StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding | Haolin Yang et.al. | 2508.01875 | null |
2025-08-03 | DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion | Zhigang Sun et.al. | 2508.01778 | null |
2025-08-03 | LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving | Luqi Cheng et.al. | 2508.01704 | null |
2025-08-03 | Adverse Weather-Independent Framework Towards Autonomous Driving Perception through Temporal Correlation and Unfolded Regularization | Wei-Bin Kou et.al. | 2508.01583 | null |
2025-08-02 | A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding | Zhan Shi et.al. | 2508.01197 | null |
2025-08-01 | CP-FREEZER: Latency Attacks against Vehicular Cooperative Perception | Chenyi Wang et.al. | 2508.01062 | null |
2025-08-01 | REACT: A Real-Time Edge-AI Based V2X Framework for Accident Avoidance in Autonomous Driving System | Fengze Yang et.al. | 2508.01057 | null |
2025-07-31 | Service Discovery-Based Hybrid Network Middleware for Efficient Communication in Distributed Robotic Systems | Shiyao Sang et.al. | 2508.00947 | null |
2025-08-01 | Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR | Adwait Chandorkar et.al. | 2508.00744 | null |
2025-08-01 | Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving | Stefan Englmeier et.al. | 2508.00589 | null |
2025-08-01 | Uncertainty-Aware Likelihood Ratio Estimation for Pixel-Wise Out-of-Distribution Detection | Marc Hölle et.al. | 2508.00587 | null |
2025-08-01 | Pro2Guard: Proactive Runtime Enforcement of LLM Agent Safety via Probabilistic Model Checking | Haoyu Wang et.al. | 2508.00500 | null |
2025-08-01 | Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence | Danzhen Fu et.al. | 2508.00299 | null |
2025-07-21 | AoI-Aware Resource Allocation with Deep Reinforcement Learning for HAPS-V2X Networks | Ahmet Melih Ince et.al. | 2508.00011 | null |
2025-07-31 | I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation | Jialei Chen et.al. | 2507.23683 | null |
2025-07-31 | DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation | Yuchen Zhou et.al. | 2507.23599 | null |
2025-08-02 | MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction | Zijian Dong et.al. | 2507.23597 | null |
2025-07-31 | A Unified Perception-Language-Action Framework for Adaptive Autonomous Driving | Yi Zhang et.al. | 2507.23540 | null |
2025-07-31 | MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting | Xingyue Peng et.al. | 2507.23340 | null |
2025-07-31 | Contrastive Learning-Driven Traffic Sign Perception: Multi-Modal Fusion of Text and Vision | Qiang Lu et.al. | 2507.23331 | null |
2025-07-31 | FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models | Yiming Yang et.al. | 2507.23325 | null |
2025-08-02 | FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning | Jiajun Cao et.al. | 2507.23318 | null |
2025-08-04 | PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving | Xuewei Tang et.al. | 2507.23309 | null |
2025-07-30 | Causal-Inspired Multi-Agent Decision-Making via Graph Reinforcement Learning | Jing Wang et.al. | 2507.23080 | null |
2025-08-05 | Vision-Language Fusion for Real-Time Autonomous Driving: Goal-Centered Cross-Attention of Camera, HD-Map, & Waypoints | Santosh Patapati et.al. | 2507.23064 | null |
2025-07-30 | Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation | Alexandru Buburuzan et.al. | 2507.23058 | null |
2025-08-07 | Bayesian Optimization applied for accelerated Virtual Validation of the Autonomous Driving Function | Satyesh Shanker Awasthi et.al. | 2507.22769 | null |
2025-07-30 | Social-Pose: Enhancing Trajectory Prediction with Human Body Pose | Yang Gao et.al. | 2507.22742 | null |
2025-07-30 | Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model | Daehee Park et.al. | 2507.22615 | null |
2025-07-30 | TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation | Jiuming Liu et.al. | 2507.22454 | null |
2025-07-30 | Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators | Kaustav Chakraborty et.al. | 2507.22389 | null |
2025-07-29 | Hierarchical Game-Based Multi-Agent Decision-Making for Autonomous Vehicles | Mushuang Liu et.al. | 2507.21941 | null |
2025-07-31 | MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors | Shouyi Lu et.al. | 2507.21872 | null |
2025-07-29 | SAMITE: Position Prompted SAM2 with Calibrated Memory for Visual Object Tracking | Qianxiong Xu et.al. | 2507.21732 | null |
2025-07-29 | Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition | Ruiyang Hao et.al. | 2507.21610 | null |
2025-07-29 | SafeDriveRAG: Towards Safe Autonomous Driving with Knowledge Graph-based Retrieval-Augmented Generation | Hao Ye et.al. | 2507.21585 | null |
2025-07-30 | No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering | Linye Wei et.al. | 2507.21572 | null |
2025-07-29 | RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors | Tianhui Cai et.al. | 2507.21567 | null |
2025-07-29 | SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity | Xingyang Li et.al. | 2507.21499 | null |
2025-07-29 | MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving | Thomas Monninger et.al. | 2507.21423 | null |
2025-08-03 | Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy | Jicheng Yuan et.al. | 2507.21358 | null |
2025-07-25 | Seeing Beyond Frames: Zero-Shot Pedestrian Intention Prediction with Raw Temporal Video and Multimodal Cues | Pallavi Zambare et.al. | 2507.21161 | null |
2025-07-28 | GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction | Tianhao Li et.al. | 2507.20963 | null |
2025-07-25 | Event-Based De-Snowing for Autonomous Driving | Manasi Muglikar et.al. | 2507.20901 | null |
2025-07-28 | DriveAgent-R1: Advancing VLM-based Autonomous Driving with Hybrid Thinking and Active Perception | Weicheng Zheng et.al. | 2507.20879 | null |
2025-07-27 | Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars | Mattia Piccinini et.al. | 2507.20427 | null |
2025-07-27 | VESPA: Towards un(Human)supervised Open-World Pointcloud Labeling for Autonomous Driving | Levente Tempfli et.al. | 2507.20397 | null |
2025-07-27 | Solving Scene Understanding for Autonomous Navigation in Unstructured Environments | Naveen Mathews Renji et.al. | 2507.20389 | null |
2025-07-27 | VLMPlanner: Integrating Visual Language Models with Motion Planning | Zhipeng Tang et.al. | 2507.20342 | null |
2025-07-27 | MambaMap: Online Vectorized HD Map Construction using State Space Model | Ruizi Yang et.al. | 2507.20224 | null |
2025-07-27 | LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks | Fei Kong et.al. | 2507.20174 | null |
2025-07-27 | Meta Fusion: A Unified Framework For Multimodality Fusion with Mutual Learning | Ziyi Liang et.al. | 2507.20089 | null |
2025-07-26 | Digital Twin Channel-Enabled Online Resource Allocation for 6G: Principle, Architecture and Application | Tongjie Li et.al. | 2507.19974 | null |
2025-07-29 | DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes | Rishav Kumar et.al. | 2507.19912 | null |
2025-07-26 | Bridging Simulation and Usability: A User-Friendly Framework for Scenario Generation in CARLA | Ahmed Abouelazm et.al. | 2507.19883 | null |
2025-07-26 | FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving | Tao Lian et.al. | 2507.19881 | null |
2025-07-30 | RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection | Xiaokai Bai et.al. | 2507.19856 | null |
2025-07-26 | A 4D Radar Camera Extrinsic Calibration Tool Based on 3D Uncertainty Perspective N Points | Chuan Cao et.al. | 2507.19829 | null |
2025-07-25 | PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction | Haichuan Li et.al. | 2507.19701 | null |
2025-07-25 | Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing | Haichuan Li et.al. | 2507.19691 | null |
2025-08-02 | GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting | Baijun Ye et.al. | 2507.19451 | null |
2025-07-25 | An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles | Matthias Weiß et.al. | 2507.19446 | null |
2025-07-25 | SDVDiag: A Modular Platform for the Diagnosis of Connected Vehicle Functions | Matthias Weiß et.al. | 2507.19403 | null |
2025-07-25 | BEV-LLM: Leveraging Multimodal BEV Maps for Scene Captioning in Autonomous Driving | Felix Brandstaetter et.al. | 2507.19370 | null |
2025-07-25 | LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences | Yusuke Hirota et.al. | 2507.19362 | null |
2025-07-25 | SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence | Viktar Dubovik et.al. | 2507.19321 | null |
2025-07-25 | CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception | Jiaru Zhong et.al. | 2507.19239 | null |
2025-07-25 | VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions | Haoang Lu et.al. | 2507.19188 | null |
2025-07-25 | Continual Learning-Based Unified Model for Unpaired Image Restoration Tasks | Kotha Kartheek et.al. | 2507.19184 | null |
2025-07-25 | Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL | Ahmed Abouelazm et.al. | 2507.19146 | null |
2025-07-31 | PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction | Yanghong Liu et.al. | 2507.19119 | null |
2025-07-25 | Fine-Grained Traffic Inference from Road to Lane via Spatio-Temporal Graph Node Generation | Shuhao Li et.al. | 2507.19089 | null |
2025-07-25 | HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback | Elham Soltani Kazemi et.al. | 2507.18921 | null |
2025-07-24 | Diffusion-FS: Multimodal Free-Space Prediction via Diffusion for Autonomous Driving | Keshav Gupta et.al. | 2507.18763 | null |
2025-07-24 | Linear Memory SE(2) Invariant Attention | Ethan Pronovost et.al. | 2507.18597 | null |
2025-07-24 | GaussianFusionOcc: A Seamless Sensor Fusion Approach for 3D Occupancy Prediction Using 3D Gaussians | Tomislav Pavković et.al. | 2507.18522 | null |
2025-07-24 | Delving into Mapping Uncertainty for Mapless Trajectory Prediction | Zongzheng Zhang et.al. | 2507.18498 | null |
2025-07-24 | Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments | Xiao Yang et.al. | 2507.18484 | null |
2025-07-24 | CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting | Haoran Xu et.al. | 2507.18473 | null |
2025-07-24 | LONG3R: Long Sequence Streaming 3D Reconstruction | Zhuoguang Chen et.al. | 2507.18255 | null |
2025-07-24 | GenAI for Automotive Software Development: From Requirements to Wheels | Nenad Petrovic et.al. | 2507.18223 | null |
2025-07-24 | Goal-based Trajectory Prediction for improved Cross-Dataset Generalization | Daniel Grimm et.al. | 2507.18196 | null |
2025-07-24 | Policy Disruption in Reinforcement Learning:Adversarial Attack with Large Language Models and Critical State Identification | Junyong Jiang et.al. | 2507.18113 | null |
2025-07-23 | BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems | Malsha Ashani Mahawatta Dona et.al. | 2507.17722 | null |
2025-07-23 | Reusing Attention for One-stage Lane Topology Understanding | Yang Li et.al. | 2507.17617 | null |
2025-07-23 | InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling | Xiaoxue Chen et.al. | 2507.17613 | null |
2025-07-24 | PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving | Maciej K. Wozniak et.al. | 2507.17596 | null |
2025-07-23 | SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving | Chuang Chen et.al. | 2507.17479 | null |
2025-07-23 | VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization | Sania Waheed et.al. | 2507.17455 | null |
2025-07-23 | Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning | Joobin Jin et.al. | 2507.17418 | null |
2025-08-06 | DeMo++: Motion Decoupling for Autonomous Driving | Bozhou Zhang et.al. | 2507.17342 | null |
2025-07-23 | JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction | Fangze Lin et.al. | 2507.17152 | null |
2025-07-23 | HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study | Mandar Pitale et.al. | 2507.17118 | null |
2025-07-22 | SDGOCC: Semantic and Depth-Guided Bird’s-Eye View Transformation for 3D Multimodal Occupancy Prediction | Zaipeng Duan et.al. | 2507.17083 | null |
2025-07-22 | Few-Shot Learning in Video and 3D Object Detection: A Survey | Md Meftahul Ferdaus et.al. | 2507.17079 | null |
2025-07-22 | Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach | Adithya Mohan et.al. | 2507.17070 | null |
2025-07-22 | Denoising-While-Completing Network (DWCNet): Robust Point Cloud Completion Under Corruption | Keneni W. Tesema et.al. | 2507.16743 | null |
2025-07-22 | Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control | Zongzheng Zhang et.al. | 2507.16645 | null |
2025-07-22 | A Target-based Multi-LiDAR Multi-Camera Extrinsic Calibration System | Lorenzo Gentilini et.al. | 2507.16621 | null |
2025-07-22 | VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences | Kai Deng et.al. | 2507.16443 | null |
2025-07-22 | A Sparsity-Aware Autonomous Path Planning Accelerator with HW/SW Co-Design and Multi-Level Dataflow Optimization | Yifan Zhang et.al. | 2507.16177 | null |
2025-07-21 | Optimal Batch-Size Control for Low-Latency Federated Learning with Device Heterogeneity | Huiling Yang et.al. | 2507.15601 | null |
2025-07-21 | Robots for Kiwifruit Harvesting and Pollination | Jamie Bell et.al. | 2507.15484 | null |
2025-07-21 | VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving | Haichao Liu et.al. | 2507.15266 | null |
2025-07-20 | CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning | Pan Hu et.al. | 2507.14903 | null |
2025-07-23 | GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving | Chi Wan et.al. | 2507.14456 | null |
2025-07-18 | Preference-based Multi-Objective Reinforcement Learning | Ni Mu et.al. | 2507.14066 | null |
2025-07-18 | Architecting Human-AI Cocreation for Technical Services – Interaction Modes and Contingency Factors | Jochen Wulf et.al. | 2507.14034 | null |
2025-07-18 | Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection | Yujian Mo et.al. | 2507.13899 | null |
2025-07-18 | Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation | Max van den Hoven et.al. | 2507.13857 | null |
2025-07-18 | One Step Closer: Creating the Future to Boost Monocular Semantic Scene Completion | Haoang Lu et.al. | 2507.13801 | null |
2025-07-18 | AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework | Yu Yao et.al. | 2507.13729 | null |
2025-07-17 | CaSTFormer: Causal Spatio-Temporal Transformer for Driving Intention Prediction | Sirui Wang et.al. | 2507.13425 | null |
2025-07-16 | From Binary to Semantic: Utilizing Large-Scale Binary Occupancy Data for 3D Semantic Occupancy Prediction | Chihiro Noguchi et.al. | 2507.13387 | null |
2025-07-17 | Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models | Arian Mousakhan et.al. | 2507.13162 | null |
2025-07-17 | Channel-wise Motion Features for Efficient Motion Segmentation | Riku Inoue et.al. | 2507.13082 | null |
2025-07-23 | LaViPlan : Language-Guided Visual Path Planning with RLVR | Hayeon Oh et.al. | 2507.12911 | null |
2025-07-17 | World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving | Yanchen Guan et.al. | 2507.12762 | null |
2025-07-17 | Domain-Enhanced Dual-Branch Model for Efficient and Interpretable Accident Anticipation | Yanchen Guan et.al. | 2507.12755 | null |
2025-07-16 | ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving | Yuhang Lu et.al. | 2507.12499 | null |
2025-07-16 | MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding | Renjie Li et.al. | 2507.12463 | null |
2025-07-16 | AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models | Santosh Vasa et.al. | 2507.12414 | null |
2025-07-21 | AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving | Jiawei Xu et.al. | 2507.12137 | null |
2025-07-16 | LidarPainter: One-Step Away From Any Lidar View To Novel Guidance | Yuzhou Ji et.al. | 2507.12114 | null |
2025-07-16 | Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics | Muleilan Pei et.al. | 2507.12083 | null |
2025-07-16 | IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving | Kanghyun Ryu et.al. | 2507.11940 | null |
2025-07-16 | Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers | Mohammed Hassanin et.al. | 2507.11852 | null |
2025-07-15 | Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation | Zhen Xu et.al. | 2507.11540 | null |
2025-07-15 | A Survey on Interpretability in Visual Recognition | Qiyang Wan et.al. | 2507.11099 | null |
2025-07-14 | RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding | Benjamin Stoler et.al. | 2507.10749 | null |
2025-07-14 | Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance | Kyungtae Han et.al. | 2507.10500 | null |
Traffic Simulation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-07 | TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven Evolution | Zhikai Zhao et.al. | 2508.05616 | null |
2025-08-07 | Benchmarking Shortcutting Techniques for Multi-Robot-Arm Motion Planning | Philip Huang et.al. | 2508.05027 | null |
2025-08-06 | LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction | Md Zahidul Hasan et.al. | 2508.04847 | null |
2025-08-06 | BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning | Ziyang Leng et.al. | 2508.04702 | null |
2025-08-06 | Incorporating Stochastic Models of Controller Behavior into Kinodynamic Efficiently Adaptive State Lattices for Mobile Robot Motion Planning in Off-Road Environments | Eric R. Damm et.al. | 2508.04384 | null |
2025-08-06 | Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction | Yu Liu et.al. | 2508.04229 | null |
2025-08-06 | Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems | Luai Abuelsamen et.al. | 2508.04146 | null |
2025-08-05 | Constraint-Preserving Data Generation for Visuomotor Policy Learning | Kevin Lin et.al. | 2508.03944 | null |
2025-08-05 | Vision-based Perception System for Automated Delivery Robot-Pedestrians Interactions | Ergi Tushe et.al. | 2508.03541 | null |
2025-08-04 | X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio | Chenxu Zhang et.al. | 2508.02944 | null |
2025-08-04 | MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model | Tianheng Zhu et.al. | 2508.02858 | null |
2025-08-04 | Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering | Xu Wang et.al. | 2508.02362 | null |
2025-08-04 | Adaptive Lattice-based Motion Planning | Abhishek Dhar et.al. | 2508.02350 | null |
2025-08-04 | Framework for Robust Motion Planning of Tethered Multi-Robot Systems in Marine Environments | Markus Buchholz et.al. | 2508.02287 | null |
2025-08-04 | AID4AD: Aerial Image Data for Automated Driving Perception | Daniel Lengerer et.al. | 2508.02140 | null |
2025-08-03 | Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving | Hunter Schofield et.al. | 2508.01922 | null |
2025-08-03 | DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion | Zhigang Sun et.al. | 2508.01778 | null |
2025-08-03 | A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction | Hua Yu et.al. | 2508.01585 | null |
2025-07-29 | A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles | Jiayuan Wang et.al. | 2508.00917 | null |
2025-08-01 | On Learning Closed-Loop Probabilistic Multi-Agent Simulator | Juanwu Lu et.al. | 2508.00384 | null |
2025-08-01 | TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps | Zehui Xu et.al. | 2508.00303 | null |
2025-07-31 | Data-Driven Motion Planning for Uncertain Nonlinear Systems | Babak Esmaeili et.al. | 2508.00154 | null |
2025-07-31 | OmniTraj: Pre-Training on Heterogeneous Data for Adaptive and Zero-Shot Human Trajectory Prediction | Yang Gao et.al. | 2507.23657 | null |
2025-07-31 | A Framework for Ethical Decision-Making in Automated Vehicles through Human Reasons-based Supervision | Lucas Elbert Suryana et.al. | 2507.23308 | null |
2025-07-31 | Simulation-based planning of Motion Sequences for Automated Procedure Optimization in Multi-Robot Assembly Cells | Loris Schneider et.al. | 2507.23270 | null |
2025-08-01 | Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future | Guoping Xu et.al. | 2507.22792 | null |
2025-07-30 | Social-Pose: Enhancing Trajectory Prediction with Human Body Pose | Yang Gao et.al. | 2507.22742 | null |
2025-07-30 | Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model | Daehee Park et.al. | 2507.22615 | null |
2025-07-30 | Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators | Kaustav Chakraborty et.al. | 2507.22389 | null |
2025-07-27 | Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars | Mattia Piccinini et.al. | 2507.20427 | null |
2025-07-27 | VLMPlanner: Integrating Visual Language Models with Motion Planning | Zhipeng Tang et.al. | 2507.20342 | null |
2025-07-27 | PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks | Clinton Ansun Mo et.al. | 2507.20170 | null |
2025-07-25 | PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction | Haichuan Li et.al. | 2507.19701 | null |
2025-07-25 | RAKOMO: Reachability-Aware K-Order Markov Path Optimization for Quadrupedal Loco-Manipulation | Mattia Risiglione et.al. | 2507.19652 | null |
2025-07-25 | High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins | Lorenzo Cazzella et.al. | 2507.19173 | null |
2025-07-31 | PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction | Yanghong Liu et.al. | 2507.19119 | null |
2025-07-24 | Probabilistic Collision Risk Estimation through Gauss-Legendre Cubature and Non-Homogeneous Poisson Processes | Trent Weiss et.al. | 2507.18819 | null |
2025-07-24 | Delving into Mapping Uncertainty for Mapless Trajectory Prediction | Zongzheng Zhang et.al. | 2507.18498 | null |
2025-07-24 | Goal-based Trajectory Prediction for improved Cross-Dataset Generalization | Daniel Grimm et.al. | 2507.18196 | null |
2025-07-24 | DanceGraph: A Complementary Architecture for Synchronous Dancing Online | David Sinclair et.al. | 2507.18052 | null |
2025-07-23 | Safety Assurance for Quadrotor Kinodynamic Motion Planning | Theodoros Tavoulareas et.al. | 2507.17679 | null |
2025-07-23 | IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception | Haichuan Li et.al. | 2507.17445 | null |
2025-08-06 | DeMo++: Motion Decoupling for Autonomous Driving | Bozhou Zhang et.al. | 2507.17342 | null |
2025-07-23 | JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction | Fangze Lin et.al. | 2507.17152 | null |
2025-07-23 | Falconry-like palm landing by a flapping-wing drone based on the human gesture interaction and distance-aware flight planning | Kazuki Numazato et.al. | 2507.17144 | null |
2025-07-22 | RAPTAR: Radar Radiation Pattern Acquisition through Automated Collaborative Robotics | Maaz Qureshi et.al. | 2507.16988 | null |
2025-07-21 | Resilience Evaluation of Kubernetes in Cloud-Edge Environments via Failure Injection | Zihao Chen et.al. | 2507.16109 | null |
2025-07-21 | Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction | Shiyang Li et.al. | 2507.15832 | null |
2025-07-21 | Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs | Ruochu Yang et.al. | 2507.15782 | null |
2025-07-21 | Selective Densification for Rapid Motion Planning in High Dimensions with Narrow Passages | Lu Huang et.al. | 2507.15710 | null |
2025-07-21 | A Universal Vehicle-Trailer Navigation System with Neural Kinematics and Online Residual Learning | Yanbo Chen et.al. | 2507.15607 | null |
2025-07-21 | VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving | Haichao Liu et.al. | 2507.15266 | null |
2025-07-20 | Search-Based Autonomous Vehicle Motion Planning Using Game Theory | Pouya Panahandeh et.al. | 2507.15088 | null |
2025-07-20 | CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning | Pan Hu et.al. | 2507.14903 | null |
2025-07-18 | Context-Aware Behavior Learning with Heuristic Motion Memory for Underwater Manipulation | Markus Buchholz et.al. | 2507.14099 | null |
2025-07-18 | NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning | Qingyi Chen et.al. | 2507.13940 | null |
2025-07-18 | Conformal Contraction for Robust Nonlinear Control with Distribution-Free Uncertainty Quantification | Sihang Wei et.al. | 2507.13613 | null |
2025-07-16 | InSyn: Modeling Complex Interactions for Pedestrian Trajectory Prediction | Kaiyuan Zhai et.al. | 2507.13397 | null |
2025-07-25 | Signal Temporal Logic Compliant Co-design of Planning and Control | Manas Sashank Juvvi et.al. | 2507.13225 | null |
2025-07-22 | Predictability-Aware Motion Prediction for Edge XR via High-Order Error-State Kalman Filtering | Ziyu Zhong et.al. | 2507.13179 | null |
2025-07-17 | Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning | Giwon Lee et.al. | 2507.12977 | null |
2025-07-17 | FFI-VTR: Lightweight and Robust Visual Teach and Repeat Navigation based on Feature Flow Indicator and Probabilistic Motion Planning | Jikai Wang et.al. | 2507.12800 | null |
2025-07-16 | MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding | Renjie Li et.al. | 2507.12463 | null |
2025-07-16 | Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios | Van-Hoang-Anh Phan et.al. | 2507.12449 | null |
2025-07-16 | Regrasp Maps for Sequential Manipulation Planning | Svetlana Levit et.al. | 2507.12407 | null |
2025-07-16 | Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics | Muleilan Pei et.al. | 2507.12083 | null |
2025-07-16 | IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving | Kanghyun Ryu et.al. | 2507.11940 | null |
2025-07-16 | A Fast Method for Planning All Optimal Homotopic Configurations for Tethered Robots and Its Extended Applications | Jinyuan Liu et.al. | 2507.11880 | null |
2025-07-15 | MPC-based Coarse-to-Fine Motion Planning for Robotic Object Transportation in Cluttered Environments | Chen Cai et.al. | 2507.11211 | null |
2025-07-15 | Enhancing Autonomous Manipulator Control with Human-in-loop for Uncertain Assembly Environments | Ashutosh Mishra et.al. | 2507.11006 | null |
2025-07-15 | OffsetCrust: Variable-Radius Offset Approximation with Power Diagrams | Zihan Zhao et.al. | 2507.10924 | null |
2025-07-15 | Mixed Discrete and Continuous Planning using Shortest Walks in Graphs of Convex Sets | Savva Morozov et.al. | 2507.10878 | null |
2025-07-14 | A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex Environments | Yuchen Wang et.al. | 2507.10792 | null |
2025-07-23 | Leveraging RAG-LLMs for Urban Mobility Simulation and Analysis | Yue Ding et.al. | 2507.10382 | null |
2025-07-16 | TOP: Trajectory Optimization via Parallel Optimization towards Constant Time Complexity | Jiajun Yu et.al. | 2507.10290 | null |
2025-07-14 | MP-RBFN: Learning-based Vehicle Motion Primitives using Radial Basis Function Networks | Marc Kaufeld et.al. | 2507.10047 | null |
2025-07-22 | Active Probing with Multimodal Predictions for Motion Planning | Darshan Gadginmath et.al. | 2507.09822 | null |
2025-07-13 | Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions | Yuanhong Zheng et.al. | 2507.09446 | null |
2025-07-12 | Real-Time Adaptive Motion Planning via Point Cloud-Guided, Energy-Based Diffusion and Potential Fields | Wondmgezahu Teshome et.al. | 2507.09383 | null |
2025-07-19 | Informed Hybrid Zonotope-based Motion Planning Algorithm | Peng Xie et.al. | 2507.09309 | null |
2025-07-12 | Integrating Planning and Predictive Control Using the Path Feasibility Governor | Shu Zhang et.al. | 2507.09134 | null |
2025-07-09 | Next-Generation Travel Demand Modeling with a Generative Framework for Household Activity Coordination | Xishun Liao et.al. | 2507.08871 | null |
2025-07-14 | STRAP: Spatial-Temporal Risk-Attentive Vehicle Trajectory Prediction for Autonomous Driving | Xinyi Ning et.al. | 2507.08563 | null |
2025-07-11 | Prediction of Lane Change Intentions of Human Drivers using an LSTM, a CNN and a Transformer | Francesco De Cristofaro et.al. | 2507.08365 | null |
2025-07-11 | Neural Parameter-varying Data-enabled Predictive Control of Cold Atmospheric Pressure Plasma Jets | Pegah GhafGhanbari et.al. | 2507.08259 | null |
2025-07-10 | GGMotion: Group Graph Dynamics-Kinematics Networks for Human Motion Prediction | Shuaijin Wan et.al. | 2507.07515 | null |
2025-07-10 | Towards Safe Autonomous Driving: A Real-Time Safeguarding Concept for Motion Planning Algorithms | Korbinian Moller et.al. | 2507.07444 | null |
2025-07-09 | When Context Is Not Enough: Modeling Unexplained Variability in Car-Following Behavior | Chengyuan Zhang et.al. | 2507.07012 | null |
2025-07-09 | Robust signal decompositions on the circle | Aral Kose et.al. | 2507.07007 | null |
2025-07-09 | ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture | Mingjin Zeng et.al. | 2507.06531 | null |
2025-07-08 | AURA-CVC: Autonomous Ultrasound-guided Robotic Assistance for Central Venous Catheterization | Deepak Raina et.al. | 2507.05979 | null |
2025-07-08 | DRO-EDL-MPC: Evidential Deep Learning-Based Distributionally Robust Model Predictive Control for Safe Autonomous Driving | Hyeongchan Ham et.al. | 2507.05710 | null |
2025-07-07 | From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving | Fabian Konstantinidis et.al. | 2507.05254 | null |
2025-07-07 | Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance | Tobias Demmler et.al. | 2507.05098 | null |
2025-07-07 | Unifying Robot Optimization: Monte Carlo Tree Search with Tensor Factorization | Teng Xue et.al. | 2507.04949 | null |
2025-07-25 | Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning | Giwon Lee et.al. | 2507.04790 | null |
2025-07-07 | LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction | Yixin Yan et.al. | 2507.04634 | null |
2025-07-06 | Free-Space Optical Communication-Driven NMPC Framework for Multi-Rotor Aerial Vehicles in Structured Inspection Scenarios | Giuseppe Silano et.al. | 2507.04443 | null |
2025-07-05 | Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic | Jianwei Tang et.al. | 2507.04062 | null |
2025-07-05 | Temporal Continual Learning with Prior Compensation for Human Motion Prediction | Jianwei Tang et.al. | 2507.04060 | null |
2025-07-05 | DK-RRT: Deep Koopman RRT for Collision-Aware Motion Planning of Space Manipulators in Dynamic Debris Environments | Qi Chen et.al. | 2507.03878 | null |
2025-07-05 | Enhanced accuracy through ensembling of randomly initialized auto-regressive models for time-dependent PDEs | Ishan Khurjekar et.al. | 2507.03863 | null |
2025-07-04 | Label-Free Long-Horizon 3D UAV Trajectory Prediction via Motion-Aligned RGB and Event Cues | Hanfang Liang et.al. | 2507.03365 | null |
2025-07-03 | Trajectory Optimization for Differential Drive Mobile Manipulators via Topological Paths Search and Arc Length-Yaw Parameterization | Long Xu et.al. | 2507.02761 | null |
2025-07-03 | Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization | Caio Azevedo et.al. | 2507.02406 | null |
2025-07-03 | Path Planning using a One-shot-sampling Skeleton Map | Gabriel O. Flores-Aquino et.al. | 2507.02328 | null |
2025-07-02 | GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters | Wanjia Zhao et.al. | 2507.02085 | null |
2025-07-09 | Test-Time Scaling with Reflective Generative Model | Zixiao Wang et.al. | 2507.01951 | null |
2025-07-06 | AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction | Bin Rao et.al. | 2507.01801 | null |
2025-07-02 | Efficient Collision Detection for Long and Slender Robotic Links in Euclidean Distance Fields: Application to a Forestry Crane | Marc-Philip Ecker et.al. | 2507.01705 | null |
2025-07-02 | LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction | Muhammad Atta ur Rahman et.al. | 2507.01308 | null |
2025-07-01 | Search-Based Robot Motion Planning With Distance-Based Adaptive Motion Primitives | Benjamin Kraljusic et.al. | 2507.01198 | null |
2025-07-01 | ARIG: Autoregressive Interactive Head Generation for Real-time Conversations | Ying Guo et.al. | 2507.00472 | null |
2025-06-30 | Rethink 3D Object Detection from Physical World | Satoshi Tanaka et.al. | 2507.00190 | null |
2025-06-30 | Epona: Autoregressive Diffusion World Model for Autonomous Driving | Kaiwen Zhang et.al. | 2506.24113 | null |
2025-06-30 | STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems | Mingfei Cheng et.al. | 2506.23995 | null |
2025-06-29 | InfGen: Scenario Generation as Next Token Group Prediction | Zhenghao Peng et.al. | 2506.23316 | null |
2025-06-29 | Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models | Maarten Hugenholtz et.al. | 2506.23164 | null |
2025-06-28 | Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example | Bei Zhou et.al. | 2506.22894 | null |
2025-06-27 | Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD | Ruthvik Bokkasam et.al. | 2506.22111 | null |
2025-06-27 | A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments | Akshay Jaitly et.al. | 2506.21982 | null |
2025-06-27 | SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model | Shuhan Tan et.al. | 2506.21976 | null |
2025-07-14 | Ark: An Open-source Python-based Framework for Robot Learning | Magnus Dierking et.al. | 2506.21628 | null |
2025-06-26 | GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction | Muleilan Pei et.al. | 2506.21121 | null |
2025-06-25 | Near Time-Optimal Hybrid Motion Planning for Timber Cranes | Marc-Philip Ecker et.al. | 2506.20314 | null |
2025-06-24 | Trajectory Prediction in Dynamic Object Tracking: A Critical Study | Zhongping Dong et.al. | 2506.19341 | null |
2025-06-25 | AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation | Ziyan Zhao et.al. | 2506.19269 | null |
2025-08-04 | Faster Motion Planning via Restarts | Nancy Amato et.al. | 2506.19016 | null |
2025-06-23 | SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives | Yizhou Chen et.al. | 2506.18825 | null |
2025-06-23 | Design, fabrication and control of a cable-driven parallel robot | Dhruv Sorathiya et.al. | 2506.18526 | null |
2025-06-23 | Integrating Maneuverable Planning and Adaptive Control for Robot Cart-Pushing under Disturbances | Zhe Zhang et.al. | 2506.18410 | null |
2025-06-23 | Selective Social-Interaction via Individual Importance for Fast Human Trajectory Prediction | Yota Urano et.al. | 2506.18291 | null |
2025-06-23 | Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning | Yue Li et.al. | 2506.18234 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213 | null |
2025-06-20 | Judo: A User-Friendly Open-Source Package for Sampling-Based Model Predictive Control | Albert H. Li et.al. | 2506.17184 | null |
2025-07-11 | Experimental Setup and Software Pipeline to Evaluate Optimization based Autonomous Multi-Robot Search Algorithms | Aditya Bhatt et.al. | 2506.16710 | null |