Updated on 2025.08.29
This page is maintained by Leheng Li that contains papers he interested in. Source code of this web is at here.
3D
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-28 | Multi-View 3D Point Tracking | Frano Rajič et.al. | 2508.21060 | null |
2025-08-28 | ActLoc: Learning to Localize on the Move via Active Viewpoint Selection | Jiajie Li et.al. | 2508.20981 | null |
2025-08-28 | DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes | Yajiao Xiong et.al. | 2508.20965 | null |
2025-08-28 | PLUME: Procedural Layer Underground Modeling Engine | Gabriel Manuel Garcia et.al. | 2508.20926 | null |
2025-08-28 | Estimating 2D Keypoints of Surgical Tools Using Vision-Language Models with Low-Rank Adaptation | Krit Duangprom et.al. | 2508.20830 | null |
2025-08-28 | Surfel-based 3D Registration with Equivariant SE(3) Features | Xueyang Kang et.al. | 2508.20789 | null |
2025-08-28 | SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding | Jiawen Lin et.al. | 2508.20758 | null |
2025-08-28 | CardioMorphNet: Cardiac Motion Prediction Using a Shape-Guided Bayesian Recurrent Deep Network | Reza Akbari Movahed et.al. | 2508.20734 | null |
2025-08-28 | Task-Oriented Edge-Assisted Cross-System Design for Real-Time Human-Robot Interaction in Industrial Metaverse | Kan Chen et.al. | 2508.20664 | null |
2025-08-28 | AvatarBack: Back-Head Generation for Complete 3D Avatars from Front-View Images | Shiqi Xin et.al. | 2508.20623 | null |
2025-08-28 | Optimization-Based Calibration for Intravascular Ultrasound Volume Reconstruction | Karl-Philippe Beaudet et.al. | 2508.20605 | null |
2025-08-28 | Embracing Aleatoric Uncertainty: Generating Diverse 3D Human Motion | Zheng Qin et.al. | 2508.20604 | null |
2025-08-28 | GLaRE: A Graph-based Landmark Region Embedding Network for Emotion Recognition | Debasis Maji et.al. | 2508.20579 | null |
2025-08-28 | Enhancing Pseudo-Boxes via Data-Level LiDAR-Camera Fusion for Unsupervised 3D Object Detection | Mingqian Ji et.al. | 2508.20530 | null |
2025-08-28 | Adam SLAM - the last mile of camera calibration with 3DGS | Matthieu Gendrin et.al. | 2508.20526 | null |
2025-08-28 | IAENet: An Importance-Aware Ensemble Model for 3D Point Cloud-Based Anomaly Detection | Xuanming Cao et.al. | 2508.20492 | null |
2025-08-28 | Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts | Zixuan Hu et.al. | 2508.20488 | null |
2025-08-28 | Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation | Jiusi Li et.al. | 2508.20471 | null |
2025-08-28 | Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation | Xiaochuan Li et.al. | 2508.20470 | null |
2025-08-28 | Prediction of Distant Metastasis for Head and Neck Cancer Patients Using Multi-Modal Tumor and Peritumoral Feature Fusion Network | Zizhao Tang et.al. | 2508.20469 | null |
2025-08-27 | MedNet-PVS: A MedNeXt-Based Deep Learning Model for Automated Segmentation of Perivascular Spaces | Zhen Xuen Brandon Low et.al. | 2508.20256 | null |
2025-08-27 | Grounding Multimodal Large Language Models with Quantitative Skin Attributes: A Retrieval Study | Max Torop et.al. | 2508.20188 | null |
2025-08-27 | Is the medical image segmentation problem solved? A survey of current developments and future directions | Guoping Xu et.al. | 2508.20139 | null |
2025-08-26 | A Machine Learning Approach to Volumetric Computations of Solid Pulmonary Nodules | Yihan Zhou et.al. | 2508.20127 | null |
2025-08-27 | Seam360GS: Seamless 360° Gaussian Splatting from Real-World Omnidirectional Images | Changha Shin et.al. | 2508.20080 | null |
2025-08-27 | OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations | Peng-Hao Hsu et.al. | 2508.20063 | null |
2025-08-27 | Visio-Verbal Teleimpedance Interface: Enabling Semi-Autonomous Control of Physical Interaction via Eye Tracking and Speech | Henk H. A. Jekel et.al. | 2508.20037 | null |
2025-08-27 | Integrating SAM Supervision for 3D Weakly Supervised Point Cloud Segmentation | Lechun You et.al. | 2508.19909 | null |
2025-08-27 | Multispectral LiDAR data for extracting tree points in urban and suburban areas | Narges Takhtkeshha et.al. | 2508.19881 | null |
2025-08-27 | Multimodal Conditional MeshGAN for Personalized Aneurysm Growth Prediction | Long Chen et.al. | 2508.19862 | null |
2025-08-27 | MAPo : Motion-Aware Partitioning of Deformable 3D Gaussian Splatting for High-Fidelity Dynamic Scene Reconstruction | Han Jiao et.al. | 2508.19786 | null |
2025-08-27 | FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers | Yue Wu et.al. | 2508.19754 | null |
2025-08-27 | LabelGS: Label-Aware 3D Gaussian Splatting for 3D Scene Segmentation | Yupeng Zhang et.al. | 2508.19699 | null |
2025-08-27 | SAT: Supervisor Regularization and Animation Augmentation for Two-process Monocular Texture 3D Human Reconstruction | Gangjian Zhang et.al. | 2508.19688 | null |
2025-08-27 | Beyond BEV: Optimizing Point-Level Tokens for Collaborative Perception | Yang Li et.al. | 2508.19638 | null |
2025-08-27 | Generalizing Monocular 3D Object Detection | Abhinav Kumar et.al. | 2508.19593 | null |
2025-08-27 | DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View | Tian Qiu et.al. | 2508.19508 | null |
2025-08-25 | 2D Ultrasound Elasticity Imaging of Abdominal Aortic Aneurysms Using Deep Neural Networks | Utsav Ratna Tuladhar et.al. | 2508.19303 | null |
2025-08-25 | CellINR: Implicitly Overcoming Photo-induced Artifacts in 4D Live Fluorescence Microscopy | Cunmin Zhao et.al. | 2508.19300 | null |
2025-08-25 | Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation | Alexandros Gkillas et.al. | 2508.19290 | null |
2025-08-26 | VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space | Lin Li et.al. | 2508.19247 | null |
2025-08-26 | Articulate3D: Zero-Shot Text-Driven 3D Object Posing | Oishi Deb et.al. | 2508.19244 | null |
2025-08-26 | Style4D-Bench: A Benchmark Suite for 4D Stylization | Beiqi Chen et.al. | 2508.19243 | null |
2025-08-26 | LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding | Julian Ost et.al. | 2508.19204 | null |
2025-08-26 | Dual Enhancement on 3D Vision-Language Perception for Monocular 3D Visual Grounding | Yuzhen Li et.al. | 2508.19165 | null |
2025-08-26 | Random forest-based out-of-distribution detection for robust lung cancer segmentation | Aneesh Rangnekar et.al. | 2508.19112 | null |
2025-08-26 | GReAT: leveraging geometric artery data to improve wall shear stress assessment | Julian Suk et.al. | 2508.19030 | null |
2025-08-26 | RoofSeg: An edge-aware transformer-based network for end-to-end roof plane segmentation | Siyuan You et.al. | 2508.19003 | null |
2025-08-26 | Can we make NeRF-based visual localization privacy-preserving? | Maxime Pietrantoni et.al. | 2508.18971 | null |
2025-08-26 | PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads | Shashikant Verma et.al. | 2508.18944 | null |
2025-08-26 | ColorGS: High-fidelity Surgical Scene Reconstruction with Colored Gaussian Splatting | Qun Ji et.al. | 2508.18696 | null |
2025-08-26 | AgriChrono: A Multi-modal Dataset Capturing Crop Growth and Lighting Variability with a Field Robot | Jaehwan Jeong et.al. | 2508.18694 | null |
2025-08-26 | ROSE: Remove Objects with Side Effects in Videos | Chenxuan Miao et.al. | 2508.18633 | null |
2025-08-26 | SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis | Xiaohao Sun et.al. | 2508.18597 | null |
2025-08-25 | Real-time 3D Visualization of Radiance Fields on Light Field Displays | Jonghyun Kim et.al. | 2508.18540 | null |
2025-08-25 | Adaptive Visual Navigation Assistant in 3D RPGs | Kaijie Xu et.al. | 2508.18539 | null |
2025-08-25 | SAT-SKYLINES: 3D Building Generation from Satellite Imagery and Coarse Geometric Priors | Zhangyu Jin et.al. | 2508.18531 | null |
2025-08-25 | DoGFlow: Self-Supervised LiDAR Scene Flow via Cross-Modal Doppler Guidance | Ajinkya Khoche et.al. | 2508.18506 | null |
2025-08-25 | FastAvatar: Instant 3D Gaussian Splatting for Faces from Single Unconstrained Poses | Hao Liang et.al. | 2508.18389 | null |
2025-08-23 | SERES: Semantic-aware neural reconstruction from sparse views | Bo Xu et.al. | 2508.18314 | null |
2025-08-22 | Towards Training-Free Underwater 3D Object Detection from Sonar Point Clouds: A Comparison of Traditional and Deep Learning Approaches | M. Salman Shaukat et.al. | 2508.18293 | null |
2025-08-25 | ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models | Haitang Feng et.al. | 2508.18271 | null |
2025-08-25 | GSVisLoc: Generalizable Visual Localization for Gaussian Splatting Scene Representations | Fadi Khatib et.al. | 2508.18242 | null |
2025-08-21 | PriorFormer: A Transformer for Real-time Monocular 3D Human Pose Estimation with Versatile Geometric Priors | Mohamed Adjel et.al. | 2508.18238 | null |
2025-08-25 | Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance | Ayce Idil Aytekin et.al. | 2508.18213 | null |
2025-08-25 | EventTracer: Fast Path Tracing-based Event Stream Rendering | Zhenyang Li et.al. | 2508.18071 | null |
2025-08-25 | Topology Aware Neural Interpolation of Scalar Fields | Mohamed Kissi et.al. | 2508.17995 | null |
2025-08-25 | SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization | Junyuan Deng et.al. | 2508.17972 | null |
2025-08-25 | A holistic perception system of internal and external monitoring for ground autonomous vehicles: AutoTRUST paradigm | Alexandros Gkillas et.al. | 2508.17969 | null |
2025-08-25 | Beam Geometry and Input Dimensionality: Impact on Sparse-Sampling Artifact Correction for Clinical CT with U-Nets | Tina Dorosti et.al. | 2508.17961 | null |
2025-08-25 | EndoUFM: Utilizing Foundation Models for Monocular depth estimation of endoscopic images | Xinning Yao et.al. | 2508.17916 | null |
2025-08-25 | Camera Pose Refinement via 3D Gaussian Splatting | Lulu Hao et.al. | 2508.17876 | null |
2025-08-25 | HLG: Comprehensive 3D Room Construction via Hierarchical Layout Generation | Xiping Wang et.al. | 2508.17832 | null |
2025-08-25 | CubeDN: Real-time Drone Detection in 3D Space from Dual mmWave Radar Cubes | Yuan Fang et.al. | 2508.17831 | null |
2025-08-25 | MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting | Hanzhi Chang et.al. | 2508.17811 | null |
2025-08-25 | DroneKey: Drone 3D Pose Estimation in Image Sequences using Gated Key-representation and Pose-adaptive Learning | Seo-Bin Hwang et.al. | 2508.17746 | null |
2025-08-25 | MEVITA: Open-Source Bipedal Robot Assembled from E-Commerce Components via Sheet Metal Welding | Kento Kawaharazuka et.al. | 2508.17684 | null |
2025-08-28 | Generating Human-AI Collaborative Design Sequence for 3D Assets via Differentiable Operation Graph | Xiaoyang Huang et.al. | 2508.17645 | null |
2025-08-25 | Wound3DAssist: A Practical Framework for 3D Wound Assessment | Remi Chierchia et.al. | 2508.17635 | null |
2025-08-25 | GWM: Towards Scalable Gaussian World Models for Robotic Manipulation | Guanxing Lu et.al. | 2508.17600 | null |
2025-08-25 | TinyGiantVLM: A Lightweight Vision-Language Architecture for Spatial Reasoning under Resource Constraints | Vinh-Thuan Ly et.al. | 2508.17595 | null |
2025-08-25 | IDU: Incremental Dynamic Update of Existing 3D Virtual Environments with New Imagery Data | Meida Chen et.al. | 2508.17579 | null |
2025-08-24 | Random-phase Gaussian Wave Splatting for Computer-generated Holography | Brian Chao et.al. | 2508.17480 | null |
2025-08-24 | Investigating Domain Gaps for Indoor 3D Object Detection | Zijing Zhao et.al. | 2508.17439 | null |
2025-08-20 | Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels | Long Le et.al. | 2508.17437 | null |
2025-08-24 | MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling | Haoyu Wang et.al. | 2508.17404 | null |
2025-08-26 | PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation | Xiaoyang Hao et.al. | 2508.17239 | null |
2025-08-24 | 4D Visual Pre-training for Robot Learning | Chengkai Hou et.al. | 2508.17230 | null |
2025-08-24 | VROOM - Visual Reconstruction over Onboard Multiview | Yajat Yadav et.al. | 2508.17172 | null |
2025-08-23 | DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method | Qingwen Zhang et.al. | 2508.17054 | null |
2025-08-23 | PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models | Xianjing Cheng et.al. | 2508.17050 | null |
2025-08-23 | M3DMap: Object-aware Multimodal 3D Mapping for Dynamic Environments | Dmitry Yudin et.al. | 2508.17044 | null |
2025-08-23 | DualReg: Dual-Space Filtering and Reinforcement for Rigid Registration | Jiayi Li et.al. | 2508.17034 | null |
2025-08-23 | Fiducial Marker Splatting for High-Fidelity Robotics Simulations | Diram Tabaa et.al. | 2508.17012 | null |
2025-08-23 | A Survey of Deep Learning-based Point Cloud Denoising | Jinxi Wang et.al. | 2508.17011 | null |
2025-08-23 | Align 3D Representation and Text Embedding for 3D Content Personalization | Qi Song et.al. | 2508.16932 | null |
2025-08-23 | Structural Energy-Guided Sampling for View-Consistent Text-to-3D | Qing Zhang et.al. | 2508.16917 | null |
2025-08-23 | MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation | Prerit Gupta et.al. | 2508.16911 | null |
2025-08-23 | Relative Navigation and Dynamic Target Tracking for Autonomous Underwater Proximity Operations | David Baxter et.al. | 2508.16901 | null |
2025-08-23 | Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network | Pouya Shiri et.al. | 2508.16897 | null |
2025-08-23 | A Workflow for Map Creation in Autonomous Vehicle Simulations | Zubair Islam et.al. | 2508.16856 | null |
2025-08-22 | Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes | Xinhao Xiang et.al. | 2508.16812 | null |
2025-08-21 | BrainPath: Generating Subject-Specific Brain Aging Trajectories | Yifan Li et.al. | 2508.16667 | null |
2025-08-22 | MV-RAG: Retrieval Augmented Multiview Diffusion | Yosef Dayani et.al. | 2508.16577 | null |
2025-08-22 | Real-time 3D Light-field Viewing with Eye-tracking on Conventional Displays | Trung Hieu Pham et.al. | 2508.16535 | null |
2025-08-26 | Comparative Analysis of UAV Path Planning Algorithms for Efficient Navigation in Urban 3D Environments | Hichem Cheriet et.al. | 2508.16515 | null |
2025-08-22 | On Kinodynamic Global Planning in a Simplicial Complex Environment: A Mixed Integer Approach | Otobong Jerome et.al. | 2508.16511 | null |
2025-08-22 | Arbitrary-Scale 3D Gaussian Super-Resolution | Huimin Zeng et.al. | 2508.16467 | null |
2025-08-25 | HOSt3R: Keypoint-free Hand-Object 3D Reconstruction from RGB images | Anilkumar Swamy et.al. | 2508.16465 | null |
2025-08-22 | HAMSt3R: Human-Aware Multi-view Stereo 3D Reconstruction | Sara Rojas et.al. | 2508.16433 | null |
2025-08-22 | SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather | Edoardo Palladin et.al. | 2508.16408 | null |
2025-08-22 | Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars | NVIDIA et.al. | 2508.16401 | null |
2025-08-22 | Self-Validated Learning for Particle Separation: A Correctness-Based Self-Training Framework Without Human Labels | Philipp D. Lösel et.al. | 2508.16224 | null |
2025-08-22 | 4D Virtual Imaging Platform for Dynamic Joint Assessment via Uni-Plane X-ray and 2D-3D Registration | Hao Tang et.al. | 2508.16138 | null |
2025-08-22 | Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables | Wontae Kim et.al. | 2508.16121 | null |
2025-08-22 | A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection | Qifeng Liu et.al. | 2508.16069 | null |
2025-08-22 | Advances and Trends in the 3D Reconstruction of the Shape and Motion of Animals | Ziqi Li et.al. | 2508.16062 | null |
2025-08-22 | NeuralMeshing: Complete Object Mesh Extraction from Casual Captures | Floris Erich et.al. | 2508.16026 | null |
2025-08-21 | Self-Aligning EPM Connector: A Versatile Solution for Adaptive and Multi-Modal Interfaces | Bingchao Wang et.al. | 2508.16008 | null |
2025-08-21 | GelSLAM: A Real-time, High-Fidelity, and Robust 3D Tactile SLAM System | Hung-Jui Huang et.al. | 2508.15990 | null |
2025-08-21 | UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation | Zhaodong Jiang et.al. | 2508.15972 | null |
2025-08-21 | Text-Driven 3D Hand Motion Generation from Sign Language Data | Léore Bensabath et.al. | 2508.15902 | null |
2025-08-21 | Active Prostate Phantom with Multiple Chambers | Sizhe Tian et.al. | 2508.15873 | null |
2025-08-21 | SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass | Yanxu Meng et.al. | 2508.15769 | null |
2025-08-21 | ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling | Jinhyung Park et.al. | 2508.15767 | null |
2025-08-21 | CM2LoD3: Reconstructing LoD3 Building Models Using Semantic Conflict Maps | Franz Hanke et.al. | 2508.15672 | null |
2025-08-25 | Hessian-Based Lightweight Neural Network HessNet for State-of-the-Art Brain Vessel Segmentation on a Minimal Training Dataset | Alexandra Bernadotte et.al. | 2508.15660 | null |
2025-08-21 | Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance | Shuchao Pang et.al. | 2508.15650 | null |
2025-08-21 | Fast globally optimal Truncated Least Squares point cloud registration with fixed rotation axis | Ivo Ivanov et.al. | 2508.15613 | null |
2025-08-21 | Deep Equilibrium Convolutional Sparse Coding for Hyperspectral Image Denoising | Jin Ye et.al. | 2508.15553 | null |
2025-08-21 | MExECON: Multi-view Extended Explicit Clothed humans Optimized via Normal integration | Fulden Ece Uğur et.al. | 2508.15500 | null |
2025-08-21 | Enhancing Novel View Synthesis from extremely sparse views with SfM-free 3D Gaussian Splatting Framework | Zongqi He et.al. | 2508.15457 | null |
2025-08-25 | DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians | Cong Wang et.al. | 2508.15376 | null |
2025-08-21 | Image-Conditioned 3D Gaussian Splat Quantization | Xinshuang Liu et.al. | 2508.15372 | null |
2025-08-21 | RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features | Olga Matykina et.al. | 2508.15353 | null |
2025-08-21 | Mag-Match: Magnetic Vector Field Features for Map Matching and Registration | William McDonald et.al. | 2508.15300 | null |
2025-08-21 | BasketLiDAR: The First LiDAR-Camera Multimodal Dataset for Professional Basketball MOT | Ryunosuke Hayashi et.al. | 2508.15299 | null |
2025-08-21 | Collaborative Multi-Modal Coding for High-Quality 3D Generation | Ziang Cao et.al. | 2508.15228 | null |
2025-08-25 | MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion | Xuyang Chen et.al. | 2508.15169 | null |
2025-08-21 | Reliable Multi-view 3D Reconstruction for `Just-in-time’ Edge Environments | Md. Nurul Absur et.al. | 2508.15158 | null |
2025-08-21 | Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors | Jeonghyun Noh et.al. | 2508.15151 | null |
2025-08-20 | Virtual Community: An Open World for Humans, Robots, and Society | Qinhong Zhou et.al. | 2508.14893 | null |
2025-08-20 | Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds | Jia Lu et.al. | 2508.14892 | null |
2025-08-20 | GaussianArt: Unified Modeling of Geometry and Motion for Articulated Objects | Licheng Shen et.al. | 2508.14891 | null |
2025-08-22 | MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds | Bingquan Dai et.al. | 2508.14879 | null |
2025-08-20 | Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization | Canyu Zhao et.al. | 2508.14811 | null |
2025-08-20 | Fusing Monocular RGB Images with AIS Data to Create a 6D Pose Estimation Dataset for Marine Vessels | Fabian Holst et.al. | 2508.14767 | null |
2025-08-20 | GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting | Jiaxin Wei et.al. | 2508.14717 | null |
2025-08-20 | GeMS: Efficient Gaussian Splatting for Extreme Motion Blur | Gopi Raju Matta et.al. | 2508.14682 | null |
2025-08-20 | UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling | Peiming Li et.al. | 2508.14604 | null |
2025-08-20 | Safety-Critical Learning for Long-Tail Events: The TUM Traffic Accident Dataset | Walter Zimmer et.al. | 2508.14567 | null |
2025-08-20 | GOGS: High-Fidelity Geometry and Relighting for Glossy Objects via Gaussian Surfels | Xingyuan Yang et.al. | 2508.14563 | null |
2025-08-20 | Making Pose Representations More Expressive and Disentangled via Residual Vector Quantization | Sukhyun Jeong et.al. | 2508.14561 | null |
2025-08-20 | From Slices to Structures: Unsupervised 3D Reconstruction of Female Pelvic Anatomy from Freehand Transvaginal Ultrasound | Max Krähenmann et.al. | 2508.14552 | null |
2025-08-20 | LookOut: Real-World Humanoid Egocentric Navigation | Boxiao Pan et.al. | 2508.14466 | null |
2025-08-20 | D^3-Talker: Dual-Branch Decoupled Deformation Fields for Few-Shot 3D Talking Head Synthesis | Yuhang Guo et.al. | 2508.14449 | null |
2025-08-20 | Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian Splatting | Gyusam Chang et.al. | 2508.14443 | null |
2025-08-20 | HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation | Bing Han et.al. | 2508.14431 | null |
2025-08-20 | Learning Point Cloud Representations with Pose Continuity for Depth-Based Category-Level 6D Object Pose Estimation | Zhujun Li et.al. | 2508.14358 | null |
2025-08-19 | Pixels to Play: A Foundation Model for 3D Gameplay | Yuguang Yue et.al. | 2508.14295 | null |
2025-08-21 | GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting | Elena Alegret et.al. | 2508.14278 | null |
2025-08-19 | Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning | Said Djafar Said et.al. | 2508.14276 | null |
2025-08-19 | SLAM-based Safe Indoor Exploration Strategy | Omar Mostafa et.al. | 2508.14235 | null |
2025-08-19 | RynnEC: Bringing MLLMs into Embodied World | Ronghao Dang et.al. | 2508.14160 | null |
2025-08-19 | Automated surgical planning with nnU-Net: delineation of the anatomy in hepatobiliary phase MRI | Karin A. Olthof et.al. | 2508.14133 | null |
2025-08-18 | 3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models | Jolanta Mozyrska et.al. | 2508.14122 | null |
2025-08-19 | LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos | Chin-Yang Lin et.al. | 2508.14041 | null |
2025-08-19 | Distilled-3DGS:Distilled 3D Gaussian Splatting | Lintao Xiang et.al. | 2508.14037 | null |
2025-08-19 | GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation | Ken Deng et.al. | 2508.14036 | null |
2025-08-19 | Online 3D Gaussian Splatting Modeling with Novel View Selection | Byeonggwon Lee et.al. | 2508.14014 | null |
2025-08-19 | ResPlan: A Large-Scale Vector-Graph Dataset of 17,000 Residential Floor Plans | Mohamed Abouagour et.al. | 2508.14006 | null |
2025-08-19 | Self-Supervised Sparse Sensor Fusion for Long Range Perception | Edoardo Palladin et.al. | 2508.13995 | null |
2025-08-19 | Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment | Samuel Seligardi et.al. | 2508.13989 | null |
2025-08-19 | OmViD: Omni-supervised active learning for video action detection | Aayush Rana et.al. | 2508.13983 | null |
2025-08-19 | ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving | Xianda Guo et.al. | 2508.13977 | null |
2025-08-19 | Augmenting cobots for sheet-metal SMEs with 3D object recognition and localisation | Martijn Cramer et.al. | 2508.13964 | null |
2025-08-19 | Real-Time, Population-Based Reconstruction of 3D Bone Models via Very-Low-Dose Protocols | Yiqun Lin et.al. | 2508.13947 | null |
2025-08-19 | PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis | Chunji Lv et.al. | 2508.13911 | null |
2025-08-21 | Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction | Niklas Bubeck et.al. | 2508.13826 | null |
2025-08-19 | Is-NeRF: In-scattering Neural Radiance Field for Blurred Images | Nan Luo et.al. | 2508.13808 | null |
2025-08-19 | Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing | Feng-Lin Liu et.al. | 2508.13797 | null |
2025-08-19 | VisionLaw: Inferring Interpretable Intrinsic Dynamics from Visual Observations via Bilevel Optimization | Jiajing Lin et.al. | 2508.13792 | null |
2025-08-19 | Shape-from-Template with Generalised Camera | Agniva Sengupta et.al. | 2508.13791 | null |
2025-08-19 | Blast Hole Seeking and Dipping – The Navigation and Perception Framework in a Mine Site Inspection Robot | Liyang Liu et.al. | 2508.13785 | null |
2025-08-19 | Deep Biomechanically-Guided Interpolation for Keypoint-Based Brain Shift Registration | Tiago Assis et.al. | 2508.13762 | null |
2025-08-19 | Unleashing Semantic and Geometric Priors for 3D Scene Completion | Shiyuan Chen et.al. | 2508.13601 | null |
2025-08-19 | The 9th AI City Challenge | Zheng Tang et.al. | 2508.13564 | null |
2025-08-19 | Learnable SMPLify: A Neural Solution for Optimization-Free Human Pose Inverse Kinematics | Yuchen Yang et.al. | 2508.13562 | null |
2025-08-22 | FLAIR: Frequency and Locality-Aware Implicit Neural Representations | Sukhun Ko et.al. | 2508.13544 | null |
2025-08-19 | EAvatar: Expression-Aware Head Avatar Reconstruction with Generative Geometry Priors | Shikun Zhang et.al. | 2508.13537 | null |
2025-08-19 | FAMNet: Integrating 2D and 3D Features for Micro-expression Recognition via Multi-task Learning and Hierarchical Attention | Liangyu Fu et.al. | 2508.13483 | null |
2025-08-18 | Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction | Sedigheh Dargahi et.al. | 2508.13340 | null |
2025-08-18 | InnerGS: Internal Scenes Rendering via Factorized 3D Gaussian Splatting | Shuxin Liang et.al. | 2508.13287 | null |
2025-08-17 | PreSem-Surf: RGB-D Surface Reconstruction with Progressive Semantic Modeling and SG-MLP Pre-Rendering Mechanism | Yuyan Ye et.al. | 2508.13228 | null |
2025-08-18 | 4DNeX: Feed-Forward 4D Generative Modeling Made Easy | Zhaoxi Chen et.al. | 2508.13154 | null |
2025-08-18 | IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion | Wenhao Hu et.al. | 2508.13153 | null |
2025-08-24 | Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping | Siddharth Khandelwal et.al. | 2508.13065 | null |
2025-08-18 | IntelliCap: Intelligent Guidance for Consistent View Sampling | Ayaka Yasunaga et.al. | 2508.13043 | null |
2025-08-18 | Multi-Phase Automated Segmentation of Dental Structures in CBCT Using a Lightweight Auto3DSeg and SegResNet Implementation | Dominic LaBella et.al. | 2508.12962 | null |
2025-08-18 | MaskSem: Semantic-Guided Masking for Learning 3D Hybrid High-Order Motion Representation | Wei Wei et.al. | 2508.12948 | null |
2025-08-18 | Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models | Jianshu Zeng et.al. | 2508.12945 | null |
2025-08-18 | CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction | Zhiwei Ning et.al. | 2508.12917 | null |
2025-08-18 | CTFlow: Video-Inspired Latent Flow Matching for 3D CT Synthesis | Jiayi Wang et.al. | 2508.12900 | null |
2025-08-18 | MCTR: Midpoint Corrected Triangulation for Autonomous Racing via Digital Twin Simulation in CARLA | Junhao Ye et.al. | 2508.12729 | null |
2025-08-18 | Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting | Kangjie Chen et.al. | 2508.12720 | null |
2025-08-18 | Neural Rendering for Sensor Adaptation in 3D Object Detection | Felix Embacher et.al. | 2508.12695 | null |
2025-08-18 | Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection | Zhongyao Li et.al. | 2508.12684 | null |
2025-08-18 | Stable Diffusion-Based Approach for Human De-Occlusion | Seung Young Noh et.al. | 2508.12663 | null |
2025-08-18 | DyCrowd: Towards Dynamic Crowd Reconstruction from a Large-scene Video | Hao Wen et.al. | 2508.12644 | null |
2025-08-18 | Synthesizing Accurate and Realistic T1-weighted Contrast-Enhanced MR Images using Posterior-Mean Rectified Flow | Bastian Brandstötter et.al. | 2508.12640 | null |
2025-08-19 | WIPES: Wavelet-based Visual Primitives | Wenhao Zhang et.al. | 2508.12615 | null |
2025-08-17 | Segmenting Thalamic Nuclei: T1 Maps Provide a Reliable and Efficient Solution | Anqi Feng et.al. | 2508.12508 | null |
2025-08-17 | FractMorph: A Fractional Fourier-Based Multi-Domain Transformer for Deformable Image Registration | Shayan Kebriti et.al. | 2508.12445 | null |
2025-08-21 | TiP4GEN: Text to Immersive Panorama 4D Scene Generation | Ke Xing et.al. | 2508.12415 | null |
2025-08-19 | SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes | Jun Zeng et.al. | 2508.12410 | null |
2025-08-17 | Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR | Fatemeh Ghorbani Lohesara et.al. | 2508.12336 | null |
2025-08-17 | Semi-Infinite Programming for Collision-Avoidance in Optimal and Model Predictive Control | Yunfan Gao et.al. | 2508.12335 | null |
2025-08-17 | Improving Densification in 3D Gaussian Splatting for High-Fidelity Rendering | Xiaobin Deng et.al. | 2508.12313 | null |
2025-08-17 | In vivo 3D ultrasound computed tomography of musculoskeletal tissues with generative neural physics | Zhijun Zeng et.al. | 2508.12226 | null |
2025-08-17 | Splat Feature Solver | Butian Xiong et.al. | 2508.12216 | null |
2025-08-16 | RealTalk: Realistic Emotion-Aware Lifelike Talking-Head Synthesis | Wenqing Wang et.al. | 2508.12163 | null |
2025-08-16 | VELVET-Med: Vision and Efficient Language Pre-training for Volumetric Imaging Tasks in Medicine | Ziyang Zhang et.al. | 2508.12108 | null |
2025-08-16 | Enhancing 3D point accuracy of laser scanner through multi-stage convolutional neural network for applications in construction | Qinyuan Fan et.al. | 2508.12089 | null |
2025-08-16 | VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models | Haidong Xu et.al. | 2508.12081 | null |
2025-08-16 | OASIS: Real-Time Opti-Acoustic Sensing for Intervention Systems in Unstructured Environments | Amy Phung et.al. | 2508.12071 | null |
2025-08-16 | InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes | Hongyuan Liu et.al. | 2508.12015 | null |
2025-08-16 | UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding | Yueming Xu et.al. | 2508.11952 | null |
2025-08-16 | Transferable Class Statistics and Multi-scale Feature Approximation for 3D Object Detection | Hao Peng et.al. | 2508.11951 | null |
2025-08-16 | OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation | Jilei Mao et.al. | 2508.11898 | null |
2025-08-16 | ComplicitSplat: Downstream Models are Vulnerable to Blackbox Attacks by 3D Gaussian Splat Camouflages | Matthew Hull et.al. | 2508.11854 | null |
2025-08-15 | Towards Understanding 3D Vision: the Role of Gaussian Curvature | Sherlon Almeida da Silva et.al. | 2508.11825 | null |
2025-08-15 | CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion | Zhe Zhu et.al. | 2508.11603 | null |
2025-08-15 | Remove360: Benchmarking Residuals After Object Removal in 3D Gaussian Splatting | Simona Kocour et.al. | 2508.11431 | null |
2025-08-15 | RMFAT: Recurrent Multi-scale Feature Atmospheric Turbulence Mitigator | Zhiming Liu et.al. | 2508.11409 | null |
2025-08-15 | G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration | Ramil Khafizov et.al. | 2508.11379 | null |
2025-08-15 | AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis | Zonglin Wu et.al. | 2508.11375 | null |
2025-08-15 | HOID-R1: Reinforcement Learning for Open-World Human-Object Interaction Detection Reasoning with Multimodal Large Language Model | Zhenhao Zhang et.al. | 2508.11350 | null |
2025-08-15 | Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking | Haonan Zhang et.al. | 2508.11323 | null |
2025-08-15 | Unifying Scale-Aware Depth Prediction and Perceptual Priors for Monocular Endoscope Pose Estimation and Tissue Reconstruction | Muzammil Khan et.al. | 2508.11282 | null |
2025-08-15 | Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds | Pei He et.al. | 2508.11265 | null |
2025-08-15 | Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception | Junjie Wang et.al. | 2508.11256 | null |
2025-08-15 | StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation | Seungmi Lee et.al. | 2508.11203 | null |
2025-08-15 | CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector | Abhinav Kumar et.al. | 2508.11185 | null |
2025-08-14 | HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing | Xinjie Gao et.al. | 2508.11106 | null |
2025-08-14 | Data-Driven Abdominal Phenotypes of Type 2 Diabetes in Lean, Overweight, and Obese Cohorts | Lucas W. Remedios et.al. | 2508.11063 | null |
2025-08-14 | Advancing 3D Scene Understanding with MV-ScanQA Multi-View Reasoning Evaluation and TripAlign Pre-training Dataset | Wentao Mo et.al. | 2508.11058 | null |
2025-08-20 | 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation | Nikolaos Gkanatsios et.al. | 2508.11002 | null |
2025-08-12 | Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction | Cheng Chen et.al. | 2508.10936 | null |
2025-08-18 | HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model | Qi Liu et.al. | 2508.10935 | null |
2025-08-12 | ViPE: Video Pose Engine for 3D Geometric Perception | Jiahui Huang et.al. | 2508.10934 | null |
2025-08-14 | Quantum Visual Fields with Neural Amplitude Encoding | Shuteng Wang et.al. | 2508.10900 | null |
2025-08-14 | Puppeteer: Rig and Animate Your 3D Models | Chaoyue Song et.al. | 2508.10898 | null |
2025-08-14 | Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning | Mengyuan Liu et.al. | 2508.10897 | null |
2025-08-14 | STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer | Yushi Lan et.al. | 2508.10893 | null |
2025-08-14 | TexVerse: A Universe of 3D Objects with High-Resolution Textures | Yibo Zhang et.al. | 2508.10868 | null |
2025-08-14 | An Efficient Model-Driven Groupwise Approach for Atlas Construction | Ziwei Zou et.al. | 2508.10743 | null |
2025-08-14 | Novel View Synthesis using DDIM Inversion | Sehajdeep SIngh et.al. | 2508.10688 | null |
2025-08-14 | Physics-Informed Joint Multi-TE Super-Resolution with Implicit Neural Representation for Robust Fetal T2 Mapping | Busra Bulut et.al. | 2508.10680 | null |
2025-08-14 | DIVA-VQA: Detecting Inter-frame Variations in UGC Video Quality | Xinyi Wang et.al. | 2508.10605 | null |
2025-08-14 | SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving | Philipp Wolters et.al. | 2508.10567 | null |
2025-08-15 | PTQAT: A Hybrid Parameter-Efficient Quantization Algorithm for 3D Perception Tasks | Xinhao Wang et.al. | 2508.10557 | null |
2025-08-14 | Multi-Sample Anti-Aliasing and Constrained Optimization for 3D Gaussian Splatting | Zheng Zhou et.al. | 2508.10507 | null |
2025-08-14 | STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes | Keishi Ishihara et.al. | 2508.10427 | null |
2025-08-14 | SC-Lane: Slope-aware and Consistent Road Height Estimation Framework for 3D Lane Detection | Chaesong Park et.al. | 2508.10411 | null |
2025-08-14 | Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models | Hyundo Lee et.al. | 2508.10382 | null |
2025-08-14 | VIFSS: View-Invariant and Figure Skating-Specific Pose Representation Learning for Temporal Action Segmentation | Ryota Tanaka et.al. | 2508.10281 | null |
2025-08-14 | Deep Learning for Crack Detection: A Review of Learning Paradigms, Generalizability, and Datasets | Xinan Zhang et.al. | 2508.10256 | null |
2025-08-13 | EntropyGS: An Efficient Entropy Coding on 3D Gaussian Splatting | Yuning Huang et.al. | 2508.10227 | null |
2025-08-13 | B-repLer: Semantic B-rep Latent Editor using Large Language Models | Yilin Liu et.al. | 2508.10201 | null |
2025-08-18 | From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation | Ke Niu et.al. | 2508.10118 | null |
2025-08-13 | A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation | Shuting He et.al. | 2508.09977 | null |
2025-08-13 | PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image | Geonhee Sim et.al. | 2508.09973 | null |
2025-08-13 | LIA-X: Interpretable Latent Portrait Animator | Yaohui Wang et.al. | 2508.09959 | null |
2025-08-13 | E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras | Chaoran Feng et.al. | 2508.09912 | null |
2025-08-13 | HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics | Weiqi Li et.al. | 2508.09858 | null |
2025-08-13 | Toward Human-Robot Teaming: Learning Handover Behaviors from 3D Scenes | Yuekun Wu et.al. | 2508.09855 | null |
2025-08-13 | ARI3D: A Software for Interactive Quantification of Regions in X-Ray CT 3D Images | Jan Phillipp Albrecht et.al. | 2508.09849 | null |
2025-08-13 | RayletDF: Raylet Distance Fields for Generalizable 3D Surface Reconstruction from Point Clouds or Gaussians | Shenxing Wei et.al. | 2508.09830 | null |
2025-08-13 | TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos | Jinxi Li et.al. | 2508.09811 | null |
2025-08-13 | Automated Segmentation of Coronal Brain Tissue Slabs for 3D Neuropathology | Jonathan Williams Ramirez et.al. | 2508.09805 | null |
2025-08-13 | MUJICA: Reforming SISR Models for PBR Material Super-Resolution via Cross-Map Attention | Xin Du et.al. | 2508.09802 | null |
2025-08-13 | Surg-InvNeRF: Invertible NeRF for 3D tracking and reconstruction in surgical vision | Gerardo Loza et.al. | 2508.09681 | null |
2025-08-13 | GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors | Xingyilang Yin et.al. | 2508.09667 | null |
2025-08-13 | Noise-adapted Neural Operator for Robust Non-Line-of-Sight Imaging | Lianfang Wang et.al. | 2508.09655 | null |
2025-08-13 | TOTNet: Occlusion-Aware Temporal Tracking for Robust Ball Detection in Sports Videos | Hao Xu et.al. | 2508.09650 | null |
2025-08-13 | The Brain Resection Multimodal Image Registration (ReMIND2Reg) 2025 Challenge | Reuben Dorent et.al. | 2508.09649 | null |
2025-08-13 | Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors | Giorgos Karvounas et.al. | 2508.09629 | null |
2025-08-14 | Semantic-aware DropSplat: Adaptive Pruning of Redundant Gaussians for 3D Aerial-View Segmentation | Xu Tang et.al. | 2508.09626 | null |
2025-08-13 | MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography | Daniel Barco et.al. | 2508.09616 | null |
2025-08-13 | DualPhys-GS: Dual Physically-Guided 3D Gaussian Splatting for Underwater Scene Reconstruction | Jiachen Li et.al. | 2508.09610 | null |
2025-08-15 | SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing | Heyi Sun et.al. | 2508.09597 | null |
2025-08-13 | CaRoBio: 3D Cable Routing with a Bio-inspired Gripper Fingernail | Jiahui Zuo et.al. | 2508.09558 | null |
2025-08-14 | Iterative Volume Fusion for Asymmetric Stereo Matching | Yuanting Gao et.al. | 2508.09543 | null |
2025-08-13 | SkySplat: Generalizable 3D Gaussian Splatting from Multi-Temporal Sparse Satellite Images | Xuejun Huang et.al. | 2508.09479 | null |
2025-08-13 | CitySeg: A 3D Open Vocabulary Semantic Segmentation Foundation Model in City-scale Scenarios | Jialei Xu et.al. | 2508.09470 | null |
2025-08-13 | DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation | Haoxiang Shi et.al. | 2508.09444 | null |
2025-08-13 | Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving | Guangxun Zhu et.al. | 2508.09404 | null |
2025-08-12 | X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents | Guoxian Song et.al. | 2508.09383 | null |
2025-08-12 | Gradient-Direction-Aware Density Control for 3D Gaussian Splatting | Zheng Zhou et.al. | 2508.09239 | null |
2025-08-12 | Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices | Ya Zou et.al. | 2508.09136 | null |
2025-08-13 | GeoVLA: Empowering 3D Representations in Vision-Language-Action Models | Lin Sun et.al. | 2508.09071 | null |
2025-08-12 | A new dataset and comparison for multi-camera frame synthesis | Conall Daly et.al. | 2508.09068 | null |
2025-08-12 | VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception | Fuhao Chang et.al. | 2508.09061 | null |
2025-08-12 | DASC: Depth-of-Field Aware Scene Complexity Metric for 3D Visualization on Light Field Display | Kamran Akbar et.al. | 2508.08928 | null |
2025-08-12 | Masked Clustering Prediction for Unsupervised Point Cloud Pre-training | Bin Ren et.al. | 2508.08910 | null |
2025-08-12 | GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments | Lin Zeng et.al. | 2508.08867 | null |
2025-08-12 | DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI | Bo-Hsun Chen et.al. | 2508.08831 | null |
2025-08-12 | 3DFroMLLM: 3D Prototype Generation only from Pretrained Multimodal LLMs | Noor Ahmed et.al. | 2508.08821 | null |
2025-08-12 | MonoPartNeRF:Human Reconstruction from Monocular Video via Part-Based Neural Radiance Fields | Yao Lu et.al. | 2508.08798 | null |
2025-08-12 | SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA) | Trong-Thuan Nguyen et.al. | 2508.08781 | null |
2025-08-12 | ROD: RGB-Only Fast and Efficient Off-road Freespace Detection | Tong Sun et.al. | 2508.08697 | null |
2025-08-14 | Yan: Foundational Interactive Video Generation | Deheng Ye et.al. | 2508.08601 | null |
2025-08-12 | RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space | Jingyun Liang et.al. | 2508.08588 | null |
2025-08-12 | Bio-Generative Design Morphology with Radiolaria: An application of a Nature-Based Generative Shape Grammar for Geometrical Design of Space Frames | Michael Kleiss et.al. | 2508.08572 | null |
2025-08-12 | Revisiting the City Tower Project: Geometric Principles and Structural Morphology in the Works of Louis I. Kahn and Anne Tyng | Aysan Mokhtarimousavi et.al. | 2508.08561 | null |
2025-08-11 | Empowering Children to Create AI-Enabled Augmented Reality Experiences | Lei Zhang et.al. | 2508.08467 | null |
2025-08-11 | Enhanced Liver Tumor Detection in CT Images Using 3D U-Net and Bat Algorithm for Hyperparameter Optimization | Nastaran Ghorbani et.al. | 2508.08452 | null |
2025-08-11 | ImageDDI: Image-enhanced Molecular Motif Sequence Representation for Drug-Drug Interaction Prediction | Yuqin He et.al. | 2508.08338 | null |
2025-08-11 | Learning an Implicit Physics Model for Image-based Fluid Simulation | Emily Yue-Ting Jia et.al. | 2508.08254 | null |
2025-08-11 | ReferSplat: Referring Segmentation in 3D Gaussian Splatting | Shuting He et.al. | 2508.08252 | null |
2025-08-11 | LL3M: Large Language 3D Modelers | Sining Lu et.al. | 2508.08228 | null |
2025-08-11 | SAGOnline: Segment Any Gaussians Online | Wentao Sun et.al. | 2508.08219 | null |
2025-08-11 | Spatial-ORMLLM: Improve Spatial Relation Understanding in the Operating Room with Multimodal Large Language Model | Peiqi He et.al. | 2508.08199 | null |
2025-08-11 | Emergent morphogenesis via planar fabrication enabled by a reduced model of composites | Yupeng Zhang et.al. | 2508.08198 | null |
2025-08-12 | 3D Human Mesh Estimation from Single View RGBD | Ozhan Suat et.al. | 2508.08178 | null |
2025-08-13 | CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data | Chongke Bi et.al. | 2508.08173 | null |
2025-08-11 | FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting | Yitong Yang et.al. | 2508.08136 | null |
2025-08-11 | GRASPTrack: Geometry-Reasoned Association via Segmentation and Projection for Multi-Object Tracking | Xudong Han et.al. | 2508.08117 | null |
2025-08-11 | 3D Plant Root Skeleton Detection and Extraction | Jiakai Lin et.al. | 2508.08094 | null |
2025-08-11 | Matrix-3D: Omnidirectional Explorable 3D World Generation | Zhongqi Yang et.al. | 2508.08086 | null |
2025-08-11 | S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix | Peng Dai et.al. | 2508.08048 | null |
2025-08-11 | Aerial Target Encirclement and Interception with Noisy Range Observations | Fen Liu et.al. | 2508.08046 | null |
2025-08-11 | TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation | Huawei Sun et.al. | 2508.08038 | null |
2025-08-11 | Mitigating Biases in Surgical Operating Rooms with Geometry | Tony Danjun Wang et.al. | 2508.08028 | null |
2025-08-11 | TrackOR: Towards Personalized Intelligent Operating Rooms Through Robust Tracking | Tony Danjun Wang et.al. | 2508.07968 | null |
2025-08-11 | Safeguarding Generative AI Applications in Preclinical Imaging through Hybrid Anomaly Detection | Jakub Binda et.al. | 2508.07923 | null |
2025-08-11 | Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models | Johanna P. Müller et.al. | 2508.07903 | null |
2025-08-11 | NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction | Tianle Zeng et.al. | 2508.07897 | null |
2025-08-11 | Autonomous Navigation of Cloud-Controlled Quadcopters in Confined Spaces Using Multi-Modal Perception and LLM-Driven High Semantic Reasoning | Shoaib Ahmmad et.al. | 2508.07885 | null |
2025-08-11 | Vertex Features for Neural Global Illumination | Rui Su et.al. | 2508.07852 | null |
2025-08-11 | Tracking Any Point Methods for Markerless 3D Tissue Tracking in Endoscopic Stereo Images | Konrad Reuter et.al. | 2508.07851 | null |
2025-08-11 | CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving | Qi Xiang et.al. | 2508.07838 | null |
2025-08-11 | DiTVR: Zero-Shot Diffusion Transformer for Video Restoration | Sicheng Gao et.al. | 2508.07811 | null |
2025-08-11 | Pose-RFT: Enhancing MLLMs for 3D Pose Generation via Hybrid Action Reinforcement Fine-Tuning | Bao Li et.al. | 2508.07804 | null |
2025-08-11 | MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks | Yushen Xu et.al. | 2508.07803 | null |
2025-08-11 | Forecasting Continuous Non-Conservative Dynamical Systems in SO(3) | Lennart Bastian et.al. | 2508.07775 | null |
2025-08-13 | Multi-view Normal and Distance Guidance Gaussian Splatting for Surface Reconstruction | Bo Jia et.al. | 2508.07701 | null |
2025-08-11 | Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing | Weitao Wang et.al. | 2508.07700 | null |
2025-08-11 | GraphCoT-VLA: A 3D Spatial-Aware Reasoning Vision-Language-Action Model for Robotic Manipulation with Ambiguous Instructions | Helong Huang et.al. | 2508.07650 | null |
2025-08-11 | Breaking Down and Building Up: Mixture of Skill-Based Vision-and-Language Navigation Agents | Tianyi Ma et.al. | 2508.07642 | null |
2025-08-11 | End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy | Zifan Wang et.al. | 2508.07611 | null |
2025-08-12 | Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring | Ludan Zhang et.al. | 2508.07552 | null |
2025-08-11 | CoT-Pose: Chain-of-Thought Reasoning for 3D Pose Generation from Abstract Prompts | Junuk Cha et.al. | 2508.07540 | null |
2025-08-10 | Novel View Synthesis with Gaussian Splatting: Impact on Photogrammetry Model Accuracy and Resolution | Pranav Chougule et.al. | 2508.07483 | null |
2025-08-10 | CharacterShot: Controllable and Consistent 4D Character Animation | Junyao Gao et.al. | 2508.07409 | null |
2025-08-10 | DIP-GS: Deep Image Prior For Gaussian Splatting Sparse View Recovery | Rajaei Khatib et.al. | 2508.07372 | null |
2025-08-10 | GS4Buildings: Prior-Guided Gaussian Splatting for 3D Building Reconstruction | Qilin Zhang et.al. | 2508.07355 | null |
2025-08-10 | Navigation and Exploration with Active Inference: from Biology to Industry | Daria de Tinguy et.al. | 2508.07269 | null |
2025-08-10 | Fading the Digital Ink: A Universal Black-Box Attack Framework for 3DGS Watermarking Systems | Qingyuan Zeng et.al. | 2508.07263 | null |
2025-08-12 | Understanding Dynamic Scenes in Ego Centric 4D Point Clouds | Junsheng Huang et.al. | 2508.07251 | null |
2025-08-10 | 3D Gaussian Representations with Motion Trajectory Field for Dynamic Scene Reconstruction | Xuesong Li et.al. | 2508.07182 | null |
2025-08-10 | CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion | Xiaotong Lin et.al. | 2508.07162 | null |
2025-08-09 | DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit | Aiden Swann et.al. | 2508.07118 | null |
2025-08-09 | AugLift: Boosting Generalization in Lifting-based 3D Human Pose Estimation | Nikolai Warner et.al. | 2508.07112 | null |
2025-08-09 | Communication-Efficient Multi-Agent 3D Detection via Hybrid Collaboration | Yue Hu et.al. | 2508.07092 | null |
2025-08-09 | ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting | Sandro Papais et.al. | 2508.07089 | null |
2025-08-09 | TeSO: Representing and Compressing 3D Point Cloud Scenes with Textured Surfel Octree | Yueyu Hu et.al. | 2508.07083 | null |
2025-08-09 | SAGCNet: Spatial-Aware Graph Completion Network for Missing Slice Imputation in Population CMR Imaging | Junkai Liu et.al. | 2508.07041 | null |
2025-08-09 | 3DGS-VBench: A Comprehensive Video Quality Evaluation Benchmark for 3DGS Compression | Yuke Xing et.al. | 2508.07038 | null |
2025-08-12 | HiMat: DiT-based Ultra-High Resolution SVBRDF Generation | Zixiong Wang et.al. | 2508.07011 | null |
2025-08-09 | Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments | Gian Mario Favero et.al. | 2508.07006 | null |
2025-08-09 | EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events | Siyu Chen et.al. | 2508.07003 | null |
2025-08-09 | Evaluating Fisheye-Compatible 3D Gaussian Splatting Methods on Real Images Beyond 180 Degree Field of View | Ulas Gunes et.al. | 2508.06968 | null |
2025-08-09 | Hybrid Machine Learning Framework for Predicting Geometric Deviations from 3D Surface Metrology | Hamidreza Samadi et.al. | 2508.06845 | null |
2025-08-09 | Edge Detection for Organ Boundaries via Top Down Refinement and SubPixel Upsampling | Aarav Mehta et.al. | 2508.06805 | null |
2025-08-09 | DiffUS: Differentiable Ultrasound Rendering from Volumetric Imaging | Noe Bertramo et.al. | 2508.06768 | null |
2025-08-09 | VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions | Yash Garg et.al. | 2508.06757 | null |
2025-08-08 | Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video | Jixuan He et.al. | 2508.06715 | null |
2025-08-08 | Fourier Optics and Deep Learning Methods for Fast 3D Reconstruction in Digital Holography | Justin London et.al. | 2508.06703 | null |
2025-08-08 | CoDe-NeRF: Neural Rendering via Dynamic Coefficient Decomposition | Wenpeng Xing et.al. | 2508.06632 | null |
2025-08-08 | LightSwitch: Multi-view Relighting with Material-guided Diffusion | Yehonathan Litman et.al. | 2508.06494 | null |
2025-08-08 | MotionSwap | Om Patil et.al. | 2508.06430 | null |
2025-08-08 | FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation | Wenbin Teng et.al. | 2508.06392 | null |
2025-08-08 | ViPro-2: Unsupervised State Estimation via Integrated Dynamics for Guiding Video Prediction | Patrick Takenaka et.al. | 2508.06335 | null |
2025-08-08 | L2Calib: $SE(3)$ -Manifold Reinforcement Learning for Robust Extrinsic Calibration with Degenerate Motion Resilience | Baorun Li et.al. | 2508.06330 | null |
2025-08-08 | Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging? | Xin Ci Wong et.al. | 2508.06327 | null |
2025-08-08 | Real-Time 3D Vision-Language Embedding Mapping | Christian Rauch et.al. | 2508.06291 | null |
2025-08-08 | Situationally-aware Path Planning Exploiting 3D Scene Graphs | Saad Ejaz et.al. | 2508.06283 | null |
2025-08-08 | XAG-Net: A Cross-Slice Attention and Skip Gating Network for 2.5D Femur MRI Segmentation | Byunghyun Ko et.al. | 2508.06258 | null |
2025-08-08 | PA-HOI: A Physics-Aware Human and Object Interaction Dataset | Ruiyan Wang et.al. | 2508.06205 | null |
2025-08-08 | AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly Detection | Zhaopeng Gu et.al. | 2508.06203 | null |
2025-08-08 | UW-3DGS: Underwater 3D Reconstruction with Physics-Aware Gaussian Splatting | Wenpeng Xing et.al. | 2508.06169 | null |
2025-08-08 | Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation | YoungChan Choi et.al. | 2508.06136 | null |
2025-08-12 | GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving | Jian Wang et.al. | 2508.06113 | null |
2025-08-08 | MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment | Gui Zou et.al. | 2508.06104 | null |
2025-08-08 | Towards MR-Based Trochleoplasty Planning | Michael Wehrli et.al. | 2508.06076 | null |
2025-08-08 | LV-Net: Anatomy-aware lateral ventricle shape modeling with a case study on Alzheimer’s disease, the Australian Imaging Biomarkers and Lifestyle flagship study of ageing | Wonjung Park et.al. | 2508.06055 | null |
2025-08-08 | Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts | Kiran Chhatre et.al. | 2508.06032 | null |
2025-08-08 | ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors | Minsu Kim et.al. | 2508.06014 | null |
2025-08-08 | AnimateScene: Camera-controllable Animation in Any Scene | Qingyang Liu et.al. | 2508.05982 | null |
2025-08-08 | A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image | Yanxing Liang et.al. | 2508.05950 | null |
2025-08-08 | Enhancing Construction Site Analysis and Understanding with 3D Segmentation | Sri Ramana Saketh Vasanthawada et.al. | 2508.05922 | null |
2025-08-07 | HOLODECK 2.0: Vision-Language-Guided 3D World Generation with Editing | Zixuan Bian et.al. | 2508.05899 | null |
2025-08-07 | MZEN: Multi-Zoom Enhanced NeRF for 3-D Reconstruction with Unknown Camera Poses | Jong-Ik Park et.al. | 2508.05819 | null |
2025-08-07 | Optimization-Free Style Transfer for 3D Gaussian Splats | Raphael Du Sablon et.al. | 2508.05813 | null |
2025-08-07 | MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss | Can Zhao et.al. | 2508.05772 | null |
2025-08-07 | GAP: Gaussianize Any Point Clouds with Text Guidance | Weiqi Zhang et.al. | 2508.05631 | null |
2025-08-07 | Physically Controllable Relighting of Photographs | Chris Careaga et.al. | 2508.05626 | null |
2025-08-07 | Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity | Yuhan Zhang et.al. | 2508.05609 | null |
2025-08-07 | Robust adaptive fuzzy sliding mode control for trajectory tracking for of cylindrical manipulator | Van Cuong Pham et.al. | 2508.05584 | null |
2025-08-07 | Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis | Kunyu Feng et.al. | 2508.05580 | null |
2025-08-07 | Point cloud segmentation for 3D Clothed Human Layering | Davide Garavaso et.al. | 2508.05531 | null |
2025-08-07 | Head Anchor Enhanced Detection and Association for Crowded Pedestrian Tracking | Zewei Wu et.al. | 2508.05514 | null |
2025-08-07 | MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips | Shibo Wang et.al. | 2508.05506 | null |
2025-08-07 | Symmetry Understanding of 3D Shapes via Chirality Disentanglement | Weikang Wang et.al. | 2508.05505 | null |
2025-08-07 | Computational Design and Fabrication of Modular Robots with Untethered Control | Manas Bhargava et.al. | 2508.05410 | null |
2025-08-07 | CT-GRAPH: Hierarchical Graph Attention Network for Anatomy-Guided CT Report Generation | Hamza Kalisch et.al. | 2508.05375 | null |
2025-08-07 | 3DGabSplat: 3D Gabor Splatting for Frequency-adaptive Radiance Field Rendering | Junyu Zhou et.al. | 2508.05343 | null |
2025-08-08 | CF3: Compact and Fast 3D Feature Fields | Hyunjoon Lee et.al. | 2508.05254 | null |
2025-08-07 | Coarse-to-Fine Joint Registration of MR and Ultrasound Images via Imaging Style Transfer | Junyi Wang et.al. | 2508.05240 | null |
2025-08-07 | EndoMatcher: Generalizable Endoscopic Image Matcher via Multi-Domain Pre-training for Robot-Assisted Surgery | Bingyu Yang et.al. | 2508.05205 | null |
2025-08-07 | Refining Gaussian Splatting: A Volumetric Densification Approach | Mohamed Abdul Gafoor et.al. | 2508.05187 | null |
2025-08-07 | Learning to See and Act: Task-Aware View Planning for Robotic Manipulation | Yongjie Bai et.al. | 2508.05186 | null |
2025-08-07 | FCBV-Net: Category-Level Robotic Garment Smoothing via Feature-Conditioned Bimanual Value Prediction | Mohammed Daba et.al. | 2508.05153 | null |
2025-08-07 | FedGIN: Federated Learning with Dynamic Global Intensity Non-linear Augmentation for Organ Segmentation using Multi-modal Images | Sachin Dudda Nagaraju et.al. | 2508.05137 | null |
2025-08-07 | A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding | Mahmoud Chick Zaouali et.al. | 2508.05064 | null |
2025-08-07 | DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion | Yifeng Huang et.al. | 2508.05060 | null |
2025-08-07 | MAG-Nav: Language-Driven Object Navigation Leveraging Memory-Reserved Active Grounding | Weifan Zhang et.al. | 2508.05021 | null |
2025-08-07 | Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion | Shenglun Chen et.al. | 2508.04984 | null |
2025-08-07 | UGOD: Uncertainty-Guided Differentiable Opacity and Soft Dropout for Enhanced Sparse-View 3DGS | Zhihao Guo et.al. | 2508.04968 | null |
2025-08-07 | Laplacian Analysis Meets Dynamics Modelling: Gaussian Splatting for 4D Reconstruction | Yifan Zhou et.al. | 2508.04966 | null |
2025-08-07 | Perceive-Sample-Compress: Towards Real-Time 3D Gaussian Splatting | Zijian Wang et.al. | 2508.04965 | null |
2025-08-06 | CryoGS: Gaussian Splatting for Cryo-EM Homogeneous Reconstruction | Suyi Chen et.al. | 2508.04929 | null |
2025-08-06 | LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction | Md Zahidul Hasan et.al. | 2508.04847 | null |
2025-08-06 | Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models | Mehrdad Moradi et.al. | 2508.04818 | null |
2025-08-05 | Neural Field-Based 3D Surface Reconstruction of Microstructures from Multi-Detector Signals in Scanning Electron Microscopy | Shuo Chen et.al. | 2508.04728 | null |
2025-08-06 | Occupancy Learning with Spatiotemporal Memory | Ziyang Leng et.al. | 2508.04705 | null |
2025-08-06 | BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning | Ziyang Leng et.al. | 2508.04702 | null |
2025-08-06 | MienCap: Realtime Performance-Based Facial Animation with Live Mood Dynamics | Ye Pan et.al. | 2508.04687 | null |
2025-08-06 | PixCuboid: Room Layout Estimation from Multi-view Featuremetric Alignment | Gustav Hanning et.al. | 2508.04659 | null |
2025-08-06 | OmniDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment | Tongfan Guan et.al. | 2508.04611 | null |
2025-08-06 | $NavA^3$ : Understanding Any Instruction, Navigating Anywhere, Finding Anything | Lingfeng Zhang et.al. | 2508.04598 | null |
2025-08-06 | Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline | Linqing Zhao et.al. | 2508.04597 | null |
2025-08-06 | LA-CaRe-CNN: Cascading Refinement CNN for Left Atrial Scar Segmentation | Franz Thaler et.al. | 2508.04553 | null |
2025-08-06 | Surf3R: Rapid Surface Reconstruction from Sparse RGB Views in Seconds | Haodong Zhu et.al. | 2508.04508 | null |
2025-08-06 | MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular Videos | Daisheng Jin et.al. | 2508.04505 | null |
2025-08-06 | 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation | Shuzhou Yang et.al. | 2508.04467 | null |
2025-08-06 | Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models | Yinan Yu et.al. | 2508.04406 | null |
2025-08-06 | RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization | Yanyan Li et.al. | 2508.04335 | null |
2025-08-07 | Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research | Ke Li et.al. | 2508.04326 | null |
2025-08-06 | MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction | Yaopeng Lou et.al. | 2508.04297 | null |
2025-08-06 | PKSS-Align: Robust Point Cloud Registration on Pre-Kendall Shape Space | Chenlei Lv et.al. | 2508.04286 | null |
2025-08-06 | PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction | Muhua Zhu et.al. | 2508.04236 | null |
2025-08-06 | SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition | Jiahui Li et.al. | 2508.04224 | null |
2025-08-06 | Small Lesions-aware Bidirectional Multimodal Multiscale Fusion Network for Lung Disease Classification | Jianxun Yu et.al. | 2508.04205 | null |
2025-08-06 | IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control | Lijuan Liu et.al. | 2508.04147 | null |
2025-08-06 | DET-GS: Depth- and Edge-Aware Regularization for High-Fidelity 3D Gaussian Splatting | Zexu Huang et.al. | 2508.04099 | null |
2025-08-06 | Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework | Yi-Ting Chen et.al. | 2508.04090 | null |
2025-08-06 | RLGS: Reinforcement Learning-Based Adaptive Hyperparameter Tuning for Gaussian Splatting | Zhan Li et.al. | 2508.04078 | null |
2025-08-06 | Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation | Jiayi He et.al. | 2508.04049 | null |
2025-08-06 | JanusNet: Hierarchical Slice-Block Shuffle and Displacement for Semi-Supervised 3D Multi-Organ Segmentation | Zheng Zhang et.al. | 2508.03997 | null |
2025-08-05 | La La LiDAR: Large-Scale Layout Generation from LiDAR Data | Youquan Liu et.al. | 2508.03691 | null |
2025-08-05 | Veila: Panoramic LiDAR Generation from a Monocular RGB Image | Youquan Liu et.al. | 2508.03690 | null |
2025-08-05 | Inland-LOAM: Voxel-Based Structural Semantic Mapping for Inland Waterways | Zhongbi Luo et.al. | 2508.03672 | null |
2025-08-05 | OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World | Katherine Liu et.al. | 2508.03669 | null |
2025-08-06 | Uni3R: Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images | Xiangyu Sun et.al. | 2508.03643 | null |
2025-08-05 | FPG-NAS: FLOPs-Aware Gated Differentiable Neural Architecture Search for Efficient 6DoF Pose Estimation | Nassim Ali Ousalah et.al. | 2508.03618 | null |
2025-08-05 | CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models | Ana Lawry Aguila et.al. | 2508.03594 | null |
2025-08-05 | Spatial Imputation Drives Cross-Domain Alignment for EEG Classification | Hongjun Liu et.al. | 2508.03437 | null |
2025-08-05 | WaMo: Wavelet-Enhanced Multi-Frequency Trajectory Analysis for Fine-Grained Text-Motion Retrieval | Junlong Ren et.al. | 2508.03343 | null |
2025-08-05 | Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion | Wentao Qu et.al. | 2508.03252 | null |
2025-08-05 | Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing | Hongyu Shen et.al. | 2508.03227 | null |
2025-08-05 | Monocular Depth Estimation with Global-Aware Discretization and Local Context Modeling | Heng Wu et.al. | 2508.03186 | null |
2025-08-05 | Duplex-GS: Proxy-Guided Weighted Blending for Real-Time Order-Independent Gaussian Splatting | Weihang Liu et.al. | 2508.03180 | null |
2025-08-05 | H3R: Hybrid Multi-view Correspondence for Generalizable 3D Reconstruction | Heng Jia et.al. | 2508.03118 | null |
2025-08-05 | Point2Act: Efficient 3D Distillation of Multimodal LLMs for Zero-Shot Context-Aware Grasping | Sang Min Kim et.al. | 2508.03099 | null |
2025-08-05 | RobustGS: Unified Boosting of Feedforward 3D Gaussian Splatting under Low-Quality Conditions | Anran Wu et.al. | 2508.03077 | null |
2025-08-05 | SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation | Bo Zhang et.al. | 2508.03069 | null |
2025-08-05 | A Survey of Medical Point Cloud Shape Learning: Registration, Reconstruction and Variation | Tongxu Zhang et.al. | 2508.03057 | null |
2025-08-05 | SA-3DGS: A Self-Adaptive Compression Method for 3D Gaussian Splatting | Liheng Zhang et.al. | 2508.03017 | null |
2025-08-05 | ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion | Meng Zhou et.al. | 2508.03008 | null |
2025-08-05 | GACL: Grounded Adaptive Curriculum Learning with Active Task and Performance Monitoring | Linji Wang et.al. | 2508.02988 | null |
2025-08-04 | Evaluation of 3D Counterfactual Brain MRI Generation | Pengwei Sun et.al. | 2508.02880 | null |
2025-08-04 | MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model | Tianheng Zhu et.al. | 2508.02858 | null |
2025-08-04 | GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing | Mikołaj Zieliński et.al. | 2508.02831 | null |
2025-08-04 | PyCAT4: A Hierarchical Vision Transformer-based Framework for 3D Human Pose Estimation | Zongyou Yang et.al. | 2508.02806 | null |
2025-08-04 | PMGS: Reconstruction of Projectile Motion across Large Spatiotemporal Spans via 3D Gaussian Splatting | Yijun Xu et.al. | 2508.02660 | null |
2025-08-04 | RL-U $^2$ Net: A Dual-Branch UNet with Reinforcement Learning-Assisted Multimodal Feature Fusion for Accurate 3D Whole-Heart Segmentation | Jierui Qu et.al. | 2508.02557 | null |
2025-08-04 | Uncertainty-Aware Perception-Based Control for Autonomous Racing | Jelena Trisovic et.al. | 2508.02494 | null |
2025-08-05 | Low-Frequency First: Eliminating Floating Artifacts in 3D Gaussian Splatting | Jianchao Wang et.al. | 2508.02493 | null |
2025-08-06 | GR-Gaussian: Graph-Based Radiative Gaussian Splatting for Sparse-View CT Reconstruction | Yikuang Yuluo et.al. | 2508.02408 | null |
2025-08-04 | Correspondence-Free Fast and Robust Spherical Point Pattern Registration | Anik Sarker et.al. | 2508.02339 | null |
2025-08-04 | Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images | Philipp Wulff et.al. | 2508.02323 | null |
2025-08-04 | ASDR: Exploiting Adaptive Sampling and Data Reuse for CIM-based Instant Neural Rendering | Fangxin Liu et.al. | 2508.02304 | null |
2025-08-04 | Unleashing the Temporal Potential of Stereo Event Cameras for Continuous-Time 3D Object Detection | Jae-Young Kang et.al. | 2508.02288 | null |
2025-08-04 | SplatSSC: Decoupled Depth-Guided Gaussian Splatting for Semantic Scene Completion | Rui Qian et.al. | 2508.02261 | null |
2025-08-04 | GaussianCross: Cross-modal Self-supervised 3D Representation Learning via Gaussian Splatting | Lei Yao et.al. | 2508.02172 | null |
2025-08-04 | Unified Category-Level Object Detection and Pose Estimation from RGB Images using 3D Prototypes | Tom Fischer et.al. | 2508.02157 | null |
2025-08-04 | ScrewSplat: An End-to-End Method for Articulated Object Recognition | Seungyeon Kim et.al. | 2508.02146 | null |
2025-08-04 | VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling | Yuru Xiao et.al. | 2508.02129 | null |
2025-08-04 | REACT-KD: Region-Aware Cross-modal Topological Knowledge Distillation for Interpretable Medical Image Classification | Hongzhao Chen et.al. | 2508.02104 | null |
2025-08-04 | StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion | Haoxin Yang et.al. | 2508.02056 | null |
2025-08-04 | Protego: User-Centric Pose-Invariant Privacy Protection Against Face Recognition-Induced Digital Footprint Exposure | Ziling Wang et.al. | 2508.02034 | null |
2025-08-04 | On-the-Fly Object-aware Representative Point Selection in Point Cloud | Xiaoyu Zhang et.al. | 2508.01980 | null |
2025-08-04 | From Photons to Physics: Autonomous Indoor Drones and the Future of Objective Property Assessment | Petteri Teikari et.al. | 2508.01965 | null |
2025-08-03 | Less is More: AMBER-AFNO – a New Benchmark for Lightweight 3D Medical Image Segmentation | Andrea Dosi et.al. | 2508.01941 | null |
2025-08-03 | MUTE-DSS: A Digital-Twin-Based Decision Support System for Minimizing Underwater Radiated Noise in Ship Voyage Planning | Akash Venkateshwaran et.al. | 2508.01907 | null |
2025-08-03 | Beyond Vulnerabilities: A Survey of Adversarial Attacks as Both Threats and Defenses in Computer Vision Systems | Zhongliang Guo et.al. | 2508.01845 | null |
2025-08-03 | OmniEvent: Unified Event Representation Learning | Weiqi Yan et.al. | 2508.01842 | null |
2025-08-03 | Diffusion-based 3D Hand Motion Recovery with Intuitive Physics | Yufei Zhang et.al. | 2508.01835 | null |
2025-08-03 | Skip priors and add graph-based anatomical information, for point-based Couinaud segmentation | Xiaotong Zhang et.al. | 2508.01785 | null |
2025-08-05 | VPN: Visual Prompt Navigation | Shuo Feng et.al. | 2508.01766 | null |
2025-08-03 | AG $^2$ aussian: Anchor-Graph Structured Gaussian Splatting for Instance-Level 3D Scene Understanding and Editing | Zhaonan Wang et.al. | 2508.01740 | null |
2025-08-03 | OpenMap: Instruction Grounding via Open-Vocabulary Visual-Language Mapping | Danyang Li et.al. | 2508.01723 | null |
2025-08-03 | LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving | Luqi Cheng et.al. | 2508.01704 | null |
2025-08-03 | Register Anything: Estimating “Corresponding Prompts” for Segment Anything Model | Shiqi Huang et.al. | 2508.01697 | null |
2025-08-03 | DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing | Yufeng Chi et.al. | 2508.01684 | null |
2025-08-03 | DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding | Hanqing Wang et.al. | 2508.01651 | null |
2025-08-03 | StrandDesigner: Towards Practical Strand Generation with Sketch Guidance | Na Zhang et.al. | 2508.01650 | null |
2025-08-03 | Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection | Hanxi Li et.al. | 2508.01591 | null |
2025-08-03 | A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction | Hua Yu et.al. | 2508.01585 | null |
2025-08-03 | Deeply Supervised Multi-Task Autoencoder for Biological Brain Age estimation using three dimensional T $_1$ -weighted magnetic resonance imaging | Mehreen Kanwal et.al. | 2508.01565 | null |
2025-08-03 | Adaptive LiDAR Scanning: Harnessing Temporal Cues for Efficient 3D Object Detection via Multi-Modal Fusion | Sara Shoouri et.al. | 2508.01562 | null |
2025-08-02 | Decentralized Aerial Manipulation of a Cable-Suspended Load using Multi-Agent Reinforcement Learning | Jack Zeng et.al. | 2508.01522 | null |
2025-08-02 | EfficientGFormer: Multimodal Brain Tumor Segmentation via Pruned Graph-Augmented Transformer | Fatemeh Ziaeetabar et.al. | 2508.01465 | null |
2025-08-02 | Can3Tok: Canonical 3D Tokenization and Latent Modeling of Scene-Level 3D Gaussians | Quankai Gao et.al. | 2508.01464 | null |
2025-08-02 | Uncertainty-Aware Segmentation Quality Prediction via Deep Learning Bayesian Modeling: Comprehensive Evaluation and Interpretation on Skin Cancer and Liver Segmentation | Sikha O K et.al. | 2508.01460 | null |
2025-08-05 | 3DRot: 3D Rotation Augmentation for RGB-Based 3D Tasks | Shitian Yang et.al. | 2508.01423 | null |
2025-08-02 | ReMu: Reconstructing Multi-layer 3D Clothed Human from Image Layers | Onat Vuran et.al. | 2508.01381 | null |
2025-08-02 | P3P Made Easy | Seong Hun Lee et.al. | 2508.01312 | null |
2025-08-02 | C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor | Haoquan Lu et.al. | 2508.01311 | null |
2025-08-02 | CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis | Alec Sargood et.al. | 2508.01292 | null |
2025-08-02 | Integrating Disparity Confidence Estimation into Relative Depth Prior-Guided Unsupervised Stereo Matching | Chuang-Wei Liu et.al. | 2508.01275 | null |
2025-08-05 | MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh | Shuangkang Fang et.al. | 2508.01242 | null |
2025-08-02 | OCSplats: Observation Completeness Quantification and Label Noise Separation in 3DGS | Han Ling et.al. | 2508.01239 | null |
2025-08-02 | Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system | Jiyong Kim et.al. | 2508.01230 | null |
2025-08-02 | MoGaFace: Momentum-Guided and Texture-Aware Gaussian Avatars for Consistent Facial Geometry | Yujian Liu et.al. | 2508.01218 | null |
2025-08-02 | Perspective from a Broader Context: Can Room Style Knowledge Help Visual Floorplan Localization? | Bolei Chen et.al. | 2508.01216 | null |
2025-08-02 | A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding | Zhan Shi et.al. | 2508.01197 | null |
2025-08-02 | Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning | Xinhang Wan et.al. | 2508.01184 | null |
2025-08-02 | No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views | Ranran Huang et.al. | 2508.01171 | null |
2025-08-02 | DELTAv2: Accelerating Dense 3D Tracking | Tuan Duc Ngo et.al. | 2508.01170 | null |
2025-08-02 | OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding | Dianyi Yang et.al. | 2508.01150 | null |
2025-08-02 | Design of Q8bot: A Miniature, Low-Cost, Dynamic Quadruped Built with Zero Wires | Yufeng Wu et.al. | 2508.01149 | null |
2025-08-02 | UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation | Chaitanya Patel et.al. | 2508.01126 | null |
2025-08-01 | DreamSat-2.0: Towards a General Single-View Asteroid 3D Reconstruction | Santiago Diaz et.al. | 2508.01079 | null |
2025-08-01 | Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation | Fenghe Tang et.al. | 2508.01064 | null |
2025-08-01 | Structured Spectral Graph Learning for Anomaly Classification in 3D Chest CT Scans | Theo Di Piazza et.al. | 2508.01045 | null |
2025-08-01 | 3D Reconstruction via Incremental Structure From Motion | Muhammad Zeeshan et.al. | 2508.01019 | null |
2025-08-01 | Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection | Cheng-You Lu et.al. | 2508.01014 | null |
2025-08-01 | Cooperative Perception: A Resource-Efficient Framework for Multi-Drone 3D Scene Reconstruction Using Federated Diffusion and NeRF | Massoud Pourmandi et.al. | 2508.00967 | null |
2025-07-31 | Investigating Crossing Perception in 3D Graph Visualisation | Ying Zhang et.al. | 2508.00950 | null |
2025-08-01 | IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation | Wenxuan Guo et.al. | 2508.00823 | null |
2025-08-01 | Cross-Dataset Semantic Segmentation Performance Analysis: Unifying NIST Point Cloud City Datasets for 3D Deep Learning | Alexander Nikitas Dimopoulos et.al. | 2508.00822 | null |
2025-08-01 | GECO: Geometrically Consistent Embedding with Lightspeed Inference | Regine Hartwig et.al. | 2508.00746 | null |
2025-08-01 | Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR | Adwait Chandorkar et.al. | 2508.00744 | null |
2025-08-04 | DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior | Junzhe Lu et.al. | 2508.00599 | null |
2025-08-01 | OmniUnet: A Multimodal Network for Unstructured Terrain Segmentation on Planetary Rovers Using RGB, Depth, and Thermal Imagery | Raul Castilla-Arquillo et.al. | 2508.00580 | null |
2025-08-04 | LesiOnTime – Joint Temporal and Clinical Modeling for Small Breast Lesion Segmentation in Longitudinal DCE-MRI | Mohammed Kamran et.al. | 2508.00496 | null |
2025-08-01 | HyPCV-Former: Hyperbolic Spatio-Temporal Transformer for 3D Point Cloud Video Anomaly Detection | Jiaping Cao et.al. | 2508.00473 | null |
2025-08-01 | Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation | Nan Xiang et.al. | 2508.00428 | null |
2025-08-01 | Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting | Seunggeun Chi et.al. | 2508.00427 | null |
2025-08-01 | Sari Sandbox: A Virtual Retail Store Environment for Embodied AI Agents | Janika Deborah Gajo et.al. | 2508.00400 | null |
2025-08-01 | Occlusion-robust Stylization for Drawing-based 3D Animation | Sunjae Yoon et.al. | 2508.00398 | null |
2025-08-01 | SparseRecon: Neural Implicit Surface Reconstruction from Sparse Views with Feature and Depth Consistencies | Liang Han et.al. | 2508.00366 | null |
2025-08-01 | Stable at Any Speed: Speed-Driven Multi-Object Tracking with Learnable Kalman Filtering | Yan Gong et.al. | 2508.00358 | null |
2025-08-01 | Omni-Scan: Creating Visually-Accurate Digital Twin Object Models Using a Bimanual Robot with Handover and Gaussian Splat Merging | Tianshuang Qiu et.al. | 2508.00354 | null |
2025-08-01 | AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer | Jin Lyu et.al. | 2508.00298 | null |
2025-08-01 | Towards Robust Semantic Correspondence: A Benchmark and Insights | Wenyue Chong et.al. | 2508.00272 | null |
2025-08-05 | Multimodal Referring Segmentation: A Survey | Henghui Ding et.al. | 2508.00265 | null |
2025-08-01 | PointGauss: Point Cloud-Guided Multi-Object Segmentation for Gaussian Splatting | Wentao Sun et.al. | 2508.00259 | null |
2025-08-01 | Weakly Supervised Intracranial Aneurysm Detection and Segmentation in MR angiography via Multi-task UNet with Vesselness Prior | Erin Rainville et.al. | 2508.00235 | null |
2025-07-31 | Robust 3D Object Detection using Probabilistic Point Clouds from Single-Photon LiDARs | Bhavya Goyal et.al. | 2508.00169 | null |
2025-07-31 | GEPAR3D: Geometry Prior-Assisted Learning for 3D Tooth Segmentation | Tomasz Szczepański et.al. | 2508.00155 | null |
2025-07-31 | Stress-Aware Resilient Neural Training | Ashkan Shakarami et.al. | 2508.00098 | null |
2025-07-31 | Punching Bag vs. Punching Person: Motion Transferability in Videos | Raiyaan Abdullah et.al. | 2508.00085 | null |
2025-07-31 | Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis | Bowen Zhang et.al. | 2507.23785 | null |
2025-07-31 | Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions | Li Siyao et.al. | 2507.23778 | null |
2025-07-31 | SeqAffordSplat: Scene-level Sequential Affordance Reasoning on 3D Gaussian Splatting | Di Li et.al. | 2507.23772 | null |
2025-08-05 | Topology Optimization in Medical Image Segmentation with Fast Euler Characteristic | Liu Li et.al. | 2507.23763 | null |
2025-07-31 | Enhanced Velocity Field Modeling for Gaussian Video Reconstruction | Zhenyang Li et.al. | 2507.23704 | null |
2025-07-31 | Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents | Shaofei Cai et.al. | 2507.23698 | null |
2025-07-31 | High-resolution eikonal imaging and uncertainty quantification of the Kilauea caldera | Angela F. Gao et.al. | 2507.23692 | null |
2025-07-31 | I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation | Jialei Chen et.al. | 2507.23683 | null |
2025-07-31 | Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes | Xiaohan Li et.al. | 2507.23677 | null |
2025-07-31 | DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation | Yuchen Zhou et.al. | 2507.23599 | null |
2025-08-02 | MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction | Zijian Dong et.al. | 2507.23597 | null |
2025-07-31 | Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization | Maxime Pietrantoni et.al. | 2507.23569 | null |
2025-07-31 | 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection | Yung-Hsu Yang et.al. | 2507.23567 | null |
2025-08-01 | H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation | Hongzhe Bi et.al. | 2507.23523 | null |
2025-07-31 | Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion | Mutian Xu et.al. | 2507.23483 | null |
2025-07-31 | FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction | Donghyun Lee et.al. | 2507.23480 | null |
2025-07-31 | 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding | Ting Huang et.al. | 2507.23478 | null |
2025-07-31 | NeRF Is a Valuable Assistant for 3D Gaussian Splatting | Shuangkang Fang et.al. | 2507.23374 | null |
2025-07-31 | MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting | Xingyue Peng et.al. | 2507.23340 | null |
2025-08-01 | Training-free Geometric Image Editing on Diffusion Models | Hanshen Zhu et.al. | 2507.23300 | null |
2025-07-31 | iLRM: An Iterative Large 3D Reconstruction Model | Gyeongjin Kang et.al. | 2507.23277 | null |
2025-07-31 | GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting | Jaeseok Park et.al. | 2507.23273 | null |
2025-07-31 | Towards Affordable Tumor Segmentation and Visualization for 3D Breast MRI Using SAM2 | Solha Kang et.al. | 2507.23272 | null |
2025-07-30 | Details Matter for Indoor Open-vocabulary 3D Instance Segmentation | Sanghun Jung et.al. | 2507.23134 | null |
2025-07-30 | Rethink Domain Generalization in Heterogeneous Sequence MRI Segmentation | Zheyuan Zhang et.al. | 2507.23110 | null |
2025-07-30 | Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation | Alexandru Buburuzan et.al. | 2507.23058 | null |
2025-07-30 | Adaptive Time-step Training for Enhancing Spike-Based Neural Radiance Fields | Ranxi Lin et.al. | 2507.23033 | null |
2025-07-30 | Learning to Prune Branches in Modern Tree-Fruit Orchards | Abhinav Jain et.al. | 2507.23015 | null |
2025-07-30 | Robust and Efficient 3D Gaussian Splatting for Urban Scene Reconstruction | Zhensheng Yuan et.al. | 2507.23006 | null |
2025-07-30 | Viser: Imperative, Web-based 3D Visualization in Python | Brent Yi et.al. | 2507.22885 | null |
2025-07-30 | DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion | Qingcheng Zhao et.al. | 2507.22825 | null |
2025-07-30 | Wall Shear Stress Estimation in Abdominal Aortic Aneurysms: Towards Generalisable Neural Surrogate Models | Patryk Rygiel et.al. | 2507.22817 | null |
2025-07-30 | Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques | Weide Liu et.al. | 2507.22791 | null |
2025-07-30 | Social-Pose: Enhancing Trajectory Prediction with Human Body Pose | Yang Gao et.al. | 2507.22742 | null |
2025-07-30 | A Linear N-Point Solver for Structure and Motion from Asynchronous Tracks | Hang Su et.al. | 2507.22733 | null |
2025-07-30 | Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints | Thuy Tran et.al. | 2507.22699 | null |
2025-07-30 | Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation | Hongbin Lin et.al. | 2507.22668 | null |
2025-07-30 | trAIce3D: A Prompt-Driven Transformer Based U-Net for Semantic Segmentation of Microglial Cells from Large-Scale 3D Microscopy Images | MohammadAmin Alamalhoda et.al. | 2507.22635 | null |
2025-07-30 | Estimating 2D Camera Motion with Hybrid Motion Basis | Haipeng Li et.al. | 2507.22480 | null |
2025-07-30 | UAVScenes: A Multi-Modal Dataset for UAVs | Sijie Wang et.al. | 2507.22412 | null |
2025-07-30 | UFV-Splatter: Pose-Free Feed-Forward 3D Gaussian Splatting Adapted to Unfavorable Views | Yuki Fujimura et.al. | 2507.22342 | null |
2025-07-30 | A Segmentation Framework for Accurate Diagnosis of Amyloid Positivity without Structural Images | Penghan Zhu et.al. | 2507.22336 | null |
2025-07-29 | Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception | Christian Ellis et.al. | 2507.22194 | null |
2025-07-29 | Enhancing efficiency in paediatric brain tumour segmentation using a pathologically diverse single-center clinical dataset | A. Piffer et.al. | 2507.22152 | null |
2025-07-29 | Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos | Ziren Gong et.al. | 2507.22052 | null |
2025-07-29 | ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports | Mohammed Baharoon et.al. | 2507.22030 | null |
2025-07-29 | Cardiac-CLIP: A Vision-Language Foundation Model for 3D Cardiac CT Images | Yutao Hu et.al. | 2507.22024 | null |
2025-07-29 | XAI for Point Cloud Data using Perturbations based on Meaningful Segmentation | Raju Ningappa Mulawade et.al. | 2507.22020 | null |
2025-07-29 | DISCOVERSE: Efficient Robot Simulation in Complex High-Fidelity Environments | Yufei Jia et.al. | 2507.21981 | null |
2025-07-29 | PanoSplatt3R: Leveraging Perspective Pretraining for Generalized Unposed Wide-Baseline Panorama Reconstruction | Jiahui Ren et.al. | 2507.21960 | null |
2025-07-31 | MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors | Shouyi Lu et.al. | 2507.21872 | null |
2025-07-29 | VidFuncta: Towards Generalizable Neural Representations for Ultrasound Videos | Julia Wolleb et.al. | 2507.21863 | null |
2025-07-29 | HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels | HunyuanWorld Team et.al. | 2507.21809 | null |
2025-07-29 | AU-LLM: Micro-Expression Action Unit Detection via Enhanced LLM-Based Feature Fusion | Zhishu Liu et.al. | 2507.21778 | null |
2025-07-29 | Multi-UAV Deployment in Obstacle-Cluttered Environments with LOS Connectivity | Yuda Chen et.al. | 2507.21772 | null |
2025-07-30 | No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering | Linye Wei et.al. | 2507.21572 | null |
2025-07-29 | Multi-View Reconstruction with Global Context for 3D Anomaly Detection | Yihan Sun et.al. | 2507.21555 | null |
2025-07-29 | LITE: A Learning-Integrated Topological Explorer for Multi-Floor Indoor Environments | Junhao Chen et.al. | 2507.21517 | null |
2025-07-29 | ST-DAI: Single-shot 2.5D Spatial Transcriptomics with Intra-Sample Domain Adaptive Imputation for Cost-efficient 3D Reconstruction | Jiahe Qian et.al. | 2507.21516 | null |
2025-07-29 | BANG: Dividing 3D Assets via Generative Exploded Dynamics | Longwen Zhang et.al. | 2507.21493 | null |
2025-07-29 | Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval | Zhichuan Wang et.al. | 2507.21489 | null |
2025-07-28 | Top2Pano: Learning to Generate Indoor Panoramas from Top-Down View | Zitong Zhang et.al. | 2507.21371 | null |
2025-08-03 | Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy | Jicheng Yuan et.al. | 2507.21358 | null |
2025-07-28 | DEM-NeRF: A Neuro-Symbolic Method for Scientific Discovery through Physics-Informed Simulation | Wenkai Tan et.al. | 2507.21350 | null |
2025-07-28 | GLCP: Global-to-Local Connectivity Preservation for Tubular Structure Segmentation | Feixiang Zhou et.al. | 2507.21328 | null |
2025-07-28 | VoluMe – Authentic 3D Video Calls from Live Gaussian Splat Prediction | Martin de La Gorce et.al. | 2507.21311 | null |
2025-07-28 | Fluidically Innervated Lattices Make Versatile and Durable Tactile Sensors | Annan Zhang et.al. | 2507.21225 | null |
2025-08-03 | Reconstructing 4D Spatial Intelligence: A Survey | Yukang Cao et.al. | 2507.21045 | null |
2025-07-28 | GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction | Tianhao Li et.al. | 2507.20963 | null |
2025-07-28 | $S^3$ LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping | Ruoyu Fan et.al. | 2507.20854 | null |
2025-07-28 | An Efficient Machine Learning Framework for Forest Height Estimation from Multi-Polarimetric Multi-Baseline SAR data | Francesca Razzano et.al. | 2507.20798 | null |
2025-07-28 | KASportsFormer: Kinematic Anatomy Enhanced Transformer for 3D Human Pose Estimation on Short Sports Scene Video | Zhuoer Yin et.al. | 2507.20763 | null |
2025-07-28 | Methods for the Segmentation of Reticular Structures Using 3D LiDAR Data: A Comparative Evaluation | Francisco J. Soler Mora et.al. | 2507.20589 | null |
2025-07-28 | M-Net: MRI Brain Tumor Sequential Segmentation Network via Mesh-Cast | Jiacheng Lu et.al. | 2507.20582 | null |
2025-07-28 | Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation | Hyung Kyu Kim et.al. | 2507.20568 | null |
2025-07-28 | MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization | Hyung Kyu Kim et.al. | 2507.20562 | null |
2025-07-28 | Uni-Mapper: Unified Mapping Framework for Multi-modal LiDARs in Complex and Dynamic Environments | Gilhwan Kang et.al. | 2507.20538 | null |
2025-07-28 | Enhancing Spatial Reasoning through Visual and Textual Thinking | Xun Liang et.al. | 2507.20529 | null |
2025-07-28 | GaRe: Relightable 3D Gaussian Splatting for Outdoor Scenes from Unconstrained Photo Collections | Haiyang Bai et.al. | 2507.20512 | null |
2025-07-28 | Automated 3D-GS Registration and Fusion via Skeleton Alignment and Gaussian-Adaptive Features | Shiyang Liu et.al. | 2507.20480 | null |
2025-07-29 | From Gallery to Wrist: Realistic 3D Bracelet Insertion in Videos | Chenjian Gao et.al. | 2507.20331 | null |
2025-07-27 | Decomposing Densification in Gaussian Splatting for Faster 3D Scene Reconstruction | Binxiao Huang et.al. | 2507.20239 | null |
2025-07-27 | NeuroVoxel-LM: Language-Aligned 3D Perception via Dynamic Voxelization and Meta-Embedding | Shiyu Liu et.al. | 2507.20110 | null |
2025-07-26 | High-Speed Event Vision-Based Tactile Roller Sensor for Large Surface Measurements | Akram Khairi et.al. | 2507.19914 | null |
2025-07-30 | RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection | Xiaokai Bai et.al. | 2507.19856 | null |
2025-07-26 | Taking Language Embedded 3D Gaussian Splatting into the Wild | Yuze Wang et.al. | 2507.19830 | null |
2025-07-25 | GSCache: Real-Time Radiance Caching for Volume Path Tracing using 3D Gaussian Splatting | David Bauer et.al. | 2507.19718 | null |
2025-07-25 | DINO-SLAM: DINO-informed RGB-D SLAM for Neural Implicit and Explicit Representations | Ziren Gong et.al. | 2507.19474 | null |
2025-07-25 | Fast Learning of Non-Cooperative Spacecraft 3D Models through Primitive Initialization | Pol Francesch Huc et.al. | 2507.19459 | null |
2025-07-25 | NerT-CA: Efficient Dynamic Reconstruction from Sparse-view X-ray Coronary Angiography | Kirsten W. H. Maas et.al. | 2507.19328 | null |
2025-07-25 | 3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering | Wei-Hsing Huang et.al. | 2507.19133 | null |
2025-07-25 | Gaussian Set Surface Reconstruction through Per-Gaussian Optimization | Zhentao Huang et.al. | 2507.18923 | null |
2025-07-24 | SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time | Yun Chen et.al. | 2507.18713 | null |
2025-07-24 | Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping | Chong Cheng et.al. | 2507.18541 | null |
2025-07-24 | G2S-ICP SLAM: Geometry-aware Gaussian Splatting ICP SLAM | Gyuhyeon Pak et.al. | 2507.18344 | null |
2025-07-24 | LONG3R: Long Sequence Streaming 3D Reconstruction | Zhuoguang Chen et.al. | 2507.18255 | null |
2025-07-24 | PS-GS: Gaussian Splatting for Multi-View Photometric Stereo | Yixiao Chen et.al. | 2507.18231 | null |
2025-07-24 | High-fidelity 3D Gaussian Inpainting: preserving multi-view consistency and photorealistic details | Jun Zhou et.al. | 2507.18023 | null |
2025-07-24 | Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners | Kostas Karakontis et.al. | 2507.17519 | null |
2025-07-23 | Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field | Yuzhe Zhu et.al. | 2507.17351 | null |
2025-07-23 | Temporal Smoothness-Aware Rate-Distortion Optimized 4D Gaussian Splatting | Hyeongmin Lee et.al. | 2507.17336 | null |
2025-07-24 | PolarAnything: Diffusion-based Polarimetric Image Synthesis | Kailong Zhang et.al. | 2507.17268 | null |
2025-07-22 | StreamME: Simplify 3D Gaussian Avatar within Live Stream | Luchuan Song et.al. | 2507.17029 | null |
2025-07-22 | VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences | Kai Deng et.al. | 2507.16443 | null |
2025-07-22 | Sparse-View 3D Reconstruction: Recent Advances and Open Challenges | Tanveer Younis et.al. | 2507.16406 | null |
2025-07-22 | Dens3R: A Foundation Model for 3D Geometry Prediction | Xianze Fang et.al. | 2507.16290 | null |
2025-07-22 | LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence Images | Guichen Huang et.al. | 2507.16144 | null |
2025-07-21 | Appearance Harmonization via Bilateral Grid Prediction with Transformers for 3DGS | Jisu Shin et.al. | 2507.15748 | null |
2025-07-21 | DWTGS: Rethinking Frequency Regularization for Sparse-view 3D Gaussian Splatting | Hung Nguyen et.al. | 2507.15690 | null |
2025-07-21 | Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing | Boni Hu et.al. | 2507.15683 | null |
2025-07-21 | Gaussian Splatting with Discretized SDF for Relightable Assets | Zuo-Liang Zhu et.al. | 2507.15629 | null |
2025-07-28 | SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting | Zihui Gao et.al. | 2507.15602 | null |
2025-07-21 | ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting | Ruijie Zhu et.al. | 2507.15454 | null |
2025-07-25 | GCC: A 3DGS Inference Architecture with Gaussian-Wise and Cross-Stage Conditional Processing | Minnan Pei et.al. | 2507.15300 | null |
2025-07-20 | 3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline | Kaishva Chintan Shah et.al. | 2507.14924 | null |
2025-07-20 | Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction | Xiufeng Huang et.al. | 2507.14921 | null |
2025-07-20 | An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks | Xinyi Wu et.al. | 2507.14798 | null |
2025-07-30 | Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey | Jiahui Zhang et.al. | 2507.14501 | null |
2025-07-19 | Adaptive 3D Gaussian Splatting Video Streaming: Visual Saliency-Aware Tiling and Meta-Learning-Based Bitrate Adaptation | Han Gong et.al. | 2507.14454 | null |
2025-07-19 | Adaptive 3D Gaussian Splatting Video Streaming | Han Gong et.al. | 2507.14432 | null |
2025-08-01 | C-DOG: Multi-View Multi-instance Feature Association Using Connected δ-Overlap Graphs | Yung-Hong Sun et.al. | 2507.14095 | null |
2025-07-18 | TimeNeRF: Building Generalizable Neural Radiance Fields across Time from Few-Shot Input Views | Hsiang-Hui Hung et.al. | 2507.13929 | null |
2025-07-18 | Neural-GASh: A CGA-based neural radiance prediction pipeline for real-time shading | Efstratios Geronikolakis et.al. | 2507.13917 | null |
2025-07-21 | PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations | Yu Wei et.al. | 2507.13891 | null |
2025-07-18 | EPSilon: Efficient Point Sampling for Lightening of Hybrid-based 3D Avatar Generation | Seungjun Moon et.al. | 2507.13648 | null |
2025-07-18 | Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation | Masahiro Ogawa et.al. | 2507.13628 | null |
2025-07-19 | AutoPartGen: Autogressive 3D Part Generation and Discovery | Minghao Chen et.al. | 2507.13346 | null |
2025-07-16 | VolSegGS: Segmentation and Tracking in Dynamic Volumetric Scenes via Deformable 3D Gaussians | Siyuan Yao et.al. | 2507.12667 | null |
2025-07-16 | NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting | Kuangshi Ai et.al. | 2507.12621 | null |
2025-07-21 | Wavelet-GS: 3D Gaussian Splatting with Wavelet Decomposition | Beizhen Zhao et.al. | 2507.12498 | null |
2025-07-19 | SpatialTrackerV2: 3D Point Tracking Made Easy | Yuxi Xiao et.al. | 2507.12462 | null |
2025-07-16 | Revealing the Ancient Beauty: Digital Reconstruction of Temple Tiles using Computer Vision | Arkaprabha Basu et.al. | 2507.12195 | null |
2025-07-16 | DoRF: Doppler Radiance Fields for Robust Human Activity Recognition Using Wi-Fi | Navid Hasanzadeh et.al. | 2507.12132 | null |
2025-07-16 | BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images | Davide Di Nucci et.al. | 2507.12095 | null |
2025-07-16 | SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation | Beining Xu et.al. | 2507.12027 | null |
2025-07-16 | HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing | Tielong Wang et.al. | 2507.11971 | null |
2025-07-16 | Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark | Jingqian Wu et.al. | 2507.11931 | null |
2025-07-16 | CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning | Peiwen Xia et.al. | 2507.11834 | null |
2025-07-15 | Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation | Zhen Xu et.al. | 2507.11540 | null |
2025-07-21 | Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling | Hayeon Kim et.al. | 2507.11061 | null |
2025-07-14 | ScaffoldAvatar: High-Fidelity Gaussian Avatars with Patch Expressions | Shivangi Aneja et.al. | 2507.10542 | null |
2025-07-14 | Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry | Geyou Zhang et.al. | 2507.10009 | null |
2025-07-19 | 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving | Yixun Zhang et.al. | 2507.09993 | null |
2025-07-14 | VoxelRF: Voxelized Radiance Field for Fast Wireless Channel Modeling | Zihang Zeng et.al. | 2507.09987 | null |
2025-07-11 | From images to properties: a NeRF-driven framework for granular material parameter inversion | Cheng-Hsi Hsiao et.al. | 2507.09005 | null |
2025-07-11 | An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan | Mengyuan Liu et.al. | 2507.08690 | null |
2025-07-11 | Adaptive Framework for Ambient Intelligence in Rehabilitation Assistance | Gábor Baranyi et.al. | 2507.08624 | null |
2025-07-11 | Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT | Wei Zhang et.al. | 2507.08448 | null |
2025-07-11 | RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting | Ji Hyun Seo et.al. | 2507.08434 | null |
2025-07-11 | CL3R: 3D Reconstruction and Contrastive Learning for Enhanced Robotic Manipulation Representations | Wenbo Cui et.al. | 2507.08262 | null |
2025-07-10 | Temporally Consistent Amodal Completion for 3D Human-Object Interaction Reconstruction | Hyungjun Doh et.al. | 2507.08137 | null |
2025-07-18 | RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration | Chong Cheng et.al. | 2507.08136 | null |
2025-07-10 | Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions | Longfei Li et.al. | 2507.07978 | null |
2025-07-10 | RTR-GS: 3D Gaussian Splatting for Inverse Rendering with Radiance Transfer and Reflection | Yongyang Zhou et.al. | 2507.07733 | null |
Diffusion
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-28 | First-Place Solution to NeurIPS 2024 Invisible Watermark Removal Challenge | Fahad Shamshad et.al. | 2508.21072 | null |
2025-08-28 | Dress&Dance: Dress up and Dance as You Like It - Technical Preview | Jun-Kun Chen et.al. | 2508.21070 | null |
2025-08-28 | OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning | Yuan Gong et.al. | 2508.21066 | null |
2025-08-28 | Mixture of Contexts for Long Video Generation | Shengqu Cai et.al. | 2508.21058 | null |
2025-08-28 | HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning | Zhi Su et.al. | 2508.21043 | null |
2025-08-28 | FW-GAN: Frequency-Driven Handwriting Synthesis with Wave-Modulated MLP Generator | Huynh Tong Dang Khoa et.al. | 2508.21040 | null |
2025-08-28 | Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets | Dale Decatur et.al. | 2508.21032 | null |
2025-08-28 | System size and event shape dependence of particle-identified balance functions in proton-proton collisions at $\sqrt{s}=13$ TeV | Subash Chandra Behera et.al. | 2508.21030 | null |
2025-08-28 | POSE: Phased One-Step Adversarial Equilibrium for Video Diffusion Models | Jiaxiang Cheng et.al. | 2508.21019 | null |
2025-08-28 | Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance | Luozhijie Jin et.al. | 2508.21016 | null |
2025-08-28 | Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees | Yaniv Hassidof et.al. | 2508.21001 | null |
2025-08-28 | RANGAN: GAN-empowered Anomaly Detection in 5G Cloud RAN | Douglas Liao et.al. | 2508.20985 | null |
2025-08-28 | Random attractors and nonergodic attractors for diffusions with degeneracies | Yuri Bakhtin et.al. | 2508.20968 | null |
2025-08-28 | Very high-energy gamma-ray and neutrino emission from hadronic interaction in compact binary millisecond pulsars | Vittoria Vecchiotti et.al. | 2508.20952 | null |
2025-08-28 | Lattice Random Walk Discretisations of Stochastic Differential Equations | Samuel Duffield et.al. | 2508.20883 | null |
2025-08-28 | Understanding and evaluating computer vision models through the lens of counterfactuals | Pushkar Shukla et.al. | 2508.20881 | null |
2025-08-28 | Leveraging Discriminative Latent Representations for Conditioning GAN-Based Speech Enhancement | Shrishti Saha Shetu et.al. | 2508.20859 | null |
2025-08-28 | Uniform error analysis of a rectangular Morley finite element method on a Shishkin mesh for a 4th-order singularly perturbed boundary value problem | Xiangyun Meng et.al. | 2508.20857 | null |
2025-08-28 | Learning Primitive Embodied World Models: Towards Scalable Robotic Learning | Qiao Sun et.al. | 2508.20840 | null |
2025-08-28 | High-Resolution Atomic Magnetometer-Based Imaging of Integrated Circuits and Batteries | Dominic Hunter et.al. | 2508.20834 | null |
2025-08-28 | Distinct Spatiotemporal Dynamics of Thermoelectric Transport Across Superconducting Transition | Rajae Malek et.al. | 2508.20792 | null |
2025-08-28 | Prediction of sulphate hazes in the lower Venus atmosphere | Peter Woitke et.al. | 2508.20790 | null |
2025-08-28 | Evaluating Compositional Generalisation in VLMs and Diffusion Models | Beth Pearson et.al. | 2508.20783 | null |
2025-08-28 | Unleashing Uncertainty: Efficient Machine Unlearning for Generative AI | Christoforos N. Spartalis et.al. | 2508.20773 | null |
2025-08-28 | Anomalous diffusion and run-and-tumble motion of a chemotactic particle in low dimensions | Jacopo Romano et.al. | 2508.20756 | null |
2025-08-28 | Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning | Yibin Wang et.al. | 2508.20751 | null |
2025-08-28 | A two-state generalisation of the strong collision model | Ola Kenji Forslund et.al. | 2508.20727 | null |
2025-08-28 | EEGDM: Learning EEG Representation with Latent Diffusion Model | Shaocong Wang et.al. | 2508.20705 | null |
2025-08-28 | Agent-based model of information diffusion in the limit order book trading | Mateusz Wilinski et.al. | 2508.20672 | null |
2025-08-28 | “Humor, Art, or Misinformation?”: A Multimodal Dataset for Intent-Aware Synthetic Image Detection | Anastasios Skoularikis et.al. | 2508.20670 | null |
2025-08-28 | Amadeus: Autoregressive Model with Bidirectional Attribute Modelling for Symbolic Music | Hongju Su et.al. | 2508.20665 | null |
2025-08-28 | VarDiU: A Variational Diffusive Upper Bound for One-Step Diffusion Distillation | Leyang Wang et.al. | 2508.20646 | null |
2025-08-28 | CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models | Ayan Banerjee et.al. | 2508.20640 | null |
2025-08-28 | EmoCAST: Emotional Talking Portrait via Emotive Text Description | Yiguo Jiang et.al. | 2508.20615 | null |
2025-08-28 | Revisiting the Privacy Risks of Split Inference: A GAN-Based Data Reconstruction Attack via Progressive Feature Optimization | Yixiang Qiu et.al. | 2508.20613 | null |
2025-08-28 | Physics Informed Generative Models for Magnetic Field Images | Aye Phyu Phyu Aung et.al. | 2508.20612 | null |
2025-08-28 | GENRE-CMR: Generalizable Deep Learning for Diverse Multi-Domain Cardiac MRI Reconstruction | Kian Anvari Hamedani et.al. | 2508.20600 | null |
2025-08-28 | Disruptive Attacks on Face Swapping via Low-Frequency Perceptual Perturbations | Mengxiao Huang et.al. | 2508.20595 | null |
2025-08-28 | FastFit: Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models | Zheng Chong et.al. | 2508.20586 | null |
2025-08-28 | Persode: Personalized Visual Journaling with Episodic Memory-Aware AI Agent | Seokho Jin et.al. | 2508.20585 | null |
2025-08-28 | SimShear: Sim-to-Real Shear-based Tactile Servoing | Kipp McAdam Freud et.al. | 2508.20561 | null |
2025-08-28 | Equilibria of aggregation-diffusion models with nonlinear potentials | Francesco Bozzola et.al. | 2508.20523 | null |
2025-08-28 | Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent | En Ci et.al. | 2508.20505 | null |
2025-08-28 | Run-and-tumble particle with diffusion: boundary local times and the zero-diffusion limit | Paul C Bressloff et.al. | 2508.20473 | null |
2025-08-28 | Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation | Jiusi Li et.al. | 2508.20471 | null |
2025-08-28 | Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models | Desen Sun et.al. | 2508.20424 | null |
2025-08-28 | AWorld: Orchestrating the Training Recipe for Agentic AI | Chengyue Yu et.al. | 2508.20404 | null |
2025-08-28 | Mean Field Game with Reflected Jump Diffusion Dynamics: A Linear Programming Approach | Zongxia Liang et.al. | 2508.20388 | null |
2025-08-28 | Do triangles matter? Replicating hypergraph disease dynamics with lower-order interactions | Eugene Tan et.al. | 2508.20380 | null |
2025-08-28 | Audio-Guided Visual Editing with Complex Multi-Modal Prompts | Hyeonyu Kim et.al. | 2508.20379 | null |
2025-08-28 | Numerical Method for Space-Time Fractional Diffusion: A Stochastic Approach | Tengteng Cui et.al. | 2508.20361 | null |
2025-08-28 | Artificial neural network solver for Fokker-Planck and Koopman eigenfunctions | Max Kreider et.al. | 2508.20339 | null |
2025-08-27 | Score-Based Diffusion Models in Infinite Dimensions: A Malliavin Calculus Perspective | Ehsan Mirafzali et.al. | 2508.20316 | null |
2025-08-27 | Efficient ion re-acceleration in laboratory-produced interpenetrating collisionless shocks | W. Yao et.al. | 2508.20303 | null |
2025-08-27 | Out-of-time-order correlators bridge classical transport and quantum dynamics | Sophia N. Fricke et.al. | 2508.20235 | null |
2025-08-27 | Velocity Spectrum Imaging using velocity encoding preparation pulses | Luis Hernandez-Garcia et.al. | 2508.20218 | null |
2025-08-27 | InfinityHuman: Towards Long-Term Audio-Driven Human | Xiaodi Li et.al. | 2508.20210 | null |
2025-08-27 | The structure of the giant radio fossil in the Ophiuchus galaxy cluster | Simona Giacintucci et.al. | 2508.20190 | null |
2025-08-27 | SDiFL: Stable Diffusion-Driven Framework for Image Forgery Localization | Yang Su et.al. | 2508.20182 | null |
2025-08-27 | Nonlinear diffusion in relativistic kinetic theory | Simone Calogero et.al. | 2508.20147 | null |
2025-08-27 | MicroLad: 2D-to-3D Microstructure Reconstruction and Generation via Latent Diffusion and Score Distillation | Kang-Hyun Lee et.al. | 2508.20138 | null |
2025-08-27 | Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning | Jinhao Liang et.al. | 2508.20095 | null |
2025-08-27 | AudioStory: Generating Long-Form Narrative Audio with Large Language Models | Yuxin Guo et.al. | 2508.20088 | null |
2025-08-27 | Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies | Zhixuan Liang et.al. | 2508.20072 | null |
2025-08-27 | A unique solution to overcome the barriers to planetesimal formation at low dust-to-gas ratio | H. Meheut et.al. | 2508.20070 | null |
2025-08-27 | Neural Conditional Simulation for Complex Spatial Processes | Julia Walchessen et.al. | 2508.20067 | null |
2025-08-27 | Joint Analysis of HI Absorption Zeeman Measurements and the Morphology of Filamentary HI Emission | Marta Nowotka et.al. | 2508.20065 | null |
2025-08-27 | Wave coarsening drives time crystallization in active solids | Jonas Veenstra et.al. | 2508.20052 | null |
2025-08-27 | GS: Generative Segmentation via Label Diffusion | Yuhao Chen et.al. | 2508.20020 | null |
2025-08-27 | Diffusion Language Models Know the Answer Before Decoding | Pengxiang Li et.al. | 2508.19982 | null |
2025-08-27 | The Information Dynamics of Generative Diffusion | Luca Ambrogioni et.al. | 2508.19897 | null |
2025-08-27 | Quantum latent distributions in deep generative models | Omar Bacarreza et.al. | 2508.19857 | null |
2025-08-28 | Ego-centric Predictive Model Conditioned on Hand Trajectories | Binjie Zhang et.al. | 2508.19852 | null |
2025-08-27 | Physics-Informed DeepONet Coupled with FEM for Convective Transport in Porous Media with Sharp Gaussian Sources | Erdi Kara et.al. | 2508.19847 | null |
2025-08-27 | Exotic rheology of materials with active rearrangements | Aondoyima Ioratim-Uba et.al. | 2508.19844 | null |
2025-08-27 | Not Every Gift Comes in Gold Paper or with a Red Ribbon: Exploring Color Perception in Text-to-Image Models | Shay Shomer Chai et.al. | 2508.19791 | null |
2025-08-27 | StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation | Xiuchao Wu et.al. | 2508.19789 | null |
2025-08-27 | Fast 3D Diffusion for Scalable Granular Media Synthesis | Muhammad Moeeze Hassan et.al. | 2508.19752 | null |
2025-08-27 | Fractal Flow: Hierarchical and Interpretable Normalizing Flow via Topic Modeling and Recursive Strategy | Binhui Zhang et.al. | 2508.19750 | null |
2025-08-27 | MC for Gastroretentive Drug Delivery | Sebastian Lotter et.al. | 2508.19739 | null |
2025-08-27 | Synthetic Image Detection via Spectral Gaps of QC-RBIM Nishimori Bethe-Hessian Operators | V. S. Usatyuk et.al. | 2508.19698 | null |
2025-08-27 | MnBr $_2$ on the graphene on Ir(110) substrate: growth, structure, and super-moiré | Affan Safeer et.al. | 2508.19694 | null |
2025-08-27 | Atomistic insights into hydrogen migration in IGZO from machine-learning interatomic potential: linking atomic diffusion to device performance | Hyunsung Cho et.al. | 2508.19674 | null |
2025-08-27 | Multi-value Probabilistic Computing with current-controlled Skyrmion Diffusion | Thomas B. Winkler et.al. | 2508.19623 | null |
2025-08-27 | IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation | Qizhe Fan et.al. | 2508.19604 | null |
2025-08-27 | Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction | Dat Nguyen Cong et.al. | 2508.19581 | null |
2025-08-28 | Interact-Custom: Customized Human Object Interaction Image Generation | Zhu Xu et.al. | 2508.19575 | null |
2025-08-27 | Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era | Dawei Li et.al. | 2508.19570 | null |
2025-08-27 | MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery | Yu-Wei Zhang et.al. | 2508.19555 | null |
2025-08-27 | Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding | Bowen Sun et.al. | 2508.19529 | null |
2025-08-27 | MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment | Zhiting Gao et.al. | 2508.19527 | null |
2025-08-27 | Functionally-graded drug delivery systems with binding reactions: analytical and stochastic approaches for the fraction of drug released | Obi A. Carwood et.al. | 2508.19510 | null |
2025-08-27 | DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View | Tian Qiu et.al. | 2508.19508 | null |
2025-08-27 | Sat2Flow: A Structure-Aware Diffusion Framework for Human Flow Generation from Satellite Imagery | Xiangxu Wang et.al. | 2508.19499 | null |
2025-08-27 | Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks | Muhammad Ahmed Mohsin et.al. | 2508.19495 | null |
2025-08-26 | MRExtrap: Longitudinal Aging of Brain MRIs using Linear Modeling in Latent Space | Jaivardhan Kapoor et.al. | 2508.19482 | null |
2025-08-26 | Shining light on degeneracies and uncertainties in quantifying both exchange and restriction with time-dependent diffusion MRI using Bayesian inference | Maëliss Jallais et.al. | 2508.19478 | null |
2025-08-26 | Hydrodynamic Limit of the Symmetric Zero-Range Process with Slow Boundary | Oslenne Araújo et.al. | 2508.19447 | null |
2025-08-26 | On Surjectivity of Neural Networks: Can you elicit any behavior from your model? | Haozhe Jiang et.al. | 2508.19445 | null |
2025-08-26 | Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization | Paimon Goulart et.al. | 2508.19443 | null |
2025-08-26 | Quantification of mobile ions in perovskite solar cells with thermally activated ion current measurements | Moritz C. Schmidt et.al. | 2508.19403 | null |
2025-08-26 | DETNO: A Diffusion-Enhanced Transformer Neural Operator for Long-Term Traffic Forecasting | Owais Ahmad et.al. | 2508.19389 | null |
2025-08-26 | Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs | Supratik Sarkar et.al. | 2508.19366 | null |
2025-08-28 | MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation | Ming Chen et.al. | 2508.19320 | null |
2025-08-26 | Disorder-induced proximate quantum spin ice phase in Pr $_2$Sn$_2$O$_7$ | Yi Luo et.al. | 2508.19248 | null |
2025-08-26 | Articulate3D: Zero-Shot Text-Driven 3D Object Posing | Oishi Deb et.al. | 2508.19244 | null |
2025-08-26 | MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation | Hao Shi et.al. | 2508.19236 | null |
2025-08-26 | VibeVoice Technical Report | Zhiliang Peng et.al. | 2508.19205 | null |
2025-08-26 | LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding | Julian Ost et.al. | 2508.19204 | null |
2025-08-26 | Planning-Query-Guided Model Generation for Model-Based Deformable Object Manipulation | Alex LaGrassa et.al. | 2508.19199 | null |
2025-08-26 | All-in-One Slider for Attribute Manipulation in Diffusion Models | Weixin Ye et.al. | 2508.19195 | null |
2025-08-26 | MDD: a Mask Diffusion Detector to Protect Speaker Verification Systems from Adversarial Perturbations | Yibo Bai et.al. | 2508.19180 | null |
2025-08-26 | Stoch-IDENT: New Method and Mathematical Analysis for Identifying SPDEs from Data | Jianbo Cui et.al. | 2508.19177 | null |
2025-08-26 | RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration | Yan Chen et.al. | 2508.19154 | null |
2025-08-26 | Saddle Hierarchy in Dense Associative Memory | Robin Thériault et.al. | 2508.19151 | null |
2025-08-26 | Alloyed cementite (Fe-Ni-Cr) $_3$ C: structure and hyperfine field from DFT calculations and experimental comparison | Lyudmila V. Dobysheva et.al. | 2508.19148 | null |
2025-08-26 | Lattice vacancy migration barriers in Fe-Ni alloys, and why Ni atoms diffuse slowly: An ab initio study | Adam M. Fisher et.al. | 2508.19124 | null |
2025-08-26 | Composition and Alignment of Diffusion Models using Constrained Learning | Shervin Khalafi et.al. | 2508.19104 | null |
2025-08-26 | Evaluation of in vitro antibacterial activity and phytochemical profile of aqueous leaf extract of Asystasia variabilis | R Wijerathna et.al. | 2508.19049 | null |
2025-08-26 | In-vitro Anti-bacterial Activity of Methanol and Aqueous Crude Extracts of Horsfieldia iryaghedhi | RMHKK Rajapaksha et.al. | 2508.19025 | null |
2025-08-28 | STDiff: A State Transition Diffusion Framework for Time Series Imputation in Industrial Systems | Gary Simethy et.al. | 2508.19011 | null |
2025-08-26 | Detection of Diffuse Radio Emission inside the Supernova Remnant G338.3-0.0 associated with the Gamma-ray Source HESS J1640-465 | Moaz Abdelmaguid et.al. | 2508.18999 | null |
2025-08-26 | Krylov-Veretennikov desomposition for measure-valued processes induced by SDEs with interaction on Riemannian manifolds | Andrey Dorogovtsev et.al. | 2508.18995 | null |
2025-08-26 | Junctional-Fluctuation-Mediated Fluidisation of Multi-Phase Field Epithelial Monolayers | James N. Graham et.al. | 2508.18987 | null |
2025-08-26 | Vanishing Angular Viscosity Limit For Micropolar Fluid Model In $\mathbb{R}_+^2$ : Boundary Layer And Optimal Convergence Rate | Yinghui Wang et.al. | 2508.18980 | null |
2025-08-26 | Linear approximations of large deviations: Cubic diffusion test | Pelerine Tsobgni Nyawo et.al. | 2508.18977 | null |
2025-08-26 | Generative AI in Map-Making: A Technical Exploration and Its Implications for Cartographers | Claudio Affolter et.al. | 2508.18959 | null |
2025-08-26 | Energy-Based Flow Matching for Generating 3D Molecular Structure | Wenyin Zhou et.al. | 2508.18949 | null |
2025-08-26 | Stochastic Forces Enhance Tracer Diffusion in Non-motile Active Matter | Henry Alston et.al. | 2508.18882 | null |
2025-08-26 | Experimental investigation of turbulence and turbulent thermal diffusion in strongly inhomogeneous and anisotropic forced convection | E. Zarbib et.al. | 2508.18865 | null |
2025-08-26 | Super and Weak Poincaré Inequalities for Sticky-Reflected Diffusion Processes | Feng-Yu Wang et.al. | 2508.18846 | null |
2025-08-26 | Single-Photon Detection in Few-Layer NbSe $_2$ Superconducting Nanowires | Lucio Zugliani et.al. | 2508.18843 | null |
2025-08-26 | Quantum-Circuit-Based Visual Fractal Image Generation in Qiskit and Analytics | Hillol Biswas et.al. | 2508.18835 | null |
2025-08-26 | On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation | Adrian Meise et.al. | 2508.18833 | null |
2025-08-26 | Asymptotic limit of a vector-valued Allen-Cahn equation for phase transition dynamics | Huan Dong et.al. | 2508.18754 | null |
2025-08-26 | Joint Time-Position Statistics and Fisher Information in Drift-Diffusion Molecular Channels | Yun-Feng Lo et.al. | 2508.18680 | null |
2025-08-26 | ROSE: Remove Objects with Side Effects in Videos | Chenxuan Miao et.al. | 2508.18633 | null |
2025-08-26 | Wan-S2V: Audio-Driven Cinematic Video Generation | Xin Gao et.al. | 2508.18621 | null |
2025-08-26 | SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis | Xiaohao Sun et.al. | 2508.18597 | null |
2025-08-26 | Search for the radiative decay of the cosmic neutrino background through spectral measurements of the cosmic infrared background using PRIMA | Yuji Takeuchi et.al. | 2508.18590 | null |
2025-08-25 | Controllable Single-shot Animation Blending with Temporal Conditioning | Eleni Tselepi et.al. | 2508.18525 | null |
2025-08-25 | VQualA 2025 Challenge on Face Image Quality Assessment: Methods and Results | Sizhuo Ma et.al. | 2508.18445 | null |
2025-08-25 | Phase-Field Model of Freeze Casting | Kaihua Ji et.al. | 2508.18416 | null |
2025-08-25 | Hillas meets Eddington: the case for blazars as ultra-high-energy neutrino sources | Xavier Rodrigues et.al. | 2508.18345 | null |
2025-08-25 | ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models | Haitang Feng et.al. | 2508.18271 | null |
2025-08-25 | SafeBimanual: Diffusion-based Trajectory Optimization for Safe Bimanual Manipulation | Haoyuan Deng et.al. | 2508.18268 | null |
2025-08-25 | Diffusiophoretic corner flows | Dobromir Nowak et.al. | 2508.18233 | null |
2025-08-25 | Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance | Ayce Idil Aytekin et.al. | 2508.18213 | null |
2025-08-25 | New shell-model calculations of the $δ_C$ correction to superallowed $0^+\rightarrow0^+$ nuclear $β$ decay and standard-model implications | L. Xayavong et.al. | 2508.18189 | null |
2025-08-25 | SpotEdit: Evaluating Visually-Guided Image Editing Methods | Sara Ghazanfari et.al. | 2508.18159 | null |
2025-08-25 | Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation | Haijian Ma et.al. | 2508.18148 | null |
2025-08-25 | Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem | Zhicong Tang et.al. | 2508.18095 | null |
2025-08-26 | Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation | Yaqi Li et.al. | 2508.18032 | null |
2025-08-25 | HD 28471: a near-resonant compact multiplanet system with a possible cold giant planet | A. T. Stevenson et.al. | 2508.18000 | null |
2025-08-26 | Solute dispersion in axially strained tube flows: Large-time asymptotics and Ornstein-Uhlenbeck Gaussian profiles | Prabakaran Rajamanickam et.al. | 2508.17982 | null |
2025-08-25 | Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech | Dimme de Groot et.al. | 2508.17980 | null |
2025-08-26 | Generative Feature Imputing – A Technique for Error-resilient Semantic Communication | Jianhao Huang et.al. | 2508.17957 | null |
2025-08-25 | Nodal error behind discrepancies between coupled cluster and diffusion Monte Carlo: AcOH dimer case study | S. Lambie et.al. | 2508.17937 | null |
2025-08-25 | Parallel Nodal Interior-Penalty Discontinuous Galerkin Methods for the Subsonic Compressible Navier-Stokes Equations: Applications to Vortical Flows and VIV Problems | Spiros Zafeiris et.al. | 2508.17917 | null |
2025-08-25 | Quasi-likelihood inference for SDE with mixed-effects observed at high frequency | Maud Delattre et.al. | 2508.17910 | null |
2025-08-25 | Local Well-Posedness of the Cahn-Hilliard-Biot System | Helmut Abels et.al. | 2508.17893 | null |
2025-08-27 | Vocoder-Projected Feature Discriminator | Takuhiro Kaneko et.al. | 2508.17874 | null |
2025-08-25 | FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation | Takuhiro Kaneko et.al. | 2508.17868 | null |
2025-08-25 | Diffusion-Based Data Augmentation for Medical Image Segmentation | Maham Nazir et.al. | 2508.17844 | null |
2025-08-25 | Threshold Diffusions | Lina Ji et.al. | 2508.17812 | null |
2025-08-25 | CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation | Mingyue Yang et.al. | 2508.17760 | null |
2025-08-25 | SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling | Fanjiang Ye et.al. | 2508.17756 | null |
2025-08-25 | DiffusionGS: Generative Search with Query Conditioned Diffusion in Kuaishou | Qinyao Li et.al. | 2508.17754 | null |
2025-08-25 | Few-shot Human Action Anomaly Detection via a Unified Contrastive Learning Framework | Koichiro Kamide et.al. | 2508.17726 | null |
2025-08-25 | Instant Preference Alignment for Text-to-Image Diffusion Models | Yang Li et.al. | 2508.17718 | null |
2025-08-25 | CATformer: Contrastive Adversarial Transformer for Image Super-Resolution | Qinyi Tian et.al. | 2508.17708 | null |
2025-08-25 | On the Edge of Memorization in Diffusion Models | Sam Buchanan et.al. | 2508.17689 | null |
2025-08-25 | Calculating the power spectrum in stochastic inflation by Monte Carlo simulation and least squares curve fitting | Koichi Miyamoto et.al. | 2508.17654 | null |
2025-08-27 | ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion | Nima Kondori et.al. | 2508.17631 | null |
2025-08-25 | Effects of Near-Field Hydrodynamic Interactions on Bacterial Dynamics Near a Solid Surface | Baopi Liu et.al. | 2508.17626 | null |
2025-08-25 | Steering When Necessary: Flexible Steering Large Language Models with Backtracking | Jinwei Gan et.al. | 2508.17621 | null |
2025-08-25 | Preference Trajectory Modeling via Flow Matching for Sequential Recommendation | Li Li et.al. | 2508.17618 | null |
2025-08-25 | JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on | Aowen Wang et.al. | 2508.17614 | null |
2025-08-25 | HotSpotter - Patterned Species Instance Recognition | Jonathan P. Crall et.al. | 2508.17605 | null |
2025-08-25 | GWM: Towards Scalable Gaussian World Models for Robotic Manipulation | Guanxing Lu et.al. | 2508.17600 | null |
2025-08-25 | HERO: Hierarchical Extrapolation and Refresh for Efficient World Models | Quanjian Song et.al. | 2508.17588 | null |
2025-08-24 | Controllability of a system of non-autonomous degenerate coupled parabolic equations | Alfredo S. Gamboa et.al. | 2508.17546 | null |
2025-08-24 | Universal scaling of higher-order cumulants in quantum isotropic spin chains | Shixian Jiang et.al. | 2508.17535 | null |
2025-08-24 | Learning Reaction-Diffusion Kinetics from Mechanical Information | Royal C. Ihuaenyi et.al. | 2508.17523 | null |
2025-08-24 | Variational Shape Inference for Grasp Diffusion on SE(3) | S. Talha Bukhari et.al. | 2508.17482 | null |
2025-08-24 | T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation | Kaiyue Sun et.al. | 2508.17472 | null |
2025-08-24 | A Synthetic Dataset for Manometry Recognition in Robotic Applications | Pedro Antonio Rabelo Saraiva et.al. | 2508.17468 | null |
2025-08-24 | Bias Amplification in Stable Diffusion’s Representation of Stigma Through Skin Tones and Their Homogeneity | Kyra Wilson et.al. | 2508.17465 | null |
2025-08-24 | Disentangled Geometry and Appearance for Efficient Multi-View Surface Reconstruction and Rendering | Qitong Zhang et.al. | 2508.17436 | null |
2025-08-24 | An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing | Zihan Liang et.al. | 2508.17435 | null |
2025-08-24 | TinySR: Pruning Diffusion for Real-World Image Super-Resolution | Linwei Dong et.al. | 2508.17434 | null |
2025-08-24 | Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling | Haochen You et.al. | 2508.17426 | null |
2025-08-24 | Asteroid Rotation Periods: Statistical Analysis in the Diameter-Spin Distribution | Maryam Nastaran et.al. | 2508.17415 | null |
2025-08-24 | MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling | Haoyu Wang et.al. | 2508.17404 | null |
2025-08-24 | Stability and uniqueness of bounded weak solutions to triangular degenerate cross-diffusion systems | Xiuqing Chen et.al. | 2508.17379 | null |
2025-08-24 | ShaLa: Multimodal Shared Latent Space Modelling | Jiali Cui et.al. | 2508.17376 | null |
2025-08-24 | Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation | Guoqing Zhang et.al. | 2508.17364 | null |
2025-08-24 | DiCache: Let Diffusion Model Determine Its Own Cache | Jiazi Bu et.al. | 2508.17356 | null |
2025-08-24 | ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation | Yuxuan Song et.al. | 2508.17345 | null |
2025-08-24 | Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing | Tristan S. W. Stevens et.al. | 2508.17326 | null |
2025-08-24 | An improved nonlocal electron heat transport model for magnetized plasmas | Z. H. Chen et.al. | 2508.17309 | null |
2025-08-24 | PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing | Peilin Xiong et.al. | 2508.17302 | null |
2025-08-24 | FoundDiff: Foundational Diffusion Model for Generalizable Low-Dose CT Denoising | Zhihao Chen et.al. | 2508.17299 | null |
2025-08-24 | 4D Visual Pre-training for Robot Learning | Chengkai Hou et.al. | 2508.17230 | null |
2025-08-24 | Multi-Metric Preference Alignment for Generative Speech Restoration | Junan Zhang et.al. | 2508.17229 | null |
2025-08-24 | Effects of Geometric configuration in relativistic isobaric collisions at $\sqrt{s_{NN}}=200$ GeV | Akash Das et.al. | 2508.17227 | null |
2025-08-24 | MMCIG: Multimodal Cover Image Generation for Text-only Documents and Its Dataset Construction via Pseudo-labeling | Hyeyeon Kim et.al. | 2508.17199 | null |
2025-08-23 | Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities | Yili Jin et.al. | 2508.17163 | null |
2025-08-23 | SyncGuard: Robust Audio Watermarking Capable of Countering Desynchronization Attacks | Zhenliang Gan et.al. | 2508.17121 | null |
2025-08-23 | CP4SBI: Local Conformal Calibration of Credible Sets in Simulation-Based Inference | Luben M. C. Cabezas et.al. | 2508.17077 | null |
2025-08-23 | LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening | Halid Abdulrahim Kadi et.al. | 2508.17070 | null |
2025-08-23 | SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation | Peng Hu et.al. | 2508.17062 | null |
2025-08-23 | PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models | Xianjing Cheng et.al. | 2508.17050 | null |
2025-08-23 | Styleclone: Face Stylization with Diffusion Based Data Augmentation | Neeraj Matiyali et.al. | 2508.17045 | null |
2025-08-23 | A Novel Local Focusing Mechanism for Deepfake Detection Generalization | Mingliang Li et.al. | 2508.17029 | null |
2025-08-23 | Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation | Konstantina Nikolaidou et.al. | 2508.17017 | null |
2025-08-23 | An improved lattice Boltzmann method with a novel conservative boundary scheme for viscoelastic fluid flows | Yuan Yu et.al. | 2508.16997 | null |
2025-08-23 | Score Matching on Large Geometric Graphs for Cosmology Generation | Diana-Alexandra Onutu et.al. | 2508.16990 | null |
2025-08-23 | HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching | Liang Feng et.al. | 2508.16984 | null |
2025-08-23 | Shape optimization problems with random coefficients via the penalty method | Xiaowei Pang et.al. | 2508.16961 | null |
2025-08-23 | RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze | Ruicheng Zhang et.al. | 2508.16956 | null |
2025-08-23 | Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model | Fan Ding et.al. | 2508.16947 | null |
2025-08-23 | Sig-DEG for Distillation: Making Diffusion Models Faster and Lighter | Lei Jiang et.al. | 2508.16939 | null |
2025-08-23 | HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation | Sizhe Shan et.al. | 2508.16930 | null |
2025-08-23 | Structural Energy-Guided Sampling for View-Consistent Text-to-3D | Qing Zhang et.al. | 2508.16917 | null |
2025-08-23 | Remarks on the three-dimensional Navier-Stokes equations with Lions’ exponent forced by space-time white noise | Kazuo Yamazaki et.al. | 2508.16906 | null |
2025-08-23 | Enhanced shape recovery in advection–diffusion problems via a novel ADMM-based CCBM optimization | Elmehdi Cherrat et.al. | 2508.16898 | null |
2025-08-23 | Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network | Pouya Shiri et.al. | 2508.16897 | null |
2025-08-23 | Delta-SVD: Efficient Compression for Personalized Text-to-Image Models | Tangyuan Zhang et.al. | 2508.16863 | null |
2025-08-23 | Subtleties of UV-crosslinking in microfluidic particle fabrication: UV dosage and intensity matter | Sabrina Marnoto et.al. | 2508.16862 | null |
2025-08-23 | Intelligent Shanghai Typhoon Model (ISTM): A generative probabilistic emulator for typhoon hybrid modeling | Zeyi Niu et.al. | 2508.16851 | null |
2025-08-23 | NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows | Denis Tarasov et.al. | 2508.16845 | null |
2025-08-22 | A Fluctuating Hydrodynamics Model for Nanoscale Surfactant-laden Interfaces | John B. Bell et.al. | 2508.16820 | null |
2025-08-22 | Two-Step Bose-Einstein Condensation of an ideal Magnetized Charged Bosonic gas under neutron star-like conditions | Amanda Castillo Ayon et.al. | 2508.16799 | null |
2025-08-22 | TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling | Yuancheng Wang et.al. | 2508.16790 | null |
2025-08-22 | Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data | Stefania L. Moroianu et.al. | 2508.16783 | null |
2025-08-26 | Characterising the short-orbital period X-ray transient Swift J1910.2-0546 | J. M. Corral-Santana et.al. | 2508.16775 | null |
2025-08-22 | Spontaneous spiral patterns etched on Germanium | Yilin Wong et.al. | 2508.16764 | null |
2025-08-22 | A Framework for Benchmarking Fairness-Utility Trade-offs in Text-to-Image Models via Pareto Frontiers | Marco N. Bochernitsan et.al. | 2508.16752 | null |
2025-08-22 | Hamiltonian Simulation for Advection-Diffusion Equation with arbitrary transport field | Niladri Gomes et.al. | 2508.16728 | null |
2025-08-22 | MV-RAG: Retrieval Augmented Multiview Diffusion | Yosef Dayani et.al. | 2508.16577 | null |
2025-08-22 | Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution | Tainyi Zhang et.al. | 2508.16557 | null |
2025-08-22 | Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning | Xuan Zhang et.al. | 2508.16524 | null |
2025-08-22 | Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation | Zhijian Zhou et.al. | 2508.16521 | null |
2025-08-22 | ARSP: Automated Repair of Verilog Designs via Semantic Partitioning | Bingkun Yao et.al. | 2508.16517 | null |
2025-08-22 | Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation | Chun-Peng Chang et.al. | 2508.16512 | null |
2025-08-22 | Underdamped Langevin MCMC with third order convergence | Maximilian Scott et.al. | 2508.16485 | null |
2025-08-22 | Large-scale concentration and relaxation for mean-field Langevin particle systems | Songbo Wang et.al. | 2508.16428 | null |
2025-08-22 | Multiscale Growth Kinetics of Model Biomolecular Condensates Under Passive and Active Conditions | Tamizhmalar Sundararajan et.al. | 2508.16398 | null |
2025-08-22 | Parrondo paradox in quantum image encryption | Łukasz Pawela et.al. | 2508.16382 | null |
2025-08-22 | Observation of negative orbital torque from Vanadium | Nikhil Vijayan et.al. | 2508.16339 | null |
2025-08-22 | A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions | Nishant Jain et.al. | 2508.16306 | null |
2025-08-22 | Towards Diagnostic Quality Flat-Panel Detector CT Imaging Using Diffusion Models | Hélène Corbaz et.al. | 2508.16252 | null |
2025-08-22 | Numerical solution of the time fractional nonlinear Fisher-KPP diffusion-reaction equation using the local domain boundary element method | Theodore V. Gortsas et.al. | 2508.16241 | null |
2025-08-22 | UniEM-3M: A Universal Electron Micrograph Dataset for Microstructural Segmentation and Generation | Nan wang et.al. | 2508.16239 | null |
2025-08-22 | PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting | Hohyun Na et.al. | 2508.16217 | null |
2025-08-22 | OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models | Huanpeng Chu et.al. | 2508.16212 | null |
2025-08-22 | Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers | Shikang Zheng et.al. | 2508.16211 | null |
2025-08-22 | Competition and Attraction Improve Model Fusion | João Abrantes et.al. | 2508.16204 | null |
2025-08-22 | FuXi-TC: A generative framework integrating deep learning and physics-based models for improved tropical cyclone forecasts | Shan Guo et.al. | 2508.16168 | null |
2025-08-22 | Transport Properties of QGP within a Bayesian Holographic QCD Model | Bing Chen et.al. | 2508.16167 | null |
2025-08-22 | RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution | Haodong He et.al. | 2508.16158 | null |
2025-08-22 | On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models | Yi Zhang et.al. | 2508.16154 | null |
2025-08-22 | Machine Learning for Medicine Must Be Interpretable, Shareable, Reproducible and Accountable by Design | Ayyüce Begüm Bektaş et.al. | 2508.16097 | null |
2025-08-22 | Two-flow Feedback Multi-scale Progressive Generative Adversarial Network | Sun Weikai et.al. | 2508.16089 | null |
2025-08-22 | A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection | Qifeng Liu et.al. | 2508.16069 | null |
2025-08-21 | Clinically-Informed Preprocessing Improves Stroke Segmentation in Low-Resource Settings | Juampablo E. Heras Rivera et.al. | 2508.16004 | null |
2025-08-21 | Multiscale Analysis of a Kinetic Model of Confined Suspensions of Self-Propelled Rods | Leonid Berlyand et.al. | 2508.16003 | null |
2025-08-21 | Universal Fluctuations in the Tail Probability for d=2 Random Walks in Space-Time Random Environments | Franscesca Ark et.al. | 2508.15999 | null |
2025-08-21 | Diverse Signer Avatars with Manual and Non-Manual Feature Modelling for Sign Language Production | Mohamed Ilyes Lakhal et.al. | 2508.15988 | null |
2025-08-21 | UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation | Zhaodong Jiang et.al. | 2508.15972 | null |
2025-08-21 | Physical blowups via buffered time change in a mean-field neural network | Nikolaos Papadopoulos et.al. | 2508.15961 | null |
2025-08-21 | Structure-Preserving Medical Image Generation from a Latent Graph Representation | Kevin Arias et.al. | 2508.15920 | null |
2025-08-21 | Text-Driven 3D Hand Motion Generation from Sign Language Data | Léore Bensabath et.al. | 2508.15902 | null |
2025-08-21 | Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning | Yijun Liu et.al. | 2508.15874 | null |
2025-08-21 | CineScale: Free Lunch in High-Resolution Cinematic Visual Generation | Haonan Qiu et.al. | 2508.15774 | null |
2025-08-21 | Scaling Group Inference for Diverse and High-Quality Generation | Gaurav Parmar et.al. | 2508.15773 | null |
2025-08-21 | Visual Autoregressive Modeling for Instruction-Guided Image Editing | Qingyang Mao et.al. | 2508.15772 | null |
2025-08-21 | Waver: Wave Your Way to Lifelike Video Generation | Yifu Zhang et.al. | 2508.15761 | null |
2025-08-21 | Skyrmion Lattice Order Controlled by Confinement Geometry | Raphael Gruber et.al. | 2508.15758 | null |
2025-08-21 | Spatial Super-Infection and Co-Infection Dynamics in Networks | Alyssa Yu et.al. | 2508.15740 | null |
2025-08-21 | Probability Density from Latent Diffusion Models for Out-of-Distribution Detection | Joonas Järve et.al. | 2508.15737 | null |
2025-08-21 | The Status of the Astrophysical Parameters of Upper Main Sequence Stars | Lukas Kueß et.al. | 2508.15722 | null |
2025-08-21 | WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception | Zhiheng Liu et.al. | 2508.15720 | null |
2025-08-21 | Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation | Nikita Kachaev et.al. | 2508.15663 | null |
2025-08-21 | When and What: Diffusion-Grounded VideoLLM with Entity Aware Segmentation for Long Video Understanding | Pengcheng Fang et.al. | 2508.15641 | null |
2025-08-21 | Are Virtual DES Images a Valid Alternative to the Real Ones? | Ana C. Perre et.al. | 2508.15594 | null |
2025-08-21 | Lattice distortions and non-sluggish diffusion in BCC refractory high entropy alloys | Jingfeng Zhang et.al. | 2508.15558 | null |
2025-08-21 | Dream 7B: Diffusion Large Language Models | Jiacheng Ye et.al. | 2508.15487 | null |
2025-08-21 | Reevaluating Anomalous Electric Fields at the Air-Water Interface: A Surface-Specific Spectroscopic Survey | Joseph C. Shirley et.al. | 2508.15422 | null |
2025-08-21 | Speckle suppression in digital in-line holographic microscopy through liquid crystal dynamic scattering | Emilia Wdowiak et.al. | 2508.15419 | null |
2025-08-21 | Numerical Analysis of Unsupervised Learning Approaches for Parameter Identification in PDEs | Siyu Cen et.al. | 2508.15381 | null |
2025-08-21 | Diffusion-driven pattern formation in an opinion dynamical network model | Tim Mauch et.al. | 2508.15377 | null |
2025-08-21 | Performance Analysis of RIS-Aided High-Mobility Wireless Systems | Hanwen Hu et.al. | 2508.15375 | null |
2025-08-22 | Analytical Theory of Chiral Active Particle Transport in a Fluctuating Density Field | Jayam Joshi et.al. | 2508.15366 | null |
2025-08-21 | The effect of multi-occupancy traps on the diffusion and retention of multiple hydrogen isotopes in irradiated tungsten and vanadium | Sanjeet Kaur et.al. | 2508.15341 | null |
2025-08-21 | Discovering correlations between metal foam thermal characteristics and non-Fourier behavior | Anna Fehér et.al. | 2508.15340 | null |
2025-08-21 | Interface fluctuations for $1$ D stochastic Allen-Cahn equation – singular regime | Weijun Xu et.al. | 2508.15319 | null |
2025-08-21 | VideoEraser: Concept Erasure in Text-to-Video Diffusion Models | Naen Xu et.al. | 2508.15314 | null |
2025-08-21 | HIP: Model-Agnostic Hypergraph Influence Prediction via Distance-Centrality Fusion and Neural ODEs | Su-Su Zhang et.al. | 2508.15312 | null |
2025-08-21 | Modeling Long-term User Behaviors with Diffusion-driven Multi-interest Network for CTR Prediction | Weijiang Lai et.al. | 2508.15311 | null |
2025-08-21 | Contribution of Globular Clusters to Diffuse Gamma-ray Emission from Galactic Plane | Jiayin He et.al. | 2508.15295 | null |
2025-08-21 | Optimizing Compilation for Distributed Quantum Computing via Clustering and Annealing | Ruilin Zhou et.al. | 2508.15267 | null |
2025-08-21 | Pathology-Informed Latent Diffusion Model for Anomaly Detection in Lymph Node Metastasis | Jiamu Wang et.al. | 2508.15236 | null |
2025-08-21 | Pretrained Diffusion Models Are Inherently Skipped-Step Samplers | Wenju Xu et.al. | 2508.15233 | null |
2025-08-21 | Collaborative Multi-Modal Coding for High-Quality 3D Generation | Ziang Cao et.al. | 2508.15228 | null |
2025-08-21 | GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design | Wen-Fan Wang et.al. | 2508.15227 | null |
2025-08-21 | A rutile-based homologous series Na(PtO $2$)${2\it{n}+1}$ discovered by computationally assisted high-pressure synthesis | Yasuhito Kobayashi et.al. | 2508.15223 | null |
2025-08-21 | See it. Say it. Sorted: Agentic System for Compositional Diagram Generation | Hantao Zhang et.al. | 2508.15222 | null |
2025-08-21 | Obstacle-tuned transition from chaotic to coherent vortex flows and odd diffusion in chiral active fluids | Joscha Mecke et.al. | 2508.15210 | null |
2025-08-21 | Quantum Differential Equation Solvers with Low State Preparation Cost: Eliminating the Time Dependence in Dissipative Equations | Gengzhi Yang et.al. | 2508.15170 | null |
2025-08-21 | MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion | Xuyang Chen et.al. | 2508.15169 | null |
2025-08-21 | Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors | Jeonghyun Noh et.al. | 2508.15151 | null |
2025-08-21 | Electron-Ion Equilibration in the Merging Galaxy Cluster Abell 665 | Christian Norseth et.al. | 2508.15138 | null |
2025-08-24 | Side Effects of Erasing Concepts from Diffusion Models | Shaswati Saha et.al. | 2508.15124 | null |
2025-08-20 | Microstructural and preliminary optical and microwave characterization of erbium doped CaMoO $_4$ thin films | Ignas Masiulionis et.al. | 2508.15122 | null |
2025-08-24 | CurveFlow: Curvature-Guided Flow Matching for Image Generation | Yan Luo et.al. | 2508.15093 | null |
2025-08-20 | Sampling by averaging: A multiscale approach to score estimation | Paula Cordero-Encinar et.al. | 2508.15069 | null |
2025-08-20 | Asymptotic analysis on narrow tubes: narrow escape problems and diffusion processes | Wen-Tai Hsu et.al. | 2508.15060 | null |
2025-08-20 | Correlating Particle Acceleration Rates with Plasma Conditions in Colliding Wind Binaries | Gislaine B Cordeiro et.al. | 2508.15059 | null |
2025-08-20 | An MRI Atlas of the Human Fetal Brain: Reference and Segmentation Tools for Fetal Brain MRI Analysis | Mahdi Bagheri et.al. | 2508.15034 | null |
2025-08-20 | Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement | Chunming He et.al. | 2508.15027 | null |
2025-08-20 | TAIGen: Training-Free Adversarial Image Generation via Diffusion Models | Susim Roy et.al. | 2508.15020 | null |
2025-08-20 | Probing Magnetic Properties of RuO $_{2}$ Heterostructures Through the Ferromagnetic Layer | Frank M. Abel et.al. | 2508.15004 | null |
2025-08-20 | LyLA-Therm: Lyapunov-based Langevin Adaptive Thermodynamic Neural Network Controller | Saiedeh Akbari et.al. | 2508.14989 | null |
2025-08-20 | Aura-CAPTCHA: A Reinforcement Learning and GAN-Enhanced Multi-Modal CAPTCHA System | Joydeep Chandra et.al. | 2508.14976 | null |
2025-08-20 | Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI | Oliver Welin Odeback et.al. | 2508.14950 | null |
2025-08-19 | Inference Time Debiasing Concepts in Diffusion Models | Lucas S. Kupssinskü et.al. | 2508.14933 | null |
2025-08-19 | TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation | Jiacheng Xie et.al. | 2508.14932 | null |
2025-08-20 | Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs | Haokun Lin et.al. | 2508.14896 | null |
2025-08-20 | Virtual Community: An Open World for Humans, Robots, and Society | Qinhong Zhou et.al. | 2508.14893 | null |
2025-08-20 | Squeezed Diffusion Models | Jyotirmai Singh et.al. | 2508.14871 | null |
2025-08-20 | Critical trajectories in kinetic geometry | Helge Dietert et.al. | 2508.14868 | null |
2025-08-20 | Universal winding properties of chiral active motion | Ion Santra et.al. | 2508.14862 | null |
2025-08-20 | Physics-Informed ML Exploration of Structure-Transport Relationships in Hard Carbon | Nikhil Rampal et.al. | 2508.14849 | null |
2025-08-20 | TransLight: Image-Guided Customized Lighting Control with Generative Decoupling | Zongming Li et.al. | 2508.14814 | null |
2025-08-20 | Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization | Canyu Zhao et.al. | 2508.14811 | null |
2025-08-20 | Cross-Modality Controlled Molecule Generation with Diffusion Language Model | Yunzhe Zhang et.al. | 2508.14748 | null |
2025-08-20 | Modeling the impact of temperature and bird migration on the spread of West Nile virus | Pride Duve et.al. | 2508.14740 | null |
2025-08-20 | GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting | Jiaxin Wei et.al. | 2508.14717 | null |
2025-08-20 | The heating and cooling of 2D electrons at low temperatures | A. K. Jain et.al. | 2508.14694 | null |
2025-08-20 | Virtual Multiplex Staining for Histological Images using a Marker-wise Conditioned Diffusion Model | Hyun-Jic Oh et.al. | 2508.14681 | null |
2025-08-21 | Phase space transport, quasilinear diffusion and locality in phase velocity | Didier Bénisti et.al. | 2508.14657 | null |
2025-08-20 | AnchorSync: Global Consistency Optimization for Long Video Editing | Zichi Liu et.al. | 2508.14609 | null |
2025-08-20 | Call Option Price using Pearson Diffusion Processes | Tapan Kar et.al. | 2508.14577 | null |
2025-08-20 | Minimizing Task-Oriented Age of Information for Remote Monitoring with Pre-Identification | Shuying Gan et.al. | 2508.14575 | null |
2025-08-20 | EffiFusion-GAN: Efficient Fusion Generative Adversarial Network for Speech Enhancement | Bin Wen et.al. | 2508.14525 | null |
2025-08-20 | SATURN: Autoregressive Image Generation Guided by Scene Graphs | Thanh-Nhan Vo et.al. | 2508.14502 | null |
2025-08-20 | Multimode Fiber Imaging Based on Hydrogel Fiber | Lele He et.al. | 2508.14501 | null |
2025-08-20 | DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion | Moyu Zhang et.al. | 2508.14500 | null |
2025-08-20 | Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration | Haoran Bai et.al. | 2508.14483 | null |
2025-08-20 | DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing | Weitao Wang et.al. | 2508.14465 | null |
2025-08-20 | Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering | Shanlin Sun et.al. | 2508.14461 | null |
2025-08-20 | Early Evolution of the Cavity and Core of a Coronal Mass Ejection in the Inner Corona | Shuting Li et.al. | 2508.14455 | null |
2025-08-20 | FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy | Yijin Chen et.al. | 2508.14441 | null |
2025-08-20 | MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion | Fei Peng et.al. | 2508.14440 | null |
2025-08-20 | Weakly-Convex Regularization for Magnetic Resonance Image Denoising | Akash Prabakar et.al. | 2508.14438 | null |
2025-08-20 | FOCUS: Frequency-Optimized Conditioning of DiffUSion Models for mitigating catastrophic forgetting during Test-Time Adaptation | Gabriel Tjio et.al. | 2508.14437 | null |
2025-08-20 | HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation | Bing Han et.al. | 2508.14431 | null |
2025-08-20 | Disentanglement in T-space for Faster and Distributed Training of Diffusion Models with Fewer Latent-states | Samarth Gupta et.al. | 2508.14413 | null |
2025-08-20 | A Real-world Display Inverse Rendering Dataset | Seokjun Choi et.al. | 2508.14411 | null |
2025-08-20 | CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities | Yue Gong et.al. | 2508.14405 | null |
2025-08-20 | Img2ST-Net: Efficient High-Resolution Spatial Omics Prediction from Whole Slide Histology Images via Fully Convolutional Image-to-Image Learning | Junchao Zhu et.al. | 2508.14393 | null |
2025-08-20 | Physics-Constrained Diffusion Reconstruction with Posterior Correction for Quantitative and Fast PET Imaging | Yucun Hou et.al. | 2508.14364 | null |
2025-08-20 | Organ-Agents: Virtual Human Physiology Simulator via LLMs | Rihao Chang et.al. | 2508.14357 | null |
2025-08-20 | SBGD: Improving Graph Diffusion Generative Model via Stochastic Block Diffusion | Junwei Su et.al. | 2508.14352 | null |
2025-08-20 | A Non-Asymptotic Convergent Analysis for Scored-Based Graph Generative Model via a System of Stochastic Differential Equations | Junwei Su et.al. | 2508.14351 | null |
2025-08-20 | Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation | Lingkai Kong et.al. | 2508.14342 | null |
2025-08-20 | Modeling oxygen-void interactions in uranium nitride | Mohamed AbdulHameed et.al. | 2508.14329 | null |
2025-08-20 | MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation | Guile Wu et.al. | 2508.14327 | null |
2025-08-20 | Modeling of silver transport in cubic SiC: Integrating molecular dynamics, bounds averaging, and uncertainty quantification | Mohamed AbdulHameed et.al. | 2508.14325 | null |
2025-08-19 | Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning | Said Djafar Said et.al. | 2508.14276 | null |
2025-08-19 | Mean field social optimization: feedback person-by-person optimality and the dynamic programming equation | Minyi Huang et.al. | 2508.14236 | null |
2025-08-19 | CO Adsorption Sites on Interstellar Water Ices Explored with Machine Learning Potentials. Binding energy distributions and snowline | Giulia M. Bovolenta et.al. | 2508.14219 | null |
2025-08-19 | A well-balanced gas-kinetic scheme with adaptive mesh refinement for shallow water equations | Gaocheng Liu et.al. | 2508.14216 | null |
2025-08-19 | Nonadiabatic force matching for alchemical free-energy estimation | Jorge L. Rosa-Raíces et.al. | 2508.14179 | null |
2025-08-19 | DPad: Efficient Diffusion Language Models with Suffix Dropout | Xinhua Chen et.al. | 2508.14148 | null |
2025-08-18 | 3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models | Jolanta Mozyrska et.al. | 2508.14122 | null |
2025-08-19 | InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing | Shaoshu Yang et.al. | 2508.14033 | null |
2025-08-19 | Electrochemical response of biological membranes to localized currents and external electric fields | Joshua B. Fernandes et.al. | 2508.14001 | null |
2025-08-19 | Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment | Samuel Seligardi et.al. | 2508.13989 | null |
2025-08-20 | Towards a general diffusion-based information quality assessment model | Anthony Lopes Temporao et.al. | 2508.13927 | null |
2025-08-19 | Learning to See Through Flare | Xiaopeng Peng et.al. | 2508.13907 | null |
2025-08-19 | Revisiting Diffusion Q-Learning: From Iterative Denoising to One-Step Action Generation | Thanh Nguyen et.al. | 2508.13904 | null |
2025-08-19 | Diffusion-Driven High-Dimensional Variable Selection | Minjie Wang et.al. | 2508.13890 | null |
2025-08-19 | Toward Deployable Multi-Robot Collaboration via a Symbolically-Guided Decision Transformer | Rathnam Vidushika Rasanji et.al. | 2508.13877 | null |
2025-08-19 | SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation | Paul Grimal et.al. | 2508.13866 | null |
2025-08-19 | Stochastic synaptic dynamics under learning | Jakob Stubenrauch et.al. | 2508.13846 | null |
2025-08-19 | UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion | Zihan Liang et.al. | 2508.13843 | null |
2025-08-20 | Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction | Niklas Bubeck et.al. | 2508.13826 | null |
2025-08-19 | COCO: Cognitive Operating System with Continuous Oversight for Multi-Agent Workflow Reliability | Churong Liang et.al. | 2508.13815 | null |
2025-08-19 | Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs | Juncheng Xie et.al. | 2508.13805 | null |
2025-08-19 | Elementary Monte Carlo model of the anisotropic recrystallization and antiripening under intensive stirring and high supersaturations | Serhii Abakumov et.al. | 2508.13799 | null |
2025-08-19 | Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing | Feng-Lin Liu et.al. | 2508.13797 | null |
2025-08-19 | DegDiT: Controllable Audio Generation with Dynamic Event Graph Guided Diffusion Transformer | Yisu Liu et.al. | 2508.13786 | null |
2025-08-19 | Comparing Conditional Diffusion Models for Synthesizing Contrast-Enhanced Breast MRI from Pre-Contrast Images | Sebastian Ibarra et.al. | 2508.13776 | null |
2025-08-19 | Eliminating Rasterization: Direct Vector Floor Plan Generation with DiffPlanner | Shidong Wang et.al. | 2508.13738 | null |
2025-08-19 | Simulation of Impact-induced seismic shaking on asteroid (25143) Itokawa to address its resurfacing process | Sunho Jin et.al. | 2508.13727 | null |
2025-08-19 | Unravelling disorder in kagome Yb $_{0.5}$Co$_3$Ge$_3$ | A. Korshunov et.al. | 2508.13719 | null |
2025-08-19 | Diffuse-Layer Capacitance at the Potential of Zero Charge in Binary Mixtures | Yuki Uematsu et.al. | 2508.13691 | null |
2025-08-19 | PHECT: A lightweight computation tool for pulsar halo emission | Kun Fang et.al. | 2508.13667 | null |
2025-08-19 | Calibrated Semantic Diffusion: A p-Laplacian Synthesis with Learnable Dissipation, Quantified Constants, and Graph-Aware Calibration | Faruk Alpay et.al. | 2508.13658 | null |
2025-08-19 | Personalized Subgraph Federated Learning with Sheaf Collaboration | Wenfei Liang et.al. | 2508.13642 | null |
2025-08-19 | V2P: From Background Suppression to Center Peaking for Robust GUI Grounding Task | Jikai Chen et.al. | 2508.13634 | null |
2025-08-19 | Text2Weight: Bridging Natural Language and Neural Network Weight Spaces | Bowen Tian et.al. | 2508.13633 | null |
2025-08-20 | DiffIER: Optimizing Diffusion Models with Iterative Error Reduction | Ao Chen et.al. | 2508.13628 | null |
2025-08-19 | Bridging Clear and Adverse Driving Conditions | Yoel Shapiro et.al. | 2508.13592 | null |
2025-08-19 | Temporal-Conditional Referring Video Object Segmentation with Noise-Free Text-to-Video Diffusion Model | Ruixin Zhang et.al. | 2508.13584 | null |
2025-08-19 | Overcoming Quantum Resistivity Scaling in Nanoscale Interconnects Using Delafossite PdCoO2 | Seoung-Hun Kang et.al. | 2508.13573 | null |
2025-08-19 | A stability-enhanced nonstandard finite difference framework for solving one and two-dimensional nonlocal differential equations | Shweta Kumari et.al. | 2508.13542 | null |
2025-08-20 | 2D Gaussians Meet Visual Tokenizer | Yiang Shi et.al. | 2508.13515 | null |
2025-08-19 | A Monte Carlo simulation on the scattering coefficients of solar radio wave propagation | Jiazhen Gan et.al. | 2508.13494 | null |
2025-08-19 | The Lévy flight foraging hypothesis: comparison between stationary distributions and anomalous diffusion | Serena Dipierro et.al. | 2508.13487 | null |
2025-08-19 | EventTSF: Event-Aware Non-Stationary Time Series Forecasting | Yunfeng Ge et.al. | 2508.13434 | null |
2025-08-19 | Hyperactive Magnetar Eruptions: Giant Flares, Baryon Ejections, and FRBs | Ashley Bransgrove et.al. | 2508.13419 | null |
2025-08-18 | Counterfactual Probabilistic Diffusion with Expert Models | Wenhao Mu et.al. | 2508.13355 | null |
2025-08-18 | Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction | Sedigheh Dargahi et.al. | 2508.13340 | null |
2025-08-18 | Resistive diffusion and radiative cooling effects in magnetized oblique shocks | R. Datta et.al. | 2508.13310 | null |
2025-08-18 | GaitCrafter: Diffusion Model for Biometric Preserving Gait Synthesis | Sirshapan Mitra et.al. | 2508.13300 | null |
2025-08-18 | Field-level Reconstruction from Foreground-Contaminated 21-cm Maps | Shu-Fan Chen et.al. | 2508.13265 | null |
2025-08-18 | 4DNeX: Feed-Forward 4D Generative Modeling Made Easy | Zhaoxi Chen et.al. | 2508.13154 | null |
2025-08-18 | MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models | Haoyu He et.al. | 2508.13148 | null |
2025-08-18 | Some semi-decoupled algorithms with optimal convergence for a four-field linear thermo-poroelastic model | Ziliang Li et.al. | 2508.13109 | null |
2025-08-18 | Precise Action-to-Video Generation Through Visual Action Prompts | Yuang Wang et.al. | 2508.13104 | null |
2025-08-18 | Denoising diffusion models for inverse design of inflatable structures with programmable deformations | Sara Karimi et.al. | 2508.13097 | null |
2025-08-18 | DMS:Diffusion-Based Multi-Baseline Stereo Generation for Improving Self-Supervised Depth Estimation | Zihua Liu et.al. | 2508.13091 | null |
2025-08-18 | ID-Card Synthetic Generation: Toward a Simulated Bona fide Dataset | Qingwen Zeng et.al. | 2508.13078 | null |
2025-08-18 | From Transthoracic to Transesophageal: Cross-Modality Generation using LoRA Diffusion | Emmanuel Oladokun et.al. | 2508.13077 | null |
2025-08-18 | Reinforced Context Order Recovery for Adaptive Reasoning and Planning | Long Ma et.al. | 2508.13070 | null |
2025-08-18 | Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping | Siddharth Khandelwal et.al. | 2508.13065 | null |
2025-08-19 | PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models | Pengcheng Huang et.al. | 2508.13021 | null |
2025-08-18 | EgoTwin: Dreaming Body and View in First Person | Jingqiao Xiu et.al. | 2508.13013 | null |
2025-08-18 | Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model | Xianglong He et.al. | 2508.13009 | null |
2025-08-18 | Transfer Learning for Neutrino Scattering: Domain Adaptation with GANs | Jose L. Bonilla et.al. | 2508.12987 | null |
2025-08-18 | The Leibenson process | Viorel Barbu et.al. | 2508.12979 | null |
2025-08-18 | Compact Attention: Exploiting Structured Spatio-Temporal Sparsity for Fast Video Generation | Qirui Li et.al. | 2508.12969 | null |
2025-08-18 | Self-Consistent Heating of the Magnetically Closed Solar Corona: Generation of Nanoflares, Thermodynamic Response of the Plasma and Observational Signatures | Craig D. Johnston et.al. | 2508.12952 | null |
2025-08-18 | Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models | Jianshu Zeng et.al. | 2508.12945 | null |
2025-08-19 | Fully Automated Segmentation of Fiber Bundles in Anatomic Tracing Data | Kyriaki-Margarita Bintsi et.al. | 2508.12942 | null |
2025-08-18 | 7Bench: a Comprehensive Benchmark for Layout-guided Text-to-image Models | Elena Izzo et.al. | 2508.12919 | null |
2025-08-18 | FoleySpace: Vision-Aligned Binaural Spatial Audio Generation | Lei Zhao et.al. | 2508.12918 | null |
2025-08-18 | S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models | Chubin Chen et.al. | 2508.12880 | null |
2025-08-18 | E3RG: Building Explicit Emotion-driven Empathetic Response Generation System with Multimodal Large Language Model | Ronghao Lin et.al. | 2508.12854 | null |
2025-08-18 | Strongly correlated stochastic systems | Marco Biroli et.al. | 2508.12818 | null |
2025-08-18 | Next Visual Granularity Generation | Yikai Wang et.al. | 2508.12811 | null |
2025-08-18 | Wavy Transformer | Satoshi Noguchi et.al. | 2508.12787 | null |
2025-08-18 | Right and Wrong Ansätze for Nonlinear Waves in Stochastic PDEs | C. H. S. Hamster et.al. | 2508.12786 | null |
2025-08-18 | Leveraging Diffusion Models for Stylization using Multiple Style Images | Dan Ruta et.al. | 2508.12784 | null |
2025-08-18 | TURB-Scalar. A large database of passive scalar fields advected by 2D Navier-Stokes in the turbulent inverse cascade regime | Chiara Calascibetta et.al. | 2508.12762 | null |
2025-08-18 | Effects of Defects on Thermal Transport across Solid/Solid Heterogeneous Interfaces | Ershuai Yin et.al. | 2508.12744 | null |
2025-08-18 | Single-Reference Text-to-Image Manipulation with Dual Contrastive Denoising Score | Syed Muhmmad Israr et.al. | 2508.12718 | null |
2025-08-18 | Hyperparameter Optimization in the Estimation of PDE and Delay-PDE models from data | Oliver Mai et.al. | 2508.12715 | null |
2025-08-18 | Asymmetric Diffusion Recommendation Model | Yongchun Zhu et.al. | 2508.12706 | null |
2025-08-18 | Deadline-Aware Bandwidth Allocation for Semantic Generative Communication with Diffusion Models | Jinhyuk Choi et.al. | 2508.12701 | null |
2025-08-18 | MixCache: Mixture-of-Cache for Video Diffusion Transformer Acceleration | Yuanxin Wei et.al. | 2508.12691 | null |
2025-08-18 | WP-CLIP: Leveraging CLIP to Predict Wölfflin’s Principles in Visual Art | Abhijay Ghildyal et.al. | 2508.12668 | null |
2025-08-18 | Stable Diffusion-Based Approach for Human De-Occlusion | Seung Young Noh et.al. | 2508.12663 | null |
2025-08-18 | Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery | Jiyeon Kang et.al. | 2508.12650 | null |
2025-08-18 | Cognitive Structure Generation: From Educational Priors to Policy Optimization | Hengnian Gu et.al. | 2508.12647 | null |
2025-08-18 | ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving | Can Cui et.al. | 2508.12603 | null |
2025-08-19 | A Tale of Two Sightlines: Comparison of Hydrocarbon Dust Absorption Bands toward Cygnus OB2-12 and the Galactic Center | Yvonne J. Pendleton et.al. | 2508.12601 | null |
2025-08-17 | Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference | Denis Blessing et.al. | 2508.12511 | null |
2025-08-17 | Say It, See It: A Systematic Evaluation on Speech-Based 3D Content Generation Methods in Augmented Reality | Yanming Xiu et.al. | 2508.12498 | null |
2025-08-19 | Portable Laser-Pumped Rb Atomic Clock with Digital Circuits | Qiang Hao et.al. | 2508.12437 | null |
2025-08-17 | Spin decoherence dynamics of Er $^{3+}$ in CeO$_2$ film | Sagar Kumar Seth et.al. | 2508.12429 | null |
2025-08-17 | TiP4GEN: Text to Immersive Panorama 4D Scene Generation | Ke Xing et.al. | 2508.12415 | null |
2025-08-17 | Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position | Zhixin Xie et.al. | 2508.12398 | null |
2025-08-17 | DeCoT: Decomposing Complex Instructions for Enhanced Text-to-Image Generation with Large Language Models | Xiaochuan Lin et.al. | 2508.12396 | null |
2025-08-17 | Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models | Xun Su et.al. | 2508.12361 | null |
2025-08-17 | Topological Dissipation as the Missing Link in Multiscale Polymer Dynamics | Xu-Ze Zhang et.al. | 2508.12359 | null |
2025-08-17 | Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data | Ahmet H. Güzel et.al. | 2508.12356 | null |
2025-08-17 | Semantic Discrepancy-aware Detector for Image Forgery Identification | Ziye Wang et.al. | 2508.12341 | null |
2025-08-17 | Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR | Fatemeh Ghorbani Lohesara et.al. | 2508.12336 | null |
2025-08-17 | Sketchar: Supporting Character Design and Illustration Prototyping Using Generative AI | Long Ling et.al. | 2508.12333 | null |
2025-08-17 | Steering chiral active Brownian motion via stochastic position-orientation resetting | Amir Shee et.al. | 2508.12223 | null |
2025-08-17 | Distribution Matching via Generalized Consistency Models | Sagar Shrestha et.al. | 2508.12222 | null |
2025-08-17 | Self-Guided Action Diffusion | Rhea Malhotra et.al. | 2508.12189 | null |
2025-08-16 | Critical Importance of Grain Boundaries to the Conductivity of Polycrystalline Molecular Crystals | Shujit Chandra Paul et.al. | 2508.12172 | null |
2025-08-16 | Belief-Conditioned One-Step Diffusion: Real-Time Trajectory Planning with Just-Enough Sensing | Gokul Puthumanaillam et.al. | 2508.12166 | null |
2025-08-16 | A Systematic Particle Filter for Estimating Time-Varying Parameters in Advection-Diffusion Equations with Source Terms | Andrea Arnold et.al. | 2508.12155 | null |
2025-08-16 | Demystifying Foreground-Background Memorization in Diffusion Models | Jimmy Z. Di et.al. | 2508.12148 | null |
2025-08-16 | Relativistic quintuple-zeta basis sets for the s block | Marten L. Reitsma et.al. | 2508.12144 | null |
2025-08-16 | DualFit: A Two-Stage Virtual Try-On via Warping and Synthesis | Minh Tran et.al. | 2508.12131 | null |
2025-08-16 | Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion | Songwei Liu et.al. | 2508.12094 | null |
2025-08-16 | Strong overlap of deterministic and stochastic dynamics in a super-diffusive regime | Muhammad Tayyab et.al. | 2508.12091 | null |
2025-08-16 | Generic Event Boundary Detection via Denoising Diffusion | Jaejun Hwang et.al. | 2508.12084 | null |
2025-08-16 | Content Accuracy and Quality Aware Resource Allocation Based on LP-Guided DRL for ISAC-Driven AIGC Networks | Ningzhe Shi et.al. | 2508.12079 | null |
2025-08-16 | Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization | Kousuke Nakano et.al. | 2508.12033 | null |
2025-08-16 | Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems | Szymon Pawlonka et.al. | 2508.12026 | null |
2025-08-16 | Virtual Trading in Multi-Settlement Electricity Markets | Agostino Capponi et.al. | 2508.11979 | null |
2025-08-16 | UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding | Yueming Xu et.al. | 2508.11952 | null |
2025-08-19 | Assessment of Using Synthetic Data in Brain Tumor Segmentation | Aditi Jahagirdar et.al. | 2508.11922 | null |
2025-08-16 | SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress | Lingyun Zhang et.al. | 2508.11904 | null |
2025-08-16 | OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation | Jilei Mao et.al. | 2508.11898 | null |
2025-08-16 | Simulation of heavy quarkonium equilibration in the quark-gluon plasma | Shouxing Zhao et.al. | 2508.11897 | null |
2025-08-16 | SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System | Truong Thanh Hung Nguyen et.al. | 2508.11873 | null |
2025-08-15 | Serendipitous discovery of a young cluster of galaxies at $z \sim 0.5$ projected next to the nearby tadpole galaxy KUG 1138 + 327 | Q. Daniel Wang et.al. | 2508.11819 | null |
2025-08-15 | FairTabGen: Unifying Counterfactual and Causal Fairness in Synthetic Tabular Data Generation | Nitish Nagesh et.al. | 2508.11810 | null |
2025-08-15 | LoRAtorio: An intrinsic approach to LoRA Skill Composition | Niki Foteinopoulou et.al. | 2508.11624 | null |
2025-08-15 | Dataset Creation for Visual Entailment using Generative AI | Rob Reijtenbach et.al. | 2508.11605 | null |
2025-08-15 | CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion | Zhe Zhu et.al. | 2508.11603 | null |
2025-08-15 | Low barrier ZrO $_x$ -based Josephson junctions | Jaehong Choi et.al. | 2508.11593 | null |
2025-08-15 | Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model | Zuo Zuo et.al. | 2508.11550 | null |
2025-08-15 | Physics-Informed Diffusion Models for Unsupervised Anomaly Detection in Multivariate Time Series | Juhi Soni et.al. | 2508.11528 | null |
2025-08-15 | CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models | Xiaoxue Wu et.al. | 2508.11484 | null |
2025-08-15 | SPG: Style-Prompting Guidance for Style-Specific Content Creation | Qian Liang et.al. | 2508.11476 | null |
2025-08-15 | DPI-SPR: A Differentiable Physical Inversion for Shadow Profile Reconstruction Framework in Forward Scatter Radar | ShuQi Lei et.al. | 2508.11470 | null |
2025-08-15 | Simulation-based inference using splitting schemes for partially observed diffusions in chemical reaction networks | Petar Jovanovski et.al. | 2508.11438 | null |
2025-08-15 | MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation | Qian Liang et.al. | 2508.11433 | null |
2025-08-15 | Wavelength dependence of laser pulse filamentation around atomic resonances | Gabor Demeter et.al. | 2508.11417 | null |
2025-08-15 | The Effect of Flow Parameters and Wall Models on Gas-Surface Interactions: A Numerical Investigation of dsmcFoam | M. B. Agir et.al. | 2508.11403 | null |
2025-08-15 | Pairwise correlations of global times in one-dimensional Brownian motion under stochastic resetting | Yihao Wang et.al. | 2508.11387 | null |
2025-08-15 | AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis | Zonglin Wu et.al. | 2508.11375 | null |
2025-08-15 | GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition | Md Asgor Hossain Reaj et.al. | 2508.11334 | null |
2025-08-15 | Noise Matters: Optimizing Matching Noise for Diffusion Classifiers | Yanghao Wang et.al. | 2508.11330 | null |
2025-08-18 | TimeMachine: Fine-Grained Facial Age Editing with Identity Preservation | Yilin Mi et.al. | 2508.11284 | null |
2025-08-15 | Probing the Representational Power of Sparse Autoencoders in Vision Models | Matthew Lyle Olson et.al. | 2508.11277 | null |
2025-08-15 | Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception | Junjie Wang et.al. | 2508.11256 | null |
2025-08-15 | FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation | MengChao Wang et.al. | 2508.11255 | null |
2025-08-15 | Graph Neural Diffusion via Generalized Opinion Dynamics | Asela Hevapathige et.al. | 2508.11249 | null |
2025-08-15 | Cross-Granularity Hypergraph Retrieval-Augmented Generation for Multi-hop Question Answering | Changjian Wang et.al. | 2508.11247 | null |
2025-08-15 | Efficient Image-to-Image Schrödinger Bridge for CT Field of View Extension | Zhenhao Li et.al. | 2508.11211 | null |
2025-08-15 | StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation | Seungmi Lee et.al. | 2508.11203 | null |
2025-08-15 | NGC 2392 and NGC 4361: Spectroscopic Diagnostics of Planetary Nebula Evolution | Atul Kumar Singh et.al. | 2508.11202 | null |
2025-08-15 | Statistical Properties of Current Noise Induced by Electron-Phonon Scattering in Metallic Carbon Nanotubes | Aina Sumiyoshi et.al. | 2508.11201 | null |
2025-08-15 | Representation Quantization for Collaborative Filtering Augmentation | Yunze Luo et.al. | 2508.11194 | null |
2025-08-15 | Semi-supervised Image Dehazing via Expectation-Maximization and Bidirectional Brownian Bridge Diffusion Models | Bing Liu et.al. | 2508.11165 | null |
2025-08-15 | LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction | Maoquan Zhang et.al. | 2508.11153 | null |
2025-08-15 | Residual-based Efficient Bidirectional Diffusion Model for Image Dehazing and Haze Generation | Bing Liu et.al. | 2508.11134 | null |
2025-08-15 | SQ-A: A Collision Triggered Starburst in Intra-Group Medium of Stephan’s Quintet | C. K. Xu et.al. | 2508.11124 | null |
2025-08-14 | Diffusion is a code repair operator and generator | Mukul Singh et.al. | 2508.11110 | null |
2025-08-14 | HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing | Xinjie Gao et.al. | 2508.11106 | null |
2025-08-14 | GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning | Kelin Yu et.al. | 2508.11049 | null |
2025-08-14 | A porous medium equation with spatially inhomogeneous absorption. Part II: Large time behavior | Razvan Gabriel Iagar et.al. | 2508.11046 | null |
2025-08-14 | 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation | Nikolaos Gkanatsios et.al. | 2508.11002 | null |
2025-08-14 | Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling | Tejomay Kishor Padole et.al. | 2508.10995 | null |
2025-08-14 | Match & Choose: Model Selection Framework for Fine-tuning Text-to-Image Diffusion Models | Basile Lewandowski et.al. | 2508.10993 | null |
2025-08-14 | The extended molecular gas of the Circinus galaxy and NGC 1097 as seen by APEX | Akhil Lasrado et.al. | 2508.10982 | null |
2025-08-14 | EVCtrl: Efficient Control Adapter for Visual Generation | Zixiang Yang et.al. | 2508.10963 | null |
2025-08-13 | From Promise to Practical Reality: Transforming Diffusion MRI Analysis with Fast Deep Learning Enhancement | Xinyi Wang et.al. | 2508.10950 | null |
2025-08-14 | Exchange-driven self-diffusion of nanoscale crystalline parahydrogen clusters on graphite | K. M. Kolevski et.al. | 2508.10883 | null |
2025-08-14 | A Survey on Diffusion Language Models | Tianyi Li et.al. | 2508.10875 | null |
2025-08-14 | Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation | Harold Haodong Chen et.al. | 2508.10858 | null |
2025-08-16 | Object Fidelity Diffusion for Remote Sensing Image Generation | Ziqi Ye et.al. | 2508.10801 | null |
2025-08-14 | Ultra-High-Definition Reference-Based Landmark Image Super-Resolution with Generative Diffusion Prior | Zhenning Shi et.al. | 2508.10779 | null |
2025-08-14 | Video-BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation | Youping Gu et.al. | 2508.10774 | null |
2025-08-14 | AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences | Jieyu Li et.al. | 2508.10771 | null |
2025-08-14 | Formation and protection of an Eu-Ir surface compound below hexagonal boron nitride | Alaa Mohammed Idris Bakhit et.al. | 2508.10746 | null |
2025-08-14 | A Kinetic Theory Approach to Ordered Fluids | José A. Carrillo et.al. | 2508.10744 | null |
2025-08-14 | Thinking Inside the Mask: In-Place Prompting in Diffusion LLMs | Xiangqi Jin et.al. | 2508.10736 | null |
2025-08-14 | Exploiting Discriminative Codebook Prior for Autoregressive Image Generation | Longxiang Tang et.al. | 2508.10719 | null |
2025-08-14 | NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale | NextStep Team et.al. | 2508.10711 | null |
2025-08-14 | CountCluster: Training-Free Object Quantity Guidance with Cross-Attention Map Clustering for Text-to-Image Generation | Joohyeon Lee et.al. | 2508.10710 | null |
2025-08-14 | Probabilistic Forecasting Method for Offshore Wind Farm Cluster under Typhoon Conditions: a Score-Based Conditional Diffusion Model | Jinhua He et.al. | 2508.10705 | null |
2025-08-14 | Effective permeability conditions for diffusive transport through impermeable membranes with gaps | Molly Brennan et.al. | 2508.10694 | null |
2025-08-14 | Novel View Synthesis using DDIM Inversion | Sehajdeep SIngh et.al. | 2508.10688 | null |
2025-08-14 | MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control | Yuchen Zhu et.al. | 2508.10684 | null |
2025-08-14 | Hybrid Generative Fusion for Efficient and Privacy-Preserving Face Recognition Dataset Generation | Feiran Li et.al. | 2508.10672 | null |
2025-08-14 | Geospatial Diffusion for Land Cover Imperviousness Change Forecasting | Debvrat Varshney et.al. | 2508.10649 | null |
2025-08-14 | Increasing the Utility of Synthetic Images through Chamfer Guidance | Nicola Dall’Asen et.al. | 2508.10631 | null |
2025-08-14 | A Unified Framework from Boltzmann Transport to Proton Treatment Planning | Andreas E. Kyprianou et.al. | 2508.10596 | null |
2025-08-14 | HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis | Shiyu Liu et.al. | 2508.10566 | null |
2025-08-14 | Projected Coupled Diffusion for Test-Time Constrained Joint Generation | Hao Luan et.al. | 2508.10531 | null |
2025-08-14 | EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba | Quang Nguyen et.al. | 2508.10522 | null |
2025-08-15 | KDPE: A Kernel Density Estimation Strategy for Diffusion Policy Trajectory Selection | Andrea Rosasco et.al. | 2508.10511 | null |
2025-08-14 | A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection | Yangjie Xiao et.al. | 2508.10509 | null |
2025-08-14 | TweezeEdit: Consistent and Efficient Image Editing with Path Regularization | Jianda Mao et.al. | 2508.10498 | null |
2025-08-14 | A Unified Multi-Agent Framework for Universal Multimodal Understanding and Generation | Jiulin Li et.al. | 2508.10494 | null |
2025-08-14 | Jamming of active particles in narrow pores: Implications for ratchet effect and diffusion coefficient | Šimon Pajger et.al. | 2508.10483 | null |
2025-08-14 | NanoControl: A Lightweight Framework for Precise and Efficient Control in Diffusion Transformer | Shanyuan Liu et.al. | 2508.10424 | null |
2025-08-14 | Extracting a stochastic model for predator-prey dynamic of turbulence and zonal flows with limited data | J. C. Huang et.al. | 2508.10408 | null |
2025-08-14 | Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models | Eunseo Koh et.al. | 2508.10407 | null |
2025-08-14 | PQ-DAF: Pose-driven Quality-controlled Data Augmentation for Data-scarce Driver Distraction Detection | Haibin Sun et.al. | 2508.10397 | null |
2025-08-14 | EDIS: A Simulation Software for Dynamic Ion Intercalation/Deintercalation Processes in Electrode Materials | Liqi Wang et.al. | 2508.10384 | null |
2025-08-14 | Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models | Hyundo Lee et.al. | 2508.10382 | null |
2025-08-14 | A Semantic-Aware Framework for Safe and Intent-Integrative Assistance in Upper-Limb Exoskeletons | Yu Chen et.al. | 2508.10378 | null |
2025-08-14 | Scalable Modeling of Nonlinear Network Dynamics in Neurodegenerative Disease | Daniel Semchin et.al. | 2508.10343 | null |
2025-08-14 | ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver | Wenxuan Song et.al. | 2508.10333 | null |
2025-08-14 | Cross-view Generalized Diffusion Model for Sparse-view CT Reconstruction | Jixiang Chen et.al. | 2508.10313 | null |
2025-08-14 | DiffAxE: Diffusion-driven Hardware Accelerator Generation and Design Space Exploration | Arkapravo Ghosh et.al. | 2508.10303 | null |
2025-08-14 | Influence Maximization in Multi-layer Social Networks Based on Differentiated Graph Embeddings | Ronghua Lin et.al. | 2508.10289 | null |
2025-08-14 | High Fidelity Text to Image Generation with Contrastive Alignment and Structural Guidance | Danyi Gao et.al. | 2508.10280 | null |
2025-08-14 | A Spectral Solver to Capture Unsteady Dynamics in the Aerospike Nozzle Wake | Zachary Pyle et.al. | 2508.10275 | null |
2025-08-14 | Non-Decaying Solutions to the 2D Dissipative Quasi-Geostrophic Equations | David M. Ambrose et.al. | 2508.10254 | null |
2025-08-13 | Run-and-tumble dynamics with non-reciprocal transitions between three velocity states | Julio C. R. Romo-Cruz et.al. | 2508.10213 | null |
2025-08-13 | Diffusive Braking of Penetrative Convection in Stably-Stratified Fluids | Bradley W. Hindman et.al. | 2508.10174 | null |
2025-08-13 | Predicting First-Passage Dynamics in Disordered Systems Exactly: Application to Sparse Networks | Daniel Marris et.al. | 2508.10140 | null |
2025-08-13 | The Perturbation Theory Approach to Stability in the Scattered Disk | Matthew Belyakov et.al. | 2508.10119 | null |
2025-08-13 | Constrained Decoding of Diffusion LLMs with Context-Free Grammars | Niels Mündler et.al. | 2508.10111 | null |
2025-08-13 | Quantum circuit simulation with a local time-dependent variational principle | Aaron Sander et.al. | 2508.10096 | null |
2025-08-13 | Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design | Yuhao Sun et.al. | 2508.10065 | null |
2025-08-13 | Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation | Junyan Ye et.al. | 2508.09987 | null |
2025-08-13 | Story2Board: A Training-Free Approach for Expressive Storyboard Generation | David Dinkevich et.al. | 2508.09983 | null |
2025-08-13 | Masquerade: Learning from In-the-wild Human Videos using Data-Editing | Marion Lepert et.al. | 2508.09976 | null |
2025-08-13 | PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image | Geonhee Sim et.al. | 2508.09973 | null |
2025-08-13 | Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models | Luca Eyring et.al. | 2508.09968 | null |
2025-08-13 | Stable Diffusion Models are Secretly Good at Visual In-Context Learning | Trevine Oorloff et.al. | 2508.09949 | null |
2025-08-13 | AST-n: A Fast Sampling Approach for Low-Dose CT Reconstruction using Diffusion Models | Tomás de la Sotta et.al. | 2508.09943 | null |
2025-08-13 | Quo Vadis Handwritten Text Generation for Handwritten Text Recognition? | Vittorio Pippi et.al. | 2508.09936 | null |
2025-08-13 | Active Particle Diffusion in Convection Roll Arrays | Pulak Kumar Ghosh et.al. | 2508.09924 | null |
2025-08-14 | Prototype-Guided Diffusion: Visual Conditioning without External Memory | Bilal Faye et.al. | 2508.09922 | null |
2025-08-13 | Hybrid Quantum-Classical Latent Diffusion Models for Medical Image Generation | Kübra Yeter-Aydeniz et.al. | 2508.09903 | null |
2025-08-13 | Binary Mixtures in Linear Convection Arrays | Pulak Kumar Ghosh et.al. | 2508.09902 | null |
2025-08-13 | Exploring the Physics of the Plasma Liner Experiment: A Multi-dimensional Study with FLASH, OSIRIS, and HELIOS | E. C. Hansen et.al. | 2508.09895 | null |
2025-08-13 | Marketron Through the Looking Glass: From Equity Dynamics to Option Pricing in Incomplete Markets | Igor Halperin et.al. | 2508.09863 | null |
2025-08-13 | HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics | Weiqi Li et.al. | 2508.09858 | null |
2025-08-13 | Enhancing Diffusion Face Generation with Contrastive Embeddings and SegFormer Guidance | Dhruvraj Singh Rawat et.al. | 2508.09847 | null |
2025-08-13 | On the Generalization Limits of Quantum Generative Adversarial Networks with Pure State Generators | Jasmin Frkatovic et.al. | 2508.09844 | null |
2025-08-13 | Speed Always Wins: A Survey on Efficient Architectures for Large Language Models | Weigao Sun et.al. | 2508.09834 | null |
2025-08-13 | Physical Autoregressive Model for Robotic Manipulation without Action Pretraining | Zijian Song et.al. | 2508.09822 | null |
2025-08-13 | Feature Impact Analysis on Top Long-Jump Performances with Quantile Random Forest and Explainable AI Techniques | Qi Gan et.al. | 2508.09810 | null |
2025-08-13 | Condition number for finite element discretisation of nonlocal PDE systems with applications to biology | Olusegun E. Adebayo et.al. | 2508.09781 | null |
2025-08-13 | Impacts of the duration and intensity of grazing cycle on vegetation population dynamics in semi-arid ecosystems with seasonal succession | Junhong Gan et.al. | 2508.09760 | null |
2025-08-13 | Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection | Zhiqiu Zhang et.al. | 2508.09746 | null |
2025-08-13 | MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers | Qianru Qiu et.al. | 2508.09709 | null |
2025-08-13 | Hydrodynamic approximations for driven dense colloidal mixtures in narrow pores | Frantisek Slanina et.al. | 2508.09686 | null |
2025-08-13 | Anomalous Transport of Elongated Particles in Oscillatory Vortical Flows | Shiyuan Hu et.al. | 2508.09677 | null |
2025-08-13 | GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors | Xingyilang Yin et.al. | 2508.09667 | null |
2025-08-13 | NegFaceDiff: The Power of Negative Context in Identity-Conditioned Diffusion for Synthetic Face Generation | Eduarda Caldeira et.al. | 2508.09661 | null |
2025-08-13 | Asymptotic-analysis-inspired boundary conditions aiming at eliminating polymer diffusive instability | Ming Dong et.al. | 2508.09635 | null |
2025-08-15 | Preacher: Paper-to-Video Agentic System | Jingwei Liu et.al. | 2508.09632 | null |
2025-08-13 | MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography | Daniel Barco et.al. | 2508.09616 | null |
2025-08-13 | Global uniform regularity for the 3D incompressible MHD equations with slip boundary condition near a background magnetic field | Jincheng Gao et.al. | 2508.09609 | null |
2025-08-13 | Images Speak Louder Than Scores: Failure Mode Escape for Enhancing Generative Quality | Jie Shao et.al. | 2508.09598 | null |
2025-08-13 | Dual Recursive Feedback on Generation and Appearance Latents for Pose-Robust Text-to-Image Diffusion | Jiwon Kim et.al. | 2508.09575 | null |
2025-08-13 | Zeolitic imidazolate framework glasses emit white light | Zhencai Li et.al. | 2508.09552 | null |
2025-08-13 | Exploring the Equivalence of Closed-Set Generative and Real Data Augmentation in Image Classification | Haowen Wang et.al. | 2508.09550 | null |
2025-08-13 | Boron Clusters for Metal-Free Water Splitting | Masaya Fujioka et.al. | 2508.09538 | null |
2025-08-13 | Ehrenfest Dynamics with Spontaneous Localization | Anderson A. Tomaz et.al. | 2508.09526 | null |
2025-08-13 | Generation of Indian Sign Language Letters, Numbers, and Words | Ajeet Kumar Yadav et.al. | 2508.09522 | null |
2025-08-13 | A hyperbolic finite difference scheme for anisotropic diffusion equations: preserving the discrete maximum principle | Tokuhiro Eto et.al. | 2508.09509 | null |
2025-08-13 | Stingrays in the radio sky: Two unusual diffuse radio relic sources in the direction of the Magellanic Stream | Zachary J Smeaton et.al. | 2508.09495 | null |
2025-08-13 | SARE: Semantic-Aware Reconstruction Error for Generalizable Diffusion-Generated Image Detection | Ju Yeon Kang et.al. | 2508.09487 | null |
2025-08-13 | CLIP-Flow: A Universal Discriminator for AI-Generated Images Inspired by Anomaly Detection | Zhipeng Yuan et.al. | 2508.09477 | null |
2025-08-14 | From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts | Yuji Wang et.al. | 2508.09476 | null |
2025-08-13 | Leveraging Failed Samples: A Few-Shot and Training-Free Framework for Generalized Deepfake Detection | Shibo Yao et.al. | 2508.09475 | null |
2025-08-13 | Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy | Hao Yu et.al. | 2508.09461 | null |
2025-08-13 | RASR: Retrieval-Augmented Super Resolution for Practical Reference-based Image Restoration | Jiaqi Yan et.al. | 2508.09449 | null |
2025-08-13 | DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation | Haoxiang Shi et.al. | 2508.09444 | null |
2025-08-13 | Scaling behaviour of rotating convection in a spherical shell with different Prandtl numbers | Wei Fan et.al. | 2508.09416 | null |
2025-08-13 | Dynamos driven by top-heavy double-diffusive convection in the strong-field regime | Wei Fan et.al. | 2508.09410 | null |
2025-08-12 | Understanding Dementia Speech Alignment with Diffusion-Based Image Generation | Mansi et.al. | 2508.09385 | null |
2025-08-12 | X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents | Guoxian Song et.al. | 2508.09383 | null |
2025-08-12 | UltraLight Med-Vision Mamba for Classification of Neoplastic Progression in Tubular Adenomas | Aqsa Sultana et.al. | 2508.09339 | null |
2025-08-12 | Lung-DDPM+: Efficient Thoracic CT Image Synthesis using Diffusion Probabilistic Model | Yifan Jiang et.al. | 2508.09327 | null |
2025-08-12 | Quantum correction to the Langevin cross section in resonant-exchange processes | I. Simbotin et.al. | 2508.09302 | null |
2025-08-12 | Evolution of a Long-Lived Deep-Seated Main-Sequence Magnetic Field During White Dwarf Cooling | Matias Castro-Tapia et.al. | 2508.09268 | null |
2025-08-12 | TFZ: Topology-Preserving Compression of 2D Symmetric and Asymmetric Second-Order Tensor Fields | Nathaniel Gorski et.al. | 2508.09235 | null |
2025-08-12 | GSMT: Graph Fusion and Spatiotemporal TaskCorrection for Multi-Bus Trajectory Prediction | Fan Ding et.al. | 2508.09227 | null |
2025-08-12 | Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models | Wen Wang et.al. | 2508.09138 | null |
2025-08-12 | Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices | Ya Zou et.al. | 2508.09136 | null |
2025-08-13 | Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer | Zixin Yin et.al. | 2508.09131 | null |
2025-08-13 | Robust quantum computational advantage with programmable 3050-photon Gaussian boson sampling | Hua-Liang Liu et.al. | 2508.09092 | null |
2025-08-13 | Direct Measurement of Electron Heating in Electron-Only Reconnection in a Laboratory Mini-Magnetosphere | Lucas Rovige et.al. | 2508.09086 | null |
2025-08-12 | Rankin-Selberg integrals for $\mathrm{GSpin}$ groups with application to the global Gan-Gross-Prasad conjecture | Pan Yan et.al. | 2508.09066 | null |
2025-08-12 | Per-Query Visual Concept Learning | Ori Malca et.al. | 2508.09045 | null |
2025-08-12 | Stochastic Decentralized Optimization of Non-Smooth Convex and Convex-Concave Problems over Time-Varying Networks | Maxim Divilkovskiy et.al. | 2508.09029 | null |
2025-08-12 | Envisioning Generative Artificial Intelligence in Cartography and Mapmaking | Yuhao Kang et.al. | 2508.09028 | null |
2025-08-12 | TaoCache: Structure-Maintained Video Generation Acceleration | Zhentao Fan et.al. | 2508.08978 | null |
2025-08-12 | Urban-STA4CLC: Urban Theory-Informed Spatio-Temporal Attention Model for Predicting Post-Disaster Commercial Land Use Change | Ziyi Guo et.al. | 2508.08976 | null |
2025-08-12 | Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation | Soo-Whan Chung et.al. | 2508.08953 | null |
2025-08-12 | Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation | Ao Ma et.al. | 2508.08949 | null |
2025-08-12 | EGGCodec: A Robust Neural Encodec Framework for EGG Reconstruction and F0 Extraction | Rui Feng et.al. | 2508.08924 | null |
2025-08-12 | When and How Ultrasound Enhances Nanoparticle Diffusion in Hydrogels: A Stick-and-Release Mechanism | Pablo M. Blanco et.al. | 2508.08918 | null |
2025-08-12 | Sound Signal Synthesis with Auxiliary Classifier GAN, COVID-19 cough as an example | Yahya Sherif Solayman Mohamed Saleh et.al. | 2508.08892 | null |
2025-08-12 | Transient Noise Removal via Diffusion-based Speech Inpainting | Mordehay Moradi et.al. | 2508.08890 | null |
2025-08-12 | DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI | Bo-Hsun Chen et.al. | 2508.08831 | null |
2025-08-12 | Geometry-Aware Global Feature Aggregation for Real-Time Indirect Illumination | Meng Gai et.al. | 2508.08826 | null |
2025-08-12 | TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models | Yuqi Peng et.al. | 2508.08812 | null |
2025-08-12 | Identity-Preserving Aging and De-Aging of Faces in the StyleGAN Latent Space | Luis S. Luevano et.al. | 2508.08808 | null |
2025-08-12 | Anomalous Sodium Insertion in Highly Oriented Graphite: Thermodynamics, Kinetics and Evidence for Two-Sided Intercalation | Chuanhai Gan et.al. | 2508.08806 | null |
2025-08-14 | Measurement-Based Quantum Diffusion Models | Xinyu Liu et.al. | 2508.08799 | null |
2025-08-12 | DiffPose-Animal: A Language-Conditioned Diffusion Framework for Animal Pose Estimation | Tianyu Xiong et.al. | 2508.08783 | null |
2025-08-12 | Patient-Adaptive Focused Transmit Beamforming using Cognitive Ultrasound | Wessel L. van Nierop et.al. | 2508.08782 | null |
2025-08-12 | Exploring Palette based Color Guidance in Diffusion Models | Qianru Qiu et.al. | 2508.08754 | null |
2025-08-12 | Elucidating Rectified Flow with Deterministic Sampler: Polynomial Discretization Complexity for Multi and One-step Models | Ruofeng Yang et.al. | 2508.08735 | null |
2025-08-13 | A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models | Lingzhe Zhang et.al. | 2508.08712 | null |
2025-08-12 | Towards Safe Imitation Learning via Potential Field-Guided Flow Matching | Haoran Ding et.al. | 2508.08707 | null |
2025-08-12 | SafeFix: Targeted Model Repair via Controlled Image Generation | Ouyang Xu et.al. | 2508.08701 | null |
2025-08-12 | Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos | Qi Zheng et.al. | 2508.08700 | null |
2025-08-12 | DiffVolume: Diffusion Models for Volume Generation in Limit Order Books | Zhuohan Wang et.al. | 2508.08698 | null |
2025-08-12 | Detecting Sterile Neutrino Dark Matter at MeV Gamma-Ray Observatories | Subaru Fujisawa et.al. | 2508.08695 | null |
2025-08-12 | Expert-Guided Diffusion Planner for Auto-bidding | Yunshan Peng et.al. | 2508.08687 | null |
2025-08-12 | In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality | Chenrui Liu et.al. | 2508.08673 | null |
2025-08-12 | Nonlinear dynamics of reaction-diffusion wave trains under large and fully nonlocalized modulations | Joannis Alexopoulos et.al. | 2508.08637 | null |
2025-08-14 | Yan: Foundational Interactive Video Generation | Deheng Ye et.al. | 2508.08601 | null |
2025-08-12 | RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space | Jingyun Liang et.al. | 2508.08588 | null |
2025-08-12 | Unlocking the Potential of Diffusion Priors in Blind Face Restoration | Yunqi Miao et.al. | 2508.08556 | null |
2025-08-12 | UQGNN: Uncertainty Quantification of Graph Neural Networks for Multivariate Spatiotemporal Prediction | Dahai Yu et.al. | 2508.08551 | null |
2025-08-12 | Fluorescence time profile measurement of LAB based liquid scintillator in response to medium relativistic ion particles | Xiaojie Luo et.al. | 2508.08546 | null |
2025-08-12 | Transition to Petschek Reconnection in Subrelativistic Pair Plasmas: Implications for Particle Acceleration | Adam Robbins et.al. | 2508.08533 | null |
2025-08-11 | SynLLM: A Comparative Analysis of Large Language Models for Medical Tabular Synthetic Data Generation via Prompt Engineering | Arshia Ilaty et.al. | 2508.08529 | null |
2025-08-11 | Control-affine Schrödinger Bridge and Generalized Bohm Potential | Alexis M. H. Teter et.al. | 2508.08511 | null |
2025-08-11 | CObL: Toward Zero-Shot Ordinal Layering without User Prompting | Aneel Damaraju et.al. | 2508.08498 | null |
2025-08-11 | MuGa-VTON: Multi-Garment Virtual Try-On via Diffusion Transformers with Prompt Customization | Ankan Deria et.al. | 2508.08488 | null |
2025-08-11 | MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling | Qian Wang et.al. | 2508.08487 | null |
2025-08-11 | Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features | Pallabee Das et.al. | 2508.08458 | null |
2025-08-11 | Hot Jupiter formation in dense stellar clusters: A Monte Carlo model applied to 47 Tucanae | J. A. Wirth et.al. | 2508.08406 | null |
2025-08-11 | Wave Propagation Dynamics via Lattice Difference Equations | Eddy Kwessi et.al. | 2508.08387 | null |
2025-08-11 | Spatiotemporally Consistent Indoor Lighting Estimation with Diffusion Priors | Mutian Tong et.al. | 2508.08384 | null |
2025-08-11 | Exponentially Improved Constant in Quantum Solution Extraction | Gumaro Rendon et.al. | 2508.08375 | null |
2025-08-11 | StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation | Shuyuan Tu et.al. | 2508.08248 | null |
2025-08-12 | Cut2Next: Generating Next Shot via In-Context Tuning | Jingwen He et.al. | 2508.08244 | null |
2025-08-13 | BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion | Qiayuan Liao et.al. | 2508.08241 | null |
2025-08-11 | OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution | Zhiqiang Wu et.al. | 2508.08227 | null |
2025-08-11 | Learning User Preferences for Image Generation Model | Wenyi Mo et.al. | 2508.08220 | null |
2025-08-11 | Reinforcement Learning in Vision: A Survey | Weijia Wu et.al. | 2508.08189 | null |
2025-08-13 | CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data | Chongke Bi et.al. | 2508.08173 | null |
2025-08-11 | ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction | Chaojun Ni et.al. | 2508.08170 | null |
2025-08-11 | An effective potential for generative modelling with active matter | Adrian Baule et.al. | 2508.08146 | null |
2025-08-11 | Reproducing and Extending Brownian Motion in Optical Trap: A Computational Reimplementation of Volpe and Volpe (2013) | Eyad I. B Hamid et.al. | 2508.08138 | null |
2025-08-11 | FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting | Yitong Yang et.al. | 2508.08136 | null |
2025-08-11 | Optimal Dividend, Reinsurance, and Capital Injection Strategies for an Insurer with Two Collaborating Business Lines | Tim J. Boonen et.al. | 2508.08130 | null |
2025-08-11 | Learned Regularization for Microwave Tomography | Bowen Tong et.al. | 2508.08114 | null |
2025-08-11 | TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning | Junzhe Xu et.al. | 2508.08098 | null |
2025-08-11 | Fast and Generalizable parameter-embedded Neural Operators for Lithium-Ion Battery Simulation | Amir Ali Panahi et.al. | 2508.08087 | null |
2025-08-11 | Matrix-3D: Omnidirectional Explorable 3D World Generation | Zhongqi Yang et.al. | 2508.08086 | null |
2025-08-12 | Why Bohmian velocity might not be the only quantum velocity and the role of quantum diffusion flux is super-luminal wave packets | Charalampos Antonakos et.al. | 2508.08065 | null |
2025-08-11 | S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix | Peng Dai et.al. | 2508.08048 | null |
2025-08-12 | Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation | Fangyuan Mao et.al. | 2508.07981 | null |
2025-08-11 | Well-posedness for a fourth-order nonisothermal tumor growth model of Caginalp type | Giulia Cavalleri et.al. | 2508.07979 | null |
2025-08-12 | Adaptive Multiple Access and Service Placement for Generative Diffusion Models | Hamidreza Mazandarani et.al. | 2508.07978 | null |
2025-08-11 | Deep imaging of the galaxy Malin 2 shows new faint structures and a candidate satellite dwarf galaxy | Junais et.al. | 2508.07930 | null |
2025-08-11 | Score Augmentation for Diffusion Models | Liang Hou et.al. | 2508.07926 | null |
2025-08-11 | Generative Video Matting | Yongtao Ge et.al. | 2508.07905 | null |
2025-08-11 | Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models | Johanna P. Müller et.al. | 2508.07903 | null |
2025-08-12 | Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation | Bowen Xue et.al. | 2508.07901 | null |
2025-08-11 | NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction | Tianle Zeng et.al. | 2508.07897 | null |
2025-08-11 | Deep Learning-Based Desikan-Killiany Parcellation of the Brain Using Diffusion MRI | Yousef Sadegheih et.al. | 2508.07815 | null |
2025-08-11 | DiTVR: Zero-Shot Diffusion Transformer for Video Restoration | Sicheng Gao et.al. | 2508.07811 | null |
2025-08-11 | MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks | Yushen Xu et.al. | 2508.07803 | null |
2025-08-11 | Generative Inversion for Property-Targeted Materials Design: Application to Shape Memory Alloys | Cheng Li et.al. | 2508.07798 | null |
2025-08-11 | Feynman-Kac formula gor general time dependent stochastic parabolic equation on a bounded domain and applications | Yaozhong Hu et.al. | 2508.07793 | null |
2025-08-13 | AgentWorld: An Interactive Simulation Platform for Scene Construction and Mobile Robotic Manipulation | Yizheng Zhang et.al. | 2508.07770 | null |
2025-08-11 | Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation | Xiaoyan Liu et.al. | 2508.07769 | null |
2025-08-11 | Sea-Undistort: A Dataset for Through-Water Image Restoration in High Resolution Airborne Bathymetric Mapping | Maximilian Kromer et.al. | 2508.07760 | null |
2025-08-11 | Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild | Haoran Wang et.al. | 2508.07759 | null |
2025-08-11 | Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion | Minseo Kim et.al. | 2508.07755 | null |
2025-08-11 | Grouped Speculative Decoding for Autoregressive Image Generation | Junhyuk So et.al. | 2508.07747 | null |
2025-08-11 | Is GAN Necessary for Mel-Spectrogram-based Neural Vocoder? | Hui-Peng Du et.al. | 2508.07711 | null |
2025-08-11 | Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing | Weitao Wang et.al. | 2508.07700 | null |
2025-08-11 | DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework | Wenzhuo Ma et.al. | 2508.07682 | null |
2025-08-11 | LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering | Xiaohang Zhan et.al. | 2508.07647 | null |
2025-08-11 | X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning | Jian Ma et.al. | 2508.07607 | null |
2025-08-11 | LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation | Wenhui Song et.al. | 2508.07603 | null |
2025-08-11 | ShoulderShot: Generating Over-the-Shoulder Dialogue Videos | Yuang Zhang et.al. | 2508.07597 | null |
2025-08-11 | Procedural Mixture Sets | Hendrik Rommeswinkel et.al. | 2508.07588 | null |
2025-08-12 | From Platform Migration to Cultural Integration: the Ingress and Diffusion of #wlw from TikTok to RedNote in Queer Women Communities | Ziqi Pan et.al. | 2508.07579 | null |
2025-08-11 | UniFlow: Unifying Speech Front-End Tasks via Continuous Generative Modeling | Ziqian Wang et.al. | 2508.07558 | null |
2025-08-11 | Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation | Minghao Yin et.al. | 2508.07557 | null |
2025-08-11 | Physics-informed Multiresolution Wavelet Neural Network Method for Solving Partial Differential Equations | Feng Han et.al. | 2508.07546 | null |
2025-08-11 | Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing | Joonghyuk Shin et.al. | 2508.07519 | null |
2025-08-10 | Forecasting solar power output in Ibadan: A machine learning approach leveraging weather data and system specifications | Obarotu Peter Urhuerhi et.al. | 2508.07462 | null |
2025-08-10 | Unified Semiclassical Theory of Nonlinear Hall Effect:Bridging Ballistic and Diffusive Transport Regime | Xinyu Liu et.al. | 2508.07445 | null |
2025-08-10 | Robust, fast, and adaptive splitting schemes for nonlinear doubly-degenerate diffusion equations | Ayesha Javed et.al. | 2508.07420 | null |
2025-08-10 | CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization | Youqi Wang et.al. | 2508.07413 | null |
2025-08-10 | Conditional splitting probabilities for hidden-state inference in drift-diffusive processes | Emir Sezik et.al. | 2508.07386 | null |
2025-08-10 | Supercritical fluids as a distinct state of matter characterized by sub-short-range structural order | Sha Jin et.al. | 2508.07385 | null |
2025-08-10 | SODiff: Semantic-Oriented Diffusion Model for JPEG Compression Artifacts Removal | Tingyu Yang et.al. | 2508.07346 | null |
2025-08-10 | CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation | Fangtai Wu et.al. | 2508.07341 | null |
2025-08-10 | Linear-Quadratic Mean Field Games with Common Noise: A Direct Approach | Wenyu Cong et.al. | 2508.07271 | null |
2025-08-10 | Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers | Xin Ma et.al. | 2508.07246 | null |
2025-08-10 | Causal Negative Sampling via Diffusion Model for Out-of-Distribution Recommendation | Chu Zhao et.al. | 2508.07243 | null |
2025-08-10 | HaDM-ST: Histology-Assisted Differential Modeling for Spatial Transcriptomics Generation | Xuepeng Liu et.al. | 2508.07225 | null |
2025-08-10 | Neural Bridge Processes | Jian Xu et.al. | 2508.07220 | null |
2025-08-10 | Explainability-in-Action: Enabling Expressive Manipulation and Tacit Understanding by Bending Diffusion Models in ComfyUI | Ahmed M. Abuzuraiq et.al. | 2508.07183 | null |
2025-08-10 | CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion | Xiaotong Lin et.al. | 2508.07162 | null |
2025-08-10 | SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models | Ruolin Yang et.al. | 2508.07149 | null |
2025-08-10 | Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction | Yu Liu et.al. | 2508.07146 | null |
2025-08-10 | SketchConcept: Sketching-based Concept Recomposition for Product Design using Generative AI | Runlin Duan et.al. | 2508.07141 | null |
2025-08-10 | Canvas3D: Empowering Precise Spatial Control for Image Generation with Constraints from a 3D Virtual Canvas | Runlin Duan et.al. | 2508.07135 | null |
2025-08-10 | On the geometric Brownian motion with state-dependent variable exponent diffusion term | Mustafa Avci et.al. | 2508.07130 | null |
2025-08-10 | Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays | Gregory Schuit et.al. | 2508.07128 | null |
2025-08-10 | Modelling Human Skin Morphology and Simulating Transdermal Transport of 50 Chemicals | Milana Tesfamarian et.al. | 2508.07123 | null |
2025-08-09 | DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit | Aiden Swann et.al. | 2508.07118 | null |
2025-08-09 | Whisfusion: Parallel ASR Decoding via a Diffusion Transformer | Taeyoun Kwon et.al. | 2508.07048 | null |
2025-08-09 | A Stage-Aware Mixture of Experts Framework for Neurodegenerative Disease Progression Modelling | Tiantian He et.al. | 2508.07032 | null |
2025-08-09 | Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities | Anindya Bijoy Das et.al. | 2508.07031 | null |
2025-08-09 | Vec2Summ: Text Summarization via Probabilistic Sentence Embeddings | Mao Li et.al. | 2508.07017 | null |
2025-08-12 | HiMat: DiT-based Ultra-High Resolution SVBRDF Generation | Zixiong Wang et.al. | 2508.07011 | null |
2025-08-09 | Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments | Gian Mario Favero et.al. | 2508.07006 | null |
2025-08-09 | Mechanism of Anisotropic Crystallization and Phase Transitions under Van der Waals Squeezing | Yuxiang Gao et.al. | 2508.06992 | null |
2025-08-09 | WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering | Yixin Zhu et.al. | 2508.06982 | null |
2025-08-09 | Structure-Preserving Digital Twins via Conditional Neural Whitney Forms | Brooks Kinch et.al. | 2508.06981 | null |
2025-08-09 | CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing | Weiyan Xie et.al. | 2508.06937 | null |
2025-08-09 | Unveiling the Puzzle of Brittleness in Single Crystal Iridium | Qing Cheng et.al. | 2508.06929 | null |
2025-08-09 | AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning | Shihao Yuan et.al. | 2508.06924 | null |
2025-08-09 | Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing | Shichao Ma et.al. | 2508.06916 | null |
2025-08-09 | MultiRef: Controllable Image Generation with Multiple Visual References | Ruoxi Chen et.al. | 2508.06905 | null |
2025-08-09 | Text to Speech System for Meitei Mayek Script | Gangular Singh Irengbam et.al. | 2508.06870 | null |
2025-08-09 | Speech Enhancement based on cascaded two flow | Seonggyu Lee et.al. | 2508.06842 | null |
2025-08-09 | FlowSE: Flow Matching-based Speech Enhancement | Seonggyu Lee et.al. | 2508.06840 | null |
2025-08-09 | Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models | Shiqian Zhao et.al. | 2508.06837 | null |
2025-08-09 | A Score-based Diffusion Model Approach for Adaptive Learning of Stochastic Partial Differential Equation Solutions | Toan Huynh et.al. | 2508.06834 | null |
2025-08-09 | Efficient data-driven regression for reduced-order modeling of spatial pattern formation | Alessandro Alla et.al. | 2508.06833 | null |
2025-08-09 | Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation | Xiao Huang et.al. | 2508.06806 | null |
2025-08-09 | D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning | Shu-Ang Yu et.al. | 2508.06804 | null |
2025-08-09 | GaN/InN HEMT based UV photodetector on SiC with hexagonal boron nitride passivation | Mustafa Kilin et.al. | 2508.06782 | null |
2025-08-08 | Topology Generation of UAV Covert Communication Networks: A Graph Diffusion Approach with Incentive Mechanism | Xin Tang et.al. | 2508.06746 | null |
2025-08-08 | Design of high-mobility p-type GaN via the piezomobility tensor | Jie-Cheng Chen et.al. | 2508.06723 | null |
2025-08-08 | Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video | Jixuan He et.al. | 2508.06715 | null |
2025-08-08 | LightSwitch: Multi-view Relighting with Material-guided Diffusion | Yehonathan Litman et.al. | 2508.06494 | null |
2025-08-08 | Weak approximation of stochastic differential equations with sticky boundary conditions | Akash Sharma et.al. | 2508.06487 | null |
2025-08-08 | SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning | Lingkun Long et.al. | 2508.06447 | null |
2025-08-08 | SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation | Guido Manni et.al. | 2508.06429 | null |
2025-08-08 | 4D operando X-ray nano-holo-tomography reveals multiscale chemomechanics in Silicon-Graphite anode | Victor Vanpeene et.al. | 2508.06413 | null |
2025-08-08 | FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation | Wenbin Teng et.al. | 2508.06392 | null |
2025-08-08 | Diffuse measures and nonlinear parabolic equations | Francesco Petitta et.al. | 2508.06384 | null |
2025-08-08 | ActivityDiff: A diffusion model with Positive and Negative Activity Guidance for De Novo Drug Design | Renyi Zhou et.al. | 2508.06364 | null |
2025-08-08 | Quantum Algorithm for Estimating Intrinsic Geometry | Nhat A. Nghiem et.al. | 2508.06355 | null |
2025-08-08 | Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging? | Xin Ci Wong et.al. | 2508.06327 | null |
2025-08-08 | OM2P: Offline Multi-Agent Mean-Flow Policy | Zhuoran Li et.al. | 2508.06269 | null |
2025-08-08 | ADPro: a Test-time Adaptive Diffusion Policy for Robot Manipulation via Manifold and Initial Noise Constraints | Zezeng Li et.al. | 2508.06266 | null |
2025-08-08 | Tanaka formula for SDEs driven by fractional Brownian motion | Tommi Sottinen et.al. | 2508.06261 | null |
2025-08-08 | Low dimensional dynamics of a sparse balanced synaptic network of quadratic integrate-and-fire neurons | Maria V. Ageeva et.al. | 2508.06253 | null |
2025-08-08 | Light-Addressable Smart Nanostructures via Resonant Nanoheating | Victor Tabouillot et.al. | 2508.06215 | null |
2025-08-08 | Inverse Source Problems for the Time-Fractional Evolution Equation | Rahmonov Askar Ahmadovich et.al. | 2508.06209 | null |
2025-08-08 | Clinically-guided Data Synthesis for Laryngeal Lesion Detection | Chiara Baldini et.al. | 2508.06182 | null |
2025-08-08 | Synthetic Data-Driven Multi-Architecture Framework for Automated Polyp Segmentation Through Integrated Detection and Mask Generation | Ojonugwa Oluwafemi Ejiga Peter et.al. | 2508.06170 | null |
2025-08-08 | Sharp non-existence threshold for a parabolic Hardy-H{é}non equation with quasilinear diffusion | Razvan Gabriel Iagar et.al. | 2508.06164 | null |
2025-08-08 | Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment | Zhenbang Du et.al. | 2508.06160 | null |
2025-08-08 | Revealing the Staging Structural Evolution and Li (De)Intercalation Kinetics in Graphite Anodes via Machine Learning Potential | Liqi Wang et.al. | 2508.06156 | null |
2025-08-08 | VISTAR:A User-Centric and Role-Driven Benchmark for Text-to-Image Evaluation | Kaiyuan Jiang et.al. | 2508.06152 | null |
2025-08-08 | Improving Diagnostic Accuracy for Oral Cancer with inpainting Synthesis Lesions Generated Using Diffusion Models | Yong Oh Lee et.al. | 2508.06151 | null |
2025-08-08 | DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera | Shaohua Pan et.al. | 2508.06139 | null |
2025-08-08 | GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving | Jian Wang et.al. | 2508.06113 | null |
2025-08-08 | MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment | Gui Zou et.al. | 2508.06104 | null |
2025-08-08 | UGD-IML: A Unified Generative Diffusion-based Framework for Constrained and Unconstrained Image Manipulation Localization | Yachun Mi et.al. | 2508.06101 | null |
2025-08-08 | MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows | Xiquan Li et.al. | 2508.06098 | null |
2025-08-08 | E-React: Towards Emotionally Controlled Synthesis of Human Reactions | Chen Zhu et.al. | 2508.06093 | null |
2025-08-08 | SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment | Yanxiao Sun et.al. | 2508.06082 | null |
2025-08-08 | DreamVE: Unified Instruction-based Image and Video Editing | Bin Xia et.al. | 2508.06080 | null |
2025-08-08 | Towards MR-Based Trochleoplasty Planning | Michael Wehrli et.al. | 2508.06076 | null |
2025-08-08 | Radio continuum and \HI 21-cm line observations of a nearby luminous infrared galaxy IRAS 17526+3253 | Jianfeng Wu et.al. | 2508.06075 | null |
2025-08-08 | Real-time physics-informed reconstruction of transient fields using sensor guidance and higher-order time differentiation | Hong-Kyun Noh et.al. | 2508.06070 | null |
2025-08-08 | ThematicPlane: Bridging Tacit User Intent and Latent Spaces for Image Generation | Daniel Lee et.al. | 2508.06065 | null |
2025-08-08 | NEP: Autoregressive Image Editing via Next Editing Token Prediction | Huimin Wu et.al. | 2508.06044 | null |
2025-08-08 | Bayesian Radio Map Estimation: Fundamentals and Implementation via Diffusion Models | Tien Ngoc Ha et.al. | 2508.06037 | null |
2025-08-08 | InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow | Yiming Gong et.al. | 2508.06033 | null |
2025-08-08 | Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts | Kiran Chhatre et.al. | 2508.06032 | null |
2025-08-08 | Improved Sub-Visible Particle Classification in Flow Imaging Microscopy via Generative AI-Based Image Synthesis | Utku Ozbulak et.al. | 2508.06021 | null |
2025-08-08 | Vacuum Dealloyed Brass as Li-Metal Battery Current Collector: Effect of Zinc and Porosity | Eric V Woods et.al. | 2508.06015 | null |
2025-08-08 | ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors | Minsu Kim et.al. | 2508.06014 | null |
2025-08-08 | KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training | Kai Zhang et.al. | 2508.06001 | null |
2025-08-08 | Global solutions in $L^{p}{v}L^{\infty}{x}$ for the Boltzmann equation in bounded domains | Dingqun Deng et.al. | 2508.05985 | null |
2025-08-08 | Revisiting $μ$ SR Studies of Ion Dynamics in the Light of Extended Kubo-Toyabe Model | Takashi U. Ito et.al. | 2508.05968 | null |
2025-08-08 | Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents | Han Lin et.al. | 2508.05954 | null |
2025-08-08 | A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image | Yanxing Liang et.al. | 2508.05950 | null |
2025-08-08 | Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution | Zhanyi Sun et.al. | 2508.05941 | null |
2025-08-08 | Reverse Diffusion Sequential Monte Carlo Samplers | Luhuan Wu et.al. | 2508.05926 | null |
2025-08-08 | Fast, Convex and Conditioned Network for Multi-Fidelity Vectors and Stiff Univariate Differential Equations | Siddharth Rout et.al. | 2508.05921 | null |
2025-08-07 | Measurement of All Flavor PeV Neutrino Flux using Combined Datasets from IceCube | Emre Yildizci et.al. | 2508.05886 | null |
2025-08-07 | Emerging ultra-wide band gap semiconductors for future high-frequency electronics | Emily M. Garrity et.al. | 2508.05823 | null |
2025-08-07 | FineDialFact: A benchmark for Fine-grained Dialogue Fact Verification | Xiangyan Chen et.al. | 2508.05782 | null |
2025-08-07 | MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss | Can Zhao et.al. | 2508.05772 | null |
2025-08-07 | UnGuide: Learning to Forget with LoRA-Guided Diffusion Models | Agnieszka Polowczyk et.al. | 2508.05755 | null |
2025-08-07 | Quantum Reservoir GAN | Hikaru Wakaura et.al. | 2508.05716 | null |
2025-08-07 | High multiplicity and global structure of coexistence states in a predator-prey model with saturation | Kousuke Kuto et.al. | 2508.05714 | null |
2025-08-07 | Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation | Yue Liao et.al. | 2508.05635 | null |
2025-08-07 | GAP: Gaussianize Any Point Clouds with Text Guidance | Weiqi Zhang et.al. | 2508.05631 | null |
2025-08-07 | Latent Space Diffusion for Topology Optimization | Aaron Lutheran et.al. | 2508.05624 | null |
2025-08-07 | Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision | Luozheng Qin et.al. | 2508.05606 | null |
2025-08-07 | Unveiling the Lithium-Ion Transport Mechanism in Li2ZrCl6 Solid-State Electrolyte via Deep Learning-Accelerated Molecular Dynamics Simulations | Hanzeng Guo et.al. | 2508.05598 | null |
2025-08-07 | Discrepancy-Aware Contrastive Adaptation in Medical Time Series Analysis | Yifan Wang et.al. | 2508.05572 | null |
2025-08-07 | MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips | Shibo Wang et.al. | 2508.05506 | null |
2025-08-07 | Heat and super-diffusive melting fronts in unsaturated porous media | Eirik G. Flekkøy et.al. | 2508.05451 | null |
2025-08-07 | Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI | Krzysztof Janowicz et.al. | 2508.05432 | null |
2025-08-07 | MolSnap: Snap-Fast Molecular Generation with Latent Variational Mean Flow | Md Atik Ahamed et.al. | 2508.05411 | null |
2025-08-07 | UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation | Wonjun Kang et.al. | 2508.05399 | null |
2025-08-07 | Real-Time Iteration Scheme for Diffusion Policy | Yufei Duan et.al. | 2508.05396 | null |
2025-08-09 | Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms | Jie Xiao et.al. | 2508.05387 | null |
2025-08-07 | Multi-Modal Multi-Behavior Sequential Recommendation with Conditional Diffusion-Based Feature Denoising | Xiaoxi Cui et.al. | 2508.05352 | null |
2025-08-07 | Stranski-Krastanov Growth of Disordered ScNx Thin Films on MgO(100): Influence of Defect Densities on Electronic Structure and Transport Properties | Susmita Chowdhury et.al. | 2508.05330 | null |
2025-08-07 | Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting | Frank Ruis et.al. | 2508.05323 | null |
2025-08-07 | Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces | Mathias Rose Bjare et.al. | 2508.05306 | null |
2025-08-07 | SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens | Nikita Dragunov et.al. | 2508.05305 | null |
2025-08-07 | An Investigation into the Distribution of Ratios of Particle Solver-based Likelihoods | Emil Løvbak et.al. | 2508.05303 | null |
2025-08-07 | Wavelet-Guided Dual-Frequency Encoding for Remote Sensing Change Detection | Xiaoyang Zhang et.al. | 2508.05271 | null |
2025-08-07 | B4DL: A Benchmark for 4D LiDAR LLM in Spatio-Temporal Understanding | Changho Choi et.al. | 2508.05269 | null |
2025-08-07 | SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion | Xiaoyang Zhang et.al. | 2508.05264 | null |
2025-08-07 | ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models | Yatong Lan et.al. | 2508.05236 | null |
2025-08-07 | Parabolic abstract evolution equations in cylindrical domains and uniformly local Sobolev spaces | Joly Romain et.al. | 2508.05220 | null |
2025-08-07 | An asymptotic-preserving active flux scheme for the hyperbolic heat equation in the diffusive scaling | Junming Duan et.al. | 2508.05166 | null |
2025-08-07 | RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer | Fangyu Du et.al. | 2508.05115 | null |
2025-08-07 | PoseGen: In-Context LoRA Finetuning for Pose-Controllable Long Human Video Generation | Jingxuan He et.al. | 2508.05091 | null |
2025-08-07 | MetaDiT: Enabling Fine-grained Constraints in High-degree-of Freedom Metasurface Design | Hao Li et.al. | 2508.05076 | null |
2025-08-07 | Align-for-Fusion: Harmonizing Triple Preferences via Dual-oriented Diffusion for Cross-domain Sequential Recommendation | Yongfu Zha et.al. | 2508.05074 | null |
2025-08-07 | FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer | Jian Zhu et.al. | 2508.05069 | null |
2025-08-07 | DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion | Yifeng Huang et.al. | 2508.05060 | null |
2025-08-07 | Observation of Super-ballistic Brownian Motion in Liquid | Jason Boynewicz et.al. | 2508.05031 | null |
2025-08-07 | Coupled 1D Chemical Kinetic-Transport and 2D Hydrodynamic Modeling Supports a modest 1-1.5x Supersolar Oxygen Abundance in Jupiter’s Atmosphere | Jeehyun Yang et.al. | 2508.05007 | null |
2025-08-07 | Switching Diffusion Systems with Past-Dependent Switching and Countable State Space: Successful Couplings and Strong Ergodicity | Fubao Xi et.al. | 2508.04997 | null |
2025-08-08 | REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers | Yuepeng Jiang et.al. | 2508.04996 | null |
2025-08-07 | Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression | Zheng Chen et.al. | 2508.04979 | null |
2025-08-06 | Simulation of Non-Premixed, Supersonic Combustion using the Discontinuous Galerkin Method on Fully Unstructured Grids | Cal J. Rising et.al. | 2508.04930 | null |
2025-08-06 | Taxonomy of Faults in Attention-Based Neural Networks | Sigma Jahan et.al. | 2508.04925 | null |
2025-08-08 | Learning AI Auditing: A Case Study of Teenagers Auditing a Generative AI Model | Luis Morales-Navarro et.al. | 2508.04902 | null |
2025-08-06 | The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models | Leo Zhang et.al. | 2508.04884 | null |
2025-08-06 | Unified Flow Matching for Long Horizon Event Forecasting | Xiao Shou et.al. | 2508.04843 | null |
2025-08-06 | Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off | Seungyong Lee et.al. | 2508.04825 | null |
2025-08-06 | Delay-constrained re-entry governs large-scale brain seizures and other network pathologies | Paul Triebkorn et.al. | 2508.04824 | null |
2025-08-06 | Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models | Mehrdad Moradi et.al. | 2508.04818 | null |
2025-08-06 | Stochastic Optimal Control with Control-Dependent Diffusion and State Constraints: A Degenerate Elliptic Approach | Anderson O. Calixto et.al. | 2508.04809 | null |
2025-08-06 | Electrodeless Magnetohydrodynamic Local Force Generator for Aerocapture | Bernard Parent et.al. | 2508.04806 | null |
2025-08-06 | ACM Multimedia Grand Challenge on ENT Endoscopy Analysis | Trong-Thuan Nguyen et.al. | 2508.04801 | null |
2025-08-08 | Quantum-impurity sensing of altermagnetic order | V. A. S. V. Bittencourt et.al. | 2508.04788 | null |
2025-08-06 | Edge-Assisted Collaborative Fine-Tuning for Multi-User Personalized Artificial Intelligence Generated Content (AIGC) | Nan Li et.al. | 2508.04745 | null |
2025-08-06 | A colossal dielectric response of HfxZr1-xO2 nanoparticles | Oleksandr S. Pylypchuk et.al. | 2508.04697 | null |
2025-08-06 | Diffusion in a $d$ -dimensional rough potential | Jacob Jeffries et.al. | 2508.04674 | null |
2025-08-06 | HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models | Young D. Kwon et.al. | 2508.04663 | null |
2025-08-06 | Stochastic Calculus for Pathwise Observables of Markov-Jump Processes: Unification of Diffusion and Jump Dynamics | Lars Torbjørn Stutzer et.al. | 2508.04647 | null |
2025-08-06 | A unified model for linear responses of physical networks | José M. Ortiz-Tavárez et.al. | 2508.04616 | null |
2025-08-06 | Multitask Learning with Stochastic Interpolants | Hugo Negrel et.al. | 2508.04605 | null |
2025-08-07 | A Comprehensive Framework for Uncertainty Quantification of Voxel-wise Supervised Models in IVIM MRI | Nicola Casali et.al. | 2508.04588 | null |
2025-08-06 | Joint Communication and Indoor Positioning Based on Visible Light in the Presence of Dimming | A. Tarik Leblebici et.al. | 2508.04570 | null |
2025-08-06 | DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling | Yijie Li et.al. | 2508.04568 | null |
2025-08-06 | TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning | Yunbi Liu et.al. | 2508.04565 | null |
2025-08-06 | Drone Detection with Event Cameras | Gabriele Magrini et.al. | 2508.04564 | null |
2025-08-06 | One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose | Jinxi Liu et.al. | 2508.04559 | null |
2025-08-06 | Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis | Angang Zhang et.al. | 2508.04551 | null |
2025-08-06 | MSC: A Marine Wildlife Video Dataset with Grounded Segmentation and Clip-Level Captioning | Quang-Trung Truong et.al. | 2508.04549 | null |
2025-08-06 | X-ray thermal diffuse scattering as a texture-robust temperature diagnostic for dynamically compressed solids | P. G. Heighway et.al. | 2508.04525 | null |
2025-08-06 | $β$ -Irida-Graphene: A New 2D Carbon Allotrope for Sodium-Ion Battery Anodes | José A. S. Laranjeira et.al. | 2508.04506 | null |
2025-08-06 | QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution | Bowen Chai et.al. | 2508.04485 | null |
2025-08-06 | Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model | Hongxu Chen et.al. | 2508.04472 | null |
2025-08-06 | 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation | Shuzhou Yang et.al. | 2508.04467 | null |
2025-08-06 | Case Studies of Generative Machine Learning Models for Dynamical Systems | Nachiket U. Bapat et.al. | 2508.04459 | null |
2025-08-06 | Cognitive Effort in the Two-Step Task: An Active Inference Drift-Diffusion Model Approach | Alvaro Garrido Perez et.al. | 2508.04435 | null |
2025-08-06 | Unmasking Interstitial Lung Diseases: Leveraging Masked Autoencoders for Diagnosis | Ethan Dack et.al. | 2508.04429 | null |
2025-08-06 | Hydrodynamic Effects in Cryogenic Buffer Gas Cells: Design Insights from Hybrid Simulations | Nick Vogeley et.al. | 2508.04364 | null |
2025-08-06 | Derivation and Numerical Simulation of a Thermodynamically Consistent Magneto Two-Phase Flow Model for Magnetic Drug Targeting | Eberhard Bänsch et.al. | 2508.04360 | null |
2025-08-06 | From Split to Share: Private Inference with Distributed Feature Sharing | Zihan Liu et.al. | 2508.04346 | null |
2025-08-06 | Performative Market Making | Charalampos Kleitsikas et.al. | 2508.04344 | null |
2025-08-06 | TempFlow-GRPO: When Timing Matters for GRPO in Flow Models | Xiaoxuan He et.al. | 2508.04324 | null |
2025-08-06 | Wave coupling in partially ionized plasmas with shear flows I. Fast-to-Alfvén transformation | Miquel Cantallops et.al. | 2508.04319 | null |
2025-08-06 | Turbulent Injection assisted by Diffusion Models for Scale Resolving Simulations | Margaux Boxho et.al. | 2508.04318 | null |
2025-08-06 | Parameter Estimation for Weakly Interacting Hypoelliptic Diffusions | Yuga Iguchi et.al. | 2508.04287 | null |
2025-08-06 | S2M3: Split-and-Share Multi-Modal Models for Distributed Multi-Task Inference on the Edge | JinYi Yoon et.al. | 2508.04271 | null |
2025-08-06 | Sparse Narrow-Band Topology Optimization for Large-Scale Thermal-Fluid Applications | Vladislav Pimanov et.al. | 2508.04261 | null |
2025-08-06 | High-Dimensional Matrix-Variate Diffusion Index Models for Time Series Forecasting | Zhiren Ma et.al. | 2508.04259 | null |
2025-08-06 | Suspensions of small ultra-soft colloids remain liquids in overcrowded conditions | Nikolaos A. Burger et.al. | 2508.04244 | null |
2025-08-06 | PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction | Muhua Zhu et.al. | 2508.04236 | null |
2025-08-06 | DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification | Saifullah Saifullah et.al. | 2508.04233 | null |
2025-08-06 | Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction | Yu Liu et.al. | 2508.04229 | null |
2025-08-06 | LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation | Kangrui Cen et.al. | 2508.04228 | null |
2025-08-06 | DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models | Saifullah Saifullah et.al. | 2508.04208 | null |
2025-08-06 | A background-free signal of jet-induced diffusion wake in quark-gluon plasma | Zhong Yang et.al. | 2508.04194 | null |
2025-08-06 | Deeper Inside Deep ViT | Sungrae Hong et.al. | 2508.04181 | null |
2025-08-06 | Quasi-Clique Discovery via Energy Diffusion | Yu Zhang et.al. | 2508.04174 | null |
2025-08-06 | Non-Equilibrium Dynamics and First-Passage Properties of Stochastic Processes: From Brownian Motion to Active Particles | Mathis Guéneau et.al. | 2508.04154 | null |
2025-08-06 | IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control | Lijuan Liu et.al. | 2508.04147 | null |
2025-08-06 | Polynomial-time sampling despite disorder chaos | Eric Ma et.al. | 2508.04133 | null |
2025-08-06 | Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation | Maximilian Ulmer et.al. | 2508.04122 | null |
2025-08-06 | Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework | Yi-Ting Chen et.al. | 2508.04090 | null |
2025-08-06 | Long time behavior and Yaglom limit for real trait-structured Birth and Death Processes | Pierre Collet et.al. | 2508.04089 | null |
2025-08-06 | Convolutional autoencoders for the reconstruction of three-dimensional interfacial multiphase flows | Murray Cutforth et.al. | 2508.04084 | null |
2025-08-06 | POD-based reduced order modeling of global-in-time iterative decoupled algorithms for Biot’s consolidation model | Huipeng Gu et.al. | 2508.04082 | null |
2025-08-06 | Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion | Fangmin Zhao et.al. | 2508.04055 | null |
2025-08-06 | Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation | Jiayi He et.al. | 2508.04049 | null |
2025-08-06 | Nonlinear stability of two-dimensional periodic waves in parabolic systems with conservation laws | L. Miguel Rodrigues et.al. | 2508.04023 | null |
2025-08-07 | S $^2$ Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation | Weilun Feng et.al. | 2508.04016 | null |
2025-08-06 | Constructing Generalized Sample Transition Probabilities with Biased Simulations | Yanbin Wang et.al. | 2508.03977 | null |
2025-08-05 | Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm | Lin Zhang et.al. | 2508.03955 | null |
2025-08-05 | Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model | Shen Zhu et.al. | 2508.03925 | null |
2025-08-05 | Coefficient Identification Problem with Integral Overdetermination Condition for Diffusion Equations | R. R. Ashurov et.al. | 2508.03859 | null |
2025-08-05 | VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations | Yifei Zong et.al. | 2508.03839 | null |
2025-08-05 | HPSv3: Towards Wide-Spectrum Human Preference Score | Yuhang Ma et.al. | 2508.03789 | null |
2025-08-05 | LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation | Jianxiong Gao et.al. | 2508.03694 | null |
2025-08-05 | LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences | Ao Liang et.al. | 2508.03692 | null |
2025-08-05 | La La LiDAR: Large-Scale Layout Generation from LiDAR Data | Youquan Liu et.al. | 2508.03691 | null |
2025-08-05 | Veila: Panoramic LiDAR Generation from a Monocular RGB Image | Youquan Liu et.al. | 2508.03690 | null |
2025-08-05 | OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World | Katherine Liu et.al. | 2508.03669 | null |
2025-08-05 | Rigidity for graph product von Neumann algebras | Camille Horbez et.al. | 2508.03662 | null |
2025-08-05 | DiWA: Diffusion Policy Adaptation with World Models | Akshay L Chandra et.al. | 2508.03645 | null |
2025-08-05 | Likelihood Matching for Diffusion Models | Lei Qian et.al. | 2508.03636 | null |
2025-08-05 | Radiative Nonideal MHD Simulations of Inner Protoplanetary Disks: Temperature Structures, Asymmetric Winds, and Episodic Surface Accretion | Shoji Mori et.al. | 2508.03624 | null |
2025-08-05 | Expanding the Standard Diffusion Process to Specified Non-Gaussian Marginal Distributions | Robert Richardson et.al. | 2508.03617 | null |
2025-08-05 | CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models | Ana Lawry Aguila et.al. | 2508.03594 | null |
2025-08-05 | Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection | Long Qian et.al. | 2508.03539 | null |
2025-08-05 | X-ray Halos of Early-Type Galaxies with AGN Feedback and Accretion from a Circumgalactic Medium: models and observations | Silvia Pellegrini et.al. | 2508.03536 | null |
2025-08-05 | CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation | Kaishen Yuan et.al. | 2508.03535 | null |
2025-08-05 | LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Text-to-Image Generation | Lianwei Yang et.al. | 2508.03485 | null |
2025-08-05 | When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models | Dasol Choi Jihwan Lee et.al. | 2508.03483 | null |
2025-08-05 | Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models | Hyungjin Kim et.al. | 2508.03481 | null |
2025-08-05 | VideoGuard: Protecting Video Content from Unauthorized Editing | Junjie Cao et.al. | 2508.03480 | null |
2025-08-05 | Learning to Incentivize: LLM-Empowered Contract for AIGC Offloading in Teleoperation | Zijun Zhan et.al. | 2508.03464 | null |
2025-08-06 | READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation | Haotian Wang et.al. | 2508.03457 | null |
2025-08-05 | Error Estimates of Semi-Lagrangian Schemes for Diffusive Conservation Laws | Haruki Takemura et.al. | 2508.03455 | null |
2025-08-05 | RAAG: Ratio Aware Adaptive Guidance | Shangwen Zhu et.al. | 2508.03442 | null |
2025-08-05 | Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN | Shivangi Nigam et.al. | 2508.03415 | null |
2025-08-05 | SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models | Pingchuan Ma et.al. | 2508.03402 | null |
2025-08-05 | Delay-facilitated self-assembly in compartmentalized systems | Severin Angerpointner et.al. | 2508.03383 | null |
2025-08-05 | Diffusion Once and Done: Degradation-Aware LoRA for Efficient All-in-One Image Restoration | Ni Tang et.al. | 2508.03373 | null |
2025-08-05 | A Closed-Loop Multi-Agent Framework for Aerodynamics-Aware Automotive Styling Design | Xinyu Jin et.al. | 2508.03370 | null |
2025-08-05 | GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images | Yifei Sun et.al. | 2508.03357 | null |
2025-08-05 | Quenching time and probability estimates for a stochastic reaction-diffusion system with coupled inner singular absorption terms driven by mixed noises | Nikos I. Kavallaris et.al. | 2508.03354 | null |
2025-08-06 | Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation | Xunzhi Xiang et.al. | 2508.03334 | null |
2025-08-05 | Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation | Peiyu Wang et.al. | 2508.03320 | null |
2025-08-05 | Thermal Metamaterials for Enhanced Non-Fourier Heat Transport | Harry Mclean et.al. | 2508.03316 | null |
2025-08-05 | The non-isothermal Maxwell-Stefan asymptotics of the multi-species Boltzmann equations | Xinqiu Chen et.al. | 2508.03311 | null |
2025-08-05 | Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation | Jun Luo et.al. | 2508.03300 | null |
2025-08-05 | Investigation on deep learning-based galaxy image translation models | Hengxin Ruan et.al. | 2508.03291 | null |
2025-08-07 | Well-Posedness of the Cauchy Problem for One-Dimensional Nonlinear Diffusion Equations with Dynamic and Fourth-Type Boundary Conditions in the Lp Lq Maximal Regularity Setting | Ken Furukawa et.al. | 2508.03288 | null |
2025-08-07 | Global solvability for doubly degenerate nutrient taxis system with a wide range of bacterial responses in physical dimension | Bao-Ngoc Tran et.al. | 2508.03268 | null |
2025-08-05 | Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation | Gang Dai et.al. | 2508.03256 | null |
2025-08-05 | V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models | Jisoo Kim et.al. | 2508.03254 | null |
2025-08-05 | Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion | Wentao Qu et.al. | 2508.03252 | null |
2025-08-06 | FFHQ-Makeup: Paired Synthetic Makeup Dataset with Facial Consistency Across Multiple Styles | Xingchao Yang et.al. | 2508.03241 | null |
2025-08-05 | BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models | Yu Pan et.al. | 2508.03221 | null |
2025-08-05 | Timing is everything: How subtle timing changes in MRI echo planar imaging can significantly alter mechanical vibrations and sound level | Amir Seginer et.al. | 2508.03220 | null |
2025-08-05 | Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance | Eliot Beyler et.al. | 2508.03210 | null |
2025-08-05 | Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models | Muhammed Saeed et.al. | 2508.03199 | null |
2025-08-05 | An Analytic Model to Determine the Interstitial-Solute Energetics and Underlying Mechanism in Refractory High-Entropy Alloys | Qianxi Zhu et.al. | 2508.03163 | null |
2025-08-05 | SARD: Segmentation-Aware Anomaly Synthesis via Region-Constrained Diffusion with Discriminative Mask Guidance | Yanshu Wang et.al. | 2508.03143 | null |
2025-08-05 | UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying | Chengyu Bai et.al. | 2508.03142 | null |
2025-08-05 | Filtering and 1/3 Power Law for Optimal Time Discretisation in Numerical Integration of Stochastic Differential Equations | Igor G. Vladimirov et.al. | 2508.03135 | null |
2025-08-05 | Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback | Jingyi Chen et.al. | 2508.03123 | null |
2025-08-05 | Power System Voltage Stability Boundary: Computational Results and Applications | Zhenyao Li et.al. | 2508.03119 | null |
2025-08-05 | T2UE: Generating Unlearnable Examples from Text Descriptions | Xingjun Ma et.al. | 2508.03091 | null |
2025-08-05 | MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation | Youran Zhou et.al. | 2508.03083 | null |
2025-08-05 | Multi-human Interactive Talking Dataset | Zeyu Zhu et.al. | 2508.03050 | null |
2025-08-05 | Urban In-Context Learning: Bridging Pretraining and Inference through Masked Diffusion for Urban Profiling | Ruixing Zhang et.al. | 2508.03042 | null |
2025-08-05 | Sparse Identification of Nonlinear Dynamics for Stochastic Delay Differential Equations | Dimitri Breda et.al. | 2508.03040 | null |
2025-08-05 | MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention | Qi Xie et.al. | 2508.03034 | null |
2025-08-05 | LiGen: GAN-Augmented Spectral Fingerprinting for Indoor Positioning | Jie Lin et.al. | 2508.03024 | null |
2025-08-05 | Generating Light-based Fingerprints for Indoor Localization | Hsun-Yu Lee et.al. | 2508.03011 | null |
2025-08-05 | Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models | Fan Yang et.al. | 2508.03006 | null |
2025-08-05 | Diffusion Models with Adaptive Negative Sampling Without External Resources | Alakh Desai et.al. | 2508.02973 | null |
2025-08-05 | Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver | Jonathan Patsenker et.al. | 2508.02964 | null |
2025-08-04 | X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio | Chenxu Zhang et.al. | 2508.02944 | null |
2025-08-04 | Documenting Patterns of Exoticism of Marginalized Populations within Text-to-Image Generators | Sourojit Ghosh et.al. | 2508.02937 | null |
2025-08-06 | A nonstandard finite difference scheme for an SEIQR epidemiological PDE model | Achraf Zinihi et.al. | 2508.02928 | null |
2025-08-04 | Goal-Oriented Adaptive Finite Element Multilevel Quasi-{M}onte {C}arlo | Joakim Beck et.al. | 2508.02925 | null |
2025-08-04 | How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution | Minh-Hai Nguyen et.al. | 2508.02923 | null |
2025-08-04 | RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation | Mehrdad Moradi et.al. | 2508.02903 | null |
2025-08-04 | REFLECT: Rectified Flows for Efficient Brain Anomaly Correction Transport | Farzad Beizaee et.al. | 2508.02889 | null |
2025-08-04 | Memoirs of mass accretion: probing the edges of intracluster light in simulated galaxy clusters | Tara Dacunha et.al. | 2508.02837 | null |
2025-08-04 | DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework | Tongchun Zuo et.al. | 2508.02807 | null |
2025-08-04 | NASIM: Revealing the low surface brightness Universe from legacy VISTA data | Elham Saremi et.al. | 2508.02780 | null |
2025-08-04 | D2PPO: Diffusion Policy Policy Optimization with Dispersive Loss | Guowei Zou et.al. | 2508.02644 | null |
2025-08-04 | CAK: Emergent Audio Effects from Minimal Deep Learning | Austin Rockman et.al. | 2508.02643 | null |
2025-08-04 | Anticipating Decoherence: a Predictive Framework for Enhancing Coherence in Quantum Emitters | Pranshu Maan et.al. | 2508.02638 | null |
2025-08-04 | ReMoMask: Retrieval-Augmented Masked Motion Generation | Zhengdao Li et.al. | 2508.02605 | null |
2025-08-04 | Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction | Yuerong Song et.al. | 2508.02558 | null |
2025-08-04 | From Pixels to Pathology: Restoration Diffusion for Diagnostic-Consistent Virtual IHC | Jingsong Liu et.al. | 2508.02528 | null |
2025-08-06 | xDeepServe: Model-as-a-Service on Huawei CloudMatrix384 | Ao Xiao et.al. | 2508.02520 | null |
2025-08-04 | QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots | Sheng Wu et.al. | 2508.02512 | null |
2025-08-04 | Quantitative and Predictive Folding Models from Limited Single-Molecule Data Using Simulation-Based Inference | Lars Dingeldein et.al. | 2508.02509 | null |
2025-08-04 | Toward Using Machine Learning as a Shape Quality Metric for Liver Point Cloud Generation | Khoa Tuan Nguyen et.al. | 2508.02482 | null |
2025-08-04 | PoseGuard: Pose-Guided Generation with Safety Guardrails | Kongxin Wang et.al. | 2508.02476 | null |
2025-08-04 | Efficient spin-pumping and spin-to-charge conversion in epitaxial Mn $_3$ Sn(0001) noncollinear antiferromagnetic films | Surya N. Panda et.al. | 2508.02415 | null |
2025-08-04 | Hydra: Accurate Multi-Modal Leaf Wetness Sensing with mm-Wave and Camera Fusion | Yimeng Liu et.al. | 2508.02409 | null |
2025-08-04 | Inference-time Scaling for Diffusion-based Audio Super-resolution | Yizhu Jin et.al. | 2508.02391 | null |
2025-08-04 | Talking Surveys: How Photorealistic Embodied Conversational Agents Shape Response Quality, Engagement, and Satisfaction | Matus Krajcovic et.al. | 2508.02376 | null |
2025-08-04 | Transport-Guided Rectified Flow Inversion: Improved Image Editing Using Optimal Transport Theory | Marian Lupascu et.al. | 2508.02363 | null |
2025-08-04 | Qwen-Image Technical Report | Chenfei Wu et.al. | 2508.02324 | null |
2025-08-04 | Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images | Philipp Wulff et.al. | 2508.02323 | null |
2025-08-05 | LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training | Sikui Zhang et.al. | 2508.02308 | null |
2025-08-05 | Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor | Xiaoliu Guan et.al. | 2508.02240 | null |
2025-08-04 | Abstract Formulation of Mean-Field Models and Propagation of Chaos | Tau Shean Lim et.al. | 2508.02224 | null |
2025-08-04 | A theory of strange metals | Simone Fratini et.al. | 2508.02221 | null |
2025-08-04 | Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference | Yuxuan Song et.al. | 2508.02193 | null |
2025-08-04 | DreamPainter: Image Background Inpainting for E-commerce Scenarios | Sijie Zhao et.al. | 2508.02155 | null |
2025-08-04 | AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models | Die Chen et.al. | 2508.02151 | null |
2025-08-04 | VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling | Yuru Xiao et.al. | 2508.02129 | null |
2025-08-04 | AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation | Zhiwen Li et.al. | 2508.02107 | null |
2025-08-04 | Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis | Kaiyang Ji et.al. | 2508.02106 | null |
2025-08-04 | “Stack It Up!”: 3D Stable Structure Generation from 2D Hand-drawn Sketch | Yiqing Xu et.al. | 2508.02093 | null |
2025-08-04 | Unsupervised Multi-channel Speech Dereverberation via Diffusion | Yulun Wu et.al. | 2508.02071 | null |
2025-08-04 | “Set It Up”: Functional Object Arrangement with Compositional Generative Models | Yiqing Xu et.al. | 2508.02068 | null |
2025-08-04 | StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion | Haoxin Yang et.al. | 2508.02056 | null |
2025-08-04 | Why Generate When You Can Transform? Unleashing Generative Attention for Dynamic Recommendation | Yuli Liu et.al. | 2508.02050 | null |
2025-08-04 | Conditional Diffusion Model with Anatomical-Dose Dual Constraints for End-to-End Multi-Tumor Dose Prediction | Hui Xie et.al. | 2508.02043 | null |
2025-08-04 | Frequency-Domain Denoising-Based in Vivo Fluorescence Imaging | XuHao Yu et.al. | 2508.02025 | null |
2025-08-04 | Significant Mobility Enhancement in Coupled AlGaN/GaN Quantum Wells considering Inter-Well Distance and Asymmetric Widths | Le Tri Dat et.al. | 2508.02024 | null |
2025-08-05 | Asymptotic analysis of the Allen-Cahn equation with dynamic boundary conditions of Cahn-Hilliard type | Pierluigi Colli et.al. | 2508.02021 | null |
2025-08-04 | Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention | Kyungmin Jo et.al. | 2508.02004 | null |
2025-08-04 | Generative Large-Scale Pre-trained Models for Automated Ad Bidding Optimization | Yu Lei et.al. | 2508.02002 | null |
2025-08-04 | Path-Integral Formulation of Bosonic Markovian Open Quantum Dynamics with Monte Carlo stochastic trajectories using the Glauber-Sudarshan P, Wigner, and Husimi Q Functions and Hybrids | Toma Yoneya et.al. | 2508.01991 | null |
2025-08-04 | Controllable and Stealthy Shilling Attacks via Dispersive Latent Diffusion | Shutong Qiao et.al. | 2508.01987 | null |
2025-08-04 | Diffusion models for inverse problems | Hyungjin Chung et.al. | 2508.01975 | null |
2025-08-03 | Distributed games with jumps: An $α$ -potential game approach | Xin Guo et.al. | 2508.01929 | null |
2025-08-03 | On the Non-Markovian Navier-Stokes Framework for Turbulence Modeling – A Preliminary Analysis | Siamak Kazemzadeh Hannani et.al. | 2508.01890 | null |
2025-08-03 | DiffusionFF: Face Forgery Detection via Diffusion-based Artifact Localization | Siran Peng et.al. | 2508.01873 | null |
2025-08-05 | Moment Estimate and Variational Approach for Learning Generalized Diffusion with Non-gradient Structures | Fanze Kong et.al. | 2508.01854 | null |
2025-08-03 | Diffusion-based 3D Hand Motion Recovery with Intuitive Physics | Yufei Zhang et.al. | 2508.01835 | null |
2025-08-03 | Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder | Runxuan Yang et.al. | 2508.01796 | null |
2025-08-03 | Exponential mixing for the stochastic Kuramoto-Sivashinsky equation on the 1D torus | Peng Gao et.al. | 2508.01794 | null |
2025-08-03 | DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion | Zhigang Sun et.al. | 2508.01778 | null |
2025-08-03 | Semantically-Guided Inference for Conditional Diffusion Models: Enhancing Covariate Consistency in Time Series Forecasting | Rui Ding et.al. | 2508.01761 | null |
2025-08-03 | Dynamic Coupling of Infiltration-Soil Moisture Feedback:Emergent Vegetation Patterns in a Water-Vegetation Model | Juan Yan et.al. | 2508.01755 | null |
2025-08-03 | Energy-Efficient Federated Learning for Edge Real-Time Vision via Joint Data, Computation, and Communication Design | Xiangwang Hou et.al. | 2508.01745 | null |
2025-08-05 | Imbalance-Robust and Sampling-Efficient Continuous Conditional GANs via Adaptive Vicinity and Auxiliary Regularization | Xin Ding et.al. | 2508.01725 | null |
2025-08-03 | ModFus-DM: Explore the Representation in Modulated Signal Diffusion Generated Models | Haoyue Tan et.al. | 2508.01719 | null |
2025-08-03 | Versatile Transition Generation with Image-to-Video Diffusion | Zuhao Yang et.al. | 2508.01698 | null |
2025-08-03 | DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing | Yufeng Chi et.al. | 2508.01684 | null |
2025-08-03 | DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding | Hanqing Wang et.al. | 2508.01651 | null |
2025-08-03 | StrandDesigner: Towards Practical Strand Generation with Sketch Guidance | Na Zhang et.al. | 2508.01650 | null |
2025-08-03 | Hamiltonian simulation for nonlinear partial differential equation by Schrödingerization | Shoya Sasaki et.al. | 2508.01640 | null |
2025-08-03 | VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation | Xuanran Zhai et.al. | 2508.01622 | null |
2025-08-03 | LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding | Xuanzhao Dong et.al. | 2508.01617 | null |
2025-08-03 | TCDiff: Triplex Cascaded Diffusion for High-fidelity Multimodal EHRs Generation with Incomplete Clinical Data | Yandong Yan et.al. | 2508.01615 | null |
2025-08-03 | Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models | Haoran Dai et.al. | 2508.01605 | null |
2025-08-03 | Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment | Lubin Gan et.al. | 2508.01602 | null |
2025-08-03 | CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation | Sung-Wook Lee et.al. | 2508.01600 | null |
2025-08-03 | Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching | Juyan Zhang et.al. | 2508.01597 | null |
2025-08-03 | A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation | Hua Yu et.al. | 2508.01590 | null |
2025-08-03 | Censored Sampling for Topology Design: Guiding Diffusion with Human Preferences | Euihyun Kim et.al. | 2508.01589 | null |
2025-08-03 | Diffusion Models for Future Networks and Communications: A Comprehensive Survey | Nguyen Cong Luong et.al. | 2508.01586 | null |
2025-08-03 | Tractography-Guided Dual-Label Collaborative Learning for Multi-Modal Cranial Nerves Parcellation | Lei Xie et.al. | 2508.01577 | null |
2025-08-03 | Sub 10 nm Nanochannels Enable Directional Quasi Ballistic Exciton Transport over 5 μm at Room Temperature | Xiao-Jie Wang et.al. | 2508.01567 | null |
2025-08-03 | MGCR-Net:Multimodal Graph-Conditioned Vision-Language Reconstruction Network for Remote Sensing Change Detection | Chengming Wang et.al. | 2508.01555 | null |
2025-08-02 | A Reward-Directed Diffusion Framework for Generative Design Optimization | Hadi Keramati et.al. | 2508.01509 | null |
2025-08-02 | Instruction-based Time Series Editing | Jiaxing Qiu et.al. | 2508.01504 | null |
2025-08-02 | The role of zealots in the spread of linguistic traits | Vivian Dornelas et.al. | 2508.01500 | null |
2025-08-02 | TreeDiff: AST-Guided Code Generation with Diffusion LLMs | Yiming Zeng et.al. | 2508.01473 | null |
2025-08-02 | Regression Augmentation With Data-Driven Segmentation | Shayan Alahyari et.al. | 2508.01455 | null |
2025-08-02 | Physically-based Lighting Augmentation for Robotic Manipulation | Shutong Jin et.al. | 2508.01442 | null |
2025-08-02 | Viscosity Stabilized Plug-and-Play Reconstruction | Arghya Sinha et.al. | 2508.01441 | null |
2025-08-02 | Parabolic-elliptic and indirect-direct simplifications in chemotaxis systems driven by indirect signalling | Le Trong Thanh Bui et.al. | 2508.01436 | null |
2025-08-02 | Artificial Intelligence and Misinformation in Art: Can Vision Language Models Judge the Hand or the Machine Behind the Canvas? | Tarian Fu et.al. | 2508.01408 | null |
2025-08-02 | StyleSentinel: Reliable Artistic Copyright Verification via Stylistic Fingerprints | Lingxiao Chen et.al. | 2508.01335 | null |
2025-08-05 | Zero-shot Segmentation of Skin Conditions: Erythema with Edit-Friendly Inversion | Konstantinos Moutselos et.al. | 2508.01334 | null |
2025-08-02 | LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points | Xuemiao Zhang et.al. | 2508.01317 | null |
2025-08-02 | CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis | Alec Sargood et.al. | 2508.01292 | null |
2025-08-02 | PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation | Zonglei Jing et.al. | 2508.01272 | null |
2025-08-02 | Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling | Lexiao Zou et.al. | 2508.01264 | null |
2025-08-02 | NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection | Jiazhen Yan et.al. | 2508.01248 | null |
2025-08-02 | Effect of protection zone on the dynamics of a diffusion-advection population-toxicant model | Jing Gao et.al. | 2508.01246 | null |
2025-08-02 | Sliding two-dimensional superconductivity and charge-density-wave state in a bulk crystal | Xiangqi Liu et.al. | 2508.01241 | null |
2025-08-02 | SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches | Cheng Tan et.al. | 2508.01237 | null |
2025-08-02 | Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system | Jiyong Kim et.al. | 2508.01230 | null |
2025-08-02 | StyDeco: Unsupervised Style Transfer with Distilling Priors and Semantic Decoupling | Yuanlin Yang et.al. | 2508.01215 | null |
2025-08-02 | Energy-dependent anisotropy of cosmic-ray muons: A twelve-year study with IceCube Neutrino Observatory | Nabin Upadhya Dhakal et.al. | 2508.01194 | null |
2025-08-02 | DELTAv2: Accelerating Dense 3D Tracking | Tuan Duc Ngo et.al. | 2508.01170 | null |
2025-08-02 | RoboLinker: A Diffusion-model-based Matching Clothing Generator Between Humans and Companion Robots | Jing Tang et.al. | 2508.01165 | null |
2025-08-02 | LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation | Xinyu Yan et.al. | 2508.01152 | null |
2025-08-02 | Personalized Safety Alignment for Text-to-Image Diffusion Models | Yu Lei et.al. | 2508.01151 | null |
2025-08-02 | Dataset Condensation with Color Compensation | Huyu Wu et.al. | 2508.01139 | null |
2025-08-01 | Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models | Jinsong Li et.al. | 2508.00819 | null |
2025-08-01 | Multibeam High Throughput Satellite: Hardware Foundation, Resource Allocation, and Precoding | Rui Chen et.al. | 2508.00800 | null |
2025-08-01 | Video Generators are Robot Policies | Junbang Liang et.al. | 2508.00795 | null |
2025-08-01 | SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation | Kien T. Pham et.al. | 2508.00782 | null |
2025-08-01 | Diffusion-Scheduled Denoising Autoencoders for Anomaly Detection in Tabular Data | Timur Sattarov et.al. | 2508.00758 | null |
2025-08-01 | LeakyCLIP: Extracting Training Data from CLIP | Yunhao Chen et.al. | 2508.00756 | null |
2025-08-01 | SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation | Prerana Ramkumar et.al. | 2508.00750 | null |
2025-08-01 | AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation | Le Wang et.al. | 2508.00733 | null |
2025-08-01 | YOLO-Count: Differentiable Object Counting for Text-to-Image Generation | Guanning Zeng et.al. | 2508.00728 | null |
2025-08-01 | Controllability of diffusive Lotka-Volterra strongly competitive systems under boundary constrained controls | Elisa Affili et.al. | 2508.00713 | null |
2025-08-01 | D3: Training-Free AI-Generated Video Detection Using Second-Order Features | Chende Zheng et.al. | 2508.00701 | null |
2025-08-01 | On-Device Diffusion Transformer Policy for Efficient Robot Manipulation | Yiming Wu et.al. | 2508.00697 | null |
2025-08-01 | Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network | Young-ho Cho et.al. | 2508.00692 | null |
2025-08-01 | Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators | Albert Matveev et.al. | 2508.00643 | null |
2025-08-01 | Minimum Data, Maximum Impact: 20 annotated samples for explainable lung nodule classification | Luisa Gallée et.al. | 2508.00639 | null |
2025-08-01 | DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior | Junzhe Lu et.al. | 2508.00599 | null |
2025-08-01 | Wukong Framework for Not Safe For Work Detection in Text-to-Image systems | Mingrui Liu et.al. | 2508.00591 | null |
2025-08-01 | Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints | Jens U. Kreber et.al. | 2508.00558 | null |
2025-08-01 | DBLP: Noise Bridge Consistency Distillation For Efficient And Reliable Adversarial Purification | Chihan Huang et.al. | 2508.00552 | null |
2025-08-01 | Video Color Grading via Look-Up Table Generation | Seunghyun Shin et.al. | 2508.00548 | null |
2025-08-01 | HannesImitation: Grasping with the Hannes Prosthetic Hand via Imitation Learning | Carlo Alessi et.al. | 2508.00491 | null |
2025-08-01 | LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer | Yuzhuo Chen et.al. | 2508.00477 | null |
2025-08-01 | A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces | Leonidas Akritidis et.al. | 2508.00472 | null |
2025-08-01 | Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution | Yiwen Wang et.al. | 2508.00471 | null |
2025-08-01 | AutoDebias: Automated Framework for Debiasing Text-to-Image Models | Hongyi Cai et.al. | 2508.00445 | null |
2025-08-01 | SDMatte: Grafting Diffusion Models for Interactive Matting | Longfei Huang et.al. | 2508.00443 | null |
2025-08-01 | Diffusion-Based User-Guided Data Augmentation for Coronary Stenosis Detection | Sumin Seo et.al. | 2508.00438 | null |
2025-08-01 | Accurate Latent Inversion for Generative Image Steganography via Rectified Flow | Yuqi Qian et.al. | 2508.00434 | null |
2025-08-01 | Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation | Nan Xiang et.al. | 2508.00428 | null |
2025-08-01 | Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting | Seunggeun Chi et.al. | 2508.00427 | null |
2025-08-01 | Collimated QED Cascades with Curved Plasma Mirror | Xuesong Geng et.al. | 2508.00417 | null |
2025-08-01 | DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space | Junyu Chen et.al. | 2508.00413 | null |
2025-08-01 | Sortblock: Similarity-Aware Feature Reuse for Diffusion Model | Hanqi Chen et.al. | 2508.00412 | null |
2025-08-01 | Predictive information criterion for jump diffusion processes | Yuma Uehara et.al. | 2508.00411 | null |
2025-08-01 | Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency | Xi Xue et.al. | 2508.00397 | null |
2025-08-01 | Sheaf Graph Neural Networks via PAC-Bayes Spectral Optimization | Yoonhyuk Choi et.al. | 2508.00357 | null |
2025-08-01 | BOOD: Boundary-based Out-Of-Distribution Data Generation | Qilin Liao et.al. | 2508.00350 | null |
2025-08-01 | Favorable modifications of Scrape-Off Layer (SOL) heat flux width through pulsed fuelling in ADITYA-U Tokamak | SK Injamul Hoque et.al. | 2508.00339 | null |
2025-08-01 | Radially Locked Sun-Ray Patterns in Autocatalytic Reaction-Diffusion-Advection Systems | Surya Narayan Maharana et.al. | 2508.00329 | null |
2025-08-01 | Steering Guidance for Personalized Text-to-Image Diffusion Models | Sunghyun Park et.al. | 2508.00319 | null |
2025-08-01 | GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection | Suhang Cai et.al. | 2508.00312 | null |
2025-08-01 | TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps | Zehui Xu et.al. | 2508.00303 | null |
2025-08-01 | Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence | Danzhen Fu et.al. | 2508.00299 | null |
2025-08-01 | AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer | Jin Lyu et.al. | 2508.00298 | null |
2025-08-01 | TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models | Christian Simon et.al. | 2508.00289 | null |
2025-08-01 | UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents | Jianqiang Xiao et.al. | 2508.00288 | null |
2025-08-01 | Towards Robust Semantic Correspondence: A Benchmark and Insights | Wenyue Chong et.al. | 2508.00272 | null |
2025-08-01 | Jet Image Generation in High Energy Physics Using Diffusion Models | Victor D. Martinez et.al. | 2508.00250 | null |
2025-07-31 | Reliability of 1D radiative-convective photochemical-equilibrium retrievals on transit spectra of WASP-107b | Thomas Konings et.al. | 2508.00177 | null |
2025-07-31 | DiSC-Med: Diffusion-based Semantic Communications for Robust Medical Image Transmission | Fupei Guo et.al. | 2508.00172 | null |
2025-07-31 | World Consistency Score: A Unified Metric for Video Generation Quality | Akshat Rakheja et.al. | 2508.00144 | null |
2025-07-31 | Entanglement spreading and emergent locality in Brownian SYK chains | Onkar Parrikar et.al. | 2508.00060 | null |
2025-07-31 | Predicting Large-scale Urban Network Dynamics with Energy-informed Graph Neural Diffusion | Tong Nie et.al. | 2508.00037 | null |
2025-07-31 | Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis | Bowen Zhang et.al. | 2507.23785 | null |
2025-07-31 | SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions | Jessica Bader et.al. | 2507.23784 | null |
2025-07-31 | General diffusions on metric graphs as limits of time-space Markov Chains | Alexis Anagnostakis et.al. | 2507.23724 | null |
2025-07-31 | DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching | Emery Pierson et.al. | 2507.23715 | null |
2025-07-31 | CFDagent: A Language-Guided, Zero-Shot Multi-Agent System for Complex Flow Simulation | Zhaoyue Xu et.al. | 2507.23693 | null |
2025-07-31 | UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration | Zihan Cheng et.al. | 2507.23685 | null |
2025-07-31 | I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation | Jialei Chen et.al. | 2507.23683 | null |
2025-07-31 | Analysis of a Cross-Nonlinear Porous-Medium System Modeling Pressure-Driven Cell Population Dynamics | Alexis Béjar-López et.al. | 2507.23680 | null |
2025-07-31 | DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data | Rabeya Tus Sadia et.al. | 2507.23676 | null |
2025-07-31 | One-Step Flow Policy Mirror Descent | Tianyi Chen et.al. | 2507.23675 | null |
2025-07-31 | Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image Synthesis | Kunpeng Qiu et.al. | 2507.23652 | null |
2025-07-31 | A stochastic heat equation with non-locally Lipschitz coefficients | Le Chen et.al. | 2507.23637 | null |
2025-07-31 | DivControl: Knowledge Diversion for Controllable Image Generation | Yucheng Xie et.al. | 2507.23620 | null |
2025-08-02 | MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction | Zijian Dong et.al. | 2507.23597 | null |
2025-07-31 | Theory of ultrafast conductance modulation in electrochemical protonic synapses by multiphase polarization | Michael L. Li et.al. | 2507.23576 | null |
2025-08-01 | H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation | Hongzhe Bi et.al. | 2507.23523 | null |
2025-07-31 | Conical diffraction of the synchrotron beam to probe the efficiency and morphology of blazed gratings | K. V. Nikolaev et.al. | 2507.23513 | null |
2025-07-31 | Emergence of long-range non-equilibrium correlations in free liquid diffusion | Marco Bussoletti et.al. | 2507.23507 | null |
2025-07-31 | Digital literacy interventions can boost humans in discerning deepfakes | Dominique Geissler et.al. | 2507.23492 | null |
2025-07-31 | Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion | Mutian Xu et.al. | 2507.23483 | null |
2025-07-31 | Adjoint-Based Aerodynamic Shape Optimization with a Manifold Constraint Learned by Diffusion Models | Long Chen et.al. | 2507.23443 | null |
2025-07-31 | Out-of-Distribution Detection in Medical Imaging via Diffusion Trajectories | Lemar Abdi et.al. | 2507.23411 | null |
2025-07-31 | An optimal preconditioner for high-order scheme arising from multi-dimensional Riesz space fractional diffusion equations with variable coefficients | Yuan-Yuan Huang et.al. | 2507.23408 | null |
2025-07-31 | UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries | Yijie Zhu et.al. | 2507.23372 | null |
2025-07-31 | IN45023 Neural Network Design Patterns in Computer Vision Seminar Report, Summer 2025 | Radu-Andrei Bourceanu et.al. | 2507.23357 | null |
2025-07-31 | Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads | Yingjie Zhou et.al. | 2507.23343 | null |
2025-07-31 | EMU and the DRAGNs I: A Catalogue of DRAGNs | Ray P. Norris et.al. | 2507.23337 | null |
2025-07-31 | Classifying Compact Radio Emission in Nearby Galaxies: a 10GHz Study of Active Galactic Nuclei, Supernovae, Anomalous Microwave Emission and Star Forming Regions | Kristen C. Dage et.al. | 2507.23332 | null |
2025-07-31 | The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models | Alfio Ferrara et.al. | 2507.23313 | null |
2025-07-31 | PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving | Xuewei Tang et.al. | 2507.23309 | null |
2025-08-01 | Training-free Geometric Image Editing on Diffusion Models | Hanshen Zhu et.al. | 2507.23300 | null |
2025-07-31 | UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing | Hao Tang et.al. | 2507.23278 | null |
2025-07-31 | PixNerd: Pixel Neural Field Diffusion | Shuai Wang et.al. | 2507.23268 | null |
2025-07-31 | Automated Mapping the Pathways of Cranial Nerve II, III, V, and VII/VIII: A Multi-Parametric Multi-Stage Diffusion Tractography Atlas | Lei Xie et.al. | 2507.23245 | null |
2025-07-31 | BS-1-to-N: Diffusion-Based Environment-Aware Cross-BS Channel Knowledge Map Generation for Cell-Free Networks | Zhuoyin Dai et.al. | 2507.23236 | null |
2025-07-31 | Adversarial-Guided Diffusion for Multimodal LLM Attacks | Chengwei Xia et.al. | 2507.23202 | null |
2025-07-30 | X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention | Xiaochen Zhao et.al. | 2507.23143 | null |
2025-07-30 | Nonzero $\mathfrak{n}$ cohomology of Totally Degenerate Limit of Discrete Series representations | Jin Kunwoo Lee et.al. | 2507.23102 | null |
2025-07-30 | Diffusion model for gradient preconditioning in hyperspectral imaging inverse problems | Jonathan Monsalve et.al. | 2507.23065 | null |
2025-07-30 | Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation | Alexandru Buburuzan et.al. | 2507.23058 | null |
2025-07-30 | Search for Neutrinos from the Galactic 4FGL Sources with the Pion-bump Signature with IceCube | Alejandra Granados et.al. | 2507.23040 | null |
2025-07-30 | Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction | Giuseppe Cartella et.al. | 2507.23021 | null |
2025-07-30 | Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods | Siwoo Park et.al. | 2507.23010 | null |
2025-07-30 | LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis | Jamil Fayyad et.al. | 2507.23001 | null |
2025-07-29 | Neural Autoregressive Modeling of Brain Aging | Ridvan Yesiloglu et.al. | 2507.22954 | null |
2025-07-30 | AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS | Hai Ling et.al. | 2507.22880 | null |
2025-07-30 | Robust Contract with Career Concerns | Tan Gan et.al. | 2507.22852 | null |
2025-07-30 | Morph: ChirpTransformer-based Encoder-decoder Co-design for Reliable LoRa Communication | Yidong Ren et.al. | 2507.22851 | null |
2025-07-30 | DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion | Qingcheng Zhao et.al. | 2507.22825 | null |
2025-07-30 | Design and Analysis of Plasmonic-Nanorod-Enhanced Lead-Free Inorganic Perovskite/Silicon Heterojunction Tandem Solar Cell Exceeding the Shockley-Queisser Limit | Md. Sad Abdullah Sami et.al. | 2507.22803 | null |
2025-07-31 | G-Core: A Simple, Scalable and Balanced RLHF Trainer | Junyu Wu et.al. | 2507.22789 | null |
2025-07-30 | DO-EM: Density Operator Expectation Maximization | Adit Vishnu et.al. | 2507.22786 | null |
2025-08-01 | Next Tokens Denoising for Speech Synthesis | Yanqing Liu et.al. | 2507.22746 | null |
2025-07-30 | Zero-Shot Image Anomaly Detection Using Generative Foundation Models | Lemar Abdi et.al. | 2507.22692 | null |
2025-07-30 | LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing | Federico Girella et.al. | 2507.22627 | null |
2025-07-30 | Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions | Yiting Qu et.al. | 2507.22617 | null |
2025-07-30 | Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model | Daehee Park et.al. | 2507.22615 | null |
2025-07-30 | ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning | Xiefan Guo et.al. | 2507.22604 | null |
2025-07-30 | Diffusion Models for Influence Maximization on Temporal Networks: A Guide to Make the Best Choice | Aaqib Zahoor et.al. | 2507.22589 | null |
2025-07-30 | DACA-Net: A Degradation-Aware Conditional Diffusion Network for Underwater Image Enhancement | Chang Huang et.al. | 2507.22501 | null |
2025-07-30 | LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning | Xiang Li et.al. | 2507.22499 | null |
2025-07-30 | Visual Language Models as Zero-Shot Deepfake Detectors | Viacheslav Pirogov et.al. | 2507.22469 | null |
2025-07-30 | TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation | Jiuming Liu et.al. | 2507.22454 | null |
2025-07-30 | GVD: Guiding Video Diffusion Model for Scalable Video Distillation | Kunyang Li et.al. | 2507.22360 | null |
2025-07-29 | Trade-offs in Image Generation: How Do Different Dimensions Interact? | Sicheng Zhang et.al. | 2507.22100 | null |
2025-07-29 | X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again | Zigang Geng et.al. | 2507.22058 | null |
2025-07-30 | See Different, Think Better: Visual Variations Mitigating Hallucinations in LVLMs | Ziyun Dai et.al. | 2507.22003 | null |
2025-07-29 | Enhancing Generalization in Data-free Quantization via Mixup-class Prompting | Jiwoong Park et.al. | 2507.21947 | null |
2025-07-29 | Anyone Can Jailbreak: Prompt-Based Attacks on LLMs and T2Is | Ahmed B Mustafa et.al. | 2507.21820 | null |
2025-07-29 | Control Copy-Paste: Controllable Diffusion-Based Augmentation Method for Remote Sensing Few-Shot Object Detection | Yanxing Liu et.al. | 2507.21816 | null |
2025-07-29 | MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE | Junzhe Li et.al. | 2507.21802 | null |
2025-07-29 | APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing | Sangmin Han et.al. | 2507.21690 | null |
2025-07-29 | GuidPaint: Class-Guided Image Inpainting with Diffusion Models | Qimin Wang et.al. | 2507.21627 | null |
2025-07-29 | Locally Controlled Face Aging with Latent Diffusion Models | Lais Isabelle Alves dos Santos et.al. | 2507.21600 | null |
2025-07-29 | Neural network enabled wide field-of-view imaging with hyperbolic metalenses | Joel Yeo et.al. | 2507.21562 | null |
2025-07-29 | Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance | Mengling Xu et.al. | 2507.21529 | null |
2025-07-29 | BANG: Dividing 3D Assets via Generative Exploded Dynamics | Longwen Zhang et.al. | 2507.21493 | null |
2025-07-29 | Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training | Sodtavilan Odonchimed et.al. | 2507.21452 | null |
2025-07-30 | Multimodal LLMs as Customized Reward Models for Text-to-Image Generation | Shijie Zhou et.al. | 2507.21391 | null |
2025-07-28 | Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation | I-Hsiang Chen et.al. | 2507.21367 | null |
2025-07-28 | A Contrastive Diffusion-based Network (CDNet) for Time Series Classification | Yaoyu Zhang et.al. | 2507.21357 | null |
2025-07-28 | HDR Environment Map Estimation with Latent Diffusion Models | Jack Hilliard et.al. | 2507.21261 | null |
2025-07-28 | Adaptive Multimodal Protein Plug-and-Play with Diffusion-Based Priors | Amartya Banerjee et.al. | 2507.21260 | null |
2025-07-28 | Learning from Limited and Imperfect Data | Harsh Rangwani et.al. | 2507.21205 | null |
2025-08-01 | Flow Matching Policy Gradients | David McAllister et.al. | 2507.21053 | null |
2025-07-29 | JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1 | Xinhan Di et.al. | 2507.20987 | null |
2025-07-28 | Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision | Xiao Fang et.al. | 2507.20976 | null |
Industry
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-28 | Leveraging Generative Models for Real-Time Query-Driven Text Summarization in Large-Scale Web Search | Zeyu Xiong et.al. | 2508.20559 | null |
2025-08-28 | Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation | Jiusi Li et.al. | 2508.20471 | null |
2025-08-28 | MedFoundationHub: A Lightweight and Secure Toolkit for Deploying Medical Vision Language Foundation Models | Xiao Li et.al. | 2508.20345 | null |
2025-08-26 | APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration | Shaobo Ma et.al. | 2508.19087 | null |
2025-08-26 | TaiBai: A fully programmable brain-inspired processor with topology-aware efficiency | Qianpeng Li et.al. | 2508.18961 | null |
2025-08-26 | ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive | Xinhao Luo et.al. | 2508.18850 | null |
2025-08-26 | Strata: Hierarchical Context Caching for Long Context Language Model Serving | Zhiqiang Xie et.al. | 2508.18572 | null |
2025-08-25 | Practical GPU Choices for Earth Observation: ResNet-50 Training Throughput on Integrated, Laptop, and Cloud Accelerators | Ritvik Chaturvedi et.al. | 2508.18206 | null |
2025-08-24 | A Synthetic Dataset for Manometry Recognition in Robotic Applications | Pedro Antonio Rabelo Saraiva et.al. | 2508.17468 | null |
2025-08-24 | MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models | Krishna Teja Chitty-Venkata et.al. | 2508.17467 | null |
2025-08-23 | DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method | Qingwen Zhang et.al. | 2508.17054 | null |
2025-08-23 | A Novel Local Focusing Mechanism for Deepfake Detection Generalization | Mingliang Li et.al. | 2508.17029 | null |
2025-08-22 | GPT-OSS-20B: A Comprehensive Deployment-Centric Analysis of OpenAI’s Open-Weight Mixture of Experts Model | Deepak Kumar et.al. | 2508.16700 | null |
2025-08-17 | GPU Acceleration for Faster Evolutionary Spatial Cyclic Game Systems | Louie Sinadjan et.al. | 2508.16639 | null |
2025-08-22 | GreenLLM: SLO-Aware Dynamic Frequency Scaling for Energy-Efficient LLM Serving | Qunyou Liu et.al. | 2508.16449 | null |
2025-08-22 | Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars | NVIDIA et.al. | 2508.16401 | null |
2025-08-27 | Hybrid Classical-Quantum Supercomputing: A demonstration of a multi-user, multi-QPU and multi-GPU environment | Mateusz Slysz et.al. | 2508.16297 | null |
2025-08-22 | Bare-Metal RISC-V + NVDLA SoC for Efficient Deep Learning Inference | Vineet Kumar et.al. | 2508.16095 | null |
2025-08-22 | A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection | Qifeng Liu et.al. | 2508.16069 | null |
2025-08-21 | graph framework: A Domain Specific Compiler for Building Physics Applications | M. Cianciosa et.al. | 2508.15967 | null |
2025-08-17 | Better Together: Leveraging Multiple Digital Twins for Deployment Optimization of Airborne Base Stations | Mauro Belgiovine et.al. | 2508.15816 | null |
2025-08-25 | DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians | Cong Wang et.al. | 2508.15376 | null |
2025-08-20 | Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds | Jia Lu et.al. | 2508.14892 | null |
2025-08-20 | Leveraging Hardware-Aware Computation in Mixed-Precision Matrix Multiply: A Tile-Centric Approach | Qiao Zhang et.al. | 2508.14848 | null |
2025-08-20 | FakeHunter: Multimodal Step-by-Step Reasoning for Explainable Video Forensics | Chen Chen et.al. | 2508.14581 | null |
2025-08-25 | NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model | NVIDIA et.al. | 2508.14444 | null |
2025-08-19 | The 9th AI City Challenge | Zheng Tang et.al. | 2508.13564 | null |
2025-08-18 | Optimizing Allreduce Operations for Heterogeneous Architectures with Multiple Processes per GPU | Michael Adams et.al. | 2508.13397 | null |
2025-08-18 | X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms | Yueming Yuan et.al. | 2508.13337 | null |
2025-07-28 | Sustainable AI Training via Hardware-Software Co-Design on NVIDIA, AMD, and Emerging GPU Architectures | Yashasvi Makin et.al. | 2508.13163 | null |
2025-08-18 | CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction | Zhiwei Ning et.al. | 2508.12917 | null |
2025-08-17 | CarelessWhisper: Turning Whisper into a Causal Streaming Model | Tomer Krichli et.al. | 2508.12301 | null |
2025-08-17 | TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform | Jun Liu et.al. | 2508.12279 | null |
2025-08-17 | ATLAS: AI-Native Receiver Test-and-Measurement by Leveraging AI-Guided Search | Mauro Belgiovine et.al. | 2508.12204 | null |
2025-08-16 | Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization | Kousuke Nakano et.al. | 2508.12033 | null |
2025-08-18 | Visual Perception Engine: Fast and Flexible Multi-Head Inference for Robotic Vision Tasks | Jakub Łucki et.al. | 2508.11584 | null |
2025-08-15 | Efficient GPU-Centered Singular Value Decomposition Using the Divide-and-Conquer Method | Shifang Liu et.al. | 2508.11467 | null |
2025-08-15 | Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking | Haonan Zhang et.al. | 2508.11323 | null |
2025-08-14 | EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI Training | Hasibul Jamil et.al. | 2508.11035 | null |
2025-08-12 | ViPE: Video Pose Engine for 3D Geometric Perception | Jiahui Huang et.al. | 2508.10934 | null |
2025-08-13 | GPU accelerated MHD in the DISPATCH framework using directive-based programming | Michael Haahr et.al. | 2508.09568 | null |
2025-08-13 | UWBa at SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval | Ladislav Lenc et.al. | 2508.09517 | null |
2025-08-13 | Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving | Guangxun Zhu et.al. | 2508.09404 | null |
2025-08-07 | Camel: Energy-Aware LLM Inference on Resource-Constrained Devices | Hao Xu et.al. | 2508.09173 | null |
2025-08-12 | Profiling Large Language Model Inference on Apple Silicon: A Quantization Perspective | Afsara Benazir et.al. | 2508.08531 | null |
2025-08-11 | Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson – Extended | Abhinaba Chakraborty et.al. | 2508.08430 | null |
2025-08-10 | Weather-Driven Agricultural Decision-Making Using Digital Twins Under Imperfect Conditions | Tamim Ahmed et.al. | 2508.08326 | null |
2025-08-11 | Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions | Bangsheng Tang et.al. | 2508.08192 | null |
2025-08-11 | TLV-HGNN: Thinking Like a Vertex for Memory-efficient HGNN Inference | Dengke Han et.al. | 2508.07796 | null |
2025-08-10 | An Experimental Exploration of In-Memory Computing for Multi-Layer Perceptrons | Pedro Carrinho et.al. | 2508.07317 | null |
2025-08-09 | The Fused Kernel Library: A C++ API to Develop Highly-Efficient GPU Libraries | Oscar Amoros et.al. | 2508.07071 | null |
2025-08-27 | From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving | Antonio Guillen-Perez et.al. | 2508.07029 | null |
2025-08-09 | A Portable Multi-GPU Solver for Collisional Plasmas with Coulombic Interactions | James Almgren-Bell et.al. | 2508.06771 | null |
2025-08-02 | PiKV: KV Cache Management System for Mixture of Experts | Dong Liu et.al. | 2508.06526 | null |
2025-08-08 | MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows | Xiquan Li et.al. | 2508.06098 | null |
2025-08-07 | CleanUpBench: Embodied Sweeping and Grasping Benchmark | Wenbo Li et.al. | 2508.05543 | null |
2025-08-07 | MedMambaLite: Hardware-Aware Mamba for Medical Image Classification | Romina Aalishah et.al. | 2508.05049 | null |
2025-08-07 | CSRAP: Enhanced Canvas Attention Scheduling for Real-Time Mission Critical Perception | Md Iftekharul Islam Sakib et.al. | 2508.04976 | null |
2025-08-07 | Real-Time Doppler and Ionospheric Dispersion Correction Techniques for Arbitrary Waveforms Utilizing GPU Compute | Daniel J. Vickers et.al. | 2508.04951 | null |
2025-08-05 | AIC CTU@FEVER 8: On-premise fact checking through long context RAG | Herbert Ullrich et.al. | 2508.04390 | null |
2025-08-06 | A Foundation Model for DAS Signal Recognition and Visual Prompt Tuning of the Pre-trained Model for Downstream Tasks | Kun Gui et.al. | 2508.04316 | null |
2025-08-11 | Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems | Luai Abuelsamen et.al. | 2508.04146 | null |
2025-08-05 | La La LiDAR: Large-Scale Layout Generation from LiDAR Data | Youquan Liu et.al. | 2508.03691 | null |
2025-08-05 | Understanding the Landscape of Ampere GPU Memory Errors | Zhu Zhu et.al. | 2508.03513 | null |
2025-08-05 | Full-History Graphs with Edge-Type Decoupled Networks for Temporal Reasoning | Osama Mohammed et.al. | 2508.03251 | null |
2025-08-04 | MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models | Wenyuan Liu et.al. | 2508.02343 | null |
2025-08-04 | Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images | Philipp Wulff et.al. | 2508.02323 | null |
2025-08-04 | CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis | Yuzhuang Xu et.al. | 2508.02322 | null |
2025-08-04 | GPU in the Blind Spot: Overlooked Security Risks in Transportation | Sefatun-Noor Puspa et.al. | 2508.01995 | null |
2025-08-03 | Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving | Hunter Schofield et.al. | 2508.01922 | null |
2025-08-02 | A Parallel Algorithm for Finding Robust Spanners in Large Social Networks | Arindam Khanda et.al. | 2508.01485 | null |
2025-08-01 | Hestia: Hierarchical Next-Best-View Exploration for Systematic Intelligent Autonomous Data Collection | Cheng-You Lu et.al. | 2508.01014 | null |
2025-08-01 | Optimal Scheduling Algorithms for LLM Inference: Theory and Practice | Agrim Bari et.al. | 2508.01002 | null |
2025-07-29 | Forecasting LLM Inference Performance via Hardware-Agnostic Analytical Modeling | Rajeev Patwari et.al. | 2508.00904 | null |
2025-08-12 | Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving | Stefan Englmeier et.al. | 2508.00589 | null |
2025-08-09 | DGEMM without FP64 Arithmetic – Using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme | Daichi Mukunoki et.al. | 2508.00441 | null |
2025-08-01 | On Learning Closed-Loop Probabilistic Multi-Agent Simulator | Juanwu Lu et.al. | 2508.00384 | null |
2025-08-01 | Beamformed 360° Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization | Belman Jahir Rodriguez et.al. | 2508.00307 | null |
2025-07-31 | FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction | Donghyun Lee et.al. | 2507.23480 | null |
2025-07-31 | InterfO-RAN: Real-Time In-band Cellular Uplink Interference Detection with GPU-Accelerated dApps | Neagin Neasamoni Santhi et.al. | 2507.23177 | null |
2025-07-30 | On the Sustainability of AI Inferences in the Edge | Ghazal Sobhani et.al. | 2507.23093 | null |
2025-07-30 | Early Goal-Guided Multi-Scale Fusion for Real-Time Vision-Language Driving | Santosh Patapati et.al. | 2507.23042 | null |
2025-07-28 | Lightweight Transformer-Driven Segmentation of Hotspots and Snail Trails in Solar PV Thermal Imagery | Deepak Joshi et.al. | 2507.20680 | null |
2025-07-27 | SWIFT: A General Sensitive Weight Identification Framework for Fast Sensor-Transfer Pansharpening | Zeyu Xia et.al. | 2507.20311 | null |
2025-07-26 | Racing to Idle: Energy Efficiency of Matrix Multiplication on Heterogeneous CPU and GPU Architectures | Mufakir Qamar Ansari et.al. | 2507.20063 | null |
2025-07-26 | A Fast Parallel Median Filtering Algorithm Using Hierarchical Tiling | Louis Sugy et.al. | 2507.19926 | null |
2025-08-02 | GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting | Baijun Ye et.al. | 2507.19451 | null |
2025-07-25 | TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability | Mohammad Aflah Khan et.al. | 2507.19419 | null |
2025-07-25 | LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences | Yusuke Hirota et.al. | 2507.19362 | null |
2025-07-25 | SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding Large Language Models | Zhen Wan et.al. | 2507.19361 | null |
2025-07-25 | High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins | Lorenzo Cazzella et.al. | 2507.19173 | null |
2025-07-24 | SaLF: Sparse Local Fields for Multi-Sensor Rendering in Real-Time | Yun Chen et.al. | 2507.18713 | null |
2025-07-24 | Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping | Chong Cheng et.al. | 2507.18541 | null |
2025-07-24 | Building an Accelerated OpenFOAM Proof-of-Concept Application using Modern C++ | Giulio Malenza et.al. | 2507.18268 | null |
2025-07-26 | MultiKernelBench: A Multi-Platform Benchmark for Kernel Generation | Zhongzhen Wen et.al. | 2507.17773 | null |
2025-07-23 | BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems | Malsha Ashani Mahawatta Dona et.al. | 2507.17722 | null |
2025-07-24 | Terrain-Aware Adaptation for Two-Dimensional UAV Path Planners | Kostas Karakontis et.al. | 2507.17519 | null |
2025-07-25 | HuNavSim 2.0: An Enhanced Human Navigation Simulator for Human-Aware Robot Navigation | Miguel Escudero-Jiménez et.al. | 2507.17317 | null |
2025-07-23 | GPU Benchmark through QPE Emulator with cuQuantum for Practical Quantum Applications | Takaki Akiba et.al. | 2507.17175 | null |
2025-07-23 | JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction | Fangze Lin et.al. | 2507.17152 | null |
2025-07-23 | Model Compression Engine for Wearable Devices Skin Cancer Diagnosis | Jacob M. Delgado-López et.al. | 2507.17125 | null |
2025-07-23 | Computer Vision for Real-Time Monkeypox Diagnosis on Embedded Systems | Jacob M. Delgado-López et.al. | 2507.17123 | null |
2025-07-22 | Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems | Imran Latif et.al. | 2507.16781 | null |
2025-07-22 | AcceleratedKernels.jl: Cross-Architecture Parallel Algorithms from a Unified, Transpiled Codebase | Andrei-Leonard Nicusan et.al. | 2507.16710 | null |
2025-07-22 | VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences | Kai Deng et.al. | 2507.16443 | null |
2025-07-21 | MSGM: A Multi-Scale Spatiotemporal Graph Mamba for EEG Emotion Recognition | Hanwen Liu et.al. | 2507.15914 | null |
2025-07-30 | GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data Analysis | Guoxi Liu et.al. | 2507.15230 | null |
2025-07-19 | Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall | Shayan Rokhva et.al. | 2507.14662 | null |
2025-07-16 | GPU-Accelerated Interpretable Generalization for Rapid Cyberattack Detection and Forensics | Shu-Ting Huang et.al. | 2507.14222 | null |
2025-08-12 | CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning | Xiaoya Li et.al. | 2507.14111 | null |
2025-07-23 | Photonic Fabric Platform for AI Accelerators | Jing Ding et.al. | 2507.14000 | null |
2025-07-18 | Leveraging Multi-Instance GPUs through moldable task scheduling | Jorge Villarrubia et.al. | 2507.13601 | null |
2025-07-17 | Performance Portable Gradient Computations Using Source Transformation | Kim Liegeois et.al. | 2507.13204 | null |
2025-07-16 | MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding | Renjie Li et.al. | 2507.12463 | null |
2025-07-16 | HyDRA: A Hybrid Dual-Mode Network for Closed- and Open-Set RFFI with Optimized VMD | Hanwen Liu et.al. | 2507.12133 | null |
2025-07-16 | PoTPTQ: A Two-step Power-of-Two Post-training for LLMs | Xinyu Wang et.al. | 2507.11959 | null |
2025-07-15 | MIRAGE: KV Cache Optimization through Parameter Remapping for Multi-tenant LLM Serving | Ruihao Li et.al. | 2507.11507 | null |
2025-07-15 | MMStencil: Optimizing High-order Stencils on Multicore CPU using Matrix Unit | Yinuo Wang et.al. | 2507.11067 | null |
2025-07-15 | Standards-Compliant DM-RS Allocation via Temporal Channel Prediction for Massive MIMO Systems | Sehyun Ryu et.al. | 2507.11064 | null |
2025-07-15 | Modernizing CNN-based Weather Forecast Model towards Higher Computational Efficiency | Minjong Cheon et.al. | 2507.10893 | null |
2025-07-21 | Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks | Aaron Jarmusch et.al. | 2507.10789 | null |
2025-07-14 | A Benchmarking Framework for AI models in Automotive Aerodynamics | Kaustubh Tangsali et.al. | 2507.10747 | null |
2025-07-14 | Quantize-then-Rectify: Efficient VQ-VAE Training | Borui Zhang et.al. | 2507.10547 | null |
2025-07-30 | Designing quantum chemistry algorithms with just-in-time compilation | Xiaojie Wu et.al. | 2507.09772 | null |
2025-07-13 | GeoWarp: An automatically differentiable and GPU-accelerated implicit MPM framework for geomechanics based on NVIDIA Warp | Yidong Zhao et.al. | 2507.09435 | null |
2025-07-12 | Factorization-free Orthogonal Projection onto the Positive Semidefinite Cone with Composite Polynomial Filtering | Shucheng Kang et.al. | 2507.09165 | null |
2025-07-10 | Vidyut3d: a GPU accelerated fluid solver for non-equilibrium plasmas on adaptive grids | Hariswaran Sitaraman et.al. | 2507.08200 | null |
2025-07-10 | GPUHammer: Rowhammer Attacks on GPU Memories are Practical | Chris S. Lin et.al. | 2507.08166 | null |
2025-07-03 | Collective Communication Profiling of Modern-day Machine Learning Workloads | Jit Gupta et.al. | 2507.07117 | null |
2025-07-09 | StixelNExT++: Lightweight Monocular Scene Segmentation and Representation for Collective Perception | Marcel Vosshans et.al. | 2507.06687 | null |
2025-07-09 | EA: An Event Autoencoder for High-Speed Vision Sensing | Riadul Islam et.al. | 2507.06459 | null |
2025-07-08 | CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation | Kushal Gajjar et.al. | 2507.06013 | null |
2025-07-07 | Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model | Mengyao Xu et.al. | 2507.05513 | null |
2025-07-07 | Robotic System with AI for Real Time Weed Detection, Canopy Aware Spraying, and Droplet Pattern Evaluation | Inayat Rasool et.al. | 2507.05432 | null |
2025-07-23 | Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms | Zhiyi Hu et.al. | 2507.04786 | null |
2025-07-05 | ArmGS: Composite Gaussian Appearance Refinement for Modeling Dynamic Urban Environments | Guile Wu et.al. | 2507.03886 | null |
2025-07-24 | Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps | Chong Cheng et.al. | 2507.03737 | null |
2025-07-03 | NVIDIA GPU Confidential Computing Demystified | Zhongshu Gu et.al. | 2507.02770 | null |
2025-07-03 | Flotilla: A scalable, modular and resilient federated learning framework for heterogeneous resources | Roopkatha Banerjee et.al. | 2507.02295 | null |
2025-07-02 | SAKURAONE: Empowering Transparent and Open AI Platforms through Private-Sector HPC Investment in Japan | Fumikazu Konishi et.al. | 2507.02124 | null |
2025-07-02 | Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization | Giuseppe Ruggeri et.al. | 2507.01676 | null |
2025-06-20 | PyTorch-based Geometric Learning with Non-CUDA Processing Units: Experiences from Intel Gaudi-v2 HPUs | Fanchen Bu et.al. | 2507.01031 | null |
2025-07-01 | Anatomy of High-Performance Column-Pivoted QR Decomposition | Maksim Melnichenko et.al. | 2507.00976 | null |
2025-07-01 | Twill: Scheduling Compound AI Systems on Heterogeneous Mobile Edge Platforms | Zain Taufique et.al. | 2507.00491 | null |
2025-07-01 | Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs | Mohammad Firas Sada et.al. | 2507.00418 | null |
2025-07-01 | Question Decomposition for Retrieval-Augmented Generation | Paul J. L. Ammann et.al. | 2507.00355 | null |
2025-06-24 | AdaDeDup: Adaptive Hybrid Data Pruning for Efficient Large-Scale Object Detection Training | Feiyang Kang et.al. | 2507.00049 | null |
2025-06-30 | Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model | Mu-Chi Chen et.al. | 2506.23635 | null |
2025-06-30 | Risk-Based Filtering of Valuable Driving Situations in the Waymo Open Motion Dataset | Tim Puphal et.al. | 2506.23433 | null |
2025-06-29 | CMOS+X: Stacking Persistent Embedded Memories based on Oxide Transistors upon GPGPU Platforms | Faaiq Waqar et.al. | 2506.23405 | null |
2025-06-28 | FF-INT8: Efficient Forward-Forward DNN Training on Edge Devices with INT8 Precision | Jingxiao Ma et.al. | 2506.22771 | null |
2025-06-27 | Quantum-Classical Auxiliary Field Quantum Monte Carlo with Matchgate Shadows on Trapped Ion Quantum Computers | Luning Zhao et.al. | 2506.22408 | null |
2025-06-27 | MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism | Zheng Zhang et.al. | 2506.22175 | null |
2025-06-27 | MCFuser: High-Performance and Rapid Fusion of Memory-Bound Compute-Intensive Operators | Zheng Zhang et.al. | 2506.22169 | null |
2025-07-08 | BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting | Zipei Ma et.al. | 2506.22099 | null |
2025-06-27 | SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model | Shuhan Tan et.al. | 2506.21976 | null |
2025-06-23 | TrajTok: Technical Report for 2025 Waymo Open Sim Agents Challenge | Zhiyuan Zhang et.al. | 2506.21618 | null |
2025-06-26 | SAM4D: Segment Anything in Camera and LiDAR Streams | Jianyun Xu et.al. | 2506.21547 | null |
2025-06-26 | Portable High-Performance Kernel Generation for a Computational Fluid Dynamics Code with DaCe | Måns I. Andersson et.al. | 2506.20994 | null |
2025-06-25 | Characterization and Mitigation of Training Instabilities in Microscaling Formats | Huangyuan Su et.al. | 2506.20752 | null |
2025-06-24 | MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models | Hoa La et.al. | 2506.20686 | null |
2025-06-25 | SuperSONIC: Cloud-Native Infrastructure for ML Inferencing | Dmitry Kondratyev et.al. | 2506.20657 | null |
2025-06-25 | Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking | Ben Kang et.al. | 2506.20381 | null |
2025-06-24 | Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification | Minghao Qin et.al. | 2506.19225 | null |
2025-06-23 | Let Your Video Listen to Your Music! | Xinyu Zhang et.al. | 2506.18881 | null |
2025-06-23 | Cross-Architecture Knowledge Distillation (KD) for Retinal Fundus Image Anomaly Detection on NVIDIA Jetson Nano | Berk Yilmaz et.al. | 2506.18220 | null |
2025-06-22 | AMD Versal Implementations of FAM and SSCA Estimators | Carol Jingyi Li et.al. | 2506.18003 | null |
2025-06-20 | Code Generation for Near-Roofline Finite Element Actions on GPUs from Symbolic Variational Forms | Kaushik Kulkarni et.al. | 2506.17471 | null |
2025-06-19 | VideoGAN-based Trajectory Proposal for Automated Vehicles | Annajoyce Mariani et.al. | 2506.16209 | null |
2025-06-19 | Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs | Xun Wang et.al. | 2506.16196 | null |
2025-06-19 | HetGPU: The pursuit of making binary compatibility towards GPUs | Yiwei Yang et.al. | 2506.15993 | null |
2025-06-18 | Early Attentive Sparsification Accelerates Neural Speech Transcription | Zifei Xu et.al. | 2506.15912 | null |
2025-06-18 | UniRelight: Learning Joint Decomposition and Synthesis for Video Relighting | Kai He et.al. | 2506.15673 | null |
2025-06-18 | Engineering Supercomputing Platforms for Biomolecular Applications | Robert Welch et.al. | 2506.15585 | null |
2025-07-30 | Automated MRI Tumor Segmentation using hybrid U-Net with Transformer and Efficient Attention | Syed Haider Ali et.al. | 2506.15562 | null |
2025-06-17 | Align Your Flow: Scaling Continuous-Time Flow Map Distillation | Amirmojtaba Sabour et.al. | 2506.14603 | null |
2025-06-18 | Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models | Xuanchi Ren et.al. | 2506.09042 | null |
2025-06-10 | Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions | David Acuna et.al. | 2506.08927 | null |
2025-07-18 | Controllable Weather Synthesis and Removal with Video Diffusion Models | Chih-Hao Lin et.al. | 2505.00704 | null |
2025-04-21 | LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception | Yuan-Hong Liao et.al. | 2504.15362 | null |
2025-04-15 | PARTFIELD: Learning 3D Feature Fields for Part Segmentation and Beyond | Minghua Liu et.al. | 2504.11451 | null |
2025-04-17 | VideoPanda: Video Panoramic Diffusion with Multi-view Attention | Kevin Xie et.al. | 2504.11389 | null |
2025-04-01 | Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control | NVIDIA et.al. | 2503.14492 | null |
2025-03-05 | GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control | Xuanchi Ren et.al. | 2503.03751 | null |
2025-03-03 | Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models | Jay Zhangjie Wu et.al. | 2503.01774 | null |
2025-03-22 | DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models | Ruofan Liang et.al. | 2501.18590 | null |
2025-07-09 | Cosmos World Foundation Model Platform for Physical AI | NVIDIA et.al. | 2501.03575 | null |
2025-06-26 | InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models | Yifan Lu et.al. | 2412.03934 | null |
2025-04-01 | Feed-Forward Bullet-Time Reconstruction of Dynamic Scenes from Monocular Videos | Hanxue Liang et.al. | 2412.03526 | null |
2024-11-14 | LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models | Zhengyi Wang et.al. | 2411.09595 | null |
2025-02-28 | ReMatching Dynamic Reconstruction Flow | Sara Oblak et.al. | 2411.00705 | null |
2024-10-26 | SCube: Instant Large-Scale Scene Reconstruction using VoxSplats | Xuanchi Ren et.al. | 2410.20030 | null |
2025-02-11 | SpaceMesh: A Continuous Representation for Learning Manifold Surface Meshes | Tianchang Shen et.al. | 2409.20562 | null |
2024-09-28 | G3R: Gradient Guided Generalizable Reconstruction | Yun Chen et.al. | 2409.19405 | null |
2024-09-27 | UniCal: Unified Neural Sensor Calibration | Ze Yang et.al. | 2409.18953 | null |
2024-09-26 | Learning to Drive via Asymmetric Self-Play | Chris Zhang et.al. | 2409.18218 | null |
2024-09-15 | Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Yuan-Hong Liao et.al. | 2409.09788 | null |
2025-04-19 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760 | null |
2024-08-19 | Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering | Ruofan Liang et.al. | 2408.09702 | null |
2025-03-20 | Wolf: Dense Video Captioning with a World Summarization Framework | Boyi Li et.al. | 2407.18908 | null |
2024-07-15 | SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation | Jordan Juravsky et.al. | 2407.10481 | null |
2024-10-10 | 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes | Nicolas Moenne-Loccoz et.al. | 2407.07090 | null |
2024-07-01 | fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence | Francis Williams et.al. | 2407.01781 | null |
2024-10-31 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-14 | L4GM: Large 4D Gaussian Reconstruction Model | Jiawei Ren et.al. | 2406.10324 | null |
2024-06-12 | UnO: Unsupervised Occupancy Fields for Perception and Forecasting | Ben Agro et.al. | 2406.08691 | null |
2024-06-12 | Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata | Dongsu Zhang et.al. | 2406.08292 | null |
2024-06-13 | DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas et.al. | 2406.04426 | null |
2024-04-24 | NeRF-XL: Scaling NeRFs with Multiple GPUs | Ruilong Li et.al. | 2404.16221 | null |
2024-04-22 | Align Your Steps: Optimizing Sampling Schedules in Diffusion Models | Amirmojtaba Sabour et.al. | 2404.14507 | null |
2024-04-16 | RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting | Ashkan Mirzaei et.al. | 2404.10765 | null |
2025-05-26 | Can Large Vision-Language Models Correct Semantic Grounding Errors By Themselves? | Yuan-Hong Liao et.al. | 2404.06510 | null |
2024-04-01 | QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving | Sourav Biswas et.al. | 2404.01486 | null |
2024-03-22 | LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Kevin Xie et.al. | 2403.15385 | null |
2024-03-22 | Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks | Aqeel Anwar et.al. | 2403.15370 | null |
2024-01-22 | EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models | Koichi Namekata et.al. | 2401.11739 | null |
2023-12-28 | Compact Neural Graphics Primitives with Learned Hash Probing | Towaki Takikawa et.al. | 2312.17241 | null |
2024-01-03 | Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Huan Ling et.al. | 2312.13763 | null |
2023-12-11 | LightSim: Neural Lighting Simulation for Urban Scenes | Ava Pun et.al. | 2312.06654 | null |
2024-04-14 | Trajeglish: Traffic Modeling as Next-Token Prediction | Jonah Philion et.al. | 2312.04535 | null |
2024-06-25 | XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies | Xuanchi Ren et.al. | 2312.03806 | null |
2024-04-12 | WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space | Katja Schwarz et.al. | 2311.13570 | null |
2023-11-16 | Adaptive Shells for Efficient Neural Radiance Field Rendering | Zian Wang et.al. | 2311.10091 | null |
2023-11-09 | Real-Time Neural Rasterization for Large Scenes | Jeffrey Yunfan Liu et.al. | 2311.05607 | null |
2023-11-09 | Reconstructing Objects in-the-wild for Realistic Sensor Simulation | Ze Yang et.al. | 2311.05602 | null |
2023-11-07 | 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features | Chenfeng Xu et.al. | 2311.04391 | null |
2023-11-03 | EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision | Jiawei Yang et.al. | 2311.02077 | null |
2023-11-03 | Towards Unsupervised Object Detection From LiDAR Point Clouds | Lunjun Zhang et.al. | 2311.02007 | null |
2023-11-02 | MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory | Enxu Li et.al. | 2311.01556 | null |
2023-11-17 | 4D-Former: Multimodal 4D Panoptic Segmentation | Ali Athar et.al. | 2311.01520 | null |
2023-11-02 | UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation | Yuwen Xiong et.al. | 2311.01448 | null |
2023-11-02 | CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation | Jingkang Wang et.al. | 2311.01447 | null |
2023-11-02 | Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation | Jay Sarva et.al. | 2311.01446 | null |
2023-11-02 | LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds | Anqi Joyce Yang et.al. | 2311.01444 | null |
2023-11-02 | Learning Realistic Traffic Agents in Closed-loop | Chris Zhang et.al. | 2311.01394 | null |
2024-04-01 | Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion | Lunjun Zhang et.al. | 2311.01017 | null |
2024-01-26 | ViR: Towards Efficient Vision Retention Backbones | Ali Hatamizadeh et.al. | 2310.19731 | null |
2023-10-20 | TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models | Tianshi Cao et.al. | 2310.13772 | null |
2023-09-11 | Towards Viewpoint Robustness in Bird’s Eye View Segmentation | Tzofi Klinghoffer et.al. | 2309.05192 | null |
2023-08-10 | Flexible Isosurface Extraction for Gradient-Based Mesh Optimization | Tianchang Shen et.al. | 2308.05371 | null |
2023-08-03 | UniSim: A Neural Closed-Loop Sensor Simulator | Ze Yang et.al. | 2308.01898 | null |
2023-08-02 | Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving | Ben Agro et.al. | 2308.01471 | null |
2023-07-14 | DreamTeacher: Pretraining Image Backbones with Deep Generative Models | Daiqing Li et.al. | 2307.07487 | null |
2023-06-27 | Rethinking Closed-loop Training for Autonomous Driving | Chris Zhang et.al. | 2306.15713 | null |
2023-06-06 | ATT3D: Amortized Text-to-3D Object Synthesis | Jonathan Lorraine et.al. | 2306.07349 | null |
2023-06-09 | Neural Kernel Surface Reconstruction | Jiahui Huang et.al. | 2305.19590 | null |
2023-08-13 | Neural LiDAR Fields for Novel View Synthesis | Shengyu Huang et.al. | 2305.01643 | null |
2023-04-19 | NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models | Seung Wook Kim et.al. | 2304.09787 | null |
2023-12-28 | Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models | Andreas Blattmann et.al. | 2304.08818 | null |
2023-04-06 | Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes | Zian Wang et.al. | 2304.03266 | null |
2023-04-04 | Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion | Davis Rempe et.al. | 2304.01893 | null |
2023-03-25 | VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion | Yiming Li et.al. | 2302.12251 | null |
2023-02-09 | Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting | Viraj Prabhu et.al. | 2302.04832 | null |
2023-02-02 | Synthesizing Physical Character-Scene Interactions | Mohamed Hassan et.al. | 2302.00883 | null |
2023-01-31 | PADL: Language-Directed Physics-Based Character Control | Jordan Juravsky et.al. | 2301.13868 | null |
2023-03-25 | Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin et.al. | 2211.10440 | null |
2022-11-08 | GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting | Alexander Cui et.al. | 2211.02545 | null |
2022-10-12 | LION: Latent Point Diffusion Models for 3D Shape Generation | Xiaohui Zeng et.al. | 2210.06978 | null |
2022-10-06 | XDGAN: Multi-Modal 3D Shape Generation in 2D Space | Hassan Abu Alhaija et.al. | 2210.03007 | null |
2022-10-03 | Optimizing Data Collection for Machine Learning | Rafid Mahmood et.al. | 2210.01234 | null |
2022-09-26 | EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Ahmad Darkhalil et.al. | 2209.13064 | null |
2022-09-22 | GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images | Jun Gao et.al. | 2209.11163 | null |
2022-08-19 | Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion | Zian Wang et.al. | 2208.09480 | null |
2022-08-18 | MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation | Gopal Sharma et.al. | 2208.08580 | null |
2022-07-05 | Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention | Gary Leung et.al. | 2207.02126 | null |
2022-07-13 | How Much More Data Do I Need? Estimating Requirements for Downstream Tasks | Rafid Mahmood et.al. | 2207.01725 | null |
2022-06-19 | Scalable Neural Data Server: A Data Recommender for Transfer Learning | Tianshi Cao et.al. | 2206.09386 | null |
2022-06-16 | Virtual Correspondence: Humans as a Cue for Extreme-View Geometry | Wei-Chiu Ma et.al. | 2206.08365 | null |
2022-06-15 | Variable Bitrate Neural Fields | Towaki Takikawa et.al. | 2206.07707 | null |
2022-06-06 | Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps | Seung Wook Kim et.al. | 2206.02903 | null |
2022-05-05 | ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters | Xue Bin Peng et.al. | 2205.01906 | null |
2022-04-19 | M $^2$ BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation | Enze Xie et.al. | 2204.05088 | null |
2022-04-06 | AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis | Zhiqin Chen et.al. | 2204.03105 | null |
Autonomous Driving
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-28 | DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes | Yajiao Xiong et.al. | 2508.20965 | null |
2025-08-28 | Surfel-based 3D Registration with Equivariant SE(3) Features | Xueyang Kang et.al. | 2508.20789 | null |
2025-08-28 | SKGE-SWIN: End-To-End Autonomous Vehicle Waypoint Prediction and Navigation Using Skip Stage Swin Transformer | Fachri Najm Noer Kartiman et.al. | 2508.20762 | null |
2025-08-28 | UTA-Sign: Unsupervised Thermal Video Augmentation via Event-Assisted Traffic Signage Sketching | Yuqi Han et.al. | 2508.20594 | null |
2025-08-28 | Adaptive Dual Uncertainty Optimization: Boosting Monocular 3D Object Detection under Test-Time Shifts | Zixuan Hu et.al. | 2508.20488 | null |
2025-08-28 | Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation | Jiusi Li et.al. | 2508.20471 | null |
2025-08-27 | Streamlining the Development of Active Learning Methods in Real-World Object Detection | Moussa Kassem Sbeyti et.al. | 2508.19906 | null |
2025-08-27 | Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities | Imad Ali Shah et.al. | 2508.19905 | null |
2025-08-27 | Generalizing Monocular 3D Object Detection | Abhinav Kumar et.al. | 2508.19593 | null |
2025-08-25 | Efficient Model-Based Purification Against Adversarial Attacks for LiDAR Segmentation | Alexandros Gkillas et.al. | 2508.19290 | null |
2025-08-26 | Interpretable Decision-Making for End-to-End Autonomous Driving | Mona Mirzaie et.al. | 2508.18898 | null |
2025-08-26 | EMind: A Foundation Model for Multi-task Electromagnetic Signals Understanding | Luqing Luo et.al. | 2508.18785 | null |
2025-08-20 | GM-Skip: Metric-Guided Transformer Block Skipping for Efficient Vision-Language Models | Lianming Huang et.al. | 2508.18227 | null |
2025-08-25 | EventTracer: Fast Path Tracing-based Event Stream Rendering | Zhenyang Li et.al. | 2508.18071 | null |
2025-08-25 | Integration of Computer Vision with Adaptive Control for Autonomous Driving Using ADORE | Abu Shad Ahammed et.al. | 2508.17985 | null |
2025-08-25 | Enhanced Drift-Aware Computer Vision Architecture for Autonomous Driving | Md Shahi Amran Hossain et.al. | 2508.17975 | null |
2025-08-25 | Adaptive Output Steps: FlexiSteps Network for Dynamic Trajectory Prediction | Yunxiang Liu et.al. | 2508.17797 | null |
2025-08-23 | A Rapid Iterative Trajectory Planning Method for Automated Parking through Differential Flatness | Zhouheng Li et.al. | 2508.17038 | null |
2025-08-23 | A Survey of Deep Learning-based Point Cloud Denoising | Jinxi Wang et.al. | 2508.17011 | null |
2025-08-23 | Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model | Fan Ding et.al. | 2508.16947 | null |
2025-08-22 | Closer to Reality: Practical Semi-Supervised Federated Learning for Foundation Model Adaptation | Guangyu Sun et.al. | 2508.16568 | null |
2025-08-22 | Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation | Chun-Peng Chang et.al. | 2508.16512 | null |
2025-08-22 | SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather | Edoardo Palladin et.al. | 2508.16408 | null |
2025-08-22 | MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction | Ziyang Yan et.al. | 2508.15653 | null |
2025-08-23 | ExtraGS: Geometric-Aware Trajectory Extrapolation with Uncertainty-Guided Generative Priors | Kaiyuan Tan et.al. | 2508.15529 | null |
2025-08-21 | RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features | Olga Matykina et.al. | 2508.15353 | null |
2025-08-21 | RATopo: Improving Lane Topology Reasoning via Redundancy Assignment | Han Li et.al. | 2508.15272 | null |
2025-08-21 | Adversarial Agent Behavior Learning in Autonomous Driving Using Deep Reinforcement Learning | Arjun Srinivasan et.al. | 2508.15207 | null |
2025-08-25 | MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion | Xuyang Chen et.al. | 2508.15169 | null |
2025-08-28 | Learning to Drive Ethically: Embedding Moral Reasoning into Autonomous Driving | Dianzhao Li et.al. | 2508.14926 | null |
2025-08-20 | Multiscale Video Transformers for Class Agnostic Segmentation in Autonomous Driving | Leila Cheshmi et.al. | 2508.14729 | null |
2025-08-20 | MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation | Guile Wu et.al. | 2508.14327 | null |
2025-08-19 | ROVR-Open-Dataset: A Large-Scale Depth Dataset for Autonomous Driving | Xianda Guo et.al. | 2508.13977 | null |
2025-08-19 | Unleashing Semantic and Geometric Priors for 3D Scene Completion | Shiyuan Chen et.al. | 2508.13601 | null |
2025-08-25 | Bridging Clear and Adverse Driving Conditions | Yoel Shapiro et.al. | 2508.13592 | null |
2025-08-19 | Generative Model-Based Feature Attention Module for Video Action Analysis | Guiqin Wang et.al. | 2508.13565 | null |
2025-08-19 | CORENet: Cross-Modal 4D Radar Denoising Network with LiDAR Supervision for Autonomous Driving | Fuyang Liu et.al. | 2508.13485 | null |
2025-08-19 | Structured Prompting and Multi-Agent Knowledge Distillation for Traffic Video Interpretation and Risk Inference | Yunxiang Yang et.al. | 2508.13439 | null |
2025-08-18 | Incremental Generalized Hybrid A* | Sidharth Talia et.al. | 2508.13392 | null |
2025-08-18 | Prune2Drive: A Plug-and-Play Framework for Accelerating Vision-Language Models in Autonomous Driving | Minhao Xiong et.al. | 2508.13305 | null |
2025-08-18 | SpotVLM: Cloud-edge Collaborative Real-time VLM based on Context Transfer | Chen Qian et.al. | 2508.12638 | null |
2025-08-18 | ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving | Can Cui et.al. | 2508.12603 | null |
2025-08-17 | An Initial Study of Bird’s-Eye View Generation for Autonomous Vehicles using Cross-View Transformers | Felipe Carlos dos Santos et.al. | 2508.12520 | null |
2025-08-17 | LMAD: Integrated End-to-End Vision-Language Model for Explainable Autonomous Driving | Nan Song et.al. | 2508.12404 | null |
2025-08-17 | DoppDrive: Doppler-Driven Temporal Aggregation for Improved Radar Object Detection | Yuval Haitman et.al. | 2508.12330 | null |
2025-08-17 | TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform | Jun Liu et.al. | 2508.12279 | null |
2025-08-16 | InstDrive: Instance-Aware 3D Gaussian Splatting for Driving Scenes | Hongyuan Liu et.al. | 2508.12015 | null |
2025-08-16 | Saliency-Based Attention Shifting: A Framework for Improving Driver Situational Awareness of Out-of-Label Hazards | Yousra Shleibik et.al. | 2508.11887 | null |
2025-08-16 | Data Shift of Object Detection in Autonomous Driving | Lida Xu et.al. | 2508.11868 | null |
2025-08-15 | Relative Position Matters: Trajectory Prediction and Planning with Polar Representation | Bozhou Zhang et.al. | 2508.11492 | null |
2025-08-15 | Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous Driving | Bozhou Zhang et.al. | 2508.11488 | null |
2025-08-15 | EvoPSF: Online Evolution of Autonomous Driving Models via Planning-State Feedback | Jiayue Jin et.al. | 2508.11453 | null |
2025-08-15 | ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving | Jingyu Li et.al. | 2508.11428 | null |
2025-08-15 | Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking | Haonan Zhang et.al. | 2508.11323 | null |
2025-08-15 | A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving | Jialin Li et.al. | 2508.11218 | null |
2025-08-14 | CSNR and JMIM Based Spectral Band Selection for Reducing Metamerism in Urban Driving | Jiarong Li et.al. | 2508.10962 | null |
2025-08-18 | HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model | Qi Liu et.al. | 2508.10935 | null |
2025-08-14 | Towards Powerful and Practical Patch Attacks for 2D Object Detection in Autonomous Driving | Yuxin Cao et.al. | 2508.10600 | null |
2025-08-14 | SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving | Philipp Wolters et.al. | 2508.10567 | null |
2025-08-14 | Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies | Ayushman Sarkar et.al. | 2508.10523 | null |
2025-08-14 | STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes | Keishi Ishihara et.al. | 2508.10427 | null |
2025-08-14 | From Pixel to Mask: A Survey of Out-of-Distribution Segmentation | Wenjie Zhao et.al. | 2508.10309 | null |
2025-08-13 | BridgeTA: Bridging the Representation Gap in Knowledge Distillation via Teacher Assistant for Bird’s Eye View Map Segmentation | Beomjun Kim et.al. | 2508.09599 | null |
2025-08-13 | Offline Auto Labeling: BAAS | Stefan Haag et.al. | 2508.09585 | null |
2025-08-13 | Waymo-3DSkelMo: A Multi-Agent 3D Skeletal Motion Dataset for Pedestrian Interaction Modeling in Autonomous Driving | Guangxun Zhu et.al. | 2508.09404 | null |
2025-08-12 | VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception | Fuhao Chang et.al. | 2508.09061 | null |
2025-08-12 | A Pseudo Global Fusion Paradigm-Based Cross-View Network for LiDAR-Based Place Recognition | Jintao Cheng et.al. | 2508.08917 | null |
2025-08-21 | ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction | Chaojun Ni et.al. | 2508.08170 | null |
2025-08-18 | TRIDE: A Text-assisted Radar-Image weather-aware fusion network for Depth Estimation | Huawei Sun et.al. | 2508.08038 | null |
2025-08-11 | CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving | Qi Xiang et.al. | 2508.07838 | null |
2025-08-11 | Risk Map As Middleware: Towards Interpretable Cooperative End-to-end Autonomous Driving for Risk-Aware Planning | Mingyue Lei et.al. | 2508.07686 | null |
2025-08-11 | Progressive Bird’s Eye View Perception for Safety-Critical Autonomous Driving: A Comprehensive Survey | Yan Gong et.al. | 2508.07560 | null |
2025-08-12 | Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring | Ludan Zhang et.al. | 2508.07552 | null |
2025-08-10 | Noise-Aware Generative Microscopic Traffic Simulation | Vindula Jayawardana et.al. | 2508.07453 | null |
2025-08-09 | An Evolutionary Game-Theoretic Merging Decision-Making Considering Social Acceptance for Autonomous Driving | Haolin Liu et.al. | 2508.07080 | null |
2025-08-27 | From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving | Antonio Guillen-Perez et.al. | 2508.07029 | null |
2025-08-09 | WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering | Yixin Zhu et.al. | 2508.06982 | null |
2025-08-08 | Robust-Sub-Gaussian Model Predictive Control for Safe Ultrasound-Image-Guided Robotic Spinal Surgery | Yunke Ao et.al. | 2508.06744 | null |
2025-08-15 | IRL-VLA: Training an Vision-Language-Action Policy via Reward World Model | Anqing Jiang et.al. | 2508.06571 | null |
2025-08-20 | MetAdv: A Unified and Interactive Adversarial Testing Platform for Autonomous Driving | Aishan Liu et.al. | 2508.06534 | null |
2025-08-02 | RMT-PPAD: Real-time Multi-task Learning for Panoptic Perception in Autonomous Driving | Jiayuan Wang et.al. | 2508.06529 | null |
2025-08-12 | GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving | Jian Wang et.al. | 2508.06113 | null |
2025-08-08 | ME $^3$ -BEV: Mamba-Enhanced Deep Reinforcement Learning for End-to-End Autonomous Driving with BEV-Perception | Siyi Lu et.al. | 2508.06074 | null |
2025-08-07 | VISTA: Vision-Language Imitation of Situational Thinking and Attention for Human-Like Driver Focus in Dynamic Environments | Kaiser Hamid et.al. | 2508.05852 | null |
2025-08-07 | SMOL-MapSeg: Show Me One Label | Yunshuang Yuan et.al. | 2508.05501 | null |
2025-08-07 | Physical Adversarial Camouflage through Gradient Calibration and Regularization | Jiawei Liang et.al. | 2508.05414 | null |
2025-08-07 | DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model | Rui Yu et.al. | 2508.05402 | null |
2025-08-07 | ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models | Yatong Lan et.al. | 2508.05236 | null |
2025-08-07 | PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems | Qi Guo et.al. | 2508.05167 | null |
2025-08-07 | AdvDINO: Domain-Adversarial Self-Supervised Representation Learning for Spatial Proteomics | Stella Su et.al. | 2508.04955 | null |
2025-08-06 | Occupancy Learning with Spatiotemporal Memory | Ziyang Leng et.al. | 2508.04705 | null |
2025-08-06 | BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning | Ziyang Leng et.al. | 2508.04702 | null |
2025-08-06 | RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case | Baihui Xiao et.al. | 2508.04642 | null |
2025-08-06 | Segment Any Vehicle: Semantic and Visual Context Driven SAM and A Benchmark | Xiao Wang et.al. | 2508.04260 | null |
2025-08-06 | DRIVE: Dynamic Rule Inference and Verified Evaluation for Constraint-Aware Autonomous Driving | Longling Geng et.al. | 2508.04066 | null |
2025-08-05 | LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences | Ao Liang et.al. | 2508.03692 | null |
2025-08-05 | La La LiDAR: Large-Scale Layout Generation from LiDAR Data | Youquan Liu et.al. | 2508.03691 | null |
2025-08-05 | Veila: Panoramic LiDAR Generation from a Monocular RGB Image | Youquan Liu et.al. | 2508.03690 | null |
2025-08-13 | MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention | Qi Xie et.al. | 2508.03034 | null |
2025-08-04 | Context-aware Risk Assessment and Its Application in Autonomous Driving | Boyang Tian et.al. | 2508.02919 | null |
2025-08-04 | MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model | Tianheng Zhu et.al. | 2508.02858 | null |
2025-08-04 | mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera | Byeonggyu Park et.al. | 2508.02348 | null |
2025-08-04 | Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images | Philipp Wulff et.al. | 2508.02323 | null |
2025-08-04 | Test-Time Model Adaptation for Quantized Neural Networks | Zeshuai Deng et.al. | 2508.02180 | null |
2025-08-04 | Beyond RGB and Events: Enhancing Object Detection under Adverse Lighting with Monocular Normal Maps | Mingjie Liu et.al. | 2508.02127 | null |
2025-08-04 | Mapillary Vistas Validation for Fine-Grained Traffic Signs: A Benchmark Revealing Vision-Language Model Limitations | Sparsh Garg et.al. | 2508.02047 | null |
2025-08-20 | Bench2ADVLM: A Closed-Loop Benchmark for Vision-language Models in Autonomous Driving | Tianyuan Zhang et.al. | 2508.02028 | null |
2025-08-03 | Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving | Hunter Schofield et.al. | 2508.01922 | null |
2025-08-03 | StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding | Haolin Yang et.al. | 2508.01875 | null |
2025-08-03 | DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion | Zhigang Sun et.al. | 2508.01778 | null |
2025-08-03 | LT-Gaussian: Long-Term Map Update Using 3D Gaussian Splatting for Autonomous Driving | Luqi Cheng et.al. | 2508.01704 | null |
2025-08-03 | Adverse Weather-Independent Framework Towards Autonomous Driving Perception through Temporal Correlation and Unfolded Regularization | Wei-Bin Kou et.al. | 2508.01583 | null |
2025-08-02 | A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding | Zhan Shi et.al. | 2508.01197 | null |
2025-08-01 | CP-FREEZER: Latency Attacks against Vehicular Cooperative Perception | Chenyi Wang et.al. | 2508.01062 | null |
2025-08-12 | Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance | Fengze Yang et.al. | 2508.01057 | null |
2025-07-31 | Service Discovery-Based Hybrid Network Middleware for Efficient Communication in Distributed Robotic Systems | Shiyao Sang et.al. | 2508.00947 | null |
2025-08-01 | Rethinking Backbone Design for Lightweight 3D Object Detection in LiDAR | Adwait Chandorkar et.al. | 2508.00744 | null |
2025-08-12 | Context-based Motion Retrieval using Open Vocabulary Methods for Autonomous Driving | Stefan Englmeier et.al. | 2508.00589 | null |
2025-08-01 | Uncertainty-Aware Likelihood Ratio Estimation for Pixel-Wise Out-of-Distribution Detection | Marc Hölle et.al. | 2508.00587 | null |
2025-08-01 | Pro2Guard: Proactive Runtime Enforcement of LLM Agent Safety via Probabilistic Model Checking | Haoyu Wang et.al. | 2508.00500 | null |
2025-08-01 | Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence | Danzhen Fu et.al. | 2508.00299 | null |
2025-07-21 | AoI-Aware Resource Allocation with Deep Reinforcement Learning for HAPS-V2X Networks | Ahmet Melih Ince et.al. | 2508.00011 | null |
2025-07-31 | I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation | Jialei Chen et.al. | 2507.23683 | null |
2025-07-31 | DA-Occ: Efficient 3D Voxel Occupancy Prediction via Directional 2D for Geometric Structure Preservation | Yuchen Zhou et.al. | 2507.23599 | null |
2025-08-09 | MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction | Zijian Dong et.al. | 2507.23597 | null |
2025-07-31 | A Unified Perception-Language-Action Framework for Adaptive Autonomous Driving | Yi Zhang et.al. | 2507.23540 | null |
2025-07-31 | MagicRoad: Semantic-Aware 3D Road Surface Reconstruction via Obstacle Inpainting | Xingyue Peng et.al. | 2507.23340 | null |
2025-07-31 | Contrastive Learning-Driven Traffic Sign Perception: Multi-Modal Fusion of Text and Vision | Qiang Lu et.al. | 2507.23331 | null |
2025-07-31 | FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models | Yiming Yang et.al. | 2507.23325 | null |
2025-08-02 | FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning | Jiajun Cao et.al. | 2507.23318 | null |
2025-08-04 | PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving | Xuewei Tang et.al. | 2507.23309 | null |
2025-07-30 | Causal-Inspired Multi-Agent Decision-Making via Graph Reinforcement Learning | Jing Wang et.al. | 2507.23080 | null |
2025-08-05 | Vision-Language Fusion for Real-Time Autonomous Driving: Goal-Centered Cross-Attention of Camera, HD-Map, & Waypoints | Santosh Patapati et.al. | 2507.23064 | null |
2025-07-30 | Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation | Alexandru Buburuzan et.al. | 2507.23058 | null |
2025-08-07 | Bayesian Optimization applied for accelerated Virtual Validation of the Autonomous Driving Function | Satyesh Shanker Awasthi et.al. | 2507.22769 | null |
2025-07-30 | Social-Pose: Enhancing Trajectory Prediction with Human Body Pose | Yang Gao et.al. | 2507.22742 | null |
2025-07-30 | Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model | Daehee Park et.al. | 2507.22615 | null |
2025-07-30 | TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation | Jiuming Liu et.al. | 2507.22454 | null |
2025-07-30 | Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators | Kaustav Chakraborty et.al. | 2507.22389 | null |
2025-07-29 | Hierarchical Game-Based Multi-Agent Decision-Making for Autonomous Vehicles | Mushuang Liu et.al. | 2507.21941 | null |
2025-07-31 | MultiEditor: Controllable Multimodal Object Editing for Driving Scenarios Using 3D Gaussian Splatting Priors | Shouyi Lu et.al. | 2507.21872 | null |
2025-07-29 | SAMITE: Position Prompted SAM2 with Calibrated Memory for Visual Object Tracking | Qianxiong Xu et.al. | 2507.21732 | null |
2025-08-16 | Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition | Ruiyang Hao et.al. | 2507.21610 | null |
2025-07-29 | SafeDriveRAG: Towards Safe Autonomous Driving with Knowledge Graph-based Retrieval-Augmented Generation | Hao Ye et.al. | 2507.21585 | null |
2025-07-30 | No Redundancy, No Stall: Lightweight Streaming 3D Gaussian Splatting for Real-time Rendering | Linye Wei et.al. | 2507.21572 | null |
2025-07-29 | RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors | Tianhui Cai et.al. | 2507.21567 | null |
2025-07-29 | SLTarch: Towards Scalable Point-Based Neural Rendering by Taming Workload Imbalance and Memory Irregularity | Xingyang Li et.al. | 2507.21499 | null |
2025-07-29 | MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving | Thomas Monninger et.al. | 2507.21423 | null |
2025-08-03 | Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy | Jicheng Yuan et.al. | 2507.21358 | null |
2025-07-25 | Seeing Beyond Frames: Zero-Shot Pedestrian Intention Prediction with Raw Temporal Video and Multimodal Cues | Pallavi Zambare et.al. | 2507.21161 | null |
2025-07-28 | GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction | Tianhao Li et.al. | 2507.20963 | null |
2025-07-25 | Event-Based De-Snowing for Autonomous Driving | Manasi Muglikar et.al. | 2507.20901 | null |
2025-07-28 | DriveAgent-R1: Advancing VLM-based Autonomous Driving with Hybrid Thinking and Active Perception | Weicheng Zheng et.al. | 2507.20879 | null |
2025-07-27 | Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars | Mattia Piccinini et.al. | 2507.20427 | null |
2025-07-27 | VESPA: Towards un(Human)supervised Open-World Pointcloud Labeling for Autonomous Driving | Levente Tempfli et.al. | 2507.20397 | null |
2025-07-27 | Solving Scene Understanding for Autonomous Navigation in Unstructured Environments | Naveen Mathews Renji et.al. | 2507.20389 | null |
2025-07-27 | VLMPlanner: Integrating Visual Language Models with Motion Planning | Zhipeng Tang et.al. | 2507.20342 | null |
2025-07-27 | MambaMap: Online Vectorized HD Map Construction using State Space Model | Ruizi Yang et.al. | 2507.20224 | null |
2025-07-27 | LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks | Fei Kong et.al. | 2507.20174 | null |
2025-07-27 | Meta Fusion: A Unified Framework For Multimodality Fusion with Mutual Learning | Ziyi Liang et.al. | 2507.20089 | null |
2025-07-26 | Digital Twin Channel-Enabled Online Resource Allocation for 6G: Principle, Architecture and Application | Tongjie Li et.al. | 2507.19974 | null |
2025-08-12 | DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes | Rishav Kumar et.al. | 2507.19912 | null |
2025-07-26 | Bridging Simulation and Usability: A User-Friendly Framework for Scenario Generation in CARLA | Ahmed Abouelazm et.al. | 2507.19883 | null |
2025-07-26 | FedS2R: One-Shot Federated Domain Generalization for Synthetic-to-Real Semantic Segmentation in Autonomous Driving | Tao Lian et.al. | 2507.19881 | null |
2025-07-30 | RaGS: Unleashing 3D Gaussian Splatting from 4D Radar and Monocular Cues for 3D Object Detection | Xiaokai Bai et.al. | 2507.19856 | null |
2025-07-26 | A 4D Radar Camera Extrinsic Calibration Tool Based on 3D Uncertainty Perspective N Points | Chuan Cao et.al. | 2507.19829 | null |
2025-07-25 | PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction | Haichuan Li et.al. | 2507.19701 | null |
2025-07-25 | Co-Win: Joint Object Detection and Instance Segmentation in LiDAR Point Clouds via Collaborative Window Processing | Haichuan Li et.al. | 2507.19691 | null |
2025-08-02 | GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting | Baijun Ye et.al. | 2507.19451 | null |
2025-07-25 | An OpenSource CI/CD Pipeline for Variant-Rich Software-Defined Vehicles | Matthias Weiß et.al. | 2507.19446 | null |
2025-07-25 | SDVDiag: A Modular Platform for the Diagnosis of Connected Vehicle Functions | Matthias Weiß et.al. | 2507.19403 | null |
2025-07-25 | BEV-LLM: Leveraging Multimodal BEV Maps for Scene Captioning in Autonomous Driving | Felix Brandstaetter et.al. | 2507.19370 | null |
2025-07-25 | LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences | Yusuke Hirota et.al. | 2507.19362 | null |
2025-07-25 | SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence | Viktar Dubovik et.al. | 2507.19321 | null |
2025-07-25 | CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception | Jiaru Zhong et.al. | 2507.19239 | null |
2025-07-25 | VisHall3D: Monocular Semantic Scene Completion from Reconstructing the Visible Regions to Hallucinating the Invisible Regions | Haoang Lu et.al. | 2507.19188 | null |
2025-07-25 | Continual Learning-Based Unified Model for Unpaired Image Restoration Tasks | Kotha Kartheek et.al. | 2507.19184 | null |
2025-07-25 | Diverse and Adaptive Behavior Curriculum for Autonomous Driving: A Student-Teacher Framework with Multi-Agent RL | Ahmed Abouelazm et.al. | 2507.19146 | null |
2025-07-31 | PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction | Yanghong Liu et.al. | 2507.19119 | null |
2025-07-25 | Fine-Grained Traffic Inference from Road to Lane via Spatio-Temporal Graph Node Generation | Shuhao Li et.al. | 2507.19089 | null |
2025-07-25 | HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback | Elham Soltani Kazemi et.al. | 2507.18921 | null |
2025-07-24 | Diffusion-FS: Multimodal Free-Space Prediction via Diffusion for Autonomous Driving | Keshav Gupta et.al. | 2507.18763 | null |
2025-07-24 | Linear Memory SE(2) Invariant Attention | Ethan Pronovost et.al. | 2507.18597 | null |
2025-07-24 | GaussianFusionOcc: A Seamless Sensor Fusion Approach for 3D Occupancy Prediction Using 3D Gaussians | Tomislav Pavković et.al. | 2507.18522 | null |
2025-07-24 | Delving into Mapping Uncertainty for Mapless Trajectory Prediction | Zongzheng Zhang et.al. | 2507.18498 | null |
2025-07-24 | Reinforced Embodied Active Defense: Exploiting Adaptive Interaction for Robust Visual Perception in Adversarial 3D Environments | Xiao Yang et.al. | 2507.18484 | null |
2025-07-24 | CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting | Haoran Xu et.al. | 2507.18473 | null |
2025-07-24 | LONG3R: Long Sequence Streaming 3D Reconstruction | Zhuoguang Chen et.al. | 2507.18255 | null |
2025-07-24 | GenAI for Automotive Software Development: From Requirements to Wheels | Nenad Petrovic et.al. | 2507.18223 | null |
2025-07-24 | Goal-based Trajectory Prediction for improved Cross-Dataset Generalization | Daniel Grimm et.al. | 2507.18196 | null |
2025-07-24 | Policy Disruption in Reinforcement Learning:Adversarial Attack with Large Language Models and Critical State Identification | Junyong Jiang et.al. | 2507.18113 | null |
2025-07-23 | BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems | Malsha Ashani Mahawatta Dona et.al. | 2507.17722 | null |
2025-07-23 | Reusing Attention for One-stage Lane Topology Understanding | Yang Li et.al. | 2507.17617 | null |
2025-07-23 | InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling | Xiaoxue Chen et.al. | 2507.17613 | null |
2025-07-24 | PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving | Maciej K. Wozniak et.al. | 2507.17596 | null |
2025-07-23 | SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving | Chuang Chen et.al. | 2507.17479 | null |
2025-07-23 | VLM-Guided Visual Place Recognition for Planet-Scale Geo-Localization | Sania Waheed et.al. | 2507.17455 | null |
2025-07-23 | Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning | Joobin Jin et.al. | 2507.17418 | null |
2025-08-06 | DeMo++: Motion Decoupling for Autonomous Driving | Bozhou Zhang et.al. | 2507.17342 | null |
2025-07-23 | JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction | Fangze Lin et.al. | 2507.17152 | null |
2025-07-23 | HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study | Mandar Pitale et.al. | 2507.17118 | null |
2025-07-22 | SDGOCC: Semantic and Depth-Guided Bird’s-Eye View Transformation for 3D Multimodal Occupancy Prediction | Zaipeng Duan et.al. | 2507.17083 | null |
2025-07-22 | Few-Shot Learning in Video and 3D Object Detection: A Survey | Md Meftahul Ferdaus et.al. | 2507.17079 | null |
2025-07-22 | Advancing Robustness in Deep Reinforcement Learning with an Ensemble Defense Approach | Adithya Mohan et.al. | 2507.17070 | null |
2025-07-22 | Denoising-While-Completing Network (DWCNet): Robust Point Cloud Completion Under Corruption | Keneni W. Tesema et.al. | 2507.16743 | null |
2025-07-22 | Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control | Zongzheng Zhang et.al. | 2507.16645 | null |
2025-07-22 | A Target-based Multi-LiDAR Multi-Camera Extrinsic Calibration System | Lorenzo Gentilini et.al. | 2507.16621 | null |
2025-07-22 | VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences | Kai Deng et.al. | 2507.16443 | null |
2025-07-22 | A Sparsity-Aware Autonomous Path Planning Accelerator with HW/SW Co-Design and Multi-Level Dataflow Optimization | Yifan Zhang et.al. | 2507.16177 | null |
2025-07-21 | Optimal Batch-Size Control for Low-Latency Federated Learning with Device Heterogeneity | Huiling Yang et.al. | 2507.15601 | null |
2025-07-21 | Robots for Kiwifruit Harvesting and Pollination | Jamie Bell et.al. | 2507.15484 | null |
2025-07-21 | VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving | Haichao Liu et.al. | 2507.15266 | null |
2025-07-20 | CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning | Pan Hu et.al. | 2507.14903 | null |
2025-07-23 | GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving | Chi Wan et.al. | 2507.14456 | null |
2025-07-18 | Preference-based Multi-Objective Reinforcement Learning | Ni Mu et.al. | 2507.14066 | null |
2025-07-18 | Architecting Human-AI Cocreation for Technical Services – Interaction Modes and Contingency Factors | Jochen Wulf et.al. | 2507.14034 | null |
2025-07-18 | Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection | Yujian Mo et.al. | 2507.13899 | null |
2025-07-18 | Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation | Max van den Hoven et.al. | 2507.13857 | null |
2025-07-18 | One Step Closer: Creating the Future to Boost Monocular Semantic Scene Completion | Haoang Lu et.al. | 2507.13801 | null |
2025-07-18 | AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework | Yu Yao et.al. | 2507.13729 | null |
2025-07-17 | CaSTFormer: Causal Spatio-Temporal Transformer for Driving Intention Prediction | Sirui Wang et.al. | 2507.13425 | null |
2025-07-16 | From Binary to Semantic: Utilizing Large-Scale Binary Occupancy Data for 3D Semantic Occupancy Prediction | Chihiro Noguchi et.al. | 2507.13387 | null |
2025-07-17 | Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models | Arian Mousakhan et.al. | 2507.13162 | null |
2025-07-17 | Channel-wise Motion Features for Efficient Motion Segmentation | Riku Inoue et.al. | 2507.13082 | null |
2025-07-23 | LaViPlan : Language-Guided Visual Path Planning with RLVR | Hayeon Oh et.al. | 2507.12911 | null |
2025-07-17 | World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving | Yanchen Guan et.al. | 2507.12762 | null |
2025-07-17 | Domain-Enhanced Dual-Branch Model for Efficient and Interpretable Accident Anticipation | Yanchen Guan et.al. | 2507.12755 | null |
2025-07-16 | ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving | Yuhang Lu et.al. | 2507.12499 | null |
2025-07-16 | MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding | Renjie Li et.al. | 2507.12463 | null |
2025-07-16 | AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models | Santosh Vasa et.al. | 2507.12414 | null |
2025-07-21 | AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving | Jiawei Xu et.al. | 2507.12137 | null |
2025-07-16 | LidarPainter: One-Step Away From Any Lidar View To Novel Guidance | Yuzhou Ji et.al. | 2507.12114 | null |
2025-07-16 | Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics | Muleilan Pei et.al. | 2507.12083 | null |
2025-07-16 | IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving | Kanghyun Ryu et.al. | 2507.11940 | null |
2025-07-16 | Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers | Mohammed Hassanin et.al. | 2507.11852 | null |
2025-07-15 | Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation | Zhen Xu et.al. | 2507.11540 | null |
2025-07-15 | A Survey on Interpretability in Visual Recognition | Qiyang Wan et.al. | 2507.11099 | null |
2025-07-14 | RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding | Benjamin Stoler et.al. | 2507.10749 | null |
2025-07-14 | Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance | Kyungtae Han et.al. | 2507.10500 | null |
Traffic Simulation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-08-28 | HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning | Zhi Su et.al. | 2508.21043 | null |
2025-08-28 | Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees | Yaniv Hassidof et.al. | 2508.21001 | null |
2025-08-28 | Deep Fuzzy Optimization for Batch-Size and Nearest Neighbors in Optimal Robot Motion Planning | Liding Zhang et.al. | 2508.20884 | null |
2025-08-28 | Uncertainty Aware-Predictive Control Barrier Functions: Safer Human Robot Interaction through Probabilistic Motion Forecasting | Lorenzo Busellato et.al. | 2508.20812 | null |
2025-08-28 | CardioMorphNet: Cardiac Motion Prediction Using a Shape-Guided Bayesian Recurrent Deep Network | Reza Akbari Movahed et.al. | 2508.20734 | null |
2025-08-27 | Regulation-Aware Game-Theoretic Motion Planning for Autonomous Racing | Francesco Prignoli et.al. | 2508.20203 | null |
2025-08-27 | Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning | Jinhao Liang et.al. | 2508.20095 | null |
2025-08-27 | APT*: Asymptotically Optimal Motion Planning via Adaptively Prolated Elliptical R-Nearest Neighbors | Liding Zhang et.al. | 2508.19790 | null |
2025-08-27 | Tree-Based Grafting Approach for Bidirectional Motion Planning with Local Subsets Optimization | Liding Zhang et.al. | 2508.19776 | null |
2025-08-27 | Elliptical K-Nearest Neighbors – Path Optimization via Coulomb’s Law and Invalid Vertices in C-space Obstacles | Liding Zhang et.al. | 2508.19771 | null |
2025-08-27 | Autonomous Aerial Manipulation at Arbitrary Pose in SE(3) with Robust Control and Whole-body Planning | Dongjae Lee et.al. | 2508.19608 | null |
2025-08-25 | Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning | Antonio Guillen-Perez et.al. | 2508.18397 | null |
2025-08-26 | FlowVLA: Thinking in Motion with a Visual Chain of Thought | Zhide Zhong et.al. | 2508.18269 | null |
2025-08-25 | Adaptive Output Steps: FlexiSteps Network for Dynamic Trajectory Prediction | Yunxiang Liu et.al. | 2508.17797 | null |
2025-08-23 | LLM-based Human-like Traffic Simulation for Self-driving Tests | Wendi Li et.al. | 2508.16962 | null |
2025-08-23 | Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model | Fan Ding et.al. | 2508.16947 | null |
2025-08-21 | Lang2Lift: A Framework for Language-Guided Pallet Detection and Pose Estimation Integrated in Autonomous Outdoor Forklift Operation | Huy Hoang Nguyen et.al. | 2508.15427 | null |
2025-08-20 | TRUST-Planner: Topology-guided Robust Trajectory Planner for AAVs with Uncertain Obstacle Spatial-temporal Avoidance | Junzhi Li et.al. | 2508.14610 | null |
2025-08-20 | FiReFly: Fair Distributed Receding Horizon Planning for Multiple UAVs | Nicole Fronda et.al. | 2508.14381 | null |
2025-08-16 | Task and Motion Planning for Humanoid Loco-manipulation | Michal Ciebielski et.al. | 2508.14099 | null |
2025-08-20 | Accelerating Signal-Temporal-Logic-Based Task and Motion Planning of Bipedal Navigation using Benders Decomposition | Jiming Ren et.al. | 2508.13407 | null |
2025-08-18 | BOW: Bayesian Optimization over Windows for Motion Planning in Complex Environments | Sourav Raxit et.al. | 2508.13052 | null |
2025-08-28 | On the complexity of constrained reconfiguration and motion planning | Nicolas Bousquet et.al. | 2508.13032 | null |
2025-08-26 | SocialTrack: Multi-Object Tracking in Complex Urban Traffic Scenes Inspired by Social Behavior | Wenguang Tao et.al. | 2508.12777 | null |
2025-08-17 | Autonomous Oil Spill Response Through Liquid Neural Trajectory Modeling and Coordinated Marine Robotics | Hadas C. Kuzmenko et.al. | 2508.12456 | null |
2025-08-17 | EgoLoc: A Generalizable Solution for Temporal Interaction Localization in Egocentric Videos | Junyi Ma et.al. | 2508.12349 | null |
2025-08-15 | A Comparative Study of Floating-Base Space Parameterizations for Agile Whole-Body Motion Planning | Evangelos Tsiatsianas et.al. | 2508.11520 | null |
2025-08-15 | Relative Position Matters: Trajectory Prediction and Planning with Polar Representation | Bozhou Zhang et.al. | 2508.11492 | null |
2025-08-15 | EvoPSF: Online Evolution of Autonomous Driving Models via Planning-State Feedback | Jiayue Jin et.al. | 2508.11453 | null |
2025-08-15 | ReachVox: Clutter-free Reachability Visualization for Robot Motion Planning in Virtual Reality | Steffen Hauck et.al. | 2508.11426 | null |
2025-08-15 | Learning Differentiable Reachability Maps for Optimization-based Humanoid Motion Generation | Masaki Murooka et.al. | 2508.11275 | null |
2025-08-15 | A CLIP-based Uncertainty Modal Modeling (UMM) Framework for Pedestrian Re-Identification in Autonomous Driving | Jialin Li et.al. | 2508.11218 | null |
2025-08-20 | 3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation | Nikolaos Gkanatsios et.al. | 2508.11002 | null |
2025-08-14 | SpaRC-AD: A Baseline for Radar-Camera Fusion in End-to-End Autonomous Driving | Philipp Wolters et.al. | 2508.10567 | null |
2025-08-14 | STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes | Keishi Ishihara et.al. | 2508.10427 | null |
2025-08-12 | CLF-RL: Control Lyapunov Function Guided Reinforcement Learning | Kejun Li et.al. | 2508.09354 | null |
2025-08-10 | Whole-Body Coordination for Dynamic Object Grasping with Legged Manipulators | Qiwei Liang et.al. | 2508.08328 | null |
2025-08-11 | Learning an Implicit Physics Model for Image-based Fluid Simulation | Emily Yue-Ting Jia et.al. | 2508.08254 | null |
2025-08-10 | A Learning-Based Framework for Collision-Free Motion Planning | Mateus Salomão et.al. | 2508.07502 | null |
2025-08-10 | Noise-Aware Generative Microscopic Traffic Simulation | Vindula Jayawardana et.al. | 2508.07453 | null |
2025-08-10 | Bio-Inspired Topological Autonomous Navigation with Active Inference in Robotics | Daria de Tinguy et.al. | 2508.07267 | null |
2025-08-12 | Understanding Dynamic Scenes in Ego Centric 4D Point Clouds | Junsheng Huang et.al. | 2508.07251 | null |
2025-08-10 | CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion | Xiaotong Lin et.al. | 2508.07162 | null |
2025-08-10 | Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction | Yu Liu et.al. | 2508.07146 | null |
2025-08-09 | ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting | Sandro Papais et.al. | 2508.07089 | null |
2025-08-09 | Model Predictive Control for Crowd Navigation via Learning-Based Trajectory Prediction | Mohamed Parvez Aslam et.al. | 2508.07079 | null |
2025-08-05 | Historical Prediction Attention Mechanism based Trajectory Forecasting for Proactive Work Zone Safety in a Digital Twin Environment | Minhaj Uddin Ahmad et.al. | 2508.06544 | null |
2025-08-04 | Symbolic Learning of Interpretable Reduced-Order Models for Jumping Quadruped Robots | Gioele Buriani et.al. | 2508.06538 | null |
2025-08-08 | V*: An Efficient Motion Planning Algorithm for Autonomous Vehicles | Abdullah Zareh Andaryan et.al. | 2508.06404 | null |
2025-08-08 | Incremental Language Understanding for Online Motion Planning of Robot Manipulators | Mitchell Abrams et.al. | 2508.06095 | null |
2025-08-08 | Dynamical Trajectory Planning of Disturbance Consciousness for Air-Land Bimodal Unmanned Aerial Vehicles | Shaoting Liu et.al. | 2508.05972 | null |
2025-08-07 | TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven Evolution | Zhikai Zhao et.al. | 2508.05616 | null |
2025-08-07 | Benchmarking Shortcutting Techniques for Multi-Robot-Arm Motion Planning | Philip Huang et.al. | 2508.05027 | null |
2025-08-06 | LuKAN: A Kolmogorov-Arnold Network Framework for 3D Human Motion Prediction | Md Zahidul Hasan et.al. | 2508.04847 | null |
2025-08-06 | BEVCon: Advancing Bird’s Eye View Perception with Contrastive Learning | Ziyang Leng et.al. | 2508.04702 | null |
2025-08-06 | Incorporating Stochastic Models of Controller Behavior into Kinodynamic Efficiently Adaptive State Lattices for Mobile Robot Motion Planning in Off-Road Environments | Eric R. Damm et.al. | 2508.04384 | null |
2025-08-06 | Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction | Yu Liu et.al. | 2508.04229 | null |
2025-08-11 | Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems | Luai Abuelsamen et.al. | 2508.04146 | null |
2025-08-05 | Constraint-Preserving Data Generation for Visuomotor Policy Learning | Kevin Lin et.al. | 2508.03944 | null |
2025-08-05 | Vision-based Perception System for Automated Delivery Robot-Pedestrians Interactions | Ergi Tushe et.al. | 2508.03541 | null |
2025-08-04 | X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio | Chenxu Zhang et.al. | 2508.02944 | null |
2025-08-04 | MIDAR: Mimicking LiDAR Detection for Traffic Applications with a Lightweight Plug-and-Play Model | Tianheng Zhu et.al. | 2508.02858 | null |
2025-08-04 | Text2Lip: Progressive Lip-Synced Talking Face Generation from Text via Viseme-Guided Rendering | Xu Wang et.al. | 2508.02362 | null |
2025-08-19 | Adaptive Lattice-based Motion Planning | Abhishek Dhar et.al. | 2508.02350 | null |
2025-08-04 | Framework for Robust Motion Planning of Tethered Multi-Robot Systems in Marine Environments | Markus Buchholz et.al. | 2508.02287 | null |
2025-08-04 | AID4AD: Aerial Image Data for Automated Driving Perception | Daniel Lengerer et.al. | 2508.02140 | null |
2025-08-03 | Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving | Hunter Schofield et.al. | 2508.01922 | null |
2025-08-03 | DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion | Zhigang Sun et.al. | 2508.01778 | null |
2025-08-03 | A Spatio-temporal Continuous Network for Stochastic 3D Human Motion Prediction | Hua Yu et.al. | 2508.01585 | null |
2025-07-29 | A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles | Jiayuan Wang et.al. | 2508.00917 | null |
2025-08-01 | On Learning Closed-Loop Probabilistic Multi-Agent Simulator | Juanwu Lu et.al. | 2508.00384 | null |
2025-08-01 | TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps | Zehui Xu et.al. | 2508.00303 | null |
2025-07-31 | Data-Driven Motion Planning for Uncertain Nonlinear Systems | Babak Esmaeili et.al. | 2508.00154 | null |
2025-07-31 | OmniTraj: Pre-Training on Heterogeneous Data for Adaptive and Zero-Shot Human Trajectory Prediction | Yang Gao et.al. | 2507.23657 | null |
2025-07-31 | A Framework for Ethical Decision-Making in Automated Vehicles through Human Reasons-based Supervision | Lucas Elbert Suryana et.al. | 2507.23308 | null |
2025-07-31 | Simulation-based planning of Motion Sequences for Automated Procedure Optimization in Multi-Robot Assembly Cells | Loris Schneider et.al. | 2507.23270 | null |
2025-08-01 | Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future | Guoping Xu et.al. | 2507.22792 | null |
2025-07-30 | Social-Pose: Enhancing Trajectory Prediction with Human Body Pose | Yang Gao et.al. | 2507.22742 | null |
2025-07-30 | Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model | Daehee Park et.al. | 2507.22615 | null |
2025-07-30 | Safety Evaluation of Motion Plans Using Trajectory Predictors as Forward Reachable Set Estimators | Kaustav Chakraborty et.al. | 2507.22389 | null |
2025-07-27 | Model-Structured Neural Networks to Control the Steering Dynamics of Autonomous Race Cars | Mattia Piccinini et.al. | 2507.20427 | null |
2025-07-27 | VLMPlanner: Integrating Visual Language Models with Motion Planning | Zhipeng Tang et.al. | 2507.20342 | null |
2025-07-27 | PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks | Clinton Ansun Mo et.al. | 2507.20170 | null |
2025-07-25 | PhysVarMix: Physics-Informed Variational Mixture Model for Multi-Modal Trajectory Prediction | Haichuan Li et.al. | 2507.19701 | null |
2025-07-25 | RAKOMO: Reachability-Aware K-Order Markov Path Optimization for Quadrupedal Loco-Manipulation | Mattia Risiglione et.al. | 2507.19652 | null |
2025-07-25 | High-Fidelity RF Mapping: Assessing Environmental Modeling in 6G Network Digital Twins | Lorenzo Cazzella et.al. | 2507.19173 | null |
2025-07-31 | PatchTraj: Unified Time-Frequency Representation Learning via Dynamic Patches for Trajectory Prediction | Yanghong Liu et.al. | 2507.19119 | null |
2025-07-24 | Probabilistic Collision Risk Estimation through Gauss-Legendre Cubature and Non-Homogeneous Poisson Processes | Trent Weiss et.al. | 2507.18819 | null |
2025-07-24 | Delving into Mapping Uncertainty for Mapless Trajectory Prediction | Zongzheng Zhang et.al. | 2507.18498 | null |
2025-07-24 | Goal-based Trajectory Prediction for improved Cross-Dataset Generalization | Daniel Grimm et.al. | 2507.18196 | null |
2025-07-24 | DanceGraph: A Complementary Architecture for Synchronous Dancing Online | David Sinclair et.al. | 2507.18052 | null |
2025-07-23 | Safety Assurance for Quadrotor Kinodynamic Motion Planning | Theodoros Tavoulareas et.al. | 2507.17679 | null |
2025-07-23 | IndoorBEV: Joint Detection and Footprint Completion of Objects via Mask-based Prediction in Indoor Scenarios for Bird’s-Eye View Perception | Haichuan Li et.al. | 2507.17445 | null |
2025-08-06 | DeMo++: Motion Decoupling for Autonomous Driving | Bozhou Zhang et.al. | 2507.17342 | null |
2025-07-23 | JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction | Fangze Lin et.al. | 2507.17152 | null |
2025-07-23 | Falconry-like palm landing by a flapping-wing drone based on the human gesture interaction and distance-aware flight planning | Kazuki Numazato et.al. | 2507.17144 | null |
2025-07-22 | RAPTAR: Radar Radiation Pattern Acquisition through Automated Collaborative Robotics | Maaz Qureshi et.al. | 2507.16988 | null |
2025-07-21 | Resilience Evaluation of Kubernetes in Cloud-Edge Environments via Failure Injection | Zihao Chen et.al. | 2507.16109 | null |
2025-07-21 | Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction | Shiyang Li et.al. | 2507.15832 | null |
2025-07-21 | Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs | Ruochu Yang et.al. | 2507.15782 | null |
2025-07-21 | Selective Densification for Rapid Motion Planning in High Dimensions with Narrow Passages | Lu Huang et.al. | 2507.15710 | null |
2025-07-21 | A Universal Vehicle-Trailer Navigation System with Neural Kinematics and Online Residual Learning | Yanbo Chen et.al. | 2507.15607 | null |
2025-07-21 | VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving | Haichao Liu et.al. | 2507.15266 | null |
2025-07-20 | Search-Based Autonomous Vehicle Motion Planning Using Game Theory | Pouya Panahandeh et.al. | 2507.15088 | null |
2025-07-20 | CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning | Pan Hu et.al. | 2507.14903 | null |
2025-07-18 | Context-Aware Behavior Learning with Heuristic Motion Memory for Underwater Manipulation | Markus Buchholz et.al. | 2507.14099 | null |
2025-07-18 | NeHMO: Neural Hamilton-Jacobi Reachability Learning for Decentralized Safe Multi-Agent Motion Planning | Qingyi Chen et.al. | 2507.13940 | null |
2025-07-18 | Conformal Contraction for Robust Nonlinear Control with Distribution-Free Uncertainty Quantification | Sihang Wei et.al. | 2507.13613 | null |
2025-08-08 | Trustworthy Pedestrian Trajectory Prediction via Pattern-Aware Interaction Modeling | Kaiyuan Zhai et.al. | 2507.13397 | null |
2025-07-25 | Signal Temporal Logic Compliant Co-design of Planning and Control | Manas Sashank Juvvi et.al. | 2507.13225 | null |
2025-07-22 | Predictability-Aware Motion Prediction for Edge XR via High-Order Error-State Kalman Filtering | Ziyu Zhong et.al. | 2507.13179 | null |
2025-07-17 | Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning | Giwon Lee et.al. | 2507.12977 | null |
2025-07-17 | FFI-VTR: Lightweight and Robust Visual Teach and Repeat Navigation based on Feature Flow Indicator and Probabilistic Motion Planning | Jikai Wang et.al. | 2507.12800 | null |
2025-07-16 | MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding | Renjie Li et.al. | 2507.12463 | null |
2025-07-16 | Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios | Van-Hoang-Anh Phan et.al. | 2507.12449 | null |
2025-07-16 | Regrasp Maps for Sequential Manipulation Planning | Svetlana Levit et.al. | 2507.12407 | null |
2025-07-16 | Foresight in Motion: Reinforcing Trajectory Prediction with Reward Heuristics | Muleilan Pei et.al. | 2507.12083 | null |
2025-07-16 | IANN-MPPI: Interaction-Aware Neural Network-Enhanced Model Predictive Path Integral Approach for Autonomous Driving | Kanghyun Ryu et.al. | 2507.11940 | null |
2025-07-16 | A Fast Method for Planning All Optimal Homotopic Configurations for Tethered Robots and Its Extended Applications | Jinyuan Liu et.al. | 2507.11880 | null |
2025-07-15 | MPC-based Coarse-to-Fine Motion Planning for Robotic Object Transportation in Cluttered Environments | Chen Cai et.al. | 2507.11211 | null |
2025-07-15 | Enhancing Autonomous Manipulator Control with Human-in-loop for Uncertain Assembly Environments | Ashutosh Mishra et.al. | 2507.11006 | null |
2025-07-15 | OffsetCrust: Variable-Radius Offset Approximation with Power Diagrams | Zihan Zhao et.al. | 2507.10924 | null |
2025-07-15 | Mixed Discrete and Continuous Planning using Shortest Walks in Graphs of Convex Sets | Savva Morozov et.al. | 2507.10878 | null |
2025-07-14 | A Generalizable Physics-Enhanced State Space Model for Long-Term Dynamics Forecasting in Complex Environments | Yuchen Wang et.al. | 2507.10792 | null |
2025-07-23 | Leveraging RAG-LLMs for Urban Mobility Simulation and Analysis | Yue Ding et.al. | 2507.10382 | null |
2025-07-16 | TOP: Trajectory Optimization via Parallel Optimization towards Constant Time Complexity | Jiajun Yu et.al. | 2507.10290 | null |
2025-07-14 | MP-RBFN: Learning-based Vehicle Motion Primitives using Radial Basis Function Networks | Marc Kaufeld et.al. | 2507.10047 | null |
2025-07-22 | Active Probing with Multimodal Predictions for Motion Planning | Darshan Gadginmath et.al. | 2507.09822 | null |
2025-07-13 | Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions | Yuanhong Zheng et.al. | 2507.09446 | null |
2025-07-12 | Real-Time Adaptive Motion Planning via Point Cloud-Guided, Energy-Based Diffusion and Potential Fields | Wondmgezahu Teshome et.al. | 2507.09383 | null |
2025-07-19 | Informed Hybrid Zonotope-based Motion Planning Algorithm | Peng Xie et.al. | 2507.09309 | null |
2025-07-12 | Integrating Planning and Predictive Control Using the Path Feasibility Governor | Shu Zhang et.al. | 2507.09134 | null |
2025-07-09 | Next-Generation Travel Demand Modeling with a Generative Framework for Household Activity Coordination | Xishun Liao et.al. | 2507.08871 | null |
2025-07-14 | STRAP: Spatial-Temporal Risk-Attentive Vehicle Trajectory Prediction for Autonomous Driving | Xinyi Ning et.al. | 2507.08563 | null |
2025-07-11 | Prediction of Lane Change Intentions of Human Drivers using an LSTM, a CNN and a Transformer | Francesco De Cristofaro et.al. | 2507.08365 | null |
2025-07-11 | Neural Parameter-varying Data-enabled Predictive Control of Cold Atmospheric Pressure Plasma Jets | Pegah GhafGhanbari et.al. | 2507.08259 | null |
2025-07-10 | GGMotion: Group Graph Dynamics-Kinematics Networks for Human Motion Prediction | Shuaijin Wan et.al. | 2507.07515 | null |
2025-07-10 | Towards Safe Autonomous Driving: A Real-Time Safeguarding Concept for Motion Planning Algorithms | Korbinian Moller et.al. | 2507.07444 | null |
2025-07-09 | When Context Is Not Enough: Modeling Unexplained Variability in Car-Following Behavior | Chengyuan Zhang et.al. | 2507.07012 | null |
2025-07-09 | Robust signal decompositions on the circle | Aral Kose et.al. | 2507.07007 | null |
2025-07-09 | ILNet: Trajectory Prediction with Inverse Learning Attention for Enhancing Intention Capture | Mingjin Zeng et.al. | 2507.06531 | null |
2025-07-08 | AURA-CVC: Autonomous Ultrasound-guided Robotic Assistance for Central Venous Catheterization | Deepak Raina et.al. | 2507.05979 | null |
2025-07-08 | DRO-EDL-MPC: Evidential Deep Learning-Based Distributionally Robust Model Predictive Control for Safe Autonomous Driving | Hyeongchan Ham et.al. | 2507.05710 | null |
2025-07-07 | From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving | Fabian Konstantinidis et.al. | 2507.05254 | null |
2025-07-07 | Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance | Tobias Demmler et.al. | 2507.05098 | null |
2025-07-07 | Unifying Robot Optimization: Monte Carlo Tree Search with Tensor Factorization | Teng Xue et.al. | 2507.04949 | null |
2025-07-25 | Interaction-Merged Motion Planning: Effectively Leveraging Diverse Motion Datasets for Robust Planning | Giwon Lee et.al. | 2507.04790 | null |
2025-07-07 | LTMSformer: A Local Trend-Aware Attention and Motion State Encoding Transformer for Multi-Agent Trajectory Prediction | Yixin Yan et.al. | 2507.04634 | null |
2025-07-06 | Free-Space Optical Communication-Driven NMPC Framework for Multi-Rotor Aerial Vehicles in Structured Inspection Scenarios | Giuseppe Silano et.al. | 2507.04443 | null |
2025-07-05 | Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic | Jianwei Tang et.al. | 2507.04062 | null |
2025-07-05 | Temporal Continual Learning with Prior Compensation for Human Motion Prediction | Jianwei Tang et.al. | 2507.04060 | null |
2025-07-05 | DK-RRT: Deep Koopman RRT for Collision-Aware Motion Planning of Space Manipulators in Dynamic Debris Environments | Qi Chen et.al. | 2507.03878 | null |
2025-07-05 | Enhanced accuracy through ensembling of randomly initialized auto-regressive models for time-dependent PDEs | Ishan Khurjekar et.al. | 2507.03863 | null |
2025-07-04 | Label-Free Long-Horizon 3D UAV Trajectory Prediction via Motion-Aligned RGB and Event Cues | Hanfang Liang et.al. | 2507.03365 | null |
2025-07-03 | Trajectory Optimization for Differential Drive Mobile Manipulators via Topological Paths Search and Arc Length-Yaw Parameterization | Long Xu et.al. | 2507.02761 | null |
2025-07-03 | Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization | Caio Azevedo et.al. | 2507.02406 | null |
2025-07-03 | Path Planning using a One-shot-sampling Skeleton Map | Gabriel O. Flores-Aquino et.al. | 2507.02328 | null |
2025-07-02 | GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters | Wanjia Zhao et.al. | 2507.02085 | null |
2025-07-09 | Test-Time Scaling with Reflective Generative Model | Zixiao Wang et.al. | 2507.01951 | null |
2025-07-06 | AMD: Adaptive Momentum and Decoupled Contrastive Learning Framework for Robust Long-Tail Trajectory Prediction | Bin Rao et.al. | 2507.01801 | null |
2025-07-02 | Efficient Collision Detection for Long and Slender Robotic Links in Euclidean Distance Fields: Application to a Forestry Crane | Marc-Philip Ecker et.al. | 2507.01705 | null |
2025-07-02 | LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction | Muhammad Atta ur Rahman et.al. | 2507.01308 | null |
2025-07-01 | Search-Based Robot Motion Planning With Distance-Based Adaptive Motion Primitives | Benjamin Kraljusic et.al. | 2507.01198 | null |
2025-07-01 | ARIG: Autoregressive Interactive Head Generation for Real-time Conversations | Ying Guo et.al. | 2507.00472 | null |
2025-06-30 | Rethink 3D Object Detection from Physical World | Satoshi Tanaka et.al. | 2507.00190 | null |
2025-06-30 | Epona: Autoregressive Diffusion World Model for Autonomous Driving | Kaiwen Zhang et.al. | 2506.24113 | null |
2025-06-30 | STCLocker: Deadlock Avoidance Testing for Autonomous Driving Systems | Mingfei Cheng et.al. | 2506.23995 | null |
2025-06-29 | InfGen: Scenario Generation as Next Token Group Prediction | Zhenghao Peng et.al. | 2506.23316 | null |
2025-06-29 | Mode Collapse Happens: Evaluating Critical Interactions in Joint Trajectory Prediction Models | Maarten Hugenholtz et.al. | 2506.23164 | null |
2025-06-28 | Safe Reinforcement Learning with a Predictive Safety Filter for Motion Planning and Control: A Drifting Vehicle Example | Bei Zhou et.al. | 2506.22894 | null |
2025-06-27 | Pedestrian Intention and Trajectory Prediction in Unstructured Traffic Using IDD-PeD | Ruthvik Bokkasam et.al. | 2506.22111 | null |
2025-06-27 | A MILP-Based Solution to Multi-Agent Motion Planning and Collision Avoidance in Constrained Environments | Akshay Jaitly et.al. | 2506.21982 | null |
2025-06-27 | SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model | Shuhan Tan et.al. | 2506.21976 | null |
2025-07-14 | Ark: An Open-source Python-based Framework for Robot Learning | Magnus Dierking et.al. | 2506.21628 | null |
2025-06-26 | GoIRL: Graph-Oriented Inverse Reinforcement Learning for Multimodal Trajectory Prediction | Muleilan Pei et.al. | 2506.21121 | null |
2025-06-25 | Near Time-Optimal Hybrid Motion Planning for Timber Cranes | Marc-Philip Ecker et.al. | 2506.20314 | null |
2025-06-24 | Trajectory Prediction in Dynamic Object Tracking: A Critical Study | Zhongping Dong et.al. | 2506.19341 | null |
2025-06-25 | AnchorDP3: 3D Affordance Guided Sparse Diffusion Policy for Robotic Manipulation | Ziyan Zhao et.al. | 2506.19269 | null |
2025-08-04 | Faster Motion Planning via Restarts | Nancy Amato et.al. | 2506.19016 | null |
2025-06-23 | SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives | Yizhou Chen et.al. | 2506.18825 | null |
2025-06-23 | Design, fabrication and control of a cable-driven parallel robot | Dhruv Sorathiya et.al. | 2506.18526 | null |
2025-06-23 | Integrating Maneuverable Planning and Adaptive Control for Robot Cart-Pushing under Disturbances | Zhe Zhang et.al. | 2506.18410 | null |
2025-06-23 | Selective Social-Interaction via Individual Importance for Fast Human Trajectory Prediction | Yota Urano et.al. | 2506.18291 | null |
2025-06-23 | Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement Learning | Yue Li et.al. | 2506.18234 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213 | null |
2025-06-20 | Judo: A User-Friendly Open-Source Package for Sampling-Based Model Predictive Control | Albert H. Li et.al. | 2506.17184 | null |
2025-07-11 | Experimental Setup and Software Pipeline to Evaluate Optimization based Autonomous Multi-Robot Search Algorithms | Aditya Bhatt et.al. | 2506.16710 | null |