2025-10-07 |
Fine-grained Defocus Blur Control for Generative Image Models |
Ayush Shrivastava et.al. |
2510.06215 |
null |
2025-10-07 |
Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models |
Jiahao Wang et.al. |
2510.06209 |
null |
2025-10-07 |
On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond |
Chenxiao Yang et.al. |
2510.06190 |
null |
2025-10-07 |
Thermodynamic Performance Limits for Score-Based Diffusion Models |
Nathan X. Kodama et.al. |
2510.06174 |
null |
2025-10-07 |
Bimanual 3D Hand Motion and Articulation Forecasting in Everyday Images |
Aditya Prakash et.al. |
2510.06145 |
null |
2025-10-07 |
CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits |
Kangyu Wang et.al. |
2510.06133 |
null |
2025-10-07 |
Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation |
Jiawei Mao et.al. |
2510.06131 |
null |
2025-10-07 |
Phase-induced switching of ferromagnetic insulators in Josephson spin valves |
A. A. Mazanik et.al. |
2510.06109 |
null |
2025-10-07 |
Complete Synchronization and Pattern Selection through Amplitude Dynamics and Diffusion in Heterogeneous Oscillatory Media |
Nicolas Thomé et.al. |
2510.06083 |
null |
2025-10-07 |
Mechanistic-statistical inference of mosquito dynamics from mark-release-recapture data |
Nga Nguyen et.al. |
2510.06080 |
null |
2025-10-07 |
Controllable Audio-Visual Viewpoint Generation from 360° Spatial Information |
Christian Marinoni et.al. |
2510.06060 |
null |
2025-10-07 |
Edit-Based Flow Matching for Temporal Point Processes |
David Lüdke et.al. |
2510.06050 |
null |
2025-10-07 |
The gamma-ray emission from Radio Galaxies and their contribution to the Isotropic Gamma-Ray Background |
A. Circiello et.al. |
2510.06047 |
null |
2025-10-07 |
Emergent Directedness in Social Contagion |
Fabian Tschofenig et.al. |
2510.06012 |
null |
2025-10-07 |
ECTSpeech: Enhancing Efficient Speech Synthesis via Easy Consistency Tuning |
Tao Zhu et.al. |
2510.05984 |
null |
2025-10-07 |
Diffusion-Based Image Editing for Breaking Robust Watermarks |
Yunyi Ni et.al. |
2510.05978 |
null |
2025-10-07 |
Diffusion Models for Low-Light Image Enhancement: A Multi-Perspective Taxonomy and Performance Analysis |
Eashan Adhikarla et.al. |
2510.05976 |
null |
2025-10-07 |
Quantum Lattice Boltzmann Method for Multiple Time Steps Without Reinitialization for Linear Advection-Diffusion Problems |
Aaron Nagel et.al. |
2510.05965 |
null |
2025-10-07 |
$\bf{D^3}$ QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection |
Yanran Zhang et.al. |
2510.05891 |
null |
2025-10-07 |
Dynamics of Choline Chloride based Deep Eutectic Solvents: Neutron Scattering Study |
Rinesh T. et.al. |
2510.05882 |
null |
2025-10-07 |
The Safety Challenge of World Models for Embodied AI Agents: A Review |
Lorenzo Baraldi et.al. |
2510.05865 |
null |
2025-10-07 |
FoleyGRAM: Video-to-Audio Generation with GRAM-Aligned Multimodal Encoders |
Riccardo Fosco Gramaccioni et.al. |
2510.05829 |
null |
2025-10-07 |
StereoSync: Spatially-Aware Stereo Audio Generation from Video |
Christian Marinoni et.al. |
2510.05828 |
null |
2025-10-07 |
First experimental measurements of biophotons from Astrocytes and Glioblastoma cell cultures |
L. De Paolis et.al. |
2510.05792 |
null |
2025-10-07 |
Models of topological barriers and molecular motors of bacterial DNA |
Marc Joyeux et.al. |
2510.05790 |
null |
2025-10-07 |
New Insights into Involutory and Orthogonal MDS Matrices |
Yogesh Kumar et.al. |
2510.05766 |
null |
2025-10-07 |
RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases |
Lang Qin et.al. |
2510.05764 |
null |
2025-10-07 |
Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis |
Sedat Dogan et.al. |
2510.05761 |
null |
2025-10-07 |
Vipera: Blending Visual and LLM-Driven Guidance for Systematic Auditing of Text-to-Image Generative AI |
Yanwei Huang et.al. |
2510.05742 |
null |
2025-10-07 |
Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies |
Chunsan Hong et.al. |
2510.05725 |
null |
2025-10-07 |
Data Factory with Minimal Human Effort Using VLMs |
Jiaojiao Ye et.al. |
2510.05722 |
null |
2025-10-07 |
DiffSDA: Unsupervised Diffusion Sequential Disentanglement Across Modalities |
Hedi Zisling et.al. |
2510.05717 |
null |
2025-10-07 |
AgeBooth: Controllable Facial Aging and Rejuvenation via Diffusion Models |
Shihao Zhu et.al. |
2510.05715 |
null |
2025-10-07 |
Hedging of exotic options in Hawkes jump-diffusion models by Malliavin calculus |
Ayub Ahmadi et.al. |
2510.05689 |
null |
2025-10-07 |
When and How to Cut Classical Concerts? A Multimodal Automated Video Editing Approach |
Daniel Gonzálbez-Biosca et.al. |
2510.05661 |
null |
2025-10-07 |
Teleportraits: Training-Free People Insertion into Any Scene |
Jialu Gao et.al. |
2510.05660 |
null |
2025-10-07 |
Beyond Spectral Peaks: Interpreting the Cues Behind Synthetic Image Detection |
Sara Mandelli et.al. |
2510.05633 |
null |
2025-10-07 |
Generative AI-Driven Hierarchical Multi-Agent Framework for Zero-Touch Optical Networks |
Yao Zhang et.al. |
2510.05625 |
null |
2025-10-07 |
PointNSP: Autoregressive 3D Point Cloud Generation with Next-Scale Level-of-Detail Prediction |
Ziqiao Meng et.al. |
2510.05613 |
null |
2025-10-07 |
Efficient Conditional Generation on Scale-based Visual Autoregressive Models |
Jiaqi Liu et.al. |
2510.05610 |
null |
2025-10-07 |
Improving Chain-of-Thought Efficiency for Autoregressive Image Generation |
Zeqi Gu et.al. |
2510.05593 |
null |
2025-10-07 |
Probing orbital currents through inverse orbital Hall and Rashba effects |
E. Santos et.al. |
2510.05543 |
null |
2025-10-07 |
Teamwork: Collaborative Diffusion with Low-rank Coordination and Adaptation |
Sam Sartor et.al. |
2510.05532 |
null |
2025-10-07 |
Be Tangential to Manifold: Discovering Riemannian Metric for Diffusion Models |
Shinnosuke Saito et.al. |
2510.05509 |
null |
2025-10-07 |
High-Fidelity Synthetic ECG Generation via Mel-Spectrogram Informed Diffusion Training |
Zhuoyi Huang et.al. |
2510.05492 |
null |
2025-10-06 |
Surface Excess Energy Governs the Non-Monotonic Behavior of Active Diffusivity with Activity |
A. Arango-Restrepo et.al. |
2510.05435 |
null |
2025-10-06 |
See the past: Time-Reversed Scene Reconstruction from Thermal Traces Using Visual Language Models |
Kebin Contreras et.al. |
2510.05408 |
null |
2025-10-06 |
LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation |
Yang Xiao et.al. |
2510.05367 |
null |
2025-10-06 |
Mitigating Diffusion Model Hallucinations with Dynamic Guidance |
Kostas Triaridis et.al. |
2510.05356 |
null |
2025-10-06 |
Domain Decomposition-Based Coupling of High-Fidelity Finite Element and Reduced Order Operator Inference Models Using the Schwarz Alternating Method |
Ian Moore et.al. |
2510.05350 |
null |
2025-10-06 |
A System Level Approach to LQR Control of the Diffusion Equation |
Addie McCurdy et.al. |
2510.05345 |
null |
2025-10-06 |
Learning the detector in optical tomography |
Zijian Wang et.al. |
2510.05341 |
null |
2025-10-06 |
Machine Learning Interatomic Potentials Enable Molecular Dynamics Simulations of Doped MoS2 |
Abrar Faiyad et.al. |
2510.05339 |
null |
2025-10-06 |
Resonance with quasinormal modes in long-range kinks’ collisions |
J. G. F. Campos et.al. |
2510.05311 |
null |
2025-10-06 |
Scalarized Hot Neutron Stars Containing Hyperons and $Δ$ -Resonances in Different Evolution Regimes |
Fahimeh Rahimi et.al. |
2510.05302 |
null |
2025-10-06 |
A Data-Driven Prism: Multi-View Source Separation with Diffusion Model Priors |
Sebastian Wagner-Carena et.al. |
2510.05205 |
null |
2025-10-06 |
Paper2Video: Automatic Video Generation from Scientific Papers |
Zeyu Zhu et.al. |
2510.05096 |
null |
2025-10-06 |
VChain: Chain-of-Visual-Thought for Reasoning in Video Generation |
Ziqi Huang et.al. |
2510.05094 |
null |
2025-10-06 |
Character Mixing for Video Generation |
Tingting Liao et.al. |
2510.05093 |
null |
2025-10-06 |
Factuality Matters: When Image Generation and Editing Meet Structured Visuals |
Le Zhuo et.al. |
2510.05091 |
null |
2025-10-06 |
Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models |
Runchu Tian et.al. |
2510.05090 |
null |
2025-10-06 |
SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder |
Ronen Kamenetsky et.al. |
2510.05081 |
null |
2025-10-06 |
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs |
Dachuan Shi et.al. |
2510.05069 |
null |
2025-10-06 |
Spectral Properties of Anomalous Microwave Emission in 144 Galactic Clouds |
Roke Cepeda-Arroita et.al. |
2510.05067 |
null |
2025-10-06 |
StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation |
Mingyu Liu et.al. |
2510.05057 |
null |
2025-10-06 |
No-reference Quality Assessment of Contrast-distorted Images using Contrast-enhanced Pseudo Reference |
Mohammad-Ali Mahmoudpour et.al. |
2510.05053 |
null |
2025-10-06 |
Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts |
Jihoon Lee et.al. |
2510.05040 |
null |
2025-10-06 |
Graph-Aware Diffusion for Signal Generation |
Sergio Rozada et.al. |
2510.05036 |
null |
2025-10-06 |
Comparing fine-tuning strategies of MACE machine learning force field for modeling Li-ion diffusion in LiF for batteries |
Nada Alghamdi et.al. |
2510.05020 |
null |
2025-10-06 |
Bridging Text and Video Generation: A Survey |
Nilay Kumar et.al. |
2510.04999 |
null |
2025-10-06 |
SSDD: Single-Step Diffusion Decoder for Efficient Image Tokenization |
Théophane Vallaeys et.al. |
2510.04961 |
null |
2025-10-06 |
Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion |
Xin Li et.al. |
2510.04947 |
null |
2025-10-06 |
Steady-State Spread Bounds for Graph Diffusion via Laplacian Regularisation |
Ardavan Rahimian et.al. |
2510.04924 |
null |
2025-10-06 |
Effect of ice nucleating proteins on the structure-property relationships of ice: A molecular dynamics study |
A. K. Shargh et.al. |
2510.04892 |
null |
2025-10-06 |
Flow-Matching Based Refiner for Molecular Conformer Generation |
Xiangyang Xu et.al. |
2510.04878 |
null |
2025-10-06 |
Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails |
Siwei Han et.al. |
2510.04860 |
null |
2025-10-06 |
Efficient structure-preserving scheme for chemotaxis PDEs with singular sensitivity in crime and epidemic modeling |
Rui Wang et.al. |
2510.04826 |
null |
2025-10-06 |
Did you just see that? Arbitrary view synthesis for egocentric replay of operating room workflows from ambient sensors |
Han Zhang et.al. |
2510.04802 |
null |
2025-10-06 |
A behavioral reinvestigation of the effect of long ties on social contagions |
Luca Lazzaro et.al. |
2510.04785 |
null |
2025-10-06 |
ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs |
Wonjun Kang et.al. |
2510.04767 |
null |
2025-10-06 |
Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba |
Baher Mohammad et.al. |
2510.04738 |
null |
2025-10-06 |
Sub-Gaussian heat kernel estimates for reflected diffusion on inner uniform domains |
Riku Anttila et.al. |
2510.04725 |
null |
2025-10-06 |
BGRem: A background noise remover for astronomical images based on a diffusion model |
R. Nicolaas et.al. |
2510.04718 |
null |
2025-10-06 |
ReactDiff: Fundamental Multiple Appropriate Facial Reaction Diffusion Model |
Luo Cheng et.al. |
2510.04712 |
null |
2025-10-06 |
ID-Consistent, Precise Expression Generation with Blendshape-Guided Diffusion |
Foivos Paraperas Papantoniou et.al. |
2510.04706 |
null |
2025-10-06 |
ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement |
Habin Lim et.al. |
2510.04668 |
null |
2025-10-06 |
Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents |
Zeyi Zhang et.al. |
2510.04637 |
null |
2025-10-06 |
The Role of Acoustic Instability in Cosmic-Ray Self-Confinement |
Antonio Capanema et.al. |
2510.04635 |
null |
2025-10-06 |
Exploring the Power of Diffusion Large Language Models for Software Engineering: An Empirical Investigation |
Jingyao Zhang et.al. |
2510.04605 |
null |
2025-10-06 |
Investigating into mechanisms of high temperature strength of refractory high-entropy alloys |
Sai Anandhi Seetharaman et.al. |
2510.04589 |
null |
2025-10-06 |
Improved probabilistic regression using diffusion models |
Carlo Kneissl et.al. |
2510.04583 |
null |
2025-10-07 |
Constrained Dikin-Langevin diffusion for polyhedra |
James Chok et.al. |
2510.04582 |
null |
2025-10-06 |
Language Model Based Text-to-Audio Generation: Anti-Causally Aligned Collaborative Residual Transformers |
Juncheng Wang et.al. |
2510.04577 |
null |
2025-10-06 |
SONA: Learning Conditional, Unconditional, and Mismatching-Aware Discriminator |
Yuhta Takida et.al. |
2510.04576 |
null |
2025-10-07 |
LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning |
Haoqiang Kang et.al. |
2510.04573 |
null |
2025-10-06 |
3Dify: a Framework for Procedural 3D-CG Generation Assisted by LLMs Using MCP and RAG |
Shun-ichiro Hayashi et.al. |
2510.04536 |
null |
2025-10-06 |
TAG:Tangential Amplifying Guidance for Hallucination-Resistant Diffusion Sampling |
Hyunmin Cho et.al. |
2510.04533 |
null |
2025-10-06 |
Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion |
Satoshi Hayakawa et.al. |
2510.04525 |
null |
2025-10-06 |
Toward a Unified Geometry Understanding: Riemannian Diffusion Framework for Graph Generation and Prediction |
Yisen Gao et.al. |
2510.04522 |
null |
2025-10-06 |
Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation |
Zijing Hu et.al. |
2510.04504 |
null |
2025-10-06 |
Non-Monotone Traveling Waves of the Weak Competition Lotka-Volterra System |
Chiun-Chuan Chen et.al. |
2510.04501 |
null |
2025-10-06 |
Identifying non-equilibrium fluctuations in Intracellular Motion Using Recurrent Neural Networks |
Tomas Basile et.al. |
2510.04485 |
null |
2025-10-06 |
TBStar-Edit: From Image Editing Pattern Shifting to Consistency Enhancement |
Hao Fang et.al. |
2510.04483 |
null |
2025-10-06 |
A Diffusion-based Generative Machine Learning Paradigm for Contingency Screening |
Quan Tran et.al. |
2510.04470 |
null |
2025-10-06 |
REAR: Rethinking Visual Autoregressive Models via Generator-Tokenizer Consistency Regularization |
Qiyuan He et.al. |
2510.04450 |
null |
2025-10-06 |
Fractional Heat Kernel for Semi-Supervised Graph Learning with Small Training Sample Size |
Farid Bozorgnia et.al. |
2510.04440 |
null |
2025-10-06 |
spd-metrics-id: A Python Package for SPD-Aware Distance Metrics in Connectome Fingerprinting and Beyond |
Kaosar Uddin et.al. |
2510.04438 |
null |
2025-10-06 |
PAD-TRO: Projection-Augmented Diffusion for Direct Trajectory Optimization |
Jushan Chen et.al. |
2510.04436 |
null |
2025-10-05 |
On the Origin of Carrier Loss in Mg-Doped N-Polar GaN |
Masahiro Kamiyama et.al. |
2510.04381 |
null |
2025-10-05 |
Diffusion^2: Dual Diffusion Model with Uncertainty-Aware Adaptive Noise for Momentary Trajectory Prediction |
Yuhao Luo et.al. |
2510.04365 |
null |
2025-10-05 |
Score-based generative emulation of impact-relevant Earth system model outputs |
Shahine Bouabid et.al. |
2510.04358 |
null |
2025-10-05 |
Reliable and Scalable Robot Policy Evaluation with Imperfect Simulators |
Apurva Badithela et.al. |
2510.04354 |
null |
2025-10-05 |
On strong solution of a multidimensional SDE: extension of Yamada – Watanabe’s theorem |
A. A. Lyappieva et.al. |
2510.04329 |
null |
2025-10-05 |
FoilDiff: A Hybrid Transformer Backbone for Diffusion-based Modelling of 2D Airfoil Flow Fields |
Kenechukwu Ogbuagu et.al. |
2510.04325 |
null |
2025-10-05 |
ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation |
Jay Zhangjie Wu et.al. |
2510.04290 |
null |
2025-10-05 |
The best performance in the CARE 2025 – Liver Task (LiSeg-Contrast): Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation |
Jincan Lou et.al. |
2510.04243 |
null |
2025-10-05 |
Diffusion-Assisted Distillation for Self-Supervised Graph Representation Learning with MLPs |
Seong Jin Ahn et.al. |
2510.04241 |
null |
2025-10-05 |
Flexible Locomotion Learning with Diffusion Model Predictive Control |
Runhan Huang et.al. |
2510.04234 |
null |
2025-10-05 |
MASC: Boosting Autoregressive Image Generation with a Manifold-Aligned Semantic Clustering |
Lixuan He et.al. |
2510.04220 |
null |
2025-10-05 |
World-To-Image: Grounding Text-to-Image Generation with Agent-Driven World Knowledge |
Moo Hyun Son et.al. |
2510.04201 |
null |
2025-10-05 |
Let Features Decide Their Own Solvers: Hybrid Feature Caching for Diffusion Transformers |
Shikang Zheng et.al. |
2510.04188 |
null |
2025-10-05 |
Relief of EGFR/FOS-downregulated miR-103a by loganin alleviates NF-kappaB-triggered inflammation and gut barrier disruption in colitis |
Yan Li et.al. |
2510.04176 |
null |
2025-10-05 |
Drax: Speech Recognition with Discrete Flow Matching |
Aviv Navon et.al. |
2510.04162 |
null |
2025-10-05 |
GDiffuSE: Diffusion-based speech enhancement with noise model guidance |
Efrayim Yanir et.al. |
2510.04157 |
null |
2025-10-05 |
ObCLIP: Oblivious CLoud-Device Hybrid Image Generation with Privacy Preservation |
Haoqi Wu et.al. |
2510.04153 |
null |
2025-10-05 |
Self Speculative Decoding for Diffusion Large Language Models |
Yifeng Gao et.al. |
2510.04147 |
null |
2025-10-05 |
Beyond Next-Token Prediction: A Performance Characterization of Diffusion versus Autoregressive Language Models |
Minseo Kim et.al. |
2510.04146 |
null |
2025-10-05 |
Joint Learning of Pose Regression and Denoising Diffusion with Score Scaling Sampling for Category-level 6D Pose Estimation |
Seunghyun Lee et.al. |
2510.04125 |
null |
2025-10-07 |
Harnessing LLM for Noise-Robust Cognitive Diagnosis in Web-Based Intelligent Education Systems |
Guixian Zhang et.al. |
2510.04093 |
null |
2025-10-05 |
What Makes Diffusion Language Models Super Data Learners? |
Zitian Gao et.al. |
2510.04071 |
null |
2025-10-05 |
Diffusion Low Rank Hybrid Reconstruction for Sparse View Medical Imaging |
Zongyin Deng et.al. |
2510.04069 |
null |
2025-10-05 |
Approaching the scaling limit of transport through lattices with dephasing |
Subhajit Sarkar et.al. |
2510.04062 |
null |
2025-10-05 |
Variational Diffusion Unlearning: A Variational Inference Framework for Unlearning in Diffusion Models under Data Constraints |
Subhodip Panda et.al. |
2510.04058 |
null |
2025-10-05 |
Prompt-to-Prompt: Text-Based Image Editing Via Cross-Attention Mechanisms – The Research of Hyperparameters and Novel Mechanisms to Enhance Existing Frameworks |
Linn Bieske et.al. |
2510.04034 |
null |
2025-10-05 |
Principled and Tractable RL for Reasoning with Diffusion Language Models |
Anthony Zhan et.al. |
2510.04019 |
null |
2025-10-05 |
Dual Pruning and Sorting-Free Overestimation for Average-Utility Sequential Pattern Mining |
Kai Cao et.al. |
2510.04014 |
null |
2025-10-05 |
Optimal estimation of a factorizable density using diffusion models with ReLU neural networks |
Jianqing Fan et.al. |
2510.03994 |
null |
2025-10-05 |
Long time evolution of a pair of 2D viscous point vortices |
Ping Zhang et.al. |
2510.03991 |
null |
2025-10-04 |
A discrete data assimilation algorithm for the reconstruction of Gray–Scott dynamics |
Tsiry Avisoa Randrianasolo et.al. |
2510.03972 |
null |
2025-10-04 |
Global weak martingale solutions to a stochastic two-sidedly degenerate aggregation-diffusion equation issued from biology |
Mostafa Bendahmane et.al. |
2510.03947 |
null |
2025-10-04 |
Super-resolution image projection over an extended depth of field using a diffractive decoder |
Hanlong Chen et.al. |
2510.03938 |
null |
2025-10-04 |
Self-Speculative Masked Diffusions |
Andrew Campbell et.al. |
2510.03929 |
null |
2025-10-04 |
High-order, Compact, and Symmetric Finite Difference Methods for a $d$ -Dimensional Hypercube |
Qiwei Feng et.al. |
2510.03927 |
null |
2025-10-04 |
Generating Human Motion Videos using a Cascaded Text-to-Video Framework |
Hyelin Nam et.al. |
2510.03909 |
null |
2025-10-04 |
Rare Text Semantics Were Always There in Your Diffusion Transformer |
Seil Kang et.al. |
2510.03886 |
null |
2025-10-04 |
Adversarial Agent Collaboration for C to Rust Translation |
Tianyu Li et.al. |
2510.03879 |
null |
2025-10-04 |
PoseGaze-AHP: A Knowledge-Based 3D Dataset for AI-Driven Ocular and Postural Diagnosis |
Saja Al-Dabet et.al. |
2510.03873 |
null |
2025-10-04 |
SDAKD: Student Discriminator Assisted Knowledge Distillation for Super-Resolution Generative Adversarial Networks |
Nikolaos Kaparinos et.al. |
2510.03870 |
null |
2025-10-04 |
Mirage: Unveiling Hidden Artifacts in Synthetic Images with Large Vision-Language Models |
Pranav Sharma et.al. |
2510.03840 |
null |
2025-10-04 |
Proximal Diffusion Neural Sampler |
Wei Guo et.al. |
2510.03824 |
null |
2025-10-04 |
Contrastive-SDE: Guiding Stochastic Differential Equations with Contrastive Learning for Unpaired Image-to-Image Translation |
Venkata Narendra Kotyada et.al. |
2510.03821 |
null |
2025-10-04 |
Diverse Text-to-Image Generation via Contrastive Noise Optimization |
Byungjun Kim et.al. |
2510.03813 |
null |
2025-10-04 |
A Variational Method for Conformable Fractional Equations Using Rank-One Updates |
Maatank Parashar et.al. |
2510.03778 |
null |
2025-10-04 |
Bridging the Gap Between Multimodal Foundation Models and World Models |
Xuehai He et.al. |
2510.03727 |
null |
2025-10-04 |
Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models |
Leander Girrbach et.al. |
2510.03721 |
null |
2025-10-04 |
Non-negative diffusion bridge of the McKean-Vlasov type: analysis of singular diffusion and application to fish migration |
Hidekazu Yoshioka et.al. |
2510.03692 |
null |
2025-10-03 |
Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner |
Cai Zhou et.al. |
2510.03206 |
null |
2025-10-03 |
Memory Forcing: Spatio-Temporal Memory for Consistent Scene Generation on Minecraft |
Junchao Huang et.al. |
2510.03198 |
null |
2025-10-03 |
Product-Quantised Image Representation for High-Quality Image Synthesis |
Denis Zavadski et.al. |
2510.03191 |
null |
2025-10-03 |
HESS J1831 $-$ 098 – Exploring a pulsar halo scenario with H.E.S.S. data |
Karim Sabri et.al. |
2510.03183 |
null |
2025-10-03 |
UniShield: An Adaptive Multi-Agent Framework for Unified Forgery Image Detection and Localization |
Qing Huang et.al. |
2510.03161 |
null |
2025-10-03 |
Mask2IV: Interaction-Centric Video Generation via Mask Trajectories |
Gen Li et.al. |
2510.03135 |
null |
2025-10-03 |
HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion |
Shiyi Zhang et.al. |
2510.03122 |
null |
2025-10-03 |
Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction |
Kaisi Guan et.al. |
2510.03117 |
null |
2025-10-03 |
GeoComplete: Geometry-Aware Diffusion for Reference-Driven Image Completion |
Beibei Lin et.al. |
2510.03110 |
null |
2025-10-03 |
Deciphering the radio-star formation correlation on kpc scales. IV. Radio halos of highly-inclined Virgo cluster spiral galaxies |
B. Vollmer et.al. |
2510.03098 |
null |
2025-10-03 |
Distilled Protein Backbone Generation |
Liyang Xie et.al. |
2510.03095 |
null |
2025-10-03 |
Latent Diffusion Unlearning: Protecting Against Unauthorized Personalization Through Trajectory Shifted Perturbations |
Naresh Kumar Devulapally et.al. |
2510.03089 |
null |
2025-10-03 |
What Drives Compositional Generalization in Visual Generative Models? |
Karim Farid et.al. |
2510.03075 |
null |
2025-10-03 |
Self-consistent model of cosmic ray penetration into molecular clouds: Effect of energy losses |
D. O. Chernyshov et.al. |
2510.03073 |
null |
2025-10-03 |
Rogue waves in extended Gross-Pitaevskii Models with a Lee-Huang-Yang correction |
Sathyanarayanan Chandramouli et.al. |
2510.03063 |
null |
2025-10-03 |
When and Where do Events Switch in Multi-Event Video Generation? |
Ruotong Liao et.al. |
2510.03049 |
null |
2025-10-03 |
Physics-Constrained Inc-GAN for Tunnel Propagation Modeling from Sparse Line Measurements |
Yang Zhou et.al. |
2510.03019 |
null |
2025-10-03 |
Learning Robust Diffusion Models from Imprecise Supervision |
Dong-Dong Wu et.al. |
2510.03016 |
null |
2025-10-03 |
3D-CovDiffusion: 3D-Aware Diffusion Policy for Coverage Path Planning |
Chenyuan Chen et.al. |
2510.03011 |
null |
2025-10-03 |
TIT-Score: Evaluating Long-Prompt Based Text-to-Image Alignment via Text-to-Image-to-Text Consistency |
Juntong Wang et.al. |
2510.02987 |
null |
2025-10-03 |
Multi-faceted light pollution modelling and its application to the decline of artificial illuminance in France |
Rolf Buhler et.al. |
2510.02977 |
null |
2025-10-03 |
Long-Time Analysis of Stochastic Heavy Ball Dynamics for Convex Optimization and Monotone Equations |
Radu Ioan Bot et.al. |
2510.02951 |
null |
2025-10-03 |
Stationarity preserving nodal Finite Element methods for multi-dimensional linear hyperbolic balance laws via a Global Flux quadrature formulation |
Wasilij Barsukow et.al. |
2510.02928 |
null |
2025-10-03 |
Probing a theoretical framework for a Photonic Extreme Learning Machine |
Vicente Rocha et.al. |
2510.02918 |
null |
2025-10-03 |
SALSA-V: Shortcut-Augmented Long-form Synchronized Audio from Videos |
Amir Dellali et.al. |
2510.02916 |
null |
2025-10-03 |
DMark: Order-Agnostic Watermarking for Diffusion Large Language Models |
Linyu Wu et.al. |
2510.02902 |
null |
2025-10-03 |
Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models |
Tianren Ma et.al. |
2510.02880 |
null |
2025-10-03 |
Dust scattering halo of 4U 1630-47: High resolution X-ray and mm observations constrain source and molecular cloud distances |
E. Kalemci et.al. |
2510.02879 |
null |
2025-10-03 |
Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech |
Hieu-Nghia Huynh-Nguyen et.al. |
2510.02848 |
null |
2025-10-03 |
TridentServe: A Stage-level Serving System for Diffusion Pipelines |
Yifei Xia et.al. |
2510.02838 |
null |
2025-10-03 |
Multi-scale Autoregressive Models are Laplacian, Discrete, and Latent Diffusion Models in Disguise |
Steve Hong et.al. |
2510.02826 |
null |
2025-10-03 |
PromptMap: Supporting Exploratory Text-to-Image Generation |
Yuhan Guo et.al. |
2510.02814 |
null |
2025-10-03 |
TeV Emission from PSR B1055-52 with HESS: Evidence for a Pulsar Halo |
Tina Wach et.al. |
2510.02802 |
null |
2025-10-03 |
SongFormer: Scaling Music Structure Analysis with Heterogeneous Supervision |
Chunbo Hao et.al. |
2510.02797 |
null |
2025-10-03 |
Periodic Event-Triggered Prescribed Time Control of Euler-Lagrange Systems under State and Input Constraints |
Chidre Shravista Kashyap et.al. |
2510.02769 |
null |
2025-10-03 |
Neural Jump ODEs as Generative Models |
Robert A. Crowell et.al. |
2510.02757 |
null |
2025-10-03 |
Wide-field GMRT imaging of X-shaped Radio-Galaxies: Spectral properties of 4C32.25 and 4C61.23 |
E. Retana-Montenegro et.al. |
2510.02753 |
null |
2025-10-03 |
Denoising and Augmentation: A Dual Use of Diffusion Model for Enhanced CSI Recovery |
Yupeng Li et.al. |
2510.02744 |
null |
2025-10-03 |
Dale meets Langevin: A Multiplicative Denoising Diffusion Model |
Nishanth Shetty et.al. |
2510.02730 |
null |
2025-10-03 |
Flow Matching for Measure Transport and Feedback Stabilization of Control-Affine Systems |
Karthik Elamvazhuthi et.al. |
2510.02706 |
null |
2025-10-03 |
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization |
Kai Fukazawa et.al. |
2510.02695 |
null |
2025-10-03 |
Fine-Tuning Diffusion Models via Intermediate Distribution Shaping |
Gautham Govind Anil et.al. |
2510.02692 |
null |
2025-10-03 |
Ohta-Kawasaki Model Reveals Patterns on Multicomponent Vesicles |
Wangbo Luo et.al. |
2510.02688 |
null |
2025-10-03 |
Smart-GRPO: Smartly Sampling Noise for Efficient RL of Flow-Matching Models |
Benjamin Yu et.al. |
2510.02654 |
null |
2025-10-03 |
Dispersion Relations and Pole-Skipping in a Holographic Charmonium Model with Rotating Plasma |
Luiz F. Ferreira et.al. |
2510.02647 |
null |
2025-10-03 |
Deep Generative Continual Learning using Functional LoRA: FunLoRA |
Victor Enescu et.al. |
2510.02631 |
null |
2025-10-02 |
Input-Aware Sparse Attention for Real-Time Co-Speech Video Generation |
Beijia Lu et.al. |
2510.02617 |
null |
2025-10-02 |
UMI-on-Air: Embodiment-Aware Guidance for Embodiment-Agnostic Visuomotor Policies |
Harsh Gupta et.al. |
2510.02614 |
null |
2025-10-02 |
PEO: Training-Free Aesthetic Quality Enhancement in Pre-Trained Text-to-Image Diffusion Models with Prompt Embedding Optimization |
Hovhannes Margaryan et.al. |
2510.02599 |
null |
2025-10-02 |
Surface Wave Solutions in 1D and 2D for the Broer-Kaup-Boussinesq-Kupershmidt (BKBK) System |
Darryl D. Holm et.al. |
2510.02577 |
null |
2025-10-02 |
How Confident are Video Models? Empowering Video Models to Express their Uncertainty |
Zhiting Mei et.al. |
2510.02571 |
null |
2025-10-02 |
Learning Microswimmer Collision Dynamics and Predicting Diffusivities using a Neural-Network-Assisted Boltzmann Approach |
Haruki Hayano et.al. |
2510.02559 |
null |
2025-10-02 |
Stable determination of the nonlinear parameter in the non-diffusive Westervelt equation from the Dirichlet-to-Neumann map |
Mike Wendels et.al. |
2510.02553 |
null |
2025-10-02 |
Active-Learning Inspired Ab Initio Theory-Experiment Loop Approach for Management of Material Defects: Application to Superconducting Qubits |
Sarvesh Chaudhari et.al. |
2510.02544 |
null |
2025-10-02 |
Self-supervised diffusion model fine-tuning for costate initialization using Markov chain Monte Carlo |
Jannik Graebner et.al. |
2510.02527 |
null |
2025-10-02 |
Graph Generation with Spectral Geodesic Flow Matching |
Xikun Huang et.al. |
2510.02520 |
null |
2025-10-02 |
Learning a distance measure from the information-estimation geometry of data |
Guy Ohayon et.al. |
2510.02514 |
null |
2025-10-02 |
Beyond Linear Diffusions: Improved Representations for Rare Conditional Generative Modeling |
Kulunu Dharmakeerthi et.al. |
2510.02499 |
null |
2025-10-02 |
The Entangled Feedback Impacts of Supernovae in Coarse- versus High-Resolution Galaxy Simulations |
Eric Zhang et.al. |
2510.02432 |
null |
2025-10-02 |
Optimal Control Meets Flow Matching: A Principled Route to Multi-Subject Fidelity |
Eric Tillmann Bill et.al. |
2510.02315 |
null |
2025-10-02 |
Inferring Dynamic Physical Properties from Video Foundation Models |
Guanqi Zhan et.al. |
2510.02311 |
null |
2025-10-02 |
NoiseShift: Resolution-Aware Noise Recalibration for Better Low-Resolution Image Generation |
Ruozhen He et.al. |
2510.02307 |
null |
2025-10-02 |
Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive |
Tyler Farghly et.al. |
2510.02305 |
null |
2025-10-02 |
Knowledge Distillation Detection for Open-weights Models |
Qin Shi et.al. |
2510.02302 |
null |
2025-10-02 |
Equilibrium Matching: Generative Modeling with Implicit Energy-Based Models |
Runqian Wang et.al. |
2510.02300 |
null |
2025-10-02 |
Continual Personalization for Diffusion Models |
Yu-Chien Liao et.al. |
2510.02296 |
null |
2025-10-02 |
Test-Time Anchoring for Discrete Diffusion Posterior Sampling |
Litu Rout et.al. |
2510.02291 |
null |
2025-10-02 |
MultiModal Action Conditioned Video Generation |
Yichen Li et.al. |
2510.02287 |
null |
2025-10-02 |
Learning to Generate Object Interactions with Physics-Guided Video Diffusion |
David Romero et.al. |
2510.02284 |
null |
2025-10-02 |
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation |
Justin Cui et.al. |
2510.02283 |
null |
2025-10-02 |
Diffusion^2: Turning 3D Environments into Radio Frequency Heatmaps |
Kyoungjun Park et.al. |
2510.02274 |
null |
2025-10-02 |
Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning |
Tianchong Jiang et.al. |
2510.02268 |
null |
2025-10-02 |
NeuroSwift: A Lightweight Cross-Subject Framework for fMRI Visual Reconstruction of Complex Scenes |
Shiyi Zhang et.al. |
2510.02266 |
null |
2025-10-02 |
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing |
Zihan Zhou et.al. |
2510.02253 |
null |
2025-10-02 |
TempoControl: Temporal Attention Guidance for Text-to-Video Models |
Shira Schiber et.al. |
2510.02226 |
null |
2025-10-02 |
Diffusion Transformers for Imputation: Statistical Efficiency and Uncertainty Quantification |
Zeqi Ye et.al. |
2510.02216 |
null |
2025-10-02 |
DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning |
Hanyang Zhao et.al. |
2510.02212 |
null |
2025-10-02 |
Measurement-Guided Consistency Model Sampling for Inverse Problems |
Amirreza Tanevardi et.al. |
2510.02208 |
null |
2025-10-02 |
Chaotic many-body quantum dynamics, spectral correlations, and energy diffusion |
J. T. Chalker et.al. |
2510.02198 |
null |
2025-10-02 |
Uncovering Semantic Selectivity of Latent Groups in Higher Visual Cortex with Mutual Information-Guided Diffusion |
Yule Wang et.al. |
2510.02182 |
null |
2025-10-02 |
Policy Gradient Guidance Enables Test Time Control |
Jianing Qi et.al. |
2510.02148 |
null |
2025-10-02 |
FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models |
Karan Dua et.al. |
2510.02133 |
null |
2025-10-02 |
SoundReactor: Frame-level Online Video-to-Audio Generation |
Koichi Saito et.al. |
2510.02110 |
null |
2025-10-02 |
Quantum Effects or Theoretical Artifacts? A Computational Reanalysis of Hydrogen at High-Pressure |
Stefano Racioppi et.al. |
2510.02098 |
null |
2025-10-02 |
VGDM: Vision-Guided Diffusion Model for Brain Tumor Detection and Segmentation |
Arman Behnam et.al. |
2510.02086 |
null |
2025-10-02 |
Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions |
Zhaoyi Li et.al. |
2510.02081 |
null |
2025-10-02 |
Spec-Gloss Surfels and Normal-Diffuse Priors for Relightable Glossy Objects |
Georgios Kouros et.al. |
2510.02069 |
null |
2025-10-02 |
MSRepaint: Multiple Sclerosis Repaint with Conditional Denoising Diffusion Implicit Model for Bidirectional Lesion Filling and Synthesis |
Jinwei Zhang et.al. |
2510.02063 |
null |
2025-10-02 |
Zero-shot Human Pose Estimation using Diffusion-based Inverse solvers |
Sahil Bhandary Karnoor et.al. |
2510.02043 |
null |
2025-10-02 |
RAD@home discovery of extragalactic radio rings and odd radio circles: clues to their origins |
Ananda Hota et.al. |
2510.01999 |
null |
2025-10-02 |
$\text{G}^2$ RPO: Granular GRPO for Precise Reward in Flow Models |
Yujie Zhou et.al. |
2510.01982 |
null |
2025-10-02 |
ZK-WAGON: Imperceptible Watermark for Image Generation Models using ZK-SNARKs |
Aadarsh Anantha Ramakrishnan et.al. |
2510.01967 |
null |
2025-10-02 |
StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold |
Zhizhong Li et.al. |
2510.01938 |
null |
2025-10-02 |
Dark characterization of Ti/Al LEKIDs for the search of axions in the W-band |
Victor Rollano et.al. |
2510.01913 |
null |
2025-10-02 |
A probabilistic representation for the gradient in a linear parabolic PDE with Neumann boundary condition |
Abdelatif Benchérif Madani et.al. |
2510.01898 |
null |
2025-10-02 |
Multi-marginal temporal Schrödinger Bridge Matching for video generation from unpaired data |
Thomas Gravier et.al. |
2510.01894 |
null |
2025-10-02 |
Fisher information and trajectorial interpretation to the Itô–Langevin relative entropy dissipation |
Jiaming Chen et.al. |
2510.01870 |
null |
2025-10-04 |
NGGAN: Noise Generation GAN Based on the Practical Measurement Dataset for Narrowband Powerline Communications |
Ying-Ren Chien et.al. |
2510.01850 |
null |
2025-10-02 |
Non-Gaussian Rotational Diffusion and Swing Motion of Dumbbell Probes in Two Dimensional Colloids |
Jeongmin Kim et.al. |
2510.01847 |
null |
2025-10-02 |
Leveraging Prior Knowledge of Diffusion Model for Person Search |
Giyeol Kim et.al. |
2510.01841 |
null |
2025-10-02 |
Representation and Integration by Parts Formulas for Affine Processes |
Arturo Kohatsu-Higa et.al. |
2510.01839 |
null |
2025-10-02 |
Intermediate diffusive-ballistic electron conduction around mesoscopic defects in graphene |
Toni Markovic et.al. |
2510.01821 |
null |
2025-10-02 |
Mean-field theory of the Santa Fe model revisited: a systematic derivation from an exact BBGKY hierarchy for the zero-intelligence limit-order book model |
Taiki Wakatsuki et.al. |
2510.01814 |
null |
2025-10-02 |
Efficient manifold evolution algorithm using adaptive B-Spline interpolation |
Muhammad Ammad et.al. |
2510.01790 |
null |
2025-10-03 |
Pack and Force Your Memory: Long-form and Consistent Video Generation |
Xiaofei Wu et.al. |
2510.01784 |
null |
2025-10-02 |
Unsupervised Dynamic Feature Selection for Robust Latent Spaces in Vision Tasks |
Bruno Corcuera et.al. |
2510.01758 |
null |
2025-10-02 |
Towards Photonic Band Diagram Generation with Transformer-Latent Diffusion Models |
Valentin Delchevalerie et.al. |
2510.01749 |
null |
2025-10-02 |
Edge GPU Aware Multiple AI Model Pipeline for Accelerated MRI Reconstruction and Analysis |
Ashiyana Abdul Majeed et.al. |
2510.01730 |
null |
2025-10-02 |
First passage times to T cell activation |
Tony Wong et.al. |
2510.01694 |
null |
2025-10-03 |
UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction |
Jin Cao et.al. |
2510.01669 |
null |
2025-10-02 |
FideDiff: Efficient Diffusion Model for High-Fidelity Image Motion Deblurring |
Xiaoyang Liu et.al. |
2510.01641 |
null |
2025-10-02 |
Finite isoresidual covers in strata of $k$ -differentials |
Dawei Chen et.al. |
2510.01630 |
null |
2025-10-02 |
Local linearization for estimating the diffusion parameter of nonlinear stochastic wave equations with spatially correlated noise |
Guoping Liu et.al. |
2510.01627 |
null |
2025-10-02 |
NPN: Non-Linear Projections of the Null-Space for Imaging Inverse Problems |
Roman Jacome et.al. |
2510.01608 |
null |
2025-10-02 |
Securing generative artificial intelligence with parallel magnetic tunnel junction true randomness |
Youwei Bao et.al. |
2510.01598 |
null |
2025-10-02 |
TetriServe: Efficient DiT Serving for Heterogeneous Image Generation |
Runyu Lu et.al. |
2510.01565 |
null |
2025-10-02 |
MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models |
Kevin Zhai et.al. |
2510.01549 |
null |
2025-10-02 |
Growing Visual Generative Capacity for Pre-Trained MLLMs |
Hanyu Wang et.al. |
2510.01546 |
null |
2025-10-02 |
Step-Aware Policy Optimization for Reasoning in Diffusion Large Language Models |
Shaoan Xie et.al. |
2510.01544 |
null |
2025-10-02 |
Towards Better Optimization For Listwise Preference in Diffusion Models |
Jiamu Bai et.al. |
2510.01540 |
null |
2025-10-01 |
Correlation estimates for Brownian particles with singular interactions |
Mitia Duerinckx et.al. |
2510.01507 |
null |
2025-10-01 |
AortaDiff: A Unified Multitask Diffusion Framework For Contrast-Free AAA Imaging |
Yuxuan Ou et.al. |
2510.01498 |
null |
2025-10-01 |
Purrception: Variational Flow Matching for Vector-Quantized Image Generation |
Răzvan-Andrei Matişan et.al. |
2510.01478 |
null |
2025-10-03 |
SCOPED: Score-Curvature Out-of-distribution Proximity Evaluator for Diffusion |
Brett Barkley et.al. |
2510.01456 |
null |
2025-10-01 |
Diffusion Modeling of the Three-Dimensional Magnetic Field in the Sun’s Corona |
Daniel E. da Silva et.al. |
2510.01441 |
null |
2025-10-01 |
DiffKnock: Diffusion-based Knockoff Statistics for Neural Networks Inference |
Heng Ge et.al. |
2510.01418 |
null |
2025-10-01 |
How Well do Diffusion Policies Learn Kinematic Constraint Manifolds? |
Lexi Foland et.al. |
2510.01404 |
null |
2025-10-01 |
Localized Pattern Formation and Oscillatory Instabilities in a Three-component Gierer Meinhardt Model |
Chunyi Gai et.al. |
2510.01401 |
null |
2025-10-01 |
DisCo: Reinforcement with Diversity Constraints for Multi-Human Generation |
Shubhankar Borse et.al. |
2510.01399 |
null |
2025-10-01 |
VENTURA: Adapting Image Diffusion Models for Unified Task Conditioned Navigation |
Arthur Zhang et.al. |
2510.01388 |
null |
2025-10-01 |
Fine-Tuning Masked Diffusion for Provable Self-Correction |
Jaeyeon Kim et.al. |
2510.01384 |
null |
2025-10-01 |
Selective Underfitting in Diffusion Models |
Kiwhan Song et.al. |
2510.01378 |
null |
2025-10-01 |
Microquasars as the major contributors to Galactic cosmic rays around the “knee” |
Samy Kaci et.al. |
2510.01369 |
null |
2025-10-01 |
Image Generation Based on Image Style Extraction |
Shuochen Chang et.al. |
2510.01347 |
null |
2025-10-01 |
Discovery of diffuse gamma-ray emission in the vicinity of G172.8+1.5: An old supernova remnant with different turbulence properties |
Yuan Li et.al. |
2510.01340 |
null |
2025-10-01 |
LVTINO: LAtent Video consisTency INverse sOlver for High Definition Video Restoration |
Alessio Spagnoletti et.al. |
2510.01339 |
null |
2025-10-01 |
Dynamical Excitation as a probe of planetary origins |
Brad M. S. Hansen et.al. |
2510.01332 |
null |
2025-10-01 |
Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling |
Huangjie Zheng et.al. |
2510.01329 |
null |
2025-10-01 |
Combining complex Langevin dynamics with score-based and energy-based diffusion models |
Gert Aarts et.al. |
2510.01328 |
null |
2025-10-01 |
IMAGEdit: Let Any Subject Transform |
Fei Shen et.al. |
2510.01186 |
null |
2025-10-01 |
Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models |
Yanbo Xu et.al. |
2510.01184 |
null |
2025-10-01 |
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory |
Jiahao Wang et.al. |
2510.01183 |
null |
2025-10-01 |
Vanishing Acts: Quantifying Black Hole Formation with the DSNB Signal |
Tim Charissé et.al. |
2510.01177 |
null |
2025-10-01 |
Audio Driven Real-Time Facial Animation for Social Telepresence |
Jiye Lee et.al. |
2510.01176 |
null |
2025-10-01 |
Code2Video: A Code-centric Paradigm for Educational Video Generation |
Yanzhe Chen et.al. |
2510.01174 |
null |
2025-10-01 |
Multi-Marginal Flow Matching with Adversarially Learnt Interpolants |
Oskar Kviman et.al. |
2510.01159 |
null |
2025-10-01 |
Superpositions of Quantum Gaussian Processes |
Lorenzo Braccini et.al. |
2510.01156 |
null |
2025-10-01 |
Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition |
Jiahang Cao et.al. |
2510.01068 |
null |
2025-10-01 |
ReSWD: ReSTIR’d, not shaken. Combining Reservoir Sampling and Sliced Wasserstein Distance for Variance Reduction |
Mark Boss et.al. |
2510.01061 |
null |
2025-10-01 |
Authentic Discrete Diffusion Model |
Xiao Li et.al. |
2510.01047 |
null |
2025-10-01 |
Gated X-TFC: Soft Domain Decomposition for Forward and Inverse Problems in Sharp-Gradient PDEs |
Vikas Dwivedi et.al. |
2510.01039 |
null |
2025-10-01 |
Secure and reversible face anonymization with diffusion models |
Pol Labarbarie et.al. |
2510.01031 |
null |
2025-10-01 |
Syntax-Guided Diffusion Language Models with User-Integrated Personalization |
Ruqian Zhang et.al. |
2510.01028 |
null |
2025-10-01 |
Equivariant Geometric Scattering Networks via Vector Diffusion Wavelets |
David R. Johnson et.al. |
2510.01022 |
null |
2025-10-01 |
Molecular Mobility of Extraterrestrial Ices: Surface Diffusion in Astrochemistry and Planetary Science |
N. F. W. Ligterink et.al. |
2510.01018 |
null |
2025-10-01 |
ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning |
Yuxiang Guo et.al. |
2510.01010 |
null |
2025-10-02 |
SoftCFG: Uncertainty-guided Stable Guidance for Visual Autoregressive Model |
Dongli Xu et.al. |
2510.00996 |
null |
2025-10-01 |
Riemannian Consistency Model |
Chaoran Cheng et.al. |
2510.00983 |
null |
2025-10-01 |
JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation |
Siheng Wan et.al. |
2510.00974 |
null |
2025-09-30 |
Stitch: Training-Free Position Control in Multimodal Diffusion Transformers |
Jessica Bader et.al. |
2509.26644 |
null |
2025-09-30 |
Query-Kontext: An Unified Multimodal Model for Image Generation and Editing |
Yuxin Song et.al. |
2509.26641 |
null |
2025-09-30 |
Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training |
Junlin Han et.al. |
2509.26625 |
null |
2025-09-30 |
DiffCamera: Arbitrary Refocusing on Images |
Yiyang Wang et.al. |
2509.26599 |
null |
2025-09-30 |
Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation |
Agneet Chatterjee et.al. |
2509.26555 |
null |
2025-09-30 |
Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents |
Zhen Yang et.al. |
2509.26539 |
null |
2025-09-30 |
HilbertA: Hilbert Attention for Image Generation with Diffusion Models |
Shaoyi Zheng et.al. |
2509.26538 |
null |
2025-09-30 |
Stab-QRAM: An All-Clifford Quantum Random Access Memory for Special Data |
Guangyi Li et.al. |
2509.26494 |
null |
2025-09-30 |
Contrastive Diffusion Guidance for Spatial Inverse Problems |
Sattwik Basu et.al. |
2509.26489 |
null |
2025-09-30 |
dParallel: Learnable Parallel Decoding for dLLMs |
Zigeng Chen et.al. |
2509.26488 |
null |
2025-09-30 |
Closures of moment expansion of anisotropic active Brownian particles |
Timothée Gautry et.al. |
2509.26453 |
null |
2025-09-30 |
Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models |
Donghoon Kim et.al. |
2509.26436 |
null |
2025-10-01 |
AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size |
Guanxi Lu et.al. |
2509.26432 |
null |
2025-09-30 |
MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation |
Chenhui Zhu et.al. |
2509.26391 |
null |
2025-09-30 |
The Effective Reactivity for Capturing Brownian Motion by Partially Reactive Patches on a Spherical Surface |
Denis S. Grebenkov et.al. |
2509.26381 |
null |
2025-09-30 |
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation |
Harold Haodong Chen et.al. |
2509.26376 |
null |
2025-09-30 |
Competition of small targets in planar domains: from Dirichlet to Robin and Steklov boundary condition |
Denis S. Grebenkov et.al. |
2509.26367 |
null |
2025-09-30 |
Data-to-Energy Stochastic Dynamics |
Kirill Tamogashev et.al. |
2509.26364 |
null |
2025-09-30 |
Universal critical dynamics near the chiral phase transition and the QCD critical point |
Yunxin Ye et.al. |
2509.26355 |
null |
2025-09-30 |
Fast-dLLM v2: Efficient Block-Diffusion LLM |
Chengyue Wu et.al. |
2509.26328 |
null |
2025-09-30 |
Anomaly detection for generic failure monitoring in robotic assembly, screwing and manipulation |
Niklas Grambow et.al. |
2509.26308 |
null |
2025-09-30 |
Two-component diffuse Galactic gamma-ray emission revealed with Fermi-LAT |
Qi-Ling Chen et.al. |
2509.26290 |
null |
2025-09-30 |
3DiFACE: Synthesizing and Editing Holistic 3D Facial Animation |
Balamurugan Thambiraja et.al. |
2509.26233 |
null |
2025-09-30 |
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance |
Jiayi Guo et.al. |
2509.26231 |
null |
2025-09-30 |
Basic Cycle Ratio: Cost-Effective Ranking of Influential Spreaders from Local and Global Perspectives |
Wenxin Zheng et.al. |
2509.26220 |
null |
2025-09-30 |
Exact rate of convergence for the empirical measure of a subordinated process in $p$ -Wasserstein distance |
René L. Schilling et.al. |
2509.26188 |
null |
2025-09-30 |
BABY 1L: First Tritium Breeding Campaign Results |
Rémi Delaporte-Mathurin et.al. |
2509.26174 |
null |
2025-09-30 |
Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models |
Yuansen Liu et.al. |
2509.26165 |
null |
2025-09-30 |
Towards Continual Expansion of Data Coverage: Automatic Text-guided Edge-case Synthesis |
Kyeongryeol Go et.al. |
2509.26158 |
null |
2025-09-30 |
EchoGen: Generating Visual Echoes in Any Scene via Feed-Forward Subject-Driven Auto-Regressive Model |
Ruixiao Dong et.al. |
2509.26127 |
null |
2025-10-01 |
Tracer diffusion coefficients in a sheared granular gas. Exact results |
David González Méndez et.al. |
2509.26115 |
null |
2025-09-30 |
EVODiff: Entropy-aware Variance Optimized Diffusion Inference |
Shigui Li et.al. |
2509.26096 |
null |
2025-09-30 |
The diffusion-driven orthorhombic to tetragonal transition in YBa $_2$Cu$_3$O$_7$ derived with a machine learning interatomic potential |
Davide Gambino et.al. |
2509.26095 |
null |
2025-09-30 |
Fading to Grow: Growing Preference Ratios via Preference Fading Discrete Diffusion for Recommendation |
Guoqing Hu et.al. |
2509.26063 |
null |
2025-09-30 |
Initial traces and solvability of the fast diffusion equation with power-type nonlinearity |
Kazuhiro Ishige et.al. |
2509.26054 |
null |
2025-09-30 |
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution |
Shian Du et.al. |
2509.26025 |
null |
2025-09-30 |
New Fourth-Order Grayscale Indicator-Based Telegraph Diffusion Model for Image Despeckling |
Rajendra K. Ray et.al. |
2509.26010 |
null |
2025-10-02 |
VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing |
Abdelilah Aitrouga et.al. |
2509.25998 |
null |
2025-09-30 |
Exact Solutions to the Quantum Schrödinger Bridge Problem |
Mykola Bordyuh et.al. |
2509.25980 |
null |
2025-09-30 |
Weak-strong uniqueness for general cross-diffusion systems with volume filling |
Maria Heitzinger et.al. |
2509.25978 |
null |
2025-09-30 |
Data-Free Continual Learning of Server Models in Model-Heterogeneous Federated learning |
Xiao Zhang et.al. |
2509.25977 |
null |
2025-09-30 |
CO3: Contrasting Concepts Compose Better |
Debottam Dutta et.al. |
2509.25940 |
null |
2025-09-30 |
Bringing Emerging Architectures to Sequence Labeling in NLP |
Ana Ezquerro et.al. |
2509.25918 |
null |
2025-10-01 |
LLaVAShield: Safeguarding Multimodal Multi-Turn Dialogues in Vision-Language Models |
Guolei Huang et.al. |
2509.25896 |
null |
2025-10-01 |
A Multimodal LLM Approach for Visual Question Answering on Multiparametric 3D Brain MRI |
Arvind Murari Vepa et.al. |
2509.25889 |
null |
2025-09-30 |
Kinetics of the photochromic effect in oxygen-containing rare-earth hydrides |
Dmitrii Moldarev et.al. |
2509.25887 |
null |
2025-09-30 |
Training-Free Reward-Guided Image Editing via Trajectory Optimal Control |
Jinho Chang et.al. |
2509.25845 |
null |
2025-09-30 |
HiStyle: Hierarchical Style Embedding Predictor for Text-Prompt-Guided Controllable Speech Synthesis |
Ziyu Zhang et.al. |
2509.25842 |
null |
2025-10-01 |
Act to See, See to Act: Diffusion-Driven Perception-Action Interplay for Adaptive Policies |
Jing Wang et.al. |
2509.25822 |
null |
2025-09-30 |
Pre-equilibrium charm quark dynamics and their impact on D-Meson observables |
Manu Kurian et.al. |
2509.25806 |
null |
2025-09-30 |
Numerical approximations to invariant measures of hybrid stochastic differential equations with superlinear coefficients via the backward Euler-Maruyama method |
Wei Liu et.al. |
2509.25799 |
null |
2025-09-30 |
PUREVQ-GAN: Defending Data Poisoning Attacks through Vector-Quantized Bottlenecks |
Alexander Branch et.al. |
2509.25792 |
null |
2025-09-30 |
Editable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation |
Mingyu Kang et.al. |
2509.25776 |
null |
2025-09-30 |
PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models |
Jeongjae Lee et.al. |
2509.25774 |
null |
2025-09-30 |
Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs |
Jia Jun Cheng Xian et.al. |
2509.25771 |
null |
2025-09-30 |
Quasi-Monte Carlo methods for uncertainty quantification of tumor growth modeled by a parametric semi-linear parabolic reaction-diffusion equation |
Alexander D. Gilbert et.al. |
2509.25753 |
null |
2025-09-30 |
ART-VITON: Measurement-Guided Latent Diffusion for Artifact-Free Virtual Try-On |
Junseo Park et.al. |
2509.25749 |
null |
2025-09-30 |
LieHMR: Autoregressive Human Mesh Recovery with $SO(3)$ Diffusion |
Donghwan Kim et.al. |
2509.25739 |
null |
2025-09-30 |
LaTo: Landmark-tokenized Diffusion Transformer for Fine-grained Human Face Editing |
Zhenghao Zhang et.al. |
2509.25731 |
null |
2025-09-30 |
Controlled Generation for Private Synthetic Text |
Zihao Zhao et.al. |
2509.25729 |
null |
2025-09-30 |
How Diffusion Models Memorize |
Juyeop Kim et.al. |
2509.25705 |
null |
2025-09-30 |
Radiative hydrodynamic simulations of FIP fractionation in solar flares |
Jeffrey W. Reep et.al. |
2509.25695 |
null |
2025-09-30 |
Hierarchical Diffusion Motion Planning with Task-Conditioned Uncertainty-Aware Priors |
Amelie Minji Kim et.al. |
2509.25685 |
null |
2025-09-30 |
dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought |
Junjie Wen et.al. |
2509.25681 |
null |
2025-09-30 |
Swift: An Autoregressive Consistency Model for Efficient Weather Forecasting |
Jason Stock et.al. |
2509.25631 |
null |
2025-09-30 |
Mean Field Type Control Problems Driven by Jump-diffusions |
Alain Bensoussan et.al. |
2509.25614 |
null |
2025-09-29 |
RFG: Test-Time Scaling for Diffusion Large Language Model Reasoning with Reward-Free Guidance |
Tianlang Chen et.al. |
2509.25604 |
null |
2025-09-29 |
MoReFlow: Motion Retargeting Learning through Unsupervised Flow Matching |
Wontaek Kim et.al. |
2509.25600 |
null |
2025-09-29 |
Machine Learning Algorithms for Improving Black Box Optimization Solvers |
Morteza Kimiaei et.al. |
2509.25592 |
null |
2025-09-29 |
IRIS: Intrinsic Reward Image Synthesis |
Yihang Chen et.al. |
2509.25562 |
null |
2025-09-29 |
Spatiotemporal Forecasting of Incidents and Congestion with Implications for Sustainable Traffic Control |
Tony Kinchen et.al. |
2509.25515 |
null |
2025-09-29 |
Non-Gaussian statistics of concentration fluctuations in free liquid diffusion |
Marco Bussoletti et.al. |
2509.25511 |
null |
2025-09-29 |
Analysis of a Cahn–Hilliard model for viscoelastoplastic two-phase flows |
Fan Cheng et.al. |
2509.25508 |
null |
2025-09-29 |
Kinetic Monte Carlo prediction of the morphology of pentaerythritol tetranitrate |
Jacob Jeffries et.al. |
2509.25490 |
null |
2025-09-29 |
Noise estimation of SDE from a single data trajectory |
Munawar Ali et.al. |
2509.25484 |
null |
2025-09-29 |
Translation from Wearable PPG to 12-Lead ECG |
Hui Ji et.al. |
2509.25480 |
null |
2025-09-29 |
Exponential Hedging for the Ornstein-Uhlenbeck Process in the Presence of Linear Price Impact |
Yan Dolinsky et.al. |
2509.25472 |
null |
2025-09-29 |
Generating Differentially Private Networks with a Modified Erdős-Rényi Model |
Huaiyuan Rao et.al. |
2509.25431 |
null |
2025-09-29 |
Stochastic dynamics on evolving geometric graphs |
Alexei Daletskii et.al. |
2509.25427 |
null |
2025-09-29 |
Electropolishing-Induced Topographic Defects in Niobium: Insights and Implications for Superconducting Radio Frequency Applications |
Oleksandr Hryhorenko et.al. |
2509.25423 |
null |
2025-09-29 |
Emotion-Aligned Generation in Diffusion Text to Speech Models via Preference-Guided Optimization |
Jiacheng Shi et.al. |
2509.25416 |
null |
2025-09-29 |
FlashOmni: A Unified Sparse Attention Engine for Diffusion Transformers |
Liang Qiao et.al. |
2509.25401 |
null |
2025-09-29 |
Let Physics Guide Your Protein Flows: Topology-aware Unfolding and Generation |
Yogesh Verma et.al. |
2509.25379 |
null |
2025-09-29 |
Safe and Stable Control via Lyapunov-Guided Diffusion Models |
Xiaoyuan Cheng et.al. |
2509.25375 |
null |
2025-09-29 |
Diffusion with doubly stochastic resetting |
Maxence Arutkin et.al. |
2509.25365 |
null |
2025-09-29 |
The spatially-resolved effect of mergers on the stellar mass assembly of MaNGA galaxies |
Eirini Angeloudi et.al. |
2509.25340 |
null |
2025-09-29 |
LUMA: Low-Dimension Unified Motion Alignment with Dual-Path Anchoring for Text-to-Motion Diffusion Model |
Haozhe Jia et.al. |
2509.25304 |
null |
2025-09-29 |
Learning to Parallel: Accelerating Diffusion Large Language Models via Adaptive Parallel Decoding |
Wenrui Bao et.al. |
2509.25188 |
null |
2025-09-29 |
FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation |
Yunyang Ge et.al. |
2509.25187 |
null |
2025-09-29 |
Guided Diffusion for the Discovery of New Superconductors |
Pawan Prakash et.al. |
2509.25186 |
null |
2025-09-29 |
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder |
Junyu Chen et.al. |
2509.25182 |
null |
2025-09-29 |
A bound-preserving multinumerics scheme for steady-state convection-diffusion equations |
Maurice S. Fabien et.al. |
2509.25181 |
null |
2025-10-01 |
DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space |
Wenkun He et.al. |
2509.25180 |
null |
2025-09-29 |
GHOST: Hallucination-Inducing Image Generation for Multimodal LLMs |
Aryan Yazdan Parast et.al. |
2509.25178 |
null |
2025-09-29 |
Personalized Vision via Visual In-Context Learning |
Yuxin Jiang et.al. |
2509.25172 |
null |
2025-09-29 |
TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion |
Sophia Tang et.al. |
2509.25171 |
null |
2025-09-29 |
GLASS Flows: Transition Sampling for Alignment of Flow and Diffusion Models |
Peter Holderrieth et.al. |
2509.25170 |
null |
2025-09-29 |
Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models |
Bowei Chen et.al. |
2509.25162 |
null |
2025-09-29 |
Rolling Forcing: Autoregressive Long Video Diffusion in Real Time |
Kunhao Liu et.al. |
2509.25161 |
null |
2025-09-29 |
GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts |
Fan Yuan et.al. |
2509.25160 |
null |
2025-09-29 |
LayerD: Decomposing Raster Graphic Designs into Layers |
Tomoyuki Suzuki et.al. |
2509.25134 |
null |
2025-09-29 |
Score Distillation of Flow Matching Models |
Mingyuan Zhou et.al. |
2509.25127 |
null |
2025-09-29 |
Diffuse Domain Methods with Dirichlet Boundary Conditions |
Luke Benfield et.al. |
2509.25115 |
null |
2025-09-29 |
MANI-Pure: Magnitude-Adaptive Noise Injection for Adversarial Purification |
Xiaoyi Huang et.al. |
2509.25082 |
null |
2025-09-29 |
Towards a Certificate of Trust: Task-Aware OOD Detection for Scientific AI |
Bogdan Raonić et.al. |
2509.25080 |
null |
2025-09-29 |
UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation |
Guanjun Wu et.al. |
2509.25079 |
null |
2025-09-29 |
Interstellar Dust-Catalyzed Molecular Hydrogen Formation Enabled by Nuclear Quantum Effects |
Xiaolong Yang et.al. |
2509.25070 |
null |
2025-09-29 |
Collective transport efficiency of microswimmer swarms optimized by tactic run-tumble dynamics |
Maggie Liu et.al. |
2509.25068 |
null |
2025-09-29 |
CharGen: Fast and Fluent Portrait Modification |
Jan-Niklas Dihlmann et.al. |
2509.25058 |
null |
2025-09-29 |
Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models |
Shuchen Xue et.al. |
2509.25050 |
null |
2025-09-29 |
Scaling Synthetic Task Generation for Agents via Exploration |
Ram Ramrakhya et.al. |
2509.25047 |
null |
2025-09-29 |
Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct |
Haoyang Zheng et.al. |
2509.25035 |
null |
2025-09-29 |
Lagrangian description and quantification of scalar mixing in fluid flows from particle tracks |
Anna Klünker et.al. |
2509.25030 |
null |
2025-09-29 |
STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation |
Xiaoxiao Ma et.al. |
2509.25027 |
null |
2025-09-29 |
Score-based Membership Inference on Diffusion Models |
Mingxing Rao et.al. |
2509.25003 |
null |
2025-09-29 |
PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion |
Yuyang Yin et.al. |
2509.24997 |
null |
2025-09-29 |
Path Diffuser: Diffusion Model for Data-Driven Traffic Simulator |
Da Saem Lee et.al. |
2509.24995 |
null |
2025-09-29 |
SDPose: Exploiting Diffusion Priors for Out-of-Domain and Robust Pose Estimation |
Shuang Liang et.al. |
2509.24980 |
null |
2025-09-30 |
Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel |
Haotian Dong et.al. |
2509.24979 |
null |
2025-09-29 |
DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern |
Lekang Yang et.al. |
2509.24975 |
null |
2025-09-29 |
Double Descent as a Lens for Sample Efficiency in Autoregressive vs. Discrete Diffusion Models |
Ahmad Fraij et.al. |
2509.24974 |
null |
2025-09-29 |
VIVALDy: A Hybrid Generative Reduced-Order Model for Turbulent Flows, Applied to Vortex-Induced Vibrations |
Niccolò Tonioni et.al. |
2509.24965 |
null |
2025-09-29 |
Sharp behavior of semilinear damped wave equations driven by mixed local-nonlocal operators |
Wenhui Chen et.al. |
2509.24940 |
null |
2025-09-29 |
Scalable GANs with Transformers |
Sangeek Hyun et.al. |
2509.24935 |
null |
2025-09-29 |
Precision calculation of $^3$He$(α,γ)^7$ Be for solar physics |
Ratna Khadka et.al. |
2509.24931 |
null |
2025-09-29 |
SAGA-SR: Semantically and Acoustically Guided Audio Super-Resolution |
Jaekwon Im et.al. |
2509.24924 |
null |
2025-09-29 |
From Code to Action: Hierarchical Learning of Diffusion-VLM Policies |
Markus Peschl et.al. |
2509.24917 |
null |
2025-09-29 |
Segmentor-Guided Counterfactual Fine-Tuning for Image Synthesis |
Tian Xia et.al. |
2509.24913 |
null |
2025-09-29 |
When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis |
Xiang Li et.al. |
2509.24912 |
null |
2025-09-29 |
DRCP: Diffusion on Reinforced Cooperative Perception for Perceiving Beyond Limits |
Lantao Li et.al. |
2509.24903 |
null |
2025-09-29 |
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing |
Zhihong Chen et.al. |
2509.24900 |
null |
2025-09-29 |
Attention Surgery: An Efficient Recipe to Linearize Your Video Diffusion Transformer |
Mohsen Ghafoorian et.al. |
2509.24899 |
null |
2025-09-29 |
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark |
Yang Shi et.al. |
2509.24897 |
null |
2025-09-29 |
VAGUEGAN: Stealthy Poisoning and Backdoor Attacks on Image Generative Pipelines |
Mostafa Mohaimen Akand Faisal et.al. |
2509.24891 |
null |
2025-09-29 |
MMRQA: Signal-Enhanced Multimodal Large Language Models for MRI Quality Assessment |
Fankai Jia et.al. |
2509.24888 |
null |
2025-09-29 |
Response to dynamic shape changes in suspensions of hard rectangles |
Denis Dertli et.al. |
2509.24885 |
null |
2025-09-29 |
ThermalGen: Style-Disentangled Flow-Based Generative Models for RGB-to-Thermal Image Translation |
Jiuhong Xiao et.al. |
2509.24878 |
null |
2025-09-29 |
Environment-Aware Satellite Image Generation with Diffusion Models |
Nikos Kostagiolas et.al. |
2509.24875 |
null |
2025-09-29 |
Causal-Adapter: Taming Text-to-Image Diffusion for Faithful Counterfactual Generation |
Lei Tong et.al. |
2509.24798 |
null |
2025-09-29 |
Fidelity-Aware Data Composition for Robust Robot Generalization |
Zizhao Tong et.al. |
2509.24797 |
null |
2025-09-29 |
Collision types and times in interacting particle systems |
Sergio Andraus et.al. |
2509.24790 |
null |
2025-09-29 |
FESTIM v2.0: Upgraded framework for multi-species hydrogen transport and enhanced performance |
James Dark et.al. |
2509.24760 |
null |
2025-09-29 |
ExGS: Extreme 3D Gaussian Compression with Diffusion Priors |
Jiaqi Chen et.al. |
2509.24758 |
null |
2025-09-29 |
Fabrication of hydrogen-bonded metal inorganic-organic complex glasses by ligand-tuning approach |
Tianzhao Xu et.al. |
2509.24755 |
null |
2025-09-29 |
Geometric structure of stationary problem for spatial 1D self-diffusion equation with logistic growth |
Yu ICHIDA et.al. |
2509.24752 |
null |
2025-09-29 |
Direct numerical simulation of two-phase flows with surfactant-induced surface viscous effects |
Debashis Panda et.al. |
2509.24722 |
null |
2025-09-29 |
MAD: Manifold Attracted Diffusion |
Dennis Elbrächter et.al. |
2509.24710 |
null |
2025-09-29 |
Enhancing Physical Plausibility in Video Generation by Reasoning the Implausibility |
Yutong Hao et.al. |
2509.24702 |
null |
2025-09-29 |
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer |
Junsong Chen et.al. |
2509.24695 |
null |
2025-09-29 |
The influence of solute induced memory on interface migration |
Chad W. Sinclair et.al. |
2509.24668 |
null |
2025-09-29 |
Learning Object-Centric Representations Based on Slots in Real World Scenarios |
Adil Kaan Akan et.al. |
2509.24652 |
null |
2025-09-29 |
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning |
Yixuan Zhou et.al. |
2509.24650 |
null |
2025-09-30 |
RIFLE: Removal of Image Flicker-Banding via Latent Diffusion Enhancement |
Libo Zhu et.al. |
2509.24644 |
null |
2025-09-29 |
PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control |
Haozhuo Zhang et.al. |
2509.24591 |
null |
2025-09-29 |
SAIP: A Plug-and-Play Scale-adaptive Module in Diffusion-based Inverse Problems |
Lingyu Wang et.al. |
2509.24580 |
null |
2025-09-29 |
U-DiT Policy: U-shaped Diffusion Transformers for Robotic Manipulation |
Linzhi Wu et.al. |
2509.24579 |
null |
2025-09-29 |
SCOPE: Semantic Conditioning for Sim2Real Category-Level Object Pose Estimation in Robotics |
Peter Hönig et.al. |
2509.24572 |
null |
2025-09-29 |
Training-Free Multimodal Guidance for Video to Audio Generation |
Eleonora Grassucci et.al. |
2509.24550 |
null |
2025-09-29 |
Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis |
Kaizhen Zhu et.al. |
2509.24531 |
null |
2025-09-29 |
CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models |
Zheyuan Hu et.al. |
2509.24526 |
null |
2025-09-29 |
The role of viral dynamics and infectivity in models of oncolytic virotherapy for tumours with different motility |
David Morselli et.al. |
2509.24522 |
null |
2025-09-29 |
Flow Crossover and Parallel Outflow during Collisionless Magnetic Reconnection |
Theerasarn Pianpanit et.al. |
2509.24513 |
null |
2025-09-29 |
A Novel Preprocessing Unit for Effective Deep Learning based Classification and Grading of Diabetic Retinopathy |
Pranoti Nage et.al. |
2509.24497 |
null |
2025-09-29 |
LaMoGen: Laban Movement-Guided Diffusion for Text-to-Motion Generation |
Heechang Kim et.al. |
2509.24469 |
null |
2025-09-29 |
An Agent-Based Framework for Automated Higher-Voice Harmony Generation |
Nia D’Souza Ganapathy et.al. |
2509.24463 |
null |
2025-09-29 |
Alternatives To Next Token Prediction In Text Generation – A Survey |
Charlie Wyatt et.al. |
2509.24435 |
null |
2025-09-29 |
UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark |
Ailing Zhang et.al. |
2509.24427 |
null |
2025-09-29 |
CLQ: Cross-Layer Guided Orthogonal-based Quantization for Diffusion Transformers |
Kai Liu et.al. |
2509.24416 |
null |
2025-09-29 |
Unsupervised Single-Channel Speech Separation with a Diffusion Prior under Speaker-Embedding Guidance |
Runwu Shi et.al. |
2509.24395 |
null |
2025-09-29 |
LLaDA-MoE: A Sparse MoE Diffusion Language Model |
Fengqi Zhu et.al. |
2509.24389 |
null |
2025-09-29 |
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning |
Xin Qiu et.al. |
2509.24372 |
null |
2025-09-29 |
From Satellite to Street: A Hybrid Framework Integrating Stable Diffusion and PanoGAN for Consistent Cross-View Synthesis |
Khawlah Bajbaa et.al. |
2509.24369 |
null |
2025-09-29 |
Watermarking Diffusion Language Models |
Thibaud Gloaguen et.al. |
2509.24368 |
null |
2025-09-29 |
Uni-X: Mitigating Modality Conflict with a Two-End-Separated Architecture for Unified Multimodal Models |
Jitai Hao et.al. |
2509.24365 |
null |
2025-09-29 |
DRIFT: Divergent Response in Filtered Transformations for Robust Adversarial Defense |
Amira Guesmi et.al. |
2509.24359 |
null |
2025-09-29 |
NeRV-Diffusion: Diffuse Implicit Neural Representations for Video Synthesis |
Yixuan Ren et.al. |
2509.24353 |
null |
2025-09-29 |
Hyperspherical Latents Improve Continuous-Token Autoregressive Generation |
Guolin Ke et.al. |
2509.24335 |
null |
2025-09-29 |
Wavelet-Assisted Mamba for Satellite-Derived Sea Surface Temperature Super-Resolution |
Wankun Chen et.al. |
2509.24334 |
null |
2025-09-29 |
3D Structure of Jet-induced Diffusion Wake |
Zhong Yang et.al. |
2509.24315 |
null |
2025-09-29 |
A study of Universal ODE approaches to predicting soil organic carbon |
Satyanarayana Raju G. V. V et.al. |
2509.24306 |
null |
2025-09-29 |
High-Precision Temperature Estimation Based on Magnetic Nanoparticles Dominated by Brownian Relaxation under Combined AC and DC Magnetic Fields |
Zhongzhou Du et.al. |
2509.24301 |
null |
2025-09-29 |
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models |
Zherui Li et.al. |
2509.24296 |
null |
2025-09-29 |
ASIA: Adaptive 3D Segmentation using Few Image Annotations |
Sai Raj Kishore Perla et.al. |
2509.24288 |
null |
2025-09-29 |
Collisional Baryon-Dominated Dwarf Galaxies: A New Probe of Bursty Feedback and Dark Matter Physics |
Yi-Ying Wang et.al. |
2509.24270 |
null |
2025-09-29 |
Cycle Diffusion Model for Counterfactual Image Generation |
Fangrui Huang et.al. |
2509.24267 |
null |
2025-09-29 |
FreeAction: Training-Free Techniques for Enhanced Fidelity of Trajectory-to-Video Generation |
Seungwook Kim et.al. |
2509.24241 |
null |
2025-09-29 |
Geometry-induced criticality in $p$ -adic scaling limits of random walks |
Rahul Rajkumar et.al. |
2509.24234 |
null |
2025-09-29 |
Non-Invasive Detection of PROState Cancer with Novel Time-Dependent Diffusion MRI and AI-Enhanced Quantitative Radiological Interpretation: PROS-TD-AI |
Baltasar Ramos et.al. |
2509.24227 |
null |
2025-09-29 |
Semantic Editing with Coupled Stochastic Differential Equations |
Jianxin Zhang et.al. |
2509.24223 |
null |
2025-09-29 |
The role of the solid-melt interface in accelerating the self-catalyzed growth kinetics of III-V semiconductors |
Zhucong Xi et.al. |
2509.24206 |
null |
2025-09-30 |
UniVid: The Open-Source Unified Video Model |
Jiabin Luo et.al. |
2509.24200 |
null |
2025-09-29 |
An Efficient 3D Latent Diffusion Model for T1-contrast Enhanced MRI Generation |
Zach Eidex et.al. |
2509.24194 |
null |
2025-09-29 |
Simulating Post-Neoadjuvant Chemotherapy Breast Cancer MRI via Diffusion Model with Prompt Tuning |
Jonghun Kim et.al. |
2509.24185 |
null |
2025-09-29 |
Tumor Synthesis conditioned on Radiomics |
Jonghun Kim et.al. |
2509.24182 |
null |
2025-09-29 |
LatXGen: Towards Radiation-Free and Accurate Quantitative Analysis of Sagittal Spinal Alignment Via Cross-Modal Radiographic View Synthesis |
Moxin Zhao et.al. |
2509.24165 |
null |
2025-09-29 |
Asymmetric VAE for One-Step Video Super-Resolution Acceleration |
Jianze Li et.al. |
2509.24142 |
null |
2025-09-28 |
GANji: A Framework for Introductory AI Image Generation |
Chandon Hamel et.al. |
2509.24128 |
null |
2025-09-28 |
Progressive Layer Stripping Analysis for HVSR Interpretation |
Mersad Fathizadeh et.al. |
2509.24121 |
null |
2025-09-28 |
GeoFunFlow: Geometric Function Flow Matching for Inverse Operator Learning over Complex Geometries |
Sifan Wang et.al. |
2509.24117 |
null |
2025-09-28 |
BTC-SAM: Leveraging LLMs for Generation of Bias Test Cases for Sentiment Analysis Models |
Zsolt T. Kardkovács et.al. |
2509.24101 |
null |
2025-09-26 |
Pixel Motion Diffusion is What We Need for Robot Control |
E-Ro Nguyen et.al. |
2509.22652 |
null |
2025-09-26 |
RefAM: Attention Magnets for Zero-Shot Referral Segmentation |
Anna Kukleva et.al. |
2509.22650 |
null |
2025-09-26 |
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs |
Xingyu Fu et.al. |
2509.22646 |
null |
2025-09-26 |
Language Models Can Learn from Verbal Feedback Without Scalar Rewards |
Renjie Luo et.al. |
2509.22638 |
null |
2025-09-26 |
Scale-Wise VAR is Secretly Discrete Diffusion |
Amandeep Kumar et.al. |
2509.22636 |
null |
2025-09-26 |
Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance |
Luc Boudier et.al. |
2509.22635 |
null |
2025-09-26 |
LongLive: Real-time Interactive Long Video Generation |
Shuai Yang et.al. |
2509.22622 |
null |
2025-09-26 |
Exact solutions of open quantum Brownian motions on the real line for two-level systems |
Manuel D. de la Iglesia et.al. |
2509.22604 |
null |
2025-09-26 |
Transport Based Mean Flows for Generative Modeling |
Elaheh Akbari et.al. |
2509.22592 |
null |
2025-09-26 |
EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation |
Yuan Xu et.al. |
2509.22578 |
null |
2025-09-26 |
UniMIC: Token-Based Multimodal Interactive Coding for Human-AI Collaboration |
Qi Mao et.al. |
2509.22570 |
null |
2025-09-26 |
ConQuER: Modular Architectures for Control and Bias Mitigation in IQP Quantum Generative Models |
Xiaocheng Zou et.al. |
2509.22551 |
null |
2025-09-26 |
EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model |
Andrii Litvynchuk et.al. |
2509.22527 |
null |
2025-09-26 |
JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation |
Guillem Capellera et.al. |
2509.22522 |
null |
2025-09-26 |
A phenotype-structured reaction-diffusion model of avascular glioma growth |
Francesca Ballatore et.al. |
2509.22519 |
null |
2025-09-26 |
Group Critical-token Policy Optimization for Autoregressive Image Generation |
Guohui Zhang et.al. |
2509.22485 |
null |
2025-09-26 |
Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation |
Chen Li et.al. |
2509.22476 |
null |
2025-09-26 |
Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs) |
Nikita Kornilov et.al. |
2509.22459 |
null |
2025-09-26 |
Overclocking Electrostatic Generative Models |
Daniil Shlenskii et.al. |
2509.22454 |
null |
2025-09-26 |
LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer |
Song Fei et.al. |
2509.22414 |
null |
2025-09-26 |
EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer |
Zhehao Dong et.al. |
2509.22407 |
null |
2025-09-26 |
Closing the Safety Gap: Surgical Concept Erasure in Visual Autoregressive Models |
Xinhao Zhong et.al. |
2509.22400 |
null |
2025-09-26 |
Gradient-based multi-focus image fusion with focus-aware saliency enhancement |
Haoyu Li et.al. |
2509.22392 |
null |
2025-09-26 |
SurvDiff: A Diffusion Model for Generating Synthetic Data in Survival Analysis |
Marie Brockschmidt et.al. |
2509.22352 |
null |
2025-09-26 |
Decoding quantum low density parity check codes with diffusion |
Zejun Liu et.al. |
2509.22347 |
null |
2025-09-26 |
RAPID^3: Tri-Level Reinforced Acceleration Policies for Diffusion Transformer |
Wangbo Zhao et.al. |
2509.22323 |
null |
2025-09-26 |
NIFTY: a Non-Local Image Flow Matching for Texture Synthesis |
Pierrick Chatillon et.al. |
2509.22318 |
null |
2025-09-26 |
Self-organization mechanism in Bridgman-grown MnBi2Te4/(Bi2Te3)n: influence on layer sequence and magnetic properties |
Paweł Skupiński et.al. |
2509.22303 |
null |
2025-09-26 |
HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models |
Seyedmorteza Sadat et.al. |
2509.22300 |
null |
2025-09-26 |
Jailbreaking on Text-to-Video Models via Scene Splitting Strategy |
Wonjun Lee et.al. |
2509.22292 |
null |
2025-09-26 |
Wavelength-scale noise-resistant on-chip spectrometer |
Jianbo Yu et.al. |
2509.22286 |
null |
2025-09-26 |
Conditional Denoising Diffusion Autoencoders for Wireless Semantic Communications |
Mehdi Letafati et.al. |
2509.22282 |
null |
2025-09-26 |
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing |
Junyi Wu et.al. |
2509.22244 |
null |
2025-09-26 |
The moving patch model with fractional diffusion |
Sebastián Flores-Sepúlveda et.al. |
2509.22234 |
null |
2025-09-26 |
Question-Driven Analysis and Synthesis: Building Interpretable Thematic Trees with LLMs for Text Clustering and Controllable Generation |
Tiago Fernandes Tavares et.al. |
2509.22211 |
null |
2025-09-26 |
MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training |
Haoyun Li et.al. |
2509.22199 |
null |
2025-09-26 |
DragGANSpace: Latent Space Exploration and Control for GANs |
Kirsten Odendaal et.al. |
2509.22169 |
null |
2025-09-26 |
REFINE-CONTROL: A Semi-supervised Distillation Method For Conditional Image Generation |
Yicheng Jiang et.al. |
2509.22139 |
null |
2025-09-26 |
Guidance Watermarking for Diffusion Models |
Enoal Gesny et.al. |
2509.22126 |
null |
2025-09-26 |
Countering adversarial evasion in regression analysis |
David Benfield et.al. |
2509.22113 |
null |
2025-09-26 |
Large Material Gaussian Model for Relightable 3D Generation |
Jingrui Ye et.al. |
2509.22112 |
null |
2025-09-26 |
50 mm $\times$ 50 mm Cesium Atomic Vapor Cell for Terahertz Imaging: Implementation and Application |
Bin Zhang et.al. |
2509.22098 |
null |
2025-09-26 |
Factor-Based Conditional Diffusion Model for Portfolio Optimization |
Xuefeng Gao et.al. |
2509.22088 |
null |
2025-09-26 |
SpecXNet: A Dual-Domain Convolutional Network for Robust Deepfake Detection |
Inzamamul Alam et.al. |
2509.22070 |
null |
2025-09-26 |
High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling |
Chao Huang et.al. |
2509.22063 |
null |
2025-09-26 |
Comparative Analysis of GAN and Diffusion for MRI-to-CT translation |
Emily Honey et.al. |
2509.22049 |
null |
2025-09-26 |
Latent Diffusion : Multi-Dimension Stable Diffusion Latent Space Explorer |
Zhihua Zhong et.al. |
2509.22038 |
null |
2025-09-26 |
Stage-wise Dynamics of Classifier-Free Guidance in Diffusion Models |
Cheng Jin et.al. |
2509.22007 |
null |
2025-09-26 |
Exposing Hallucinations To Suppress Them: VLMs Representation Editing With Generative Anchors |
Youxu Shi et.al. |
2509.21997 |
null |
2025-09-26 |
FailureAtlas:Mapping the Failure Landscape of T2I Models via Active Exploration |
Muxi Chen et.al. |
2509.21995 |
null |
2025-09-26 |
Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation |
Abdelrahman Eldesokey et.al. |
2509.21989 |
null |
2025-09-26 |
Hybrid Diffusion for Simultaneous Symbolic and Continuous Planning |
Sigmund Hennum Høeg et.al. |
2509.21983 |
null |
2025-09-26 |
Electric-field effect on spin diffusion length in solids: An \textit{ab initio} study beyond the drift-diffusion model |
Junqing Xu et.al. |
2509.21962 |
null |
2025-09-26 |
MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning |
Tao Wu et.al. |
2509.21953 |
null |
2025-09-26 |
Modeling the Equilibrium Vacancy Concentration in Multi-Principal Element Alloys from First-Principles |
Damien K. J. Lee et.al. |
2509.21944 |
null |
2025-09-26 |
Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning |
Xianghua Zeng et.al. |
2509.21942 |
null |
2025-09-26 |
SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet |
Woosung Joung et.al. |
2509.21938 |
null |
2025-09-26 |
EqDiff-CT: Equivariant Conditional Diffusion model for CT Image Synthesis from CBCT |
Alzahra Altalib et.al. |
2509.21913 |
null |
2025-09-26 |
Discrete Guidance Matching: Exact Guidance for Discrete Flow Matching |
Zhengyan Wan et.al. |
2509.21912 |
null |
2025-09-26 |
Logarithmic evolutions in solutions to the convection-diffusion equation of Burgers type |
Masakazu Yamamoto et.al. |
2509.21909 |
null |
2025-09-26 |
Error Analysis of Discrete Flow with Generator Matching |
Zhengyan Wan et.al. |
2509.21906 |
null |
2025-09-26 |
TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation |
Qihang Wang et.al. |
2509.21905 |
null |
2025-09-26 |
Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers |
Jibin Song et.al. |
2509.21893 |
null |
2025-09-26 |
Drag4D: Align Your Motion with Text-Driven 3D Scene Generation |
Minjun Kang et.al. |
2509.21888 |
null |
2025-09-26 |
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing |
Liyang Chen et.al. |
2509.21887 |
null |
2025-09-26 |
Abductive Logical Rule Induction by Bridging Inductive Logic Programming and Multimodal Large Language Models |
Yifei Peng et.al. |
2509.21874 |
null |
2025-09-26 |
Deepfakes: we need to re-think the concept of “real” images |
Janis Keuper et.al. |
2509.21864 |
null |
2025-09-26 |
DiTraj: training-free trajectory control for video diffusion transformer |
Cheng Lei et.al. |
2509.21839 |
null |
2025-09-26 |
On the Complexity Theory of Masked Discrete Diffusion: From $\mathrm{poly}(1/ε)$ to Nearly $ε$ -Free |
Xunpeng Huang et.al. |
2509.21835 |
null |
2025-09-26 |
MoWM: Mixture-of-World-Models for Embodied Planning via Latent-to-Pixel Feature Modulation |
Yu Shang et.al. |
2509.21797 |
null |
2025-09-26 |
LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE |
Yu Shang et.al. |
2509.21790 |
null |
2025-09-26 |
DeHate: A Stable Diffusion-based Multimodal Approach to Mitigate Hate Speech in Images |
Dwip Dalal et.al. |
2509.21787 |
null |
2025-09-26 |
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models |
Lan Chen et.al. |
2509.21760 |
null |
2025-09-26 |
Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription |
Michael Yeung et.al. |
2509.21739 |
null |
2025-09-26 |
MESA Isochrones and Stellar Tracks (MIST) III. The White Dwarf Cooling Sequence |
Evan B. Bauer et.al. |
2509.21717 |
null |
2025-09-26 |
MusicWeaver: Coherent Long-Range and Editable Music Generation from a Beat-Aligned Structural Plan |
Xuanchen Wang et.al. |
2509.21714 |
null |
2025-09-25 |
Snapshot Synthetic Aperture Imaging with Boiling Speckle |
Janith B. Senanayaka et.al. |
2509.21682 |
null |
2025-09-25 |
Generating Stable Placements via Physics-guided Diffusion Models |
Philippe Nadeau et.al. |
2509.21664 |
null |
2025-09-25 |
RED-DiffEq: Regularization by denoising diffusion models for solving inverse PDE problems with application to full waveform inversion |
Siming Shan et.al. |
2509.21659 |
null |
2025-09-25 |
FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction |
Yixiang Dai et.al. |
2509.21657 |
null |
2025-09-25 |
DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models |
Yinuo Ren et.al. |
2509.21655 |
null |
2025-09-25 |
A comprehensive equivalent circuit model for high overtone bulk acoustic resonators (HBARs) |
Vikrant J. Gokhale et.al. |
2509.21640 |
null |
2025-09-25 |
Guiding Audio Editing with Audio Language Model |
Zitong Lan et.al. |
2509.21625 |
null |
2025-09-25 |
Message passing for epidemiological interventions on networks with loops |
Erik Weis et.al. |
2509.21596 |
null |
2025-09-25 |
Transabdominal Fetal Oximetry via Diffuse Optics: Principled Analysis and Demonstration in Pregnant Ovine Models |
Weitai Qian et.al. |
2509.21594 |
null |
2025-09-25 |
What Happens Next? Anticipating Future Motion by Generating Point Trajectories |
Gabrijel Boduljak et.al. |
2509.21592 |
null |
2025-09-25 |
X-Streamer: Unified Human World Modeling with Audiovisual Interaction |
You Xie et.al. |
2509.21574 |
null |
2025-09-25 |
No Alignment Needed for Generation: Learning Linearly Separable Representations in Diffusion Models |
Junno Yun et.al. |
2509.21565 |
null |
2025-09-25 |
ControlHair: Physically-based Video Diffusion for Controllable Dynamic Hair Rendering |
Weikai Lin et.al. |
2509.21541 |
null |
2025-09-25 |
Patch-Based Diffusion for Data-Efficient, Radiologist-Preferred MRI Reconstruction |
Rohan Sanda et.al. |
2509.21531 |
null |
2025-09-25 |
Shortcut Flow Matching for Speech Enhancement: Step-Invariant flows via single stage training |
Naisong Zhou et.al. |
2509.21522 |
null |
2025-09-25 |
DistillKac: Few-Step Image Generation via Damped Wave Equations |
Weiqiao Han et.al. |
2509.21513 |
null |
2025-09-25 |
Quantum algorithms for solving a drift-diffusion equation: analysing circuit depths |
Ellen Devereux et.al. |
2509.21509 |
null |
2025-09-25 |
SlimDiff: Training-Free, Activation-Guided Hands-free Slimming of Diffusion Models |
Arani Roy et.al. |
2509.21498 |
null |
2025-09-25 |
d2: Improved Techniques for Training Reasoning Diffusion Language Models |
Guanghan Wang et.al. |
2509.21474 |
null |
2025-09-25 |
Are Hallucinations Bad Estimations? |
Hude Liu et.al. |
2509.21473 |
null |
2025-09-25 |
Score-based Idempotent Distillation of Diffusion Models |
Shehtab Zaman et.al. |
2509.21470 |
null |
2025-09-25 |
Gender Stereotypes in Professional Roles Among Saudis: An Analytical Study of AI-Generated Images Using Language Models |
Khaloud S. AlKhalifah et.al. |
2509.21466 |
null |
2025-09-25 |
Viscous Growth Law in Bubble Coarsening: A Molecular Dynamics Perspective |
Parameshwaran A et.al. |
2509.21457 |
null |
2025-09-25 |
SD3.5-Flash: Distribution-Guided Distillation of Generative Flows |
Hmrishav Bandyopadhyay et.al. |
2509.21318 |
null |
2025-09-25 |
Two ADI compact difference methods for variable-exponent diffusion wave equations |
Hao Zhang et.al. |
2509.21316 |
null |
2025-09-25 |
NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics |
Yu Yuan et.al. |
2509.21309 |
null |
2025-09-25 |
Einstein@Home Searches for Gamma-ray Pulsars in the Inner Galaxy |
C. J. Clark et.al. |
2509.21307 |
null |
2025-09-26 |
Outflow-cloud interaction as the possible origin of the peculiar radio emission in the tidal disruption event AT2018cqh |
Lei Yang et.al. |
2509.21299 |
null |
2025-09-25 |
Does FLUX Already Know How to Perform Physically Plausible Image Composition? |
Shilin Lu et.al. |
2509.21278 |
null |
2025-09-25 |
Dense Semantic Matching with VGGT Prior |
Songlin Yang et.al. |
2509.21263 |
null |
2025-09-25 |
Un-Doubling Diffusion: LLM-guided Disambiguation of Homonym Duplication |
Evgeny Kaskov et.al. |
2509.21262 |
null |
2025-09-25 |
Hallucination as an Upper Bound: A New Perspective on Text-to-Image Evaluation |
Seyed Amir Kasaei et.al. |
2509.21257 |
null |
2025-09-25 |
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets |
Team Hunyuan3D et.al. |
2509.21245 |
null |
2025-09-25 |
Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation |
Seyed Amir Kasaei et.al. |
2509.21227 |
null |
2025-09-25 |
A Unified Framework for Diffusion Model Unlearning with f-Divergence |
Nicola Novello et.al. |
2509.21167 |
null |
2025-09-25 |
DAGDiff: Guiding Dual-Arm Grasp Diffusion to Stable and Collision-Free Grasps |
Md Faizal Karim et.al. |
2509.21145 |
null |
2025-09-25 |
The Unwinnable Arms Race of AI Image Detection |
Till Aczel et.al. |
2509.21135 |
null |
2025-09-25 |
MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation |
Guojun Lei et.al. |
2509.21119 |
null |
2025-09-25 |
Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks? |
Rostislav Makarov et.al. |
2509.21087 |
null |
2025-09-25 |
UniTransfer: Video Concept Transfer via Progressive Spatial and Timestep Decomposition |
Guojun Lei et.al. |
2509.21086 |
null |
2025-09-25 |
Normalizing Flows are Capable Visuomotor Policy Learning Models |
Simon Kristoffersson Lind et.al. |
2509.21073 |
null |
2025-09-25 |
SPREAD: Sampling-based Pareto front Refinement via Efficient Adaptive Diffusion |
Sedjro Salomon Hotegni et.al. |
2509.21058 |
null |
2025-09-25 |
Actor-Critic without Actor |
Donghyeon Ki et.al. |
2509.21022 |
null |
2025-09-25 |
Graphical Willmore Problems with Low-Regularity Boundary and Dirichlet Data |
Boris Gulyak et.al. |
2509.21018 |
null |
2025-09-25 |
Unbiased Parameter Estimation of Partially Observed Diffusions using Diffusion Bridges |
Miguel Alvarez et.al. |
2509.21015 |
null |
2025-09-25 |
A Single Neuron Works: Precise Concept Erasure in Text-to-Image Diffusion Models |
Qinqin He et.al. |
2509.21008 |
null |
2025-09-26 |
TF-Restormer: Complex Spectral Prediction for Speech Restoration |
Ui-Hyeop Shin et.al. |
2509.21003 |
null |
2025-09-25 |
High energy gammas and neutrinos from the Sun, Jupiter and Earth |
Pablo de la Torre et.al. |
2509.20970 |
null |
2025-09-25 |
Flow Matching in the Low-Noise Regime: Pathologies and a Contrastive Remedy |
Weili Zeng et.al. |
2509.20952 |
null |
2025-09-25 |
SMC-X: A Distributed Scalable Monte Carlo Simulation Method for Chemically Complex Alloys |
Xianglin Liu et.al. |
2509.20949 |
null |
2025-09-25 |
Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting |
Yanfeng Yang et.al. |
2509.20928 |
null |
2025-09-25 |
SimDiff: Simulator-constrained Diffusion Model for Physically Plausible Motion Generation |
Akihisa Watanabe et.al. |
2509.20927 |
null |
2025-09-25 |
Deterministic Discrete Denoising |
Hideyuki Suzuki et.al. |
2509.20896 |
null |
2025-09-25 |
AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion |
Junyoung Koh et.al. |
2509.20891 |
null |
2025-09-25 |
FerretNet: Efficient Synthetic Image Detection via Local Pixel Dependencies |
Shuqiao Liang et.al. |
2509.20890 |
null |
2025-09-25 |
Holographic Brownian dynamics of a heavy particle in a boosted thermal plasma background |
Anirban Roy Chowdhury et.al. |
2509.20889 |
null |
2025-09-25 |
Nuclear Diffusion Models for Low-Rank Background Suppression in Videos |
Tristan S. W. Stevens et.al. |
2509.20886 |
null |
2025-09-25 |
Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering |
Zhifei Li et.al. |
2509.20884 |
null |
2025-09-25 |
WeFT: Weighted Entropy-driven Fine-Tuning for dLLMs |
Guowei Xu et.al. |
2509.20863 |
null |
2025-09-25 |
Causal Time Series Generation via Diffusion Models |
Yutong Xia et.al. |
2509.20846 |
null |
2025-09-25 |
Topological Catenation-induced Pore Size in 2D Olympic Network |
Wenbo Zhao et.al. |
2509.20827 |
null |
2025-09-25 |
T2I-Diff: fMRI Signal Generation via Time-Frequency Image Transform and Classifier-Free Denoising Diffusion Models |
Hwa Hui Tew et.al. |
2509.20822 |
null |
2025-09-25 |
Diffusive Scaling limit of stochastic Box-Ball systems and PushTASEP |
David Keating et.al. |
2509.20779 |
null |
2025-09-25 |
CusEnhancer: A Zero-Shot Scene and Controllability Enhancement Method for Photo Customization via ResInversion |
Maoye Ren et.al. |
2509.20775 |
null |
2025-09-25 |
Measuring LLM Sensitivity in Transformer-based Tabular Data Synthesis |
Maria F. Davila R et.al. |
2509.20768 |
null |
2025-09-25 |
FreeInsert: Personalized Object Insertion with Geometric and Style Control |
Yuhong Zhang et.al. |
2509.20756 |
null |
2025-09-25 |
RAPTOR-GEN: RApid PosTeriOR GENerator for Bayesian Learning in Biomanufacturing |
Wandi Xu et.al. |
2509.20753 |
null |
2025-09-25 |
Parallel Thinking, Sequential Answering: Bridging NAR and AR for Efficient Reasoning |
Qihang Ai et.al. |
2509.20744 |
null |
2025-09-25 |
Quantum Algorithm for Subcellular Multiscale Reaction-Diffusion Systems |
Margot Lockwood et.al. |
2509.20668 |
null |
2025-09-25 |
Atomistic Insights into Cu/amorphous-Ta $_x$ N Interfacial Adhesion via Machine Learning Interatomic Potentials: Effects of Stoichiometry and Interface Construction |
Jeong Min Choi et.al. |
2509.20662 |
null |
2025-09-25 |
Scaling limit for Brownian motions on the $l$ -level Sierpinski gaskets: The fractal to Euclidean crossover |
David A. Croydon et.al. |
2509.20657 |
null |
2025-09-25 |
Stray light in 3D porous nanostructures of single crystalline copper film |
Yu-Seong Seo et.al. |
2509.20644 |
null |
2025-09-24 |
FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models |
Amin Karimi Monsefi et.al. |
2509.20624 |
null |
2025-09-24 |
MMG: Mutual Information Estimation via the MMSE Gap in Diffusion |
Longxuan Yu et.al. |
2509.20609 |
null |
2025-09-24 |
The X-ray Emission of NGC 5005: An Unobscured Low-Luminosity AGN with a Weakly Accreting Broad-Line Region |
Anna Trindade Falcão et.al. |
2509.20597 |
null |
2025-09-24 |
von Kármán–Howarth Similarity of Spatial Correlations and the Distribution of Correlation Lengths in Solar Photospheric Turbulence |
Rohit Chhiber et.al. |
2509.20590 |
null |
2025-09-24 |
Burning games on strong path products |
Sally Ambrose et.al. |
2509.20572 |
null |
2025-09-24 |
PIRF: Physics-Informed Reward Fine-Tuning for Diffusion Models |
Mingze Yuan et.al. |
2509.20570 |
null |
2025-09-24 |
A Hierarchical Adaptive Diffusion Model for Flexible Protein-Protein Docking |
Rujie Yin et.al. |
2509.20542 |
null |
2025-09-24 |
Pattern Formation in Agent-Based and PDE Models for Evolutionary Games with Payoff-Driven Motion |
Tianyong Yao et.al. |
2509.20538 |
null |
2025-09-24 |
InstructVTON: Optimal Auto-Masking and Natural-Language-Guided Interactive Style Control for Inpainting-Based Virtual Try-On |
Julien Han et.al. |
2509.20524 |
null |
2025-09-24 |
A Recovery Theory for Diffusion Priors: Deterministic Analysis of the Implicit Prior Algorithm |
Oscar Leong et.al. |
2509.20511 |
null |
2025-09-24 |
How two-dimensional are planet-disc interactions? II. Radiation hydrodynamics and suitable cooling prescriptions |
Alexandros Ziampras et.al. |
2509.20464 |
null |
2025-09-24 |
On the Hydrodynamic Approximation of Quantum Integrable Models – An Illustration via the repulsive Lieb-Liniger Model |
Friedrich Hübner et.al. |
2509.20445 |
null |
2025-09-24 |
pop-cosmos: Star formation over 12 Gyr from generative modelling of a deep infrared-selected galaxy catalogue |
Sinan Deger et.al. |
2509.20430 |
null |
2025-09-24 |
Seedream 4.0: Toward Next-generation Multimodal Image Generation |
Team Seedream et.al. |
2509.20427 |
null |
2025-09-24 |
Adversarial Defense in Cybersecurity: A Systematic Review of GANs for Threat Detection and Mitigation |
Tharcisse Ndayipfukamiye et.al. |
2509.20411 |
null |
2025-09-25 |
EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning |
Xuan Ju et.al. |
2509.20360 |
null |
2025-09-24 |
PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation |
Chen Wang et.al. |
2509.20358 |
null |
2025-09-26 |
mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies |
Remo Steiner et.al. |
2509.20297 |
null |
2025-09-26 |
FAST: Foreground-aware Diffusion with Accelerated Sampling Trajectory for Segmentation-oriented Anomaly Synthesis |
Xichen Xu et.al. |
2509.20295 |
null |
2025-09-24 |
Biologically Plausible Learning via Bidirectional Spike-Based Distillation |
Changze Lv et.al. |
2509.20284 |
null |
2025-09-24 |
On Brinkman flows with curvature-induced phase separation in binary mixtures |
Pierluigi Colli et.al. |
2509.20282 |
null |
2025-09-24 |
Turing instability and 2-D pattern formation in reaction-diffusion systems derived from kinetic theory |
Stefano Boccelli et.al. |
2509.20268 |
null |
2025-09-24 |
Radial Variations in Residence Time Distribution for Pipe Flows |
Etienne Boulais et.al. |
2509.20256 |
null |
2025-09-24 |
AnchDrive: Bootstrapping Diffusion Policies with Hybrid Trajectory Anchors for End-to-End Driving |
Jinhao Chai et.al. |
2509.20253 |
null |
2025-09-24 |
4D Driving Scene Generation With Stereo Forcing |
Hao Lu et.al. |
2509.20251 |
null |
2025-09-24 |
Universal Camouflage Attack on Vision-Language Models for Autonomous Driving |
Dehong Kong et.al. |
2509.20196 |
null |
2025-09-24 |
KSDiff: Keyframe-Augmented Speech-Aware Dual-Path Diffusion for Facial Animation |
Tianle Lyu et.al. |
2509.20128 |
null |
2025-09-24 |
Experiments on geostrophic convection: the role of the Prandtl number |
Hannah M. Clercx et.al. |
2509.20126 |
null |
2025-09-24 |
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving |
Pengxiang Li et.al. |
2509.20109 |
null |
2025-09-24 |
First-Extinction Law for Resampling Processes |
Matteo Benati et.al. |
2509.20101 |
null |
2025-09-24 |
Incomplete Data, Complete Dynamics: A Diffusion Approach |
Zihan Zhou et.al. |
2509.20098 |
null |
2025-09-24 |
Constrained Higher-Order Binary Optimization for Wireless Communications Systems Using Ising Machines |
Gan Zheng et.al. |
2509.20092 |
null |
2025-09-24 |
Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing |
Zizheng Yang et.al. |
2509.20091 |
null |
2025-09-24 |
Hierarchy of timescales in a disordered spin- $1/2$ XX ladder |
Kadir Çeven et.al. |
2509.20078 |
null |
2025-09-25 |
From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training |
Tianqiao Liu et.al. |
2509.20072 |
null |
2025-09-24 |
Resistive switching behaviors in vertically aligned MoS $_2$ films with Cu, Ag, and Au electrodes |
Shuei-De Huang et.al. |
2509.20061 |
null |
2025-09-24 |
Discrete Diffusion for Generative Modeling of Text-Aligned Speech Tokens |
Pin-Jui Ku et.al. |
2509.20060 |
null |
2025-09-25 |
Diffusion-Augmented Contrastive Learning: A Noise-Robust Encoder for Biosignal Representations |
Rami Zewail et.al. |
2509.20048 |
null |
2025-09-24 |
The role of photospheric magnetic flux diffusion in initiation of solar eruptions |
Xinkai Bian et.al. |
2509.20040 |
null |
2025-09-24 |
Development of a time calibration system for the KLM upgrade in the Belle II experiment |
Ziyu Liu et.al. |
2509.20029 |
null |
2025-09-24 |
Generative Adversarial Networks Applied for Privacy Preservation in Biometric-Based Authentication and Identification |
Lubos Mjachky et.al. |
2509.20024 |
null |
2025-09-24 |
CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion |
Chenhao Ji et.al. |
2509.19979 |
null |
2025-09-24 |
Learnable Sampler Distillation for Discrete Diffusion Models |
Feiyang Fu et.al. |
2509.19962 |
null |
2025-09-24 |
GS-RoadPatching: Inpainting Gaussians via 3D Searching and Placing for Driving Scenes |
Guo Chen et.al. |
2509.19937 |
null |
2025-09-25 |
GUIDE: A Diffusion-Based Autonomous Robot Exploration Framework Using Global Graph Inference |
Zijun Che et.al. |
2509.19916 |
null |
2025-09-24 |
Dynamically Optimal Unraveling Schemes for Simulating Lindblad Equations |
Yu Cao et.al. |
2509.19887 |
null |
2025-09-24 |
Adaptive User Interest Modeling via Conditioned Denoising Diffusion For Click-Through Rate Prediction |
Qihang Zhao et.al. |
2509.19876 |
null |
2025-09-24 |
FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models |
Xin Wang et.al. |
2509.19870 |
null |
2025-09-24 |
Parameter Estimation for Jump-Diffusion Stochastic Master Equations |
Weichao Liang et.al. |
2509.19862 |
null |
2025-09-24 |
Gauge invariance and hyperforce correlation theory for equilibrium fluid mixtures |
Joshua Matthes et.al. |
2509.19837 |
null |
2025-09-24 |
Boundary effect on asymptotic behaviour of solution to the hyperbolic-parabolic chemotaxis system |
Nangao Zhang et.al. |
2509.19828 |
null |
2025-09-24 |
An Efficient Conditional Score-based Filter for High Dimensional Nonlinear Filtering Problems |
Zhijun Zeng et.al. |
2509.19816 |
null |
2025-09-25 |
StrCGAN: A Generative Framework for Stellar Image Restoration |
Shantanusinh Parmar et.al. |
2509.19805 |
null |
2025-09-24 |
Colossal Effect of Nanopore Surface Ionic Charge on the Dynamics of Confined Water |
Armin Mozhdehei et.al. |
2509.19802 |
null |
2025-09-24 |
On The Cutoff Phenomenon For Dyson-Laguerre Processes |
Samuel Chan-Ashing et.al. |
2509.19798 |
null |
2025-09-24 |
Beyond Human Demonstrations: Diffusion-Based Reinforcement Learning to Generate Data for VLA Training |
Rushuai Yang et.al. |
2509.19752 |
null |
2025-09-24 |
Talking Head Generation via AU-Guided Landmark Prediction |
Shao-Yu Chang et.al. |
2509.19749 |
null |
2025-09-24 |
Controls on the ocean response to idealized Antarctic meltwater input |
Rory Basinski-Ferris et.al. |
2509.19730 |
null |
2025-09-24 |
PolGS: Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction |
Yufei Han et.al. |
2509.19726 |
null |
2025-09-24 |
TopoCut: Learning Multi-Step Cutting with Spectral Rewards and Discrete Diffusion Policies |
Liquan Wang et.al. |
2509.19712 |
null |
2025-09-24 |
Diffusion and Flow-based Copulas: Forgetting and Remembering Dependencies |
David Huk et.al. |
2509.19707 |
null |
2025-09-24 |
Diffusion-Based Impedance Learning for Contact-Rich Manipulation Tasks |
Noah Geiger et.al. |
2509.19696 |
null |
2025-09-24 |
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition |
Ling Lo et.al. |
2509.19690 |
null |
2025-09-24 |
Formal Safety Verification and Refinement for Generative Motion Planners via Certified Local Stabilization |
Devesh Nath et.al. |
2509.19688 |
null |
2025-09-24 |
Selective Classifier-free Guidance for Zero-shot Text-to-speech |
John Zheng et.al. |
2509.19668 |
null |
2025-09-24 |
Long-Range Dependence in Financial Markets: Empirical Evidence and Generative Modeling Challenges |
Yifan He et.al. |
2509.19663 |
null |
2025-09-24 |
Statistical Parameter Calibration with the Generalized Fluctuation Dissipation Theorem and Generative Modeling |
Ludovico T. Giorgini et.al. |
2509.19660 |
null |
2025-09-23 |
TIMED: Adversarial and Autoregressive Refinement of Diffusion-Based Time Series Generation |
MohammadReza EskandariNasab et.al. |
2509.19638 |
null |
2025-09-23 |
Connecting cosmologically decaying dark matter to neutrino physics |
Lea Fuß et.al. |
2509.19596 |
null |
2025-09-23 |
Synthesizing Artifact Dataset for Pixel-level Detection |
Dennis Menn et.al. |
2509.19589 |
null |
2025-09-23 |
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions |
Zongyue Li et.al. |
2509.19538 |
null |
2025-09-23 |
Real-Time Reinforcement Learning for Dynamic Tasks with a Parallel Soft Robot |
James Avtges et.al. |
2509.19525 |
null |
2025-09-23 |
Frame-based Equivariant Diffusion Models for 3D Molecular Generation |
Mohan Guo et.al. |
2509.19506 |
null |
2025-09-23 |
Hierarchical null controllability of a degenerate parabolic equation with nonlocal coefficient |
Juan Límaco et.al. |
2509.19505 |
null |
2025-09-23 |
Reaction/Diffusion Competition Drives Anomalous Relaxation of Vitrimers |
Makayla R. Branham-Ferrari et.al. |
2509.19496 |
null |
2025-09-23 |
ArtiFree: Detecting and Reducing Generative Artifacts in Diffusion-based Speech Enhancement |
Bhawana Chhaglani et.al. |
2509.19495 |
null |
2025-09-23 |
Anchored Langevin Algorithms |
Mert Gurbuzbalaban et.al. |
2509.19455 |
null |
2025-09-23 |
ROPA: Synthetic Robot Pose Generation for RGB-D Bimanual Data Augmentation |
Jason Chen et.al. |
2509.19454 |
null |
2025-09-23 |
Two-moment cosmic ray transport in RAMSES |
Joki Rosdahl et.al. |
2509.19447 |
null |
2025-09-23 |
CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching |
Chen Chen et.al. |
2509.19300 |
null |
2025-09-23 |
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation |
Sherwin Bahmani et.al. |
2509.19296 |
null |
2025-09-23 |
OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps |
Bingnan Li et.al. |
2509.19282 |
null |
2025-09-23 |
A Gradient Flow Approach to Solving Inverse Problems with Latent Diffusion Models |
Tim Y. J. Wang et.al. |
2509.19276 |
null |
2025-09-23 |
Reconstruction of a potential parameter in time-fractional diffusion problems via a Kohn–Vogelius type functional: Theoretical aspects |
Hamza Kahlaoui et.al. |
2509.19260 |
null |
2025-09-23 |
Adversarially-Refined VQ-GAN with Dense Motion Tokenization for Spatio-Temporal Heatmaps |
Gabriel Maldonado et.al. |
2509.19252 |
null |
2025-09-24 |
Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation |
Shufan Li et.al. |
2509.19244 |
null |
2025-09-23 |
Stability and Generalization of Adversarial Diffusion Training |
Hesam Hosseini et.al. |
2509.19234 |
null |
2025-09-23 |
Enabling Plant Phenotyping in Weedy Environments using Multi-Modal Imagery via Synthetic and Generated Training Data |
Earl Ranario et.al. |
2509.19208 |
null |
2025-09-23 |
Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions |
Ioanna Ntinou et.al. |
2509.19203 |
null |
2025-09-23 |
Detachment limited interlayer transport processes during SrTiO3 pulsed laser epitaxy |
Jeffrey G. Ulbrandt et.al. |
2509.19181 |
null |
2025-09-23 |
A noise-robust Monte Carlo method for electric field calculations in EMC3 |
William De Deyn et.al. |
2509.19178 |
null |
2025-09-23 |
2D implementation of Kinetic-diffusion Monte Carlo in Eiron |
Oskar Lappi et.al. |
2509.19140 |
null |
2025-09-23 |
FUNCanon: Learning Pose-Aware Action Primitives via Functional Object Canonicalization for Generalizable Robotic Manipulation |
Hongli Xu et.al. |
2509.19102 |
null |
2025-09-23 |
World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation |
Zhennan Jiang et.al. |
2509.19080 |
null |
2025-09-23 |
Diffusion Bridge Variational Inference for Deep Gaussian Processes |
Jian Xu et.al. |
2509.19078 |
null |
2025-09-23 |
WaveletGaussian: Wavelet-domain Diffusion for Sparse-view 3D Gaussian Object Reconstruction |
Hung Nguyen et.al. |
2509.19073 |
null |
2025-09-23 |
Dwarf Galaxies in the MATLAS Survey: Hubble Space Telescope Observations of Nuclear Star Clusters |
Mélina Poulain et.al. |
2509.19068 |
null |
2025-09-23 |
ManipForce: Force-Guided Policy Learning with Frequency-Aware Representation for Contact-Rich Manipulation |
Geonhyup Lee et.al. |
2509.19047 |
null |
2025-09-23 |
Latent Danger Zone: Distilling Unified Attention for Cross-Architecture Black-box Attacks |
Yang Li et.al. |
2509.19044 |
null |
2025-09-24 |
Improving Credit Card Fraud Detection through Transformer-Enhanced GAN Oversampling |
Kashaf Ul Emaan et.al. |
2509.19032 |
null |
2025-09-23 |
OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment |
Teng Xiao et.al. |
2509.19018 |
null |
2025-09-23 |
Pure Vision Language Action (VLA) Models: A Comprehensive Survey |
Dapeng Zhang et.al. |
2509.19012 |
null |
2025-09-23 |
Generative data augmentation for biliary tract detection on intraoperative images |
Cristina Iacono et.al. |
2509.18958 |
null |
2025-09-23 |
One-shot Embroidery Customization via Contrastive LoRA Modulation |
Jun Ma et.al. |
2509.18948 |
null |
2025-09-23 |
Soret and Dufour effects in hot and dense QCD matter |
Kamaljeet Singh et.al. |
2509.18946 |
null |
2025-09-23 |
1-bit RIS-aided Index Modulation with Quantum Annealing |
Ioannis Krikidis et.al. |
2509.18932 |
null |
2025-09-23 |
Direct Preference Optimization for Speech Autoregressive Diffusion Models |
Zhijun Liu et.al. |
2509.18928 |
null |
2025-09-23 |
Diffusive Stochastic Master Equation (SME) with dispersive qubit/cavity coupling |
Pierre Rouchon et.al. |
2509.18925 |
null |
2025-09-23 |
LiDAR Point Cloud Image-based Generation Using Denoising Diffusion Probabilistic Models |
Amirhesam Aghanouri et.al. |
2509.18917 |
null |
2025-09-23 |
RS3DBench: A Comprehensive Benchmark for 3D Spatial Perception in Remote Sensing |
Jiayu Wang et.al. |
2509.18897 |
null |
2025-09-23 |
How special are the dynamics of deep eutectic solvents? A Look at the Prototypical Case of Ethaline |
Mohammad Nadim Kamar et.al. |
2509.18896 |
null |
2025-09-23 |
Quantum-to-classical transition and H-theorem in surface diffusion |
E. E. Torres-Miyares et.al. |
2509.18844 |
null |
2025-09-23 |
Validation of a Reynolds-averaged numerical simulation environment to simulate high-pressure, auto-igniting hydrogen diffusion flames |
N. Diepstraten et.al. |
2509.18841 |
null |
2025-09-23 |
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters |
Pin-Yen Chiu et.al. |
2509.18831 |
null |
2025-09-23 |
Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation |
Yanzuo Lu et.al. |
2509.18824 |
null |
2025-09-23 |
Training-Free Data Assimilation with GenCast |
Thomas Savary et.al. |
2509.18811 |
null |
2025-09-23 |
Nonlocal degenerate parabolic hyperbolic equations on bounded domains. Part II: Existence |
Jørgen Endal et.al. |
2509.18797 |
null |
2025-09-23 |
Towards Application Aligned Synthetic Surgical Image Synthesis |
Danush Kumar Venkatesh et.al. |
2509.18796 |
null |
2025-09-23 |
FixingGS: Enhancing 3D Gaussian Splatting via Training-Free Score Distillation |
Zhaorui Wang et.al. |
2509.18759 |
null |
2025-09-23 |
Complexity of Activity Patterns in a Bio-Inspired Hopfield-Type Network in Different Topologies |
Marco Cafiso et.al. |
2509.18758 |
null |
2025-09-23 |
RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing Images |
Ke Li et.al. |
2509.18711 |
null |
2025-09-23 |
AGSwap: Overcoming Category Boundaries in Object Fusion via Adaptive Group Swapping |
Zedong Zhang et.al. |
2509.18699 |
null |
2025-09-23 |
FlowCrypt: Flow-Based Lightweight Encryption with Near-Lossless Recovery for Cloud Photo Privacy |
Xiaohui Yang et.al. |
2509.18696 |
null |
2025-09-23 |
Advances in Large Language Models for Medicine |
Zhiyu Kan et.al. |
2509.18690 |
null |
2025-09-23 |
Query-Centric Diffusion Policy for Generalizable Robotic Assembly |
Ziyi Xu et.al. |
2509.18686 |
null |
2025-09-23 |
3D Flow Diffusion Policy: Visuomotor Policy Learning via Generating Flow in 3D Space |
Sangjun Noh et.al. |
2509.18676 |
null |
2025-09-23 |
Global Existence of Solutions for A Class of Nonlocal Reaction-Diffusion Systems and Their Diffusive Limit |
Md Shah Alam et.al. |
2509.18645 |
null |
2025-09-23 |
Well-posedness of the Electron MHD with random diffusion |
Ruimeng Hu et.al. |
2509.18640 |
null |
2025-09-23 |
Understanding-in-Generation: Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation |
Yuanhuiyi Lyu et.al. |
2509.18639 |
null |
2025-09-23 |
Prompt-Guided Dual Latent Steering for Inversion Problems |
Yichen Wu et.al. |
2509.18619 |
null |
2025-09-23 |
Flow marching for a generative PDE foundation model |
Zituo Chen et.al. |
2509.18611 |
null |
2025-09-23 |
SynSonic: Augmenting Sound Event Detection through Text-to-Audio Diffusion ControlNet and Effective Sample Filtering |
Jiarui Hai et.al. |
2509.18603 |
null |
2025-09-23 |
Training-Free Multi-Style Fusion Through Reference-Based Adaptive Modulation |
Xu Liu et.al. |
2509.18602 |
null |
2025-09-23 |
SSCM: A Spatial-Semantic Consistent Model for Multi-Contrast MRI Super-Resolution |
Xiaoman Wu et.al. |
2509.18593 |
null |
2025-09-23 |
Kernel Variational Inference Flow for Nonlinear Filtering Problem |
Weiye Gan et.al. |
2509.18589 |
null |
2025-09-23 |
DS-Diffusion: Data Style-Guided Diffusion Model for Time-Series Generation |
Mingchun Sun et.al. |
2509.18584 |
null |
2025-09-23 |
Active Ornstein-Uhlenbeck particle under stochastic resetting |
Uma Shankari et.al. |
2509.18515 |
null |
2025-09-23 |
Source-Free Domain Adaptive Semantic Segmentation of Remote Sensing Images with Diffusion-Guided Label Enrichment |
Wenjie Liu et.al. |
2509.18502 |
null |
2025-09-23 |
Differentiable Light Transport with Gaussian Surfels via Adapted Radiosity for Efficient Relighting and Geometry Reconstruction |
Kaiwen Jiang et.al. |
2509.18497 |
null |
2025-09-23 |
An Advection-Difusion Model Incorporating Investor Inertia for the Dynamics of Financial Asset Prices |
Diego et.al. |
2509.18488 |
null |
2025-09-22 |
Discrete-time diffusion-like models for speech synthesis |
Xiaozhou Tan et.al. |
2509.18470 |
null |
2025-09-22 |
Zero-Shot Visual Deepfake Detection: Can AI Predict and Prevent Fake Content Before It’s Created? |
Ayan Sar et.al. |
2509.18461 |
null |
2025-09-22 |
Learning Geometry-Aware Nonprehensile Pushing and Pulling with Dexterous Hands |
Yunshuang Li et.al. |
2509.18455 |
null |
2025-09-22 |
Diffusion Policies with Offline and Inverse Reinforcement Learning for Promoting Physical Activity in Older Adults Using Wearable Sensors |
Chang Liu et.al. |
2509.18433 |
null |
2025-09-22 |
Measurement Score-Based MRI Reconstruction with Automatic Coil Sensitivity Estimation |
Tingjun Liu et.al. |
2509.18402 |
null |
2025-09-22 |
Efficient Particle Acceleration in 2.5-Dimensional, Hybrid-Kinetic Simulations of Decaying, Supersonic, Plasma Turbulence |
Keyan Gootkin et.al. |
2509.18374 |
null |
2025-09-22 |
Galactic Center Gamma-Ray Emission in MHD Galaxy Formation Simulations with Full Cosmic Ray Spectra |
Isabel S. Sands et.al. |
2509.18351 |
null |
2025-09-22 |
Bootstrapping transport in the Drude-Kadanoff-Martin model |
Subham Dutta Chowdhury et.al. |
2509.18255 |
null |
2025-09-22 |
Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers |
Chaehyun Kim et.al. |
2509.18096 |
null |
2025-09-22 |
ComposeMe: Attribute-Specific Image Prompts for Controllable Human Image Generation |
Guocheng Gordon Qian et.al. |
2509.18092 |
null |
2025-09-22 |
RnGCam: High-speed video from rolling & global shutter measurements |
Kevin Tandi et.al. |
2509.18087 |
null |
2025-09-22 |
Spiffy: Multiplying Diffusion LLM Acceleration via Lossless Speculative Decoding |
Sudhanshu Agrawal et.al. |
2509.18085 |
null |
2025-09-22 |
RadarSFD: Single-Frame Diffusion with Pretrained Priors for Radar Point Clouds |
Bin Zhao et.al. |
2509.18068 |
null |
2025-09-22 |
Introduction to the relative Langlands program |
Raphaël Beuzart-Plessis et.al. |
2509.18062 |
null |
2025-09-22 |
Density convergence on Markov diffusion chaos via Stein’s method |
Thanh Dang et.al. |
2509.18045 |
null |
2025-09-22 |
Prepare Before You Act: Learning From Humans to Rearrange Initial States |
Yinlong Dai et.al. |
2509.18043 |
null |
2025-09-22 |
Microsecond-Pulsed Nanocalorimetry: A Scalable Approach for Ultrasensitive Heat Capacity Measurements |
Hugo Gómez-Torres et.al. |
2509.18019 |
null |
2025-09-23 |
StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models |
Haoxin Yang et.al. |
2509.17993 |
null |
2025-09-22 |
VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models |
Geonung Kim et.al. |
2509.17985 |
null |
2025-09-22 |
Cosmic inventory of the background fields of relativistic particles in the Universe |
Jonathan Biteau et.al. |
2509.17954 |
null |
2025-09-22 |
ComposableNav: Instruction-Following Navigation in Dynamic Environments via Composable Diffusion |
Zichao Hu et.al. |
2509.17941 |
null |
2025-09-22 |
MEF: A Systematic Evaluation Framework for Text-to-Image Models |
Xiaojing Dong et.al. |
2509.17907 |
null |
2025-09-23 |
Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark |
Siu Hang Ho et.al. |
2509.17894 |
null |
2025-09-22 |
Invariance of finite-dimensional realisations of Heath-Jarrow-Morton models under diffusion estimation |
Andreas Celary et.al. |
2509.17875 |
null |
2025-09-22 |
SocialTraj: Two-Stage Socially-Aware Trajectory Prediction for Autonomous Driving via Conditional Diffusion Model |
Xiao Zhou et.al. |
2509.17850 |
null |
2025-09-22 |
Semantic and Visual Crop-Guided Diffusion Models for Heterogeneous Tissue Synthesis in Histopathology |
Saghir Alfasly et.al. |
2509.17847 |
null |
2025-09-22 |
The origin of the intra-cluster light in The Three Hundred simulations |
A. Contreras-Santos et.al. |
2509.17831 |
null |
2025-09-22 |
Folding-unfolding transition of active polymer on the reconfiguration of bidirectional tangential active force |
Arindam Panda et.al. |
2509.17824 |
null |
2025-09-22 |
ContextFlow: Training-Free Video Object Editing via Adaptive Context Enrichment |
Yiyang Chen et.al. |
2509.17818 |
null |
2025-09-22 |
Solving time-fractional diffusion equations with Robin boundary conditions via fractional Hamiltonian boundary value methods |
Qian Luo et.al. |
2509.17793 |
null |
2025-09-22 |
Elucidating the Design Space of FP4 training |
Robert Hu et.al. |
2509.17791 |
null |
2025-09-22 |
Conditional Diffusion Models for CT Image Synthesis from CBCT: A Systematic Review |
Alzahra Altalib et.al. |
2509.17790 |
null |
2025-09-22 |
I2VWM: Robust Watermarking for Image to Video Generation |
Guanjie Wang et.al. |
2509.17773 |
null |
2025-09-22 |
Qwen3-Omni Technical Report |
Jin Xu et.al. |
2509.17765 |
null |
2025-09-22 |
Multi-Agent Amodal Completion: Direct Synthesis with Fine-Grained Semantic Guidance |
Hongxing Fan et.al. |
2509.17757 |
null |
2025-09-22 |
GAN-Based Multi-Microphone Spatial Target Speaker Extraction |
Shrishti Saha Shetu et.al. |
2509.17741 |
null |
2025-09-22 |
Non-equilibrium state during proton-deuteron exchange at a liquid-liquid interface |
Tillmann Buttersack et.al. |
2509.17724 |
null |
2025-09-22 |
DINOv3-Diffusion Policy: Self-Supervised Large Visual Model for Visuomotor Diffusion Policy Learning |
ThankGod Egbe et.al. |
2509.17684 |
null |
2025-09-23 |
Clothing agnostic Pre-inpainting Virtual Try-ON |
Sehyun Kim et.al. |
2509.17654 |
null |
2025-09-22 |
SISMA: Semantic Face Image Synthesis with Mamba |
Filippo Botti et.al. |
2509.17651 |
null |
2025-09-22 |
VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video |
Yu Liu et.al. |
2509.17647 |
null |
2025-09-22 |
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models |
Jinshu Chen et.al. |
2509.17627 |
null |
2025-09-22 |
Audio Super-Resolution with Latent Bridge Models |
Chang Li et.al. |
2509.17609 |
null |
2025-09-22 |
Measurements and scaling of X-ray total scattering from single crystals |
S. Gorfman et.al. |
2509.17605 |
null |
2025-09-22 |
Conditioning in Generative Quantum Denoising Diffusion Models |
Daniel Quinn et.al. |
2509.17569 |
null |
2025-09-22 |
Robust spectral preconditioning for high-Péclet number convection-diffusion |
Lukas Holbach et.al. |
2509.17531 |
null |
2025-09-22 |
Stable Video-Driven Portraits |
Mallikarjun B. R. et.al. |
2509.17476 |
null |
2025-09-22 |
CARINOX: Inference-time Scaling with Category-Aware Reward-based Initial Noise Optimization and Exploration |
Seyed Amir Kasaei et.al. |
2509.17458 |
null |
2025-09-22 |
Learning Dexterous Manipulation with Quantized Hand State |
Ying Feng et.al. |
2509.17450 |
null |
2025-09-22 |
Exploring Machine Learning Models for Physical Dose Calculation in Carbon Ion Therapy Using Heterogeneous Imaging Data - A Proof of Concept Study |
Miriam Schwarze et.al. |
2509.17433 |
null |
2025-09-22 |
Single-Image Depth from Defocus with Coded Aperture and Diffusion Posterior Sampling |
Hodaka Kawachi et.al. |
2509.17427 |
null |
2025-09-22 |
Diff-GNSS: Diffusion-based Pseudorange Error Estimation |
Jiaqi Zhu et.al. |
2509.17397 |
null |
2025-09-22 |
The Asymptotic Analysis of Some PDE and Steklov Eigenvalue Problems with Partially Reactive Patches in 3-D |
Denis S. Grebenkov et.al. |
2509.17394 |
null |
2025-09-22 |
Magnetically Enhanced Thermoelectric Effect Driven by Martensitic Transformation in the Weak Itinerant Ferromagnet Co $_2$ NbSn |
Takumi Kihara et.al. |
2509.17378 |
null |
2025-09-22 |
Volume Density Mapper: 3D Density Reconstruction Algorithm for Molecular Clouds |
Guang-Xing Li et.al. |
2509.17369 |
null |
2025-09-22 |
SeqUDA-Rec: Sequential User Behavior Enhanced Recommendation via Global Unsupervised Data Augmentation for Personalized Content Marketing |
Ruihan Luo et.al. |
2509.17361 |
null |
2025-09-22 |
DiffQ: Unified Parameter Initialization for Variational Quantum Algorithms via Diffusion Models |
Chi Zhang et.al. |
2509.17324 |
null |
2025-09-22 |
GraphWeave: Interpretable and Robust Graph Generation via Random Walk Trajectories |
Rahul Nandakumar et.al. |
2509.17291 |
null |
2025-09-21 |
Graph Signal Generative Diffusion Models |
Yigit Berkay Uslu et.al. |
2509.17250 |
null |
2025-09-21 |
Scalable Multi Agent Diffusion Policies for Coverage Control |
Frederic Vatnsdal et.al. |
2509.17244 |
null |
2025-09-21 |
DT-NeRF: A Diffusion and Transformer-Based Optimization Approach for Neural Radiance Fields in 3D Reconstruction |
Bo Liu et.al. |
2509.17232 |
null |
2025-09-21 |
Virtual Consistency for Audio Editing |
Matthieu Cervera et.al. |
2509.17219 |
null |
2025-09-21 |
Guided and Unguided Conditional Diffusion Mechanisms for Structured and Semantically-Aware 3D Point Cloud Generation |
Gunner Stone et.al. |
2509.17206 |
null |
2025-09-21 |
Conditional Policy Generator for Dynamic Constraint Satisfaction and Optimization |
Wook Lee et.al. |
2509.17205 |
null |
2025-09-21 |
Echo-Path: Pathology-Conditioned Echo Video Generation |
Kabir Hamzah Muhammad et.al. |
2509.17190 |
null |
2025-09-21 |
Towards a unified turbulence model through multi-objective learning |
Zhuo-Ran Liu et.al. |
2509.17189 |
null |
2025-09-21 |
Ambiguous Medical Image Segmentation Using Diffusion Schrödinger Bridge |
Lalith Bharadwaj Baru et.al. |
2509.17187 |
null |
2025-09-21 |
SynergyNet: Fusing Generative Priors and State-Space Models for Facial Beauty Prediction |
Djamel Eddine Boukhari et.al. |
2509.17172 |
null |
2025-09-21 |
Criticality of a stochastic modern Hopfield network model with exponential interaction function |
Marco Cafiso et.al. |
2509.17152 |
null |
2025-09-21 |
Stencil: Subject-Driven Generation with Context Guidance |
Gordon Chen et.al. |
2509.17120 |
null |
2025-09-21 |
ScenGAN: Attention-Intensive Generative Model for Uncertainty-Aware Renewable Scenario Forecasting |
Yifei Wu et.al. |
2509.17119 |
null |
2025-09-21 |
$\texttt{DiffSyn}$ : A Generative Diffusion Approach to Materials Synthesis Planning |
Elton Pan et.al. |
2509.17094 |
null |
2025-09-21 |
AlignedGen: Aligning Style Across Generated Images |
Jiexuan Zhang et.al. |
2509.17088 |
null |
2025-09-21 |
CoPlanner: An Interactive Motion Planner with Contingency-Aware Diffusion for Autonomous Driving |
Ruiguo Zhong et.al. |
2509.17080 |
null |
2025-09-21 |
Global classical solutions to a two-dimensional chemotaxis-fluid system involving signal-dependent degenerate diffusion |
Yansheng Ma et.al. |
2509.17073 |
null |
2025-09-21 |
Intention-aware Hierarchical Diffusion Model for Long-term Trajectory Anomaly Detection |
Chen Wang et.al. |
2509.17068 |
null |
2025-09-21 |
Geodesic Prototype Matching via Diffusion Maps for Interpretable Fine-Grained Recognition |
Junhao Jia et.al. |
2509.17050 |
null |
2025-09-21 |
Boundary Feller-Dynkin processes associated with Laguerre processes and Pickrell diffusions |
Alexander I. Bufetov et.al. |
2509.17045 |
null |
2025-09-21 |
When Color-Space Decoupling Meets Diffusion for Adverse-Weather Image Restoration |
Wenxuan Fang et.al. |
2509.17024 |
null |
2025-09-21 |
Multiscale solution decomposition of nonlocal-in-time problems with application in numerical computation |
Mengmeng Liu et.al. |
2509.17020 |
null |
2025-09-21 |
DocIQ: A Benchmark Dataset and Feature Fusion Network for Document Image Quality Assessment |
Zhichao Ma et.al. |
2509.17012 |
null |
2025-09-21 |
Generalized Momenta-Based Koopman Formalism for Robust Control of Euler-Lagrangian Systems |
Rajpal Singh et.al. |
2509.17010 |
null |
2025-09-21 |
Radiation Mediated Shock and Planar Shock Breakout in the Presence of Atomic Transition Lines |
Jonathan Morag et.al. |
2509.16996 |
null |
2025-09-21 |
VCE: Safe Autoregressive Image Generation via Visual Contrast Exploitation |
Feng Han et.al. |
2509.16986 |
null |
2025-09-21 |
Ledrappier-Young entropy formula for $C^1$ diffeomorphisms with dominated splitting Part 1: Unstable entropy formula and invariance principle |
Shaobo Gan et.al. |
2509.16981 |
null |
2025-09-21 |
Penalizing Boundary Activation for Object Completeness in Diffusion Models |
Haoyang Xu et.al. |
2509.16968 |
null |
2025-09-21 |
SemanticGarment: Semantic-Controlled Generation and Editing of 3D Gaussian Garments |
Ruiyan Wang et.al. |
2509.16960 |
null |
2025-09-21 |
VidCLearn: A Continual Learning Approach for Text-to-Video Generation |
Luca Zanchetta et.al. |
2509.16956 |
null |
2025-09-21 |
Machine learning meets Singular Optics II: Single-pixel Detection of Structured Light |
Purnesh Singh Badavath et.al. |
2509.16946 |
null |
2025-09-21 |
Discrete Heat Kernels on Simplicial Complexes and Its Application to Functional Brain Networks |
Sixtus Dakurah et.al. |
2509.16908 |
null |
2025-09-21 |
PRISM: Precision-Recall Informed Data-Free Knowledge Distillation via Generative Diffusion |
Xuewan He et.al. |
2509.16897 |
null |
2025-09-21 |
A Mutil-conditional Diffusion Transformer for Versatile Seismic Wave Generation |
Longfei Duan et.al. |
2509.16874 |
null |
2025-09-21 |
$\mathtt{M^3VIR}$ : A Large-Scale Multi-Modality Multi-View Synthesized Benchmark Dataset for Image Restoration and Content Creation |
Yuanzhi Li et.al. |
2509.16873 |
null |
2025-09-21 |
HOGraspFlow: Exploring Vision-based Generative Grasp Synthesis with Hand-Object Priors and Taxonomy Awareness |
Yitian Shi et.al. |
2509.16871 |
null |
2025-09-21 |
PhysHDR: When Lighting Meets Materials and Scene Geometry in HDR Reconstruction |
Hrishav Bakul Barua et.al. |
2509.16869 |
null |
2025-09-20 |
DoubleGen: Debiased Generative Modeling of Counterfactuals |
Alex Luedtke et.al. |
2509.16842 |
null |
2025-09-20 |
Factorizing Diffusion Policies for Observation Modality Prioritization |
Omkar Patil et.al. |
2509.16830 |
null |
2025-09-20 |
DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images |
Ozgur Kara et.al. |
2509.16767 |
null |
2025-09-20 |
Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees |
Yuchen Liang et.al. |
2509.16756 |
null |
2025-09-20 |
HyPlaneHead: Rethinking Tri-plane-like Representations in Full-Head Image Synthesis |
Heyuan Li et.al. |
2509.16748 |
null |
2025-09-20 |
Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment |
Xin Lei Lin et.al. |
2509.16727 |
null |
2025-09-20 |
Animalbooth: multimodal feature enhancement for animal subject personalization |
Chen Liu et.al. |
2509.16702 |
null |
2025-09-20 |
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention |
Qiang Xiang et.al. |
2509.16691 |
null |
2025-09-20 |
Follow-Your-Emoji-Faster: Towards Efficient, Fine-Controllable, and Expressive Freestyle Portrait Animation |
Yue Ma et.al. |
2509.16630 |
null |
2025-09-20 |
Investigation of the Axe-shaped Radio Galaxy J1051+5523 with uGMRT |
Sudheesh T. P. et.al. |
2509.16624 |
null |
2025-09-20 |
Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing |
Mengqi Wang et.al. |
2509.16622 |
null |
2025-09-20 |
An Octave-based Multi-Resolution CQT Architecture for Diffusion-based Audio Generation |
Maurício do V. M. da Costa et.al. |
2509.16603 |
null |
2025-09-20 |
FakeChain: Exposing Shallow Cues in Multi-Step Deepfake Detection |
Minji Heo et.al. |
2509.16602 |
null |
2025-09-19 |
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer |
Yanghao Li et.al. |
2509.16197 |
null |
2025-09-19 |
AcT2I: Evaluating and Improving Action Depiction in Text-to-Image Models |
Vatsal Malaviya et.al. |
2509.16141 |
null |
2025-09-19 |
Dynamic Classifier-Free Diffusion Guidance via Online Feedback |
Pinelopi Papalampidi et.al. |
2509.16131 |
null |
2025-09-19 |
DiffusionNFT: Online Diffusion Reinforcement with Forward Process |
Kaiwen Zheng et.al. |
2509.16117 |
null |
2025-09-19 |
KRED: Korea Research Economic Database for Macroeconomic Research |
Changryong Baek et.al. |
2509.16115 |
null |
2025-09-19 |
PRISM: Probabilistic and Robust Inverse Solver with Measurement-Conditioned Diffusion Prior for Blind Inverse Problems |
Yuanyun Hu et.al. |
2509.16106 |
null |
2025-09-19 |
Blind-Spot Guided Diffusion for Self-supervised Real-World Denoising |
Shen Cheng et.al. |
2509.16091 |
null |
2025-09-19 |
Generating Detailed Character Motion from Blocking Poses |
Purvi Goel et.al. |
2509.16064 |
null |
2025-09-19 |
Latent Conditioned Loco-Manipulation Using Motion Priors |
Maciej Stępień et.al. |
2509.16061 |
null |
2025-09-19 |
Compose by Focus: Scene Graph-based Atomic Skills |
Han Qi et.al. |
2509.16053 |
null |
2025-09-19 |
A Note on the formulation of the Neumann boundary condition for a nonlocal problem |
Antonio Luiz Pereira et.al. |
2509.16041 |
null |
2025-09-19 |
SLaM-DiMM: Shared Latent Modeling for Diffusion Based Missing Modality Synthesis in MRI |
Bhavesh Sandbhor et.al. |
2509.16019 |
null |
2025-09-19 |
DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching |
Meng Yang et.al. |
2509.16017 |
null |
2025-09-19 |
Going with the Flow: Solving for Symmetry-Driven PDE dynamics with Physics-informed Neural Networks |
Michail Kavousanakis et.al. |
2509.15963 |
null |
2025-09-19 |
Structured Information for Improving Spatial Relationships in Text-to-Image Generation |
Sander Schildermans et.al. |
2509.15962 |
null |
2025-09-19 |
Optimal Experimental Design of a Moving Sensor for Linear Bayesian Inverse Problems |
Nicole Aretz et.al. |
2509.15961 |
null |
2025-09-19 |
Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement |
Gang Yang et.al. |
2509.15952 |
null |
2025-09-19 |
UniTac2Pose: A Unified Approach Learned in Simulation for Category-level Visuotactile In-hand Pose Estimation |
Mingdong Wu et.al. |
2509.15934 |
null |
2025-09-19 |
Bayesian Physics Informed Neural Networks for Reliable Transformer Prognostics |
Ibai Ramirez et.al. |
2509.15933 |
null |
2025-09-19 |
Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search |
Zhiyu Mou et.al. |
2509.15927 |
null |
2025-09-19 |
An optimal-control framework for reaction diffusion systems with application to synthetic developmental biology |
Mohamed Amine Ouchdiri et.al. |
2509.15889 |
null |
2025-09-19 |
A Multidimensional Self-Adaptive Numerical Simulation Framework for Semiconductor Boltzmann Transport Equation |
Zeyu Zhang et.al. |
2509.15879 |
null |
2025-09-19 |
SAGE: Semantic-Aware Shared Sampling for Efficient Diffusion |
Haoran Zhao et.al. |
2509.15865 |
null |
2025-09-19 |
Observation of the Galactic Center in the Sub-MeV Gamma-Ray Band with an Electron-Tracking Compton Camera |
Tomonori Ikeda et.al. |
2509.15851 |
null |
2025-09-19 |
Turing Patterns in a Morphogenetic Model with Single Regulatory Function |
Mohamed Amine Ouchdiri et.al. |
2509.15829 |
null |
2025-09-19 |
QWD-GAN: Quality-aware Wavelet-driven GAN for Unsupervised Medical Microscopy Images Denoising |
Qijun Yang et.al. |
2509.15814 |
null |
2025-09-19 |
Polynomial approximation from diffused data: unisolvence and stability |
Ludovico Bruni Bruno et.al. |
2509.15813 |
null |
2025-09-19 |
CIDER: A Causal Cure for Brand-Obsessed Text-to-Image Models |
Fangjian Shen et.al. |
2509.15803 |
null |
2025-09-19 |
Monte Carlo Tree Diffusion with Multiple Experts for Protein Design |
Xuefeng Liu et.al. |
2509.15796 |
null |
2025-09-19 |
Absence of Radio Emission Reveals an Exceptionally Weak Explosion of the Putative Historical Supernova Pa 30 |
Yi-xuan Shao et.al. |
2509.15792 |
null |
2025-09-19 |
Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation |
Weimin Bai et.al. |
2509.15772 |
null |
2025-09-19 |
Learning to Optimize Capacity Planning in Semiconductor Manufacturing |
Philipp Andelfinger et.al. |
2509.15767 |
null |
2025-09-19 |
Utility-based Privacy Preserving Data Mining |
Qingfeng Zhou et.al. |
2509.15755 |
null |
2025-09-19 |
Discovering Top-k Periodic and High-Utility Patterns |
Qingfeng Zhou et.al. |
2509.15732 |
null |
2025-09-19 |
Search for cosmic-ray induced gamma-ray emission from local galaxy clusters using Fermi-LAT data |
Judit Pérez-Romero et.al. |
2509.15720 |
null |
2025-09-19 |
Imagination at Inference: Synthesizing In-Hand Views for Robust Visuomotor Policy Inference |
Haoran Ding et.al. |
2509.15717 |
null |
2025-09-19 |
Weak Error Estimates of Ergodic Approximations for Monotone Jump-diffusion SODEs |
Zhihui Liu et.al. |
2509.15698 |
null |
2025-09-19 |
Bose’s Probabilistic Interactions, Einstein’s Objections, and Their Legacy in Quantum Optics and Stochastic Mechanics |
Partha Ghose et.al. |
2509.15686 |
null |
2025-09-19 |
Spontaneous stochasticity in the Armstrong-Vicol passive scalar |
Wandrille Ruffenach et.al. |
2509.15683 |
null |
2025-09-19 |
Layout Stroke Imitation: A Layout Guided Handwriting Stroke Generation for Style Imitation with Diffusion Model |
Sidra Hanif et.al. |
2509.15678 |
null |
2025-09-19 |
Diffusion of gravitactic chiral active Brownian particles in an asymmetric channel |
Narender Khatri et.al. |
2509.15630 |
null |
2025-09-19 |
MAGENTA: Magnitude and Geometry-ENhanced Training Approach for Robust Long-Tailed Sound Event Localization and Detection |
Jun-Wei Yeow et.al. |
2509.15599 |
null |
2025-09-19 |
Global Existence of Solutions of Nonlocal Geirer-Meinhardt Model and Effect of Nonlocal Operator in Pattern Formation |
Md Shah Alam et.al. |
2509.15598 |
null |
2025-09-19 |
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification |
Zinan Lin et.al. |
2509.15591 |
null |
2025-09-19 |
Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification |
Tian Lan et.al. |
2509.15553 |
null |
2025-09-19 |
PolyJuice Makes It Real: Black-Box, Universal Red Teaming for Synthetic Image Detectors |
Sepehr Dehdashtian et.al. |
2509.15551 |
null |
2025-09-19 |
Global Existence and Boundedness of Gray-Scott Model with Local and Nonlocal Diffusion |
Md Shah Alam et.al. |
2509.15535 |
null |
2025-09-19 |
Lynx: Towards High-Fidelity Personalized Video Generation |
Shen Sang et.al. |
2509.15496 |
null |
2025-09-18 |
Full Quantum Stack: Ket Platform |
Evandro Rosa et.al. |
2509.15484 |
null |
2025-09-18 |
OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data |
Björn Möller et.al. |
2509.15479 |
null |
2025-09-18 |
Efficient Multimodal Dataset Distillation via Generative Models |
Zhenghao Zhao et.al. |
2509.15472 |
null |
2025-09-18 |
$ν$ SpaceSim: A Comprehensive Simulation Package for Modeling the Measurement of Cosmic Neutrinos using the Earth as the Neutrino Target and Space-based Detectors |
Mary Hall Reno et.al. |
2509.15469 |
null |
2025-09-18 |
SERVAL: Surprisingly Effective Zero-Shot Visual Document Retrieval Powered by Large Vision and Language Models |
Thong Nguyen et.al. |
2509.15432 |
null |
2025-09-18 |
Random Matrix Theory-guided sparse PCA for single-cell RNA-seq data |
Victor Chardès et.al. |
2509.15429 |
null |
2025-09-18 |
Thin-film boundary-layer diffusion of non-equilibrium flow to kinetically limited reactive surfaces via Damköhler thermochemistry tables |
Jeffrey D. Engerer et.al. |
2509.15427 |
null |
2025-09-18 |
Spectral Characterization of Wave Scattering at a Granular-Elastic Solid Interface: From Hyperbolic Wave Propagation to Near-Parabolic Diffusion |
Joshua R. Tempelman et.al. |
2509.15415 |
null |
2025-09-18 |
Causal Fingerprints of AI Generative Models |
Hui Xu et.al. |
2509.15406 |
null |
2025-09-18 |
Caught in the Cosmic Web: Evidence for Ram-Pressure Stripping of a Low-Mass Galaxy by the Cosmic Web |
Nicholas Luber et.al. |
2509.15405 |
null |
2025-09-18 |
RaceGAN: A Framework for Preserving Individuality while Converting Racial Information for Image-to-Image Translation |
Mst Tasnim Pervin et.al. |
2509.15391 |
null |
2025-09-18 |
MaskAttn-SDXL: Controllable Region-Level Text-To-Image Generation |
Yu Chang et.al. |
2509.15357 |
null |
2025-09-18 |
LowDiff: Efficient Diffusion Sampling with Low-Resolution Condition |
Jiuyi Xu et.al. |
2509.15342 |
null |
2025-09-18 |
WALLABY Pilot Survey: A gas-rich diffuse dwarf on the baryonic Tully Fisher relation |
Rebecca Dudley et.al. |
2509.15340 |
null |
2025-09-18 |
Kuramoto Orientation Diffusion Models |
Yue Song et.al. |
2509.15328 |
null |
2025-09-18 |
Anisotropic Cosmic Ray Transport resulting from Magnetic Mirroring and Resonant Curvature Scattering |
Jeremiah Lübke et.al. |
2509.15320 |
null |
2025-09-18 |
PRISM: Phase-enhanced Radial-based Image Signature Mapping framework for fingerprinting AI-generated images |
Emanuele Ricco et.al. |
2509.15270 |
null |
2025-09-18 |
Autoguided Online Data Curation for Diffusion Model Training |
Valeria Pais et.al. |
2509.15267 |
null |
2025-09-18 |
Lightweight and Accurate Multi-View Stereo with Confidence-Aware Diffusion Model |
Fangjinhua Wang et.al. |
2509.15220 |
null |
2025-09-18 |
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation |
Yuming Jiang et.al. |
2509.15212 |
null |
2025-09-18 |
Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning |
Yeongbin Seo et.al. |
2509.15188 |
null |
2025-09-18 |
Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation |
Xiaoyu Yue et.al. |
2509.15185 |
null |
2025-09-18 |
Conditional Prior-based Non-stationary Channel Estimation Using Accelerated Diffusion Models |
Muhammad Ahmed Mohsin et.al. |
2509.15182 |
null |
2025-09-18 |
A Race Bias Free Face Aging Model for Reliable Kinship Verification |
Ali Nazari et.al. |
2509.15177 |
null |
2025-09-18 |
Unveiling TeV halos among unidentified extended TeV sources |
Michela Rigoselli et.al. |
2509.15168 |
null |
2025-09-18 |
AnoF-Diff: One-Step Diffusion-Based Anomaly Detection for Forceful Tool Use |
Yating Lin et.al. |
2509.15153 |
null |
2025-09-18 |
WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance |
Chenxi Song et.al. |
2509.15130 |
null |
2025-09-18 |
Learning Mechanistic Subtypes of Neurodegeneration with a Physics-Informed Variational Autoencoder Mixture Model |
Sanduni Pinnawala et.al. |
2509.15124 |
null |
2025-09-18 |
LOFAR 58 MHz Legacy Survey of the 3CRR Catalog |
J. M. Boxelaar et.al. |
2509.15115 |
null |
2025-09-18 |
Real-Time Streaming Mel Vocoding with Generative Flow Matching |
Simon Welker et.al. |
2509.15085 |
null |
2025-09-18 |
Forecasting and Visualizing Air Quality from Sky Images with Vision-Language Models |
Mohammad Saleh Vahdatpour et.al. |
2509.15076 |
null |
2025-09-19 |
Ask-to-Clarify: Resolving Instruction Ambiguity through Multi-turn Dialogue |
Xingyao Lin et.al. |
2509.15061 |
null |
2025-09-18 |
How long does it take an Elephant Random Walk to forget its training |
Zheng Fang et.al. |
2509.15049 |
null |
2025-09-18 |
AutoEdit: Automatic Hyperparameter Tuning for Image Editing |
Chau Pham et.al. |
2509.15031 |
null |
2025-09-19 |
Sea-ing Through Scattered Rays: Revisiting the Image Formation Model for Realistic Underwater Image Generation |
Vasiliki Ismiroglou et.al. |
2509.15011 |
null |
2025-09-19 |
SPATIALGEN: Layout-guided 3D Indoor Scene Generation |
Chuan Fang et.al. |
2509.14981 |
null |
2025-09-18 |
M4Diffuser: Multi-View Diffusion Policy with Manipulability-Aware Control for Robust Mobile Manipulation |
Ju Dong et.al. |
2509.14980 |
null |
2025-09-19 |
Stochastic Hamiltonian Type Jump Diffusion Systems with Countable Regimes: Strong Feller Property and Exponential Ergodicity |
Fubao Xi et.al. |
2509.14951 |
null |
2025-09-18 |
A Novel Task-Driven Diffusion-Based Policy with Affordance Learning for Generalizable Manipulation of Articulated Objects |
Hao Zhang et.al. |
2509.14939 |
null |
2025-09-18 |
Mitigating data replication in text-to-audio generative diffusion models through anti-memorization guidance |
Francisco Messina et.al. |
2509.14934 |
null |
2025-09-18 |
Back to Ear: Perceptually Driven High Fidelity Music Reconstruction |
Kangdi Wang et.al. |
2509.14912 |
null |
2025-09-18 |
Finite Volumes for a dissipative free boundary problem |
Clément Cancès et.al. |
2509.14908 |
null |
2025-09-18 |
Constraining gamma-ray burst parameters with the first ultra-high energy neutrino event KM3-230213A |
KM3NeT Collaboration et.al. |
2509.14895 |
null |
2025-09-18 |
NeRF-based Visualization of 3D Cues Supporting Data-Driven Spacecraft Pose Estimation |
Antoine Legrand et.al. |
2509.14890 |
null |
2025-09-18 |
CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human |
Nan Sun et.al. |
2509.14889 |
null |
2025-09-18 |
Controllable Localized Face Anonymization Via Diffusion Inpainting |
Ali Salar et.al. |
2509.14866 |
null |
2025-09-19 |
MeanFlowSE: one-step generative speech enhancement via conditional mean flow |
Duojia Li et.al. |
2509.14858 |
null |
2025-09-18 |
A class of flexible and efficient partitioned Runge-Kutta-Chebyshev methods for some time-dependent partial differential equations |
Xiao Tang et.al. |
2509.14847 |
null |
2025-09-18 |
[Re] Improving Interpretation Faithfulness for Vision Transformers |
Izabela Kurek et.al. |
2509.14846 |
null |
2025-09-18 |
Diffusion-Based Scenario Tree Generation for Multivariate Time Series Prediction and Multistage Stochastic Optimization |
Stelios Zarifis et.al. |
2509.14832 |
null |
2025-09-18 |
Spectral survey of the diffuse gas toward BL Lac in the Q band |
Maryvonne Gerin et.al. |
2509.14822 |
null |
2025-09-18 |
Acoustic Simulation Framework for Multi-channel Replay Speech Detection |
Michael Neri et.al. |
2509.14789 |
null |
2025-09-18 |
MELA-TTS: Joint transformer-diffusion model with representation alignment for speech synthesis |
Keyu An et.al. |
2509.14784 |
null |
2025-09-18 |
Radiology Report Conditional 3D CT Generation with Multi Encoder Latent diffusion Model |
Sina Amirrajab et.al. |
2509.14780 |
null |
2025-09-18 |
Dataset Distillation for Super-Resolution without Class Labels and Pre-trained Models |
Sunwoo Cho et.al. |
2509.14777 |
null |
2025-09-18 |
Diffuse emission from stochastic sources |
Anton Stall et.al. |
2509.14776 |
null |
2025-09-18 |
UMind: A Unified Multitask Network for Zero-Shot M/EEG Visual Decoding |
Chengjian Xu et.al. |
2509.14772 |
null |
2025-09-18 |
Hydrodynamic Attraction and Hindered Diffusion Govern First-passage Times of Swimming Microorganisms |
Yanis Baouche et.al. |
2509.14765 |
null |
2025-09-18 |
Data Augmentation via Latent Diffusion Models for Detecting Smell-Related Objects in Historical Artworks |
Ahmed Sheta et.al. |
2509.14755 |
null |
2025-09-18 |
Chain-of-Thought Re-ranking for Image Retrieval Tasks |
Shangrong Wu et.al. |
2509.14746 |
null |
2025-09-18 |
UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets |
Pengyu Wang et.al. |
2509.14738 |
null |
2025-09-18 |
Towards Pre-trained Graph Condensation via Optimal Transport |
Yeyu Yan et.al. |
2509.14722 |
null |
2025-09-18 |
DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images |
Kazuma Nagata et.al. |
2509.14685 |
null |
2025-09-18 |
Enhancing Situational Awareness in Wearable Audio Devices Using a Lightweight Sound Event Localization and Detection System |
Jun-Wei Yeow et.al. |
2509.14650 |
null |
2025-09-18 |
On the algebraic stretching dynamics of variable-density mixing in shock-bubble interaction |
Xu Han et.al. |
2509.14607 |
null |
2025-09-18 |
DICE: Diffusion Consensus Equilibrium for Sparse-view CT Reconstruction |
Leon Suarez-Rodriguez et.al. |
2509.14566 |
null |
2025-09-18 |
DiffVL: Diffusion-Based Visual Localization on 2D Maps via BEV-Conditioned GPS Denoising |
Li Gao et.al. |
2509.14565 |
null |
2025-09-18 |
Adaptive and Iterative Point Cloud Denoising with Score-Based Diffusion Model |
Zhaonan Wang et.al. |
2509.14560 |
null |
2025-09-18 |
Radiolunadiff: Estimation of wireless network signal strength in lunar terrain |
Paolo Torrado et.al. |
2509.14559 |
null |
2025-09-18 |
Event-LAB: Towards Standardized Evaluation of Neuromorphic Localization Methods |
Adam D. Hines et.al. |
2509.14516 |
null |
2025-09-18 |
A Time-Inconsistent Stochastic Optimal Control Problem in an Infinite Time Horizon |
Qingmeng Wei et.al. |
2509.14495 |
null |
2025-09-17 |
Error analysis of a fully discrete structure-preserving finite element scheme for a diffuse-interface model of tumour growth |
Agus L. Soenjaya et.al. |
2509.14486 |
null |
2025-09-17 |
AToken: A Unified Tokenizer for Vision |
Jiasen Lu et.al. |
2509.14476 |
null |
2025-09-17 |
Keywords are not always the key: A metadata field analysis for natural language search on open data portals |
Lisa-Yao Gan et.al. |
2509.14457 |
null |
2025-09-17 |
On the equivalence and optimality of transformations of diffusive systems |
Davide Gabrielli et.al. |
2509.14450 |
null |
2025-09-17 |
Diffusion-Based Unsupervised Audio-Visual Speech Separation in Noisy Environments with Noise Prior |
Yochai Yemini et.al. |
2509.14379 |
null |
2025-09-17 |
Electricity in international comparison – Future technologies in power generation |
Axel Kleidon et.al. |
2509.14365 |
null |
2025-09-17 |
DreamControl: Human-Inspired Whole-Body Humanoid Control for Scene Interaction via Guided Diffusion |
Dvij Kalaria et.al. |
2509.14353 |
null |
2025-09-17 |
Enhanced Radio Emission Between a Galaxy Cluster Pair |
Andrea Botteon et.al. |
2509.14348 |
null |
2025-09-17 |
Dichotomy in Long-Lived Radio Emission from Tidal Disruption Events AT 2020zso and AT 2021sdu: Multi-Component Outflows vs. Host Contamination |
Collin T. Christy et.al. |
2509.14317 |
null |
2025-09-17 |
FlowDrive: Energy Flow Field for End-to-End Autonomous Driving |
Hao Jiang et.al. |
2509.14303 |
null |
2025-09-17 |
D4PM: A Dual-branch Driven Denoising Diffusion Probabilistic Model with Joint Posterior Diffusion Sampling for EEG Artifacts Removal |
Feixue Shao et.al. |
2509.14302 |
null |
2025-09-17 |
SpeechOp: Inference-Time Task Composition for Generative Speech Processing |
Justin Lovelace et.al. |
2509.14298 |
null |
2025-09-17 |
GenExam: A Multidisciplinary Text-to-Image Exam |
Zhaokai Wang et.al. |
2509.14232 |
null |
2025-09-17 |
Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics |
Benjamin Sterling et.al. |
2509.14225 |
null |
2025-09-17 |
Looking into the faintEst WIth MUSE (LEWIS): Exploring the nature of ultra-diffuse galaxies in the Hydra-I cluster IV. A study of the Globular Cluster population in four UDGs |
Marco Mirabile et.al. |
2509.14206 |
null |
2025-09-17 |
Mass Transport, Turbulent Mixing, and Inflow in Black Hole Accretion |
George N. Wong et.al. |
2509.14202 |
null |
2025-09-16 |
\textsc{Gen2Real}: Towards Demo-Free Dexterous Manipulation by Harnessing Generated Video |
Kai Ye et.al. |
2509.14178 |
null |
2025-09-17 |
Reaction-diffusion models of invasive tree pest spread: quantifying the spread of oak processionary moth in the UK |
Jamie P. McKeown et.al. |
2509.14166 |
null |
2025-09-17 |
Quantum Reinforcement Learning-Guided Diffusion Model for Image Synthesis via Hybrid Quantum-Classical Generative Model Architectures |
Chi-Sheng Chen et.al. |
2509.14163 |
null |
2025-09-17 |
MIMIC-D: Multi-modal Imitation for MultI-agent Coordination with Decentralized Diffusion Policies |
Dayi Dong et.al. |
2509.14159 |
null |
2025-09-17 |
An Exploratory Study on Abstract Images and Visual Representations Learned from Them |
Haotian Li et.al. |
2509.14149 |
null |
2025-09-17 |
FlightDiffusion: Revolutionising Autonomous Drone Training with Diffusion Models Generating FPV Video |
Valerii Serpiva et.al. |
2509.14082 |
null |
2025-09-17 |
Dissipativity-Based Data-Driven Decentralized Control of Interconnected Systems |
Taiki Nakano et.al. |
2509.14047 |
null |
2025-09-17 |
Cross-diffusion limits in multispecies kinetic models |
Ansgar Jüngel et.al. |
2509.14046 |
null |
2025-09-17 |
A Pearl in the Shell: an ultra-compact dwarf within the tidal debris surrounding spiral galaxy NGC 7531 |
David Martínez-Delgado et.al. |
2509.14038 |
null |
2025-09-17 |
Improving cosmological reach of a gravitational wave observatory using Deep Loop Shaping |
Jonas Buchli et.al. |
2509.14016 |
null |
2025-09-17 |
RFM-Editing: Rectified Flow Matching for Text-guided Audio Editing |
Liting Gao et.al. |
2509.14003 |
null |
2025-09-17 |
Reconstruction of strong degeneracy region for parabolic equations and systems |
Piermarco Cannarsa et.al. |
2509.13962 |
null |
2025-09-17 |
Noise-Level Diffusion Guidance: Well Begun is Half Done |
Harvey Mannering et.al. |
2509.13936 |
null |
2025-09-17 |
Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification |
Wenkui Yang et.al. |
2509.13922 |
null |
2025-09-17 |
Recovering the Coupled Treatment of Redshift-Space Distortions and the Lightcone Effect after Diffuse Foreground Removal |
Jennifer Feron et.al. |
2509.13920 |
null |
2025-09-17 |
Inverse Design of Amorphous Materials with Targeted Properties |
Jonas A. Finkler et.al. |
2509.13916 |
null |
2025-09-17 |
Using Deep Learning Methods to Detect for Ultra-diffuse Galaxies in KiDS |
Hao Su et.al. |
2509.13910 |
null |
2025-09-17 |
A Tight Quantum Algorithm for Multiple Collision Search |
Xavier Bonnetain et.al. |
2509.13909 |
null |
2025-09-17 |
PhysicalAgent: Towards General Cognitive Robotics with Foundation World Models |
Artem Lykov et.al. |
2509.13903 |
null |
2025-09-17 |
Masked Diffusion Models as Energy Minimization |
Sitong Chen et.al. |
2509.13866 |
null |
2025-09-17 |
EDITS: Enhancing Dataset Distillation with Implicit Textual Semantics |
Qianxin Xia et.al. |
2509.13858 |
null |
2025-09-17 |
Surfing on chemical waves: a simple yet dynamically rich two-sphere responsive gel swimmer |
Joseph J. Webber et.al. |
2509.13850 |
null |
2025-09-17 |
SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation |
Jiayi Pan et.al. |
2509.13848 |
null |
2025-09-17 |
Polycyclic aromatic hydrocarbons destruction in star-forming regions across 42 nearby galaxies |
Oleg V. Egorov et.al. |
2509.13845 |
null |
2025-09-18 |
BWCache: Accelerating Video Diffusion Transformers through Block-Wise Caching |
Hanshuai Cui et.al. |
2509.13789 |
null |
2025-09-17 |
Generative Image Coding with Diffusion Prior |
Jianhui Chang et.al. |
2509.13768 |
null |
2025-09-17 |
Iterative Prompt Refinement for Safer Text-to-Image Generation |
Jinwoo Jeon et.al. |
2509.13760 |
null |
2025-09-17 |
Controllable-Continuous Color Editing in Diffusion Model via Color Mapping |
Yuqi Yang et.al. |
2509.13756 |
null |
2025-09-17 |
Cross-modal Full-mode Fine-grained Alignment for Text-to-Image Person Retrieval |
Hao Yin et.al. |
2509.13754 |
null |
2025-09-17 |
Heavy Traffic Diffusion Limit for a Closed Queueing Network with Single-Server and Infinite-Server Stations |
Amir A. Alwan et.al. |
2509.13748 |
null |
2025-09-17 |
Ion-modulated structure, proton transfer, and capacitance in the Pt(111)/water electric double layer |
Xiaoyu Wang et.al. |
2509.13727 |
null |
2025-09-17 |
StyleProtect: Safeguarding Artistic Identity in Fine-tuned Diffusion Models |
Qiuyu Tang et.al. |
2509.13711 |
null |
2025-09-17 |
LLM-I: LLMs are Naturally Interleaved Multimodal Creators |
Zirun Guo et.al. |
2509.13642 |
null |
2025-09-17 |
Generative Consistency Models for Estimation of Kinetic Parametric Image Posteriors in Total-Body PET |
Yun Zhao et.al. |
2509.13614 |
null |
2025-09-16 |
Cross-Distribution Diffusion Priors-Driven Iterative Reconstruction for Sparse-View CT |
Haodong Li et.al. |
2509.13576 |
null |
2025-09-16 |
ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors |
Romain Hardy et.al. |
2509.13525 |
null |
2025-09-16 |
AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions |
Väinö Hatanpää et.al. |
2509.13523 |
null |
2025-09-16 |
DEFT-VTON: Efficient Virtual Try-On with Consistent Generalised H-Transform |
Xingzi Xu et.al. |
2509.13506 |
null |
2025-09-16 |
BiasMap: Leveraging Cross-Attentions to Discover and Mitigate Hidden Social Biases in Text-to-Image Generation |
Rajatsubhra Chakraborty et.al. |
2509.13496 |
null |
2025-09-16 |
The effect of parameter drift in the transport of magnetized plasma particles |
P. Haerter et.al. |
2509.13472 |
null |
2025-09-18 |
Unified Spatiotemporal Physics-Informed Learning (USPIL): A Framework for Modeling Complex Predator-Prey Dynamics |
Julian Evan Chrisnanto et.al. |
2509.13425 |
null |
2025-09-16 |
Modeling Cosmological Evolution of Jetted Seyfert Galaxies for z<10 |
Julianne Goddard et.al. |
2509.13418 |
null |
2025-09-16 |
SOFIA Polarization Spectrum of Three Star-Forming Clouds |
Erin G. Cox et.al. |
2509.13416 |
null |
2025-09-16 |
EdiVal-Agent: An Object-Centric Framework for Automated, Scalable, Fine-Grained Evaluation of Multi-Turn Editing |
Tianyu Chen et.al. |
2509.13399 |
null |
2025-09-16 |
Valuation of Exotic Options and Counterparty Games Based on Conditional Diffusion |
Helin Zhao et.al. |
2509.13374 |
null |
2025-09-16 |
Runaway electron interactions with whistler waves in tokamak plasmas: energy-dependent transport scaling |
Yashika Ghai et.al. |
2509.13271 |
null |
2025-09-16 |
Beyond Private or Public: Large Language Models as Quasi-Public Goods in the AI Economy |
Yukun Zhang et.al. |
2509.13265 |
null |
2025-09-16 |
Geometry, Energy and Sensitivity in Stochastic Proton Dynamics |
Veronika Chronholm et.al. |
2509.13223 |
null |
2025-09-17 |
The Gamma Expansion of the Level Two Large Deviation Rate Functional for Reversible Diffusion Processes |
Claudio Landim et.al. |
2509.13222 |
null |
2025-09-18 |
End4: End-to-end Denoising Diffusion for Diffusion-Based Inpainting Detection |
Fei Wang et.al. |
2509.13214 |
null |
2025-09-16 |
Global existence and decay of small solutions in a viscous half Klein-Gordon equation |
Louis Garénaux et.al. |
2509.13188 |
null |
2025-09-16 |
PDE-Based Bayesian Hierarchical Modeling for Event Spread, with Application to COVID-19 Infection |
Mengqi Cen et.al. |
2509.13174 |
null |
2025-09-17 |
TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving |
Jiawei Wang et.al. |
2509.13164 |
null |
2025-09-16 |
Enhancing Video Large Language Models with Structured Multi-Video Collaborative Reasoning (early version) |
Zhihao He et.al. |
2509.13161 |
null |
2025-09-16 |
MSDNet: Efficient 4D Radar Super-Resolution via Multi-Stage Distillation |
Minqing Huang et.al. |
2509.13149 |
null |
2025-09-16 |
Discovering Mathematical Equations with Diffusion Language Model |
Xiaoxu Han et.al. |
2509.13136 |
null |
2025-09-16 |
Quantifying CO2 Distribution at the Air-Water Interface – Spatiotemporally Resolved Measurements Using Tunable Diode Laser Spectroscopy |
Dongfang Zhao et.al. |
2509.13113 |
null |
2025-09-16 |
Quantitative 3D Morphology of Cellular H2/O2/N2 Flames on a Porous-Plug Burner: Spatially Resolved Measurements of Temperature and OH Radical |
Zeyu Yan et.al. |
2509.13106 |
null |
2025-09-16 |
MIA-EPT: Membership Inference Attack via Error Prediction for Tabular Data |
Eyal German et.al. |
2509.13046 |
null |
2025-09-16 |
ReTrack: Data Unlearning in Diffusion Models through Redirecting the Denoising Trajectory |
Qitan Shi et.al. |
2509.13007 |
null |
2025-09-16 |
Difference-Based Recovery for Modulo Sampling: Tightened Bounds and Robustness Guarantees |
Wenyi Yan et.al. |
2509.12971 |
null |
2025-09-16 |
Cosmic dust as a prerequisite for the formation of complex organic molecules in space? |
Alexey Potapov et.al. |
2509.12967 |
null |
2025-09-16 |
Mathematical Study of Reaction-Diffusion in Congested Crowd Motion |
Noureddine Igbida et.al. |
2509.12935 |
null |
2025-09-16 |
The Anatomy of Alignment: Decomposing Preference Optimization by Steering Sparse Features |
Jeremias Ferrao et.al. |
2509.12934 |
null |
2025-09-16 |
Non-parametric estimation of non-linear diffusion coefficient in parabolic SPDEs |
Martin Andersson et.al. |
2509.12921 |
null |
2025-09-16 |
Neural Network Localized Orthogonal Decomposition for Numerical Homogenization of Diffusion Operators with Random Coefficients |
Fabian Kröpfl et.al. |
2509.12896 |
null |
2025-09-16 |
Runge-Kutta Approximation and Decoupled Attention for Rectified Flow Inversion and Semantic Editing |
Weiming Chen et.al. |
2509.12888 |
null |
2025-09-16 |
Few to Big: Prototype Expansion Network via Diffusion Learner for Point Cloud Few-shot Semantic Segmentation |
Qianguang Zhao et.al. |
2509.12878 |
null |
2025-09-16 |
Bayesian Signal Separation via Plug-and-Play Diffusion-Within-Gibbs Sampling |
Yi Zhang et.al. |
2509.12857 |
null |
2025-09-16 |
Benchmarking thermostat algorithms in molecular dynamics simulations of a binary Lennard-Jones glass-former model |
Kumpei Shiraishi et.al. |
2509.12837 |
null |
2025-09-16 |
Pressure dependent structure of neat liquid methanol, CH3OH: molecular dynamics simulations with various united atom type potentials |
Imre Bakó et.al. |
2509.12834 |
null |
2025-09-16 |
A Lightweight Pipeline for Noisy Speech Voice Cloning and Accurate Lip Sync Synthesis |
Javeria Amir et.al. |
2509.12831 |
null |
2025-09-17 |
DiffHash: Text-Guided Targeted Attack via Diffusion Models against Deep Hashing Image Retrieval |
Zechao Liu et.al. |
2509.12824 |
null |
2025-09-16 |
A Pressure-Based Diffusion Model for Influence Maximization on Social Networks |
Curt Stutsman et.al. |
2509.12822 |
null |
2025-09-16 |
A Statistical Benchmark for Diffusion Posterior Sampling Algorithms |
Martin Zach et.al. |
2509.12821 |
null |
2025-09-16 |
Double Helix Diffusion for Cross-Domain Anomaly Image Generation |
Linchun Wu et.al. |
2509.12787 |
null |
2025-09-18 |
A-TDOM: Active TDOM via On-the-Fly 3DGS |
Yiwei Xu et.al. |
2509.12759 |
null |
2025-09-16 |
What Makes a Good Generated Image? Investigating Human and Multimodal LLM Image Preference Alignment |
Rishab Parthasarathy et.al. |
2509.12750 |
null |
2025-09-16 |
$L^2$ -solutions to stochastic reaction-diffusion equations with superlinear drifts driven by space-time white noise^ |
Shijie Shang et.al. |
2509.12744 |
null |
2025-09-16 |
Generalizable Holographic Reconstruction via Amplitude-Only Diffusion Priors |
Jeongsol Kim et.al. |
2509.12728 |
null |
2025-09-16 |
SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation |
Jingdong Zhang et.al. |
2509.12721 |
null |
2025-09-16 |
Joint AoI and Handover Optimization in Space-Air-Ground Integrated Network |
Zifan Lang et.al. |
2509.12716 |
null |
2025-09-16 |
AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models |
Heng Zhang et.al. |
2509.12715 |
null |
2025-09-16 |
Morphological and Chemical Changes in Cd-free Colloidal QD-LEDs During Operation |
Ruiqi Zhang et.al. |
2509.12597 |
null |
2025-09-16 |
Anomalous statistics in the Langevin equation with fluctuating diffusivity: from Brownian yet non-Gaussian diffusion to anomalous diffusion and ergodicity breaking |
Takuma Akimoto et.al. |
2509.12571 |
null |
2025-09-16 |
Adaptive Sampling Scheduler |
Qi Wang et.al. |
2509.12569 |
null |
2025-09-16 |
Thermal Transport of GaN/Substrate Heterostructures under Non-Uniform Heat Source |
Ershuai Yin et.al. |
2509.12548 |
null |
2025-09-16 |
Topological Phononic Crystal on the Scale of Quasi-Ballistic Phonon Transport |
Keita Funayama et.al. |
2509.12528 |
null |
2025-09-15 |
Context-Aware Language Models for Forecasting Market Impact from Sequences of Financial News |
Ross Koval et.al. |
2509.12519 |
null |
2025-09-15 |
Image Tokenizer Needs Post-Training |
Kai Qiu et.al. |
2509.12474 |
null |
2025-09-15 |
Effects of temporal variations on wave speeds of bistable traveling waves for Lotka-Volterra competition systems |
Weiwei Ding et.al. |
2509.12472 |
null |
2025-09-15 |
PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization |
Dawei Xiang et.al. |
2509.12446 |
null |
2025-09-15 |
Diffusion-Based Generation and Imputation of Driving Scenarios from Limited Vehicle CAN Data |
Julian Ripper et.al. |
2509.12375 |
null |
2025-09-15 |
Brown Dwarf Formation Through Gravitational Collapse: Insights From 3D Numerical Simulations |
Adnan Ali Ahmad et.al. |
2509.12336 |
null |
2025-09-15 |
Radial Oscillations of Viscous Neutron Stars: Zero Diffusion Case |
Raissa F. P. Mendes et.al. |
2509.12330 |
null |
2025-09-15 |
LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence |
Zixin Yin et.al. |
2509.12203 |
null |
2025-09-15 |
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling |
Yang Zhou et.al. |
2509.12201 |
null |
2025-09-15 |
Homogeneous soil moisture fields suppress Sahelian MCS frequency |
Ben Maybee et.al. |
2509.12118 |
null |
2025-09-15 |
Predicting Structural Relaxation in Supercooled Small Molecules via Molecular Dynamics Simulations and Microscopic Theory |
Anh D. Phan et.al. |
2509.12092 |
null |
2025-09-15 |
Progressive Flow-inspired Unfolding for Spectral Compressive Imaging |
Xiaodong Wang et.al. |
2509.12079 |
null |
2025-09-15 |
AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective |
Yuchen Deng et.al. |
2509.12052 |
null |
2025-09-15 |
Layout-Conditioned Autoregressive Text-to-Image Generation via Structured Masking |
Zirui Zheng et.al. |
2509.12046 |
null |
2025-09-15 |
Robust Concept Erasure in Diffusion Models: A Theoretical Perspective on Security and Robustness |
Zixuan Fu et.al. |
2509.12024 |
null |
2025-09-15 |
A shortcut through the macroscopic fluctuation theory: a generalised Fick law |
Théotim Berlioz et.al. |
2509.12017 |
null |
2025-09-15 |
Data-driven Smile Design: Personalized Dental Aesthetics Outcomes Using Deep Learning |
Marcus Lin et.al. |
2509.12001 |
null |
2025-09-15 |
Optimization for Massive 3D-RIS Deployment: A Generative Diffusion Model-Based Approach |
Kaining Wang et.al. |
2509.11969 |
null |
2025-09-15 |
Learning to Generate 4D LiDAR Sequences |
Ao Liang et.al. |
2509.11959 |
null |
2025-09-15 |
Adaptive least-squares space-time finite element methods for convection-diffusion problems |
Christian Köthe et.al. |
2509.11955 |
null |
2025-09-15 |
Sphere-GAN: a GAN-based Approach for Saliency Estimation in 360° Videos |
Mahmoud Z. A. Wahba et.al. |
2509.11948 |
null |
2025-09-15 |
The Filter Echo: A General Tool for Filter Visualisation |
Daniel Gaa et.al. |
2509.11932 |
null |
2025-09-15 |
VH-Diffuser: Variable Horizon Diffusion Planner for Time-Aware Goal-Conditioned Trajectory Planning |
Ruijia Liu et.al. |
2509.11930 |
null |
2025-09-15 |
A thermodynamically consistent model for bulk-surface viscous fluid mixtures: Model derivation and mathematical analysis |
Patrik Knopf et.al. |
2509.11925 |
null |
2025-09-15 |
A nonlinear model for long-range segregation |
Howen Chuah et.al. |
2509.11912 |
null |
2025-09-15 |
Enhanced Cosmic-Ray Cooling in AGN from Dark Matter Deep Inelastic Scattering |
Linjie Li et.al. |
2509.11906 |
null |
2025-09-15 |
Bayesian recalibration of flux scale factors in diffuse radio maps using low-resolution absolute radiometers |
Ainulnabilah Nasirudin et.al. |
2509.11894 |
null |
2025-09-15 |
Numerical analysis of fluid estimation for source terms in neutral particles simulation |
Zhirui Tang et.al. |
2509.11883 |
null |
2025-09-15 |
Do It Yourself (DIY): Modifying Images for Poems in a Zero-Shot Setting Using Weighted Prompt Manipulation |
Sofia Jamil et.al. |
2509.11878 |
null |
2025-09-15 |
Wasserstein error estimates between telegraph processes and Brownian motion |
Gerardo Barrera et.al. |
2509.11871 |
null |
2025-09-15 |
Tenma: Robust Cross-Embodiment Robot Manipulation with Diffusion Transformer |
Travis Davies et.al. |
2509.11865 |
null |
2025-09-15 |
Understanding variations of galactic energetic particles in the heliosphere: modelling and radiation hazard assessment |
Miguel Orcinha et.al. |
2509.11837 |
null |
2025-09-15 |
Rough stochastic filtering |
Fabio Bugini et.al. |
2509.11825 |
null |
2025-09-15 |
Stochastic restarting with multiple restart conditions |
Johannes Aspman et.al. |
2509.11809 |
null |
2025-09-15 |
Modes of Mechanical Guidance of Adhesion-Independent Cell Migration |
Hanna Luise Gertack et.al. |
2509.11801 |
null |
2025-09-15 |
Dense gas properties and star formation in M 82 |
Fei Li et.al. |
2509.11770 |
null |
2025-09-15 |
Igniting VLMs toward the Embodied Space |
Andy Zhai et.al. |
2509.11766 |
null |
2025-09-17 |
Removal Attack and Defense on AI-generated Content Latent-based Watermarking |
De Zhang Lee et.al. |
2509.11745 |
null |
2025-09-15 |
DRAG: Data Reconstruction Attack using Guided Diffusion |
Wa-Kin Lei et.al. |
2509.11724 |
null |
2025-09-15 |
Controlled growth of polar altermagnets via chemical vapor transport |
Hiraka Haruhiro et.al. |
2509.11716 |
null |
2025-09-15 |
Lie symmetry analysis and similarity reductions for the tempered-fractional Keller Segel system |
Ghorbanali Haghighatdoost et.al. |
2509.11690 |
null |
2025-09-15 |
DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition |
Lifei Hao et.al. |
2509.11661 |
null |
2025-09-15 |
IS-Diff: Improving Diffusion-Based Inpainting with Better Initial Seed |
Yongzhe Lyu et.al. |
2509.11638 |
null |
2025-09-15 |
SpeCa: Accelerating Diffusion Transformers with Speculative Feature Caching |
Jiacheng Liu et.al. |
2509.11628 |
null |
2025-09-15 |
Inference-stage Adaptation-projection Strategy Adapts Diffusion Policy to Cross-manipulators Scenarios |
Xiangtong Yao et.al. |
2509.11621 |
null |
2025-09-15 |
A Phase Field Formulation of Frictional Sliding Contact for 3D Fully Eulerian Fluid Structure Interactions |
Biswajeet Rath et.al. |
2509.11611 |
null |
2025-09-15 |
Scaling to Multimodal and Multichannel Heart Sound Classification: Fine-Tuning Wav2Vec 2.0 with Synthetic and Augmented Biosignals |
Milan Marocchi et.al. |
2509.11606 |
null |
2025-09-15 |
MVQA-68K: A Multi-dimensional and Causally-annotated Dataset with Quality Interpretability for Video Assessment |
Yanyun Pu et.al. |
2509.11589 |
null |
2025-09-15 |
Reconstructing High-fidelity Plasma Turbulence with Data-driven Tuning of Diffusion in Low Resolution Grids |
Kunpeng Li et.al. |
2509.11576 |
null |
2025-09-15 |
The Dynamics of the Profit Rate in an Extended Okishio Framework |
Jihyuan Liuh et.al. |
2509.11538 |
null |
2025-09-15 |
Learning Majority-to-Minority Transformations with MMD and Triplet Loss for Imbalanced Classification |
Suman Cha et.al. |
2509.11511 |
null |
2025-09-15 |
Collective Recourse for Generative Urban Visualizations |
Rashid Mushkani et.al. |
2509.11487 |
null |
2025-09-14 |
Improving LLMs’ Learning for Coreference Resolution |
Yujian Gan et.al. |
2509.11466 |
null |
2025-09-14 |
Diffusion of $^{210}\text{Pb}$ and $^{210}\text{Po}$ in Nylon |
P. Adhikari et.al. |
2509.11464 |
null |
2025-09-14 |
Fast Percolation Centrality Approximation with Importance Sampling |
Antonio Cruciani et.al. |
2509.11454 |
null |
2025-09-14 |
Mechanisms of isotope exchange between aqueous solutions and barite in low-temperature geochemical systems |
Chen Zhu et.al. |
2509.11428 |
null |
2025-09-14 |
IGA-LBM: Isogeometric lattice Boltzmann method |
Ye Ji et.al. |
2509.11427 |
null |
2025-09-14 |
Solving ill-conditioned polynomial equations using score-based priors with application to multi-target detection |
Rafi Beinhorn et.al. |
2509.11397 |
null |
2025-09-14 |
ActivePose: Active 6D Object Pose Estimation and Tracking for Robotic Manipulation |
Sheng Liu et.al. |
2509.11364 |
null |
2025-09-14 |
On the Escaping Efficiency of Distributed Adversarial Training Algorithms |
Ying Cao et.al. |
2509.11337 |
null |
2025-09-14 |
PINGS: Physics-Informed Neural Network for Fast Generative Sampling |
Achmad Ardani Prasha et.al. |
2509.11284 |
null |
2025-09-14 |
VideoAgent: Personalized Synthesis of Scientific Videos |
Xiao Liang et.al. |
2509.11253 |
null |
2025-09-14 |
Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation |
Chengze li et.al. |
2509.11252 |
null |
2025-09-14 |
Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation |
Yufei Tang et.al. |
2509.11213 |
null |
2025-09-14 |
StegOT: Trade-offs in Steganography via Optimal Transport |
Chengde Lin et.al. |
2509.11178 |
null |
2025-09-14 |
Cryptanalysis and design for a family of plaintext non-delayed chaotic ciphers |
Qianxue Wang et.al. |
2509.11158 |
null |
2025-09-14 |
Entropic active particle transport in pulsating 3D geometries |
Rahul Sinha et.al. |
2509.11147 |
null |
2025-09-14 |
Neural cellular automata: applications to biology and beyond classical AI |
Benedikt Hartl et.al. |
2509.11131 |
null |
2025-09-14 |
Filling the Gaps: A Multitask Hybrid Multiscale Generative Framework for Missing Modality in Remote Sensing Semantic Segmentation |
Nhi Kieu et.al. |
2509.11102 |
null |
2025-09-14 |
PanoLora: Bridging Perspective and Panoramic Video Generation with LoRA Adaptation |
Zeyu Dong et.al. |
2509.11092 |
null |
2025-09-14 |
An Advanced Convolutional Neural Network for Bearing Fault Diagnosis under Limited Data |
Shengke Sun et.al. |
2509.11053 |
null |
2025-09-14 |
Data-Efficient Ensemble Weather Forecasting with Diffusion Models |
Kevin Valencia et.al. |
2509.11047 |
null |
2025-09-13 |
General Decentralized Stochastic Optimal Control via Change of Measure: Applications to the Witsenhausen Counterexample |
Bhagyashri Telsang et.al. |
2509.11013 |
null |
2025-09-13 |
Approximation in an optimal design problem governed by the heat equation |
Kei Matsushima et.al. |
2509.11011 |
null |
2025-09-13 |
TrueSkin: Towards Fair and Accurate Skin Tone Recognition and Generation |
Haoming Lu et.al. |
2509.10980 |
null |
2025-09-13 |
Development and Analysis of Chien-Physics-Informed Neural Networks for Singular Perturbation Problems |
Gautam Singh et.al. |
2509.10945 |
null |
2025-09-13 |
ToMA: Token Merge with Attention for Image Generation with Diffusion Models |
Wenbo Lu et.al. |
2509.10918 |
null |
2025-09-13 |
Robustifying Diffusion-Denoised Smoothing Against Covariate Shift |
Ali Hedayatnia et.al. |
2509.10913 |
null |
2025-09-13 |
Real-Time Super-Resolution Imaging System Based on Zero-Shot Learning for Infrared Non-Destructive Testing |
Pengfei Zhu et.al. |
2509.10902 |
null |
2025-09-13 |
Thermal diffusivity characterization of impacted composites using evaporative cryocooling excitation and inverse physics-informed neural networks |
Pengfei Zhu et.al. |
2509.10898 |
null |
2025-09-13 |
A novel IR-SRGAN assisted super-resolution evaluation of photothermal coherence tomography for impact damage in toughened thermoplastic CFRP laminates under room temperature and low temperature |
Pengfei Zhu et.al. |
2509.10894 |
null |
2025-09-13 |
Text2Sign Diffusion: A Generative Approach for Gloss-Free Sign Language Production |
Liqian Feng et.al. |
2509.10845 |
null |
2025-09-13 |
Orbit-based structural decomposition and stellar population recovery for edge-on barred galaxies |
Yunpeng Jin et.al. |
2509.10832 |
null |
2025-09-13 |
Multi-Task Diffusion Approach For Prediction of Glioma Tumor Progression |
Aghiles Kebaili et.al. |
2509.10824 |
null |
2025-09-13 |
Hybrid Atomic Norm Sparse/Diffuse Channel Estimation |
Lei Lyu et.al. |
2509.10770 |
null |
2025-09-12 |
Using Drift Diffusion Model to Analyze Cars’ Lane Change Decisions behind Heavy Vehicles |
Nachuan Li et.al. |
2509.10733 |
null |
2025-09-12 |
The Rapid Arrival of Josiah Willard Gibbs’s Elementary Principles in Statistical Mechanics in European University Libraries |
Hector Giacomini et.al. |
2509.10732 |
null |
2025-09-12 |
Simultaneous determination of wave speed, diffusivity and nonlinearity in the Westervelt equation using complex time-periodic solutions |
Sebastian Acosta et.al. |
2509.10718 |
null |
2025-09-12 |
Maestro: Self-Improving Text-to-Image Generation via Agent Orchestration |
Xingchen Wan et.al. |
2509.10704 |
null |
2025-09-12 |
Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation |
Hao Zhang et.al. |
2509.10687 |
null |
2025-09-12 |
T2Bs: Text-to-Character Blendshapes via Video Generation |
Jiahao Luo et.al. |
2509.10678 |
null |
2025-09-12 |
Parallel and perpendicular diffusion of energetic particles in the near-Sun solar wind observed by Parker Solar Probe |
Nibuna Siranjeevi Madam Subashchandar et.al. |
2509.10648 |
null |
2025-09-12 |
Generalized Time-Reversal for Pulse Control in Diffusive Media |
Rohin E. McIntosh et.al. |
2509.10646 |
null |
2025-09-12 |
Radiation GRMHD Models of Accretion onto Stellar-Mass Black Holes: II. Super-Eddington Accretion |
Lizhong Zhang et.al. |
2509.10638 |
null |
2025-09-12 |
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis |
Tao Han et.al. |
2509.10441 |
null |
2025-09-12 |
Inpainting-Guided Policy Optimization for Diffusion Large Language Models |
Siyan Zhao et.al. |
2509.10396 |
null |
2025-09-12 |
Immunizing Images from Text to Image Editing via Adversarial Cross-Attention |
Matteo Trippodo et.al. |
2509.10359 |
null |
2025-09-12 |
GARD: Gamma-based Anatomical Restoration and Denoising for Retinal OCT |
Botond Fazekas et.al. |
2509.10341 |
null |
2025-09-12 |
Compute Only 16 Tokens in One Timestep: Accelerating Diffusion Transformers with Cluster-Driven Feature Caching |
Zhixin Zheng et.al. |
2509.10312 |
null |
2025-09-12 |
Morphogenetic mechanical metamaterials: Emerging tensor properties from self-organized structures |
Thomas Fromentèze et.al. |
2509.10277 |
null |
2025-09-12 |
MagicMirror: A Large-Scale Dataset and Benchmark for Fine-Grained Artifacts Assessment in Text-to-Image Generation |
Jia Wang et.al. |
2509.10260 |
null |
2025-09-12 |
Mask Consistency Regularization in Object Removal |
Hua Yuan et.al. |
2509.10259 |
null |
2025-09-12 |
Computational modeling of diffusive dynamics in a bouncer system with an irregular surface |
Luiz Antonio Barreiro et.al. |
2509.10253 |
null |
2025-09-12 |
Phase Transitions for Elephant Random Walks with Two memory Channels |
Krishanu Maulik et.al. |
2509.10225 |
null |
2025-09-12 |
Ionospheric Electron Heat Flow Modulates Planetary Ambipolar Electric Fields |
Liangliang Yuan et.al. |
2509.10218 |
null |
2025-09-12 |
Subordinators and time-space fractional diffusion equations |
Mohamed Majdoub et.al. |
2509.10203 |
null |
2025-09-12 |
P3D: Scalable Neural Surrogates for High-Resolution 3D Physics Simulations with Global Context |
Benjamin Holzschuh et.al. |
2509.10186 |
null |
2025-09-12 |
Convergence to equilibrium for fully discretizations of nonlocal Cahn-Hilliard equation |
Danni Zhang et.al. |
2509.10180 |
null |
2025-09-12 |
The unified gas kinetic wave-particle method for the neutron transport equation |
Guangwei Liu et.al. |
2509.10178 |
null |
2025-09-12 |
Scalable Training for Vector-Quantized Networks with 100% Codebook Utilization |
Yifan Chang et.al. |
2509.10140 |
null |
2025-09-12 |
Turing patterns on adaptive networks |
Marie Dorchain et.al. |
2509.10124 |
null |
2025-09-12 |
Realism Control One-step Diffusion for Real-World Image Super-Resolution |
Zongliang Wu et.al. |
2509.10122 |
null |
2025-09-12 |
Intrinsic disorder in the candidate quantum spin ice Pr $_2$Zr$_2$O$_7$ |
T. J. Hicken et.al. |
2509.10101 |
null |
2025-09-12 |
HHI-Assist: A Dataset and Benchmark of Human-Human Interaction in Physical Assistance Scenario |
Saeed Saadatnejad et.al. |
2509.10096 |
null |
2025-09-12 |
Color Me Correctly: Bridging Perceptual Color Spaces and Text Embeddings for Improved Diffusion Generation |
Sung-Lin Tsai et.al. |
2509.10058 |
null |
2025-09-12 |
Approximate Graph Propagation Revisited: Dynamic Parameterized Queries, Tighter Bounds and Dynamic Updates |
Zhuowei Zhao et.al. |
2509.10036 |
null |
2025-09-12 |
Effects of harmonic magnetic field boundary conditions in mean-field solar dynamo |
V. V. Pipin et.al. |
2509.09985 |
null |
2025-09-12 |
Normalized solutions to a Choquard equation involving mixed local and nonlocal operators |
J. Giacomoni et.al. |
2509.09968 |
null |
2025-09-12 |
Limited Reference, Reliable Generation: A Two-Component Framework for Tabular Data Generation in Low-Data Regimes |
Mingxuan Jiang et.al. |
2509.09960 |
null |
2025-09-12 |
Chord: Chain of Rendering Decomposition for PBR Material Estimation from Generated Texture Images |
Zhi Ying et.al. |
2509.09952 |
null |
2025-09-12 |
Acoustic Scene Classification Using CNN-GRU Model Without Knowledge Distillation |
Ee-Leng Tan et.al. |
2509.09931 |
null |
2025-09-12 |
A streamline upwind/Petrov-Galerkin method for the magnetic advection-diffusion problem |
Haochen Li et.al. |
2509.09913 |
null |
2025-09-11 |
Accelerating 3D Photoacoustic Computed Tomography with End-to-End Physics-Aware Neural Operators |
Jiayun Wang et.al. |
2509.09894 |
null |
2025-09-11 |
PeV particle acceleration and non-thermal emission in the `minimalist’ model of the extended jets in W50/SS433 |
A. M. Bykov et.al. |
2509.09883 |
null |
2025-09-11 |
Automated Tuning for Diffusion Inverse Problem Solvers without Generative Prior Retraining |
Yaşar Utku Alçalar et.al. |
2509.09880 |
null |
2025-09-11 |
Privacy-Preserving Automated Rosacea Detection Based on Medically Inspired Region of Interest Selection |
Chengyu Yang et.al. |
2509.09844 |
null |
2025-09-11 |
A risk-sensitive ergodic singular stochastic control problem |
Justin Gwee et.al. |
2509.09835 |
null |
2025-09-11 |
DiTReducio: A Training-Free Acceleration for DiT-Based TTS via Progressive Calibration |
Yanru Huo et.al. |
2509.09748 |
null |
2025-09-11 |
FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark |
Rongyao Fang et.al. |
2509.09680 |
null |
2025-09-11 |
Locality in Image Diffusion Models Emerges from Data Statistics |
Artem Lukoianov et.al. |
2509.09672 |
null |
2025-09-11 |
Geometric Neural Distance Fields for Learning Human Motion Priors |
Zhengdi Yu et.al. |
2509.09667 |
null |
2025-09-12 |
DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech |
Ngoc-Son Nguyen et.al. |
2509.09631 |
null |
2025-09-11 |
I Know Who Clones Your Code: Interpretable Smart Contract Similarity Detection |
Zhenguang Liu et.al. |
2509.09630 |
null |
2025-09-11 |
Mechanistic Learning with Guided Diffusion Models to Predict Spatio-Temporal Brain Tumor Growth |
Daria Laslo et.al. |
2509.09610 |
null |
2025-09-11 |
Constraints on Ultra-heavy DM from TeV-PeV gamma-ray diffuse measurements |
Manuel Rocamora et.al. |
2509.09609 |
null |
2025-09-11 |
Iterative energy reduction Galerkin methods and variational adaptivity |
Pascal Heid et.al. |
2509.09600 |
null |
2025-09-11 |
Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis |
Yikang Ding et.al. |
2509.09595 |
null |
2025-09-11 |
Exactly Solvable Model of Random Walks with Stochastic Exchange |
José Julian Díaz-Pérez et.al. |
2509.09577 |
null |
2025-09-11 |
Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders |
Dohun Lee et.al. |
2509.09547 |
null |
2025-09-11 |
Generative Diffusion Contrastive Network for Multi-View Clustering |
Jian Zhu et.al. |
2509.09527 |
null |
2025-09-11 |
Mapping of discrete range modulated proton radiograph to water-equivalent path length using machine learning |
Atiq Ur Rahman et.al. |
2509.09514 |
null |
2025-09-11 |
Explainable AI for Accelerated Microstructure Imaging: A SHAP-Guided Protocol on the Connectome 2.0 scanner |
Quentin Uhl et.al. |
2509.09513 |
null |
2025-09-11 |
Mixture of Semantics Transmission for Generative AI-Enabled Semantic Communication Systems |
Junjie Ni et.al. |
2509.09499 |
null |
2025-09-11 |
SEDM: Scalable Self-Evolving Distributed Memory for Agents |
Haoran Xu et.al. |
2509.09498 |
null |
2025-09-11 |
Prompt Pirates Need a Map: Stealing Seeds helps Stealing Prompts |
Felix Mächtle et.al. |
2509.09488 |
null |
2025-09-11 |
Vorticity Packing Effects on Turbulent Transport in Decaying 2D Incompressible Navier-Stokes Fluids |
Snehanshu Maiti et.al. |
2509.09487 |
null |
2025-09-11 |
Comprehensive Mapping of Tracer Diffusivities Across Composition Space in Ternary NiAlTi and Quinary NiCoFeAlTi High-Entropy Alloy Using Diffusion Couple Experiments and Physics Informed Neural Network Inversion |
Ismail Kamil Worke et.al. |
2509.09486 |
null |
2025-09-11 |
Bath-induced stabilization of classical non-linear response in two dimensional infrared spectroscopy |
Rajesh Dutta et.al. |
2509.09476 |
null |
2025-09-11 |
Axion-Photon Conversion in FLRW with Primordial Magnetic Fields: Explaining the Radio Excess |
Setabuddin et.al. |
2509.09472 |
null |
2025-09-11 |
FlexiD-Fuse: Flexible number of inputs multi-modal medical image fusion based on diffusion model |
Yushen Xu et.al. |
2509.09456 |
null |
2025-09-11 |
Optimal Investment and Consumption in a Stochastic Factor Model |
Florian Gutekunst et.al. |
2509.09452 |
null |
2025-09-11 |
Composable Score-based Graph Diffusion Model for Multi-Conditional Molecular Generation |
Anjie Qiao et.al. |
2509.09451 |
null |
2025-09-11 |
Steady advection-diffusion in multiply-connected potential flows |
Kyle McKee et.al. |
2509.09444 |
null |
2025-09-11 |
Plug-and-play Diffusion Models for Image Compressive Sensing with Data Consistency Projection |
Xiaodong Wang et.al. |
2509.09365 |
null |
2025-09-11 |
Expressive Power of Deep Networks on Manifolds: Simultaneous Approximation |
Hanfei Zhou et.al. |
2509.09362 |
null |
2025-09-11 |
Turnpike properties for zero-sum stochastic linear quadratic differential games of Markovian regime switching system |
Xun Li et.al. |
2509.09358 |
null |
2025-09-11 |
Euler-type methods for Levy-driven McKean-Vlasov SDEs with super-linear coefficients: mean-square error analysis |
Jingtao Zhu et.al. |
2509.09302 |
null |
2025-09-11 |
A note on quantifying the contributions of incidence functions in spatio-temporal epidemic models |
Mohamed Mehdaoui et.al. |
2509.09301 |
null |
2025-09-11 |
Data Driven Discovery of Emergent Dynamics in Reaction Diffusion Systems from Sparse and Noisy Observations |
Saumitra Dwivedi et.al. |
2509.09278 |
null |
2025-09-11 |
Long time strong convergence analysis of one-step methods for McKean-Vlasov SDEs with superlinear growth coefficients |
Taiyuan Liu et.al. |
2509.09274 |
null |
2025-09-11 |
The role of communication delays in the optimal control of spatially invariant systems |
Luca Ballotta et.al. |
2509.09269 |
null |
2025-09-11 |
A novel method and dataset for depth-guided image deblurring from smartphone Lidar |
Antonio Montanaro et.al. |
2509.09241 |
null |
2025-09-11 |
MAPSS: Manifold-based Assessment of Perceptual Source Separation |
Amir Ivry et.al. |
2509.09212 |
null |
2025-09-11 |
ALL-PET: A Low-resource and Low-shot PET Foundation Model in the Projection Domain |
Bin Huang et.al. |
2509.09130 |
null |
2025-09-11 |
Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention |
Junhao Xing et.al. |
2509.09116 |
null |
2025-09-10 |
Integrating Anatomical Priors into a Causal Diffusion Model |
Binxu Li et.al. |
2509.09054 |
null |
2025-09-10 |
Noise-Activated Dopant Dynamics in Two-Dimensional Thermal Landscapes with Localized Cold Spots |
Mesfin Taye et.al. |
2509.09046 |
null |
2025-09-10 |
Cosmic Ray Spatial Distribution and the Galactic/Extragalactic Transition |
Paolo Lipari et.al. |
2509.09028 |
null |
2025-09-10 |
Complex dynamics and pattern formation in a diffusive epidemic model with an infection-dependent recovery rate |
Wael El Khateeb et.al. |
2509.09000 |
null |
2025-09-10 |
HARD: A Performance Portable Radiation Hydrodynamics Code based on FleCSI Framework |
Julien Loiseau et.al. |
2509.08971 |
null |
2025-09-10 |
Activity-driven clustering of jamming run-and-tumble particles: Exact three-body steady state by dynamical symmetry |
Leo Hahn et.al. |
2509.08945 |
null |
2025-09-10 |
Discovering Divergent Representations between Text-to-Image Models |
Lisa Dunlap et.al. |
2509.08940 |
null |
2025-09-10 |
Diffusion-Based Action Recognition Generalizes to Untrained Domains |
Rogerio Guimaraes et.al. |
2509.08908 |
null |
2025-09-10 |
Anomalously fast transport in non-integrable lattice gauge theories |
Devendra Singh Bhakuni et.al. |
2509.08889 |
null |
2025-09-10 |
RewardDance: Reward Scaling in Visual Generation |
Jie Wu et.al. |
2509.08826 |
null |
2025-09-10 |
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts |
Jenna Kang et.al. |
2509.08818 |
null |
2025-09-10 |
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles |
Eric Slyman et.al. |
2509.08777 |
null |
2025-09-11 |
Joint Model-based Model-free Diffusion for Planning with Constraints |
Wonsuhk Jung et.al. |
2509.08775 |
null |
2025-09-10 |
Sharp power concavity of two relevant free boundary problems of reaction-diffusion type |
Qingyou He et.al. |
2509.08768 |
null |
2025-09-10 |
Learning Turbulent Flows with Generative Models: Super-resolution, Forecasting, and Sparse Flow Reconstruction |
Vivek Oommen et.al. |
2509.08752 |
null |
2025-09-10 |
On the Lebesgue Constant of Extended-Domain Spectral Methods for Elliptic PDEs |
Po-Yi Wu et.al. |
2509.08745 |
null |
2025-09-10 |
Finite-temperature transport in the gapped spin-1/2 XXZ chain and one-dimensional lattice spinless fermion model |
J. M. P. Carmelo et.al. |
2509.08741 |
null |
2025-09-10 |
Data-driven generative simulation of SDEs using diffusion models |
Xuefeng Gao et.al. |
2509.08731 |
null |
2025-09-10 |
Accelerating Diffusion Transformer-Based Text-to-Speech with Transformer Layer Caching |
Siratish Sakpiboonchit et.al. |
2509.08696 |
null |
2025-09-10 |
The Small Magellanic Cloud through the lens of the James Webb Space Telescope : binaries and mass function within the galaxy outskirts |
M. V. Legnardi et.al. |
2509.08687 |
null |
2025-09-10 |
X-Part: high fidelity and structure coherent shape decomposition |
Xinhao Yan et.al. |
2509.08643 |
null |
2025-09-10 |
RoentMod: A Synthetic Chest X-Ray Modification Model to Identify and Correct Image Interpretation Model Shortcuts |
Lauren H. Cooke et.al. |
2509.08640 |
null |
2025-09-10 |
LADB: Latent Aligned Diffusion Bridges for Semi-Supervised Domain Translation |
Xuqin Wang et.al. |
2509.08628 |
null |
2025-09-10 |
Microstructural Control and Heat Transport Enhancement in Lanthanum Sulfate for Thermochemical Heat Storage |
Kunihiko Shizume et.al. |
2509.08585 |
null |
2025-09-10 |
EfficientIML: Efficient High-Resolution Image Manipulation Localization |
Jinhan Li et.al. |
2509.08583 |
null |
2025-09-10 |
Quenched and annealed heat kernel estimates for Brox’s diffusion |
Xin Chen et.al. |
2509.08559 |
null |
2025-09-10 |
PEHRT: A Common Pipeline for Harmonizing Electronic Health Record data for Translational Research |
Jessica Gronsbell et.al. |
2509.08553 |
null |
2025-09-10 |
System size and boundaries determine the patterning dynamics of attracting active particles |
Jan Rombouts et.al. |
2509.08533 |
null |
2025-09-10 |
RoboMatch: A Mobile-Manipulation Teleoperation Platform with Auto-Matching Network Architecture for Long-Horizon Manipulation |
Hanyu Liu et.al. |
2509.08522 |
null |
2025-09-10 |
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning |
Liyang Chen et.al. |
2509.08519 |
null |
2025-09-10 |
Search for a photon peak from keV-scale dark matter annihilation with NuSTAR: Constraints on $\langle σv \rangle$ after 11 years of observations |
E. I. Zakharov et.al. |
2509.08506 |
null |
2025-09-10 |
Prompt-Driven Image Analysis with Multimodal Generative AI: Detection, Segmentation, Inpainting, and Interpretation |
Kaleem Ahmad et.al. |
2509.08489 |
null |
2025-09-10 |
Audio Deepfake Verification |
Li Wang et.al. |
2509.08476 |
null |
2025-09-10 |
Spherical Brownian Bridge Diffusion Models for Conditional Cortical Thickness Forecasting |
Ivan Stoyanov et.al. |
2509.08442 |
null |
2025-09-10 |
PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching |
Lei Ye et.al. |
2509.08435 |
null |
2025-09-10 |
One-dimensional particle clouds with elastic collisions |
Mikhail Menshikov et.al. |
2509.08430 |
null |
2025-09-10 |
LD-ViCE: Latent Diffusion Model for Video Counterfactual Explanations |
Payal Varshney et.al. |
2509.08422 |
null |
2025-09-10 |
The Critical 9365 Å Diffuse Interstellar Band and C $_{60}^{+}$ Association |
Daniel Majaess et.al. |
2509.08414 |
null |
2025-09-10 |
Protoplanetary disks around magnetized young stars with large-scale magnetic fields I: Steady-state solutions |
D. Steiner et.al. |
2509.08393 |
null |
2025-09-11 |
VRAE: Vertical Residual Autoencoder for License Plate Denoising and Deblurring |
Cuong Nguyen et.al. |
2509.08392 |
null |
2025-09-10 |
LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models |
Hirokazu Kameoka et.al. |
2509.08379 |
null |
2025-09-10 |
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video |
Xiao Li et.al. |
2509.08376 |
null |
2025-09-10 |
Stop using root-mean-square error as a precipitation target! |
Kieran M. R. Hunt et.al. |
2509.08369 |
null |
2025-09-10 |
Physics-Guided Rectified Flow for Low-light RAW Image Enhancement |
Juntai Zeng et.al. |
2509.08330 |
null |
2025-09-10 |
Trans-scale spin Seebeck effect in nanostructured bulk composites based on magnetic insulator |
Sang J. Park et.al. |
2509.08327 |
null |
2025-09-10 |
Controlling GaN nucleation via O $_2$ -plasma-perforated graphene masks on c-plane sapphire |
Su Young An et.al. |
2509.08275 |
null |
2025-09-10 |
Generative Quasi-Continuum Modeling of Confined Fluids at the Nanoscale |
Bugra Yalcin et.al. |
2509.08223 |
null |
2025-09-10 |
Moiré excitons in generalized Wigner crystals |
Jing-Yang You et.al. |
2509.08211 |
null |
2025-09-09 |
ArtifactGen: Benchmarking WGAN-GP vs Diffusion for Label-Aware EEG Artifact Synthesis |
Hritik Arasu et.al. |
2509.08188 |
null |
2025-09-09 |
Modeling of convective cells, turbulence, and transport induced by a radio-frequency antenna in the tokamak boundary plasma |
M. V. Umansky et.al. |
2509.08178 |
null |
2025-09-09 |
A Linear Pricing Mechanism for Load Management in Day-Ahead Retail Energy Markets |
Phillippe K. Phanivong et.al. |
2509.08166 |
null |
2025-09-09 |
Diffusion-Guided Multi-Arm Motion Planning |
Viraj Parimi et.al. |
2509.08160 |
null |
2025-09-09 |
Electronic Fluctuations and Ionic Dynamics in Molten Silver Iodide |
Harender S. Dhattarwal et.al. |
2509.08143 |
null |
2025-09-09 |
Joint calibration of the volatility surface and variance term structure |
Jiwook Yoo et.al. |
2509.08096 |
null |
2025-09-09 |
DDNet: A Unified Physics-Informed Deep Learning Framework for Semiconductor Device Modeling |
Roberto Riganti et.al. |
2509.08073 |
null |
2025-09-09 |
Discovery of a $z \sim 0.8$ Ultra Steep Spectrum Radio Halo in the MeerKAT-South Pole Telescope Survey |
Isaac S. Magolego et.al. |
2509.08062 |
null |
2025-09-09 |
Acceleration of Heavy Ions at Non-Relativistic Collisionless Shocks |
Damiano Caprioli et.al. |
2509.08061 |
null |
2025-09-09 |
Breaking Dark: Hunting Heavy Decaying Dark Matter with Tibet AS $_γ$ and LHAASO-KM2A |
Abhishek Dubey et.al. |
2509.08039 |
null |
2025-09-09 |
PyPAS – Python package for Positron Annihilation Spectroscopy Doppler Broadening Analysis |
Achiya Yosef Amrusi et.al. |
2509.08023 |
null |
2025-09-08 |
CardioComposer: Flexible and Compositional Anatomical Structure Generation with Disentangled Geometric Guidance |
Karim Kadry et.al. |
2509.08015 |
null |
2025-09-08 |
Validation of a CT-brain analysis tool for measuring global cortical atrophy in older patient cohorts |
Sukhdeep Bal et.al. |
2509.08012 |
null |
2025-09-09 |
LHAASO Galactic Plane $γ$ -rays Strongly Constrain Heavy Dark Matter |
Celine Boehm et.al. |
2509.07982 |
null |
2025-09-09 |
Edwards-Wilkinson limit for a stochastic advection-diffusion PDE |
Sotirios Kotitsas et.al. |
2509.07956 |
null |
2025-09-09 |
Feature Space Analysis by Guided Diffusion Model |
Kimiaki Shirahama et.al. |
2509.07936 |
null |
2025-09-09 |
ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion |
Ao Li et.al. |
2509.07920 |
null |
2025-09-09 |
Measurement of ion acceleration and diffusion in a laser-driven magnetized plasma |
J. T. Y. Chu et.al. |
2509.07880 |
null |
2025-09-09 |
Duality estimates for subdiffusion problems including time-fractional porous medium type equations |
Arlúcio Viana et.al. |
2509.07862 |
null |
2025-09-09 |
Convergence analysis for the Barrett-Garcke-Nurnberg method of transport type |
Genming Bai et.al. |
2509.07834 |
null |
2025-09-09 |
A Note on the failure of temporal regularity for stochastic PDEs |
Antonio Agresti et.al. |
2509.07803 |
null |
2025-09-09 |
Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey |
Minghan Li et.al. |
2509.07794 |
null |
2025-09-09 |
SN 2022xlp: The second-known well-observed, intermediate-luminosity Iax supernova |
D. Bánhidi et.al. |
2509.07717 |
null |
2025-09-09 |
A Generalisable Generative Model for Multi-Detector Calorimeter Simulation |
Piyush Raikwar et.al. |
2509.07700 |
null |
2025-09-09 |
Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity |
Sung Ju Lee et.al. |
2509.07647 |
null |
2025-09-09 |
An all-sky 3D dust map Based on Gaia and LAMOST |
Tao Wang et.al. |
2509.07640 |
null |
2025-09-10 |
LSMTCR: A Scalable Multi-Architecture Model for Epitope-Specific T Cell Receptor de novo Design |
Ruihao Zhang et.al. |
2509.07627 |
null |
2025-09-09 |
AgentX: Towards Orchestrating Robust Agentic Workflow Patterns with FaaS-hosted MCP Services |
Shiva Sai Krishna Anand Tokal et.al. |
2509.07595 |
null |
2025-09-09 |
Sorting of binary active-passive mixtures in designed microchannels |
Horacio Serna et.al. |
2509.07582 |
null |
2025-09-09 |
Atomic Layer Etching of Aluminum Nitride: Mechanistic Insights from First-Principles Studies of Chlorine Chemistry |
Sanjay Nayak et.al. |
2509.07554 |
null |
2025-09-09 |
PanoLAM: Large Avatar Model for Gaussian Full-Head Synthesis from One-shot Unposed Image |
Peng Li et.al. |
2509.07552 |
null |
2025-09-09 |
Two-dimensional fractional Brownian motion: Analysis in time and frequency domains |
Michał Balcerek et.al. |
2509.07537 |
null |
2025-09-09 |
Universal Few-Shot Spatial Control for Diffusion Models |
Kiet T. Nguyen et.al. |
2509.07530 |
null |
2025-09-09 |
Emergence of continuously varying critical exponents in coupled map lattice as an effect of quenched disorder |
Priyanka D. Bhoyar et.al. |
2509.07529 |
null |
2025-09-09 |
Target matching based generative model for speech enhancement |
Taihui Wang et.al. |
2509.07521 |
null |
2025-09-09 |
Magnetic Resonance Imaging Virtual Liver Biopsy Using Radiomics Analysis for the Assessment of Chronic Liver Disease |
Jiqing Huang et.al. |
2509.07516 |
null |
2025-09-09 |
LINR Bridge: Vector Graphic Animation via Neural Implicits and Video Diffusion Priors |
Wenshuo Gao et.al. |
2509.07484 |
null |
2025-09-09 |
Uncertainty in Hadronic Diffuse $γ$ -Ray Emission from the Temporal Stochasticity of Cosmic-Ray Sources |
Xing-Jian Lv et.al. |
2509.07481 |
null |
2025-09-09 |
ANYPORTAL: Zero-Shot Consistent Video Background Replacement |
Wenshuo Gao et.al. |
2509.07472 |
null |
2025-09-09 |
DepthVision: Robust Vision-Language Understanding through GAN-Based LiDAR-to-RGB Synthesis |
Sven Kirchner et.al. |
2509.07463 |
null |
2025-09-09 |
Unveiling Biological Models Through Turing Patterns |
Yuhan Li et.al. |
2509.07458 |
null |
2025-09-09 |
Node Position Estimation in Diffusion-Based Molecular Communications Using Multi-Layer Perceptron |
Sangjun Hwang et.al. |
2509.07441 |
null |
2025-09-09 |
GRASPion: an Open-Source, Programmable Brainbot for Active Matter Research |
F. Novkoski et.al. |
2509.07437 |
null |
2025-09-09 |
DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation |
Ze-Xin Yin et.al. |
2509.07435 |
null |
2025-09-11 |
Blow-up for a Nonlocal Diffusion Equation with Time Regularly Varying Nonlinearity and Forcing |
Rihab Ben Belgacem et.al. |
2509.07405 |
null |
2025-09-09 |
Time evolution of averaged limit shapes of random multiple Young diagrams |
Akihito Hora et.al. |
2509.07393 |
null |
2025-09-09 |
On the exponential convergence to equilibrium for ultrafast diffusion equations |
Yi C. Huang et.al. |
2509.07382 |
null |
2025-09-09 |
Knowledge Distillation Driven Semantic NOMA for Image Transmission with Diffusion Model |
Qifei Wang et.al. |
2509.07363 |
null |
2025-09-09 |
Distributed Frequency Control for Multi-Area Power Systems Considering Transient Frequency Safety |
Xiemin Mo et.al. |
2509.07345 |
null |
2025-09-09 |
SpecifyUI: Supporting Iterative UI Design Intent Expression through Structured Specifications and Generative AI |
Yunnong Chen et.al. |
2509.07334 |
null |
2025-09-09 |
Data-knowledge fusion driven frequency security assessment: A robust framework for renewable-dominated power grids |
Yurun Zhang et.al. |
2509.07320 |
null |
2025-09-08 |
Reconstruction Alignment Improves Unified Multimodal Models |
Ji Xie et.al. |
2509.07295 |
null |
2025-09-08 |
Breast Cancer Detection in Thermographic Images via Diffusion-Based Augmentation and Nonlinear Feature Fusion |
Sepehr Salem et.al. |
2509.07277 |
null |
2025-09-08 |
Hybrid Galam–Bass Model for Technology Innovation |
Giulia Rotundo et.al. |
2509.07275 |
null |
2025-09-08 |
Thermodynamic Irreversibility in Underdamped Brownian Motion with Spatial Temperature Gradients |
Mesfin Taye et.al. |
2509.07272 |
null |
2025-09-08 |
Extended Version: Market-Driven Equilibria for Distributed Solar Panel Investment |
Mehdi Davoudi et.al. |
2509.07203 |
null |
2025-09-08 |
Realism to Deception: Investigating Deepfake Detectors Against Face Enhancement |
Muhammad Saad Saeed et.al. |
2509.07178 |
null |
2025-09-08 |
Ultrathin oxide freestanding membranes with large-scale continuity and structural perfection |
Yuhao Hong et.al. |
2509.07176 |
null |
2025-09-08 |
Unveiling the Impact of Cosmic Rays on the Disc Sizes and Outflows from Dwarf Scales to Galaxy Groups |
Rebekka Bieri et.al. |
2509.07124 |
null |
2025-09-08 |
Indirect detection of boosted light scalar dark matter |
Arindam Basu et.al. |
2509.07110 |
null |
2025-09-08 |
Constraining Baryon Fractions in Galaxy Groups and Clusters with the First CHIME/FRB Outrigger |
Adam E. Lanman et.al. |
2509.07097 |
null |
2025-09-08 |
Automated Evaluation of Gender Bias Across 13 Large Multimodal Models |
Juan Manuel Contreras et.al. |
2509.07050 |
null |
2025-09-07 |
The Impact of Artificial Intelligence on Traditional Art Forms: A Disruption or Enhancement |
Viswa Chaitanya Marella et.al. |
2509.07029 |
null |
2025-09-10 |
Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models |
Jisung Hwang et.al. |
2509.07027 |
null |
2025-09-08 |
Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data |
Nithin Gopalakrishnan Nair et.al. |
2509.06950 |
null |
2025-09-08 |
Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models |
Yinjie Wang et.al. |
2509.06949 |
null |
2025-09-09 |
Interleaving Reasoning for Better Text-to-Image Generation |
Wenxuan Huang et.al. |
2509.06945 |
null |
2025-09-09 |
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference |
Xiangwei Shen et.al. |
2509.06942 |
null |
2025-09-10 |
LLaDA-VLA: Vision Language Diffusion Action Models |
Yuqing Wen et.al. |
2509.06932 |
null |
2025-09-08 |
BIR-Adapter: A Low-Complexity Diffusion Model Adapter for Blind Image Restoration |
Cem Eteke et.al. |
2509.06904 |
null |
2025-09-08 |
Nanobot Algorithms for Treatment of Diffuse Cancer |
Noble Harasha et.al. |
2509.06893 |
null |
2025-09-08 |
Homogenisation of a Passive Scalar Transported by Locally Supported White Noise |
Federico Butori et.al. |
2509.06878 |
null |
2025-09-08 |
Infinite Interacting Brownian Motions and EVI Gradient Flows |
Kohei Suzuki et.al. |
2509.06869 |
null |
2025-09-08 |
A New Hybrid Model of Generative Adversarial Network and You Only Look Once Algorithm for Automatic License-Plate Recognition |
Behnoud Shafiezadeh et.al. |
2509.06868 |
null |
2025-09-08 |
floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL |
Bhavya Agrawalla et.al. |
2509.06863 |
null |
2025-09-08 |
Stochastic modelling of cosmic-ray sources for Galactic diffuse emissions |
Anton Stall et.al. |
2509.06857 |
null |
2025-09-08 |
CRISP – Compliant ROS2 Controllers for Learning-Based Manipulation Policies and Teleoperation |
Daniel San José Pro et.al. |
2509.06819 |
null |
2025-09-08 |
UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward |
Yufeng Cheng et.al. |
2509.06818 |
null |
2025-09-08 |
Large eddy simulations in astrophysics |
Wolfram Schmidt-Brückner et.al. |
2509.06801 |
null |
2025-09-08 |
Image Encryption Scheme Based on Hyper-Chaotic Map and Self-Adaptive Diffusion |
Yiqi Tang et.al. |
2509.06754 |
null |
2025-09-08 |
Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training |
Ruicheng Zhang et.al. |
2509.06723 |
null |
2025-09-08 |
STAGE: Segmentation-oriented Industrial Anomaly Synthesis via Graded Diffusion with Explicit Mask Alignment |
Xichen Xu et.al. |
2509.06693 |
null |
2025-09-08 |
A Parallel Solver with Multiphysics Finite Element Method for Poroelasticity Coupled with Elasticity Model |
Zhihao Ge et.al. |
2509.06673 |
null |
2025-09-08 |
The complementary of CTAO, direct detection and collider searches for dark matter in Effective Field Theories and Simplified models |
Igor Reis et.al. |
2509.06628 |
null |
2025-09-08 |
Fisher entropic Fokker-Planck model of monatomic rarefied gases |
Veronica Montanaro et.al. |
2509.06610 |
null |
2025-09-08 |
Contrastive Anatomy-Contrast Disentanglement: A Domain-General MRI Harmonization Method |
Daniel Scholz et.al. |
2509.06592 |
null |
2025-09-08 |
CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis |
Xin Kong et.al. |
2509.06579 |
null |
2025-09-08 |
From Rigging to Waving: 3D-Guided Diffusion for Natural Animation of Hand-Drawn Characters |
Jie Zhou et.al. |
2509.06573 |
null |
2025-09-08 |
Interlayer Coupling and Exciton Dynamics in 2D Hybrid Structures based on an InGaN Quantum Well coupled to a MoSe2 Monolayer |
D. Chen et.al. |
2509.06547 |
null |
2025-09-08 |
A multiscale theory for network advection-reaction-diffusion |
Hadrien Oliveri et.al. |
2509.06546 |
null |
2025-09-08 |
Thermalization dynamics of finite-size quantum critical systems |
Li Li et.al. |
2509.06523 |
null |
2025-09-08 |
On optimal solutions of classical and sliced Wasserstein GANs with non-Gaussian data |
Yu-Jui Huang et.al. |
2509.06505 |
null |
2025-09-08 |
TIDE: Achieving Balanced Subject-Driven Image Generation via Target-Instructed Diffusion Enhancement |
Jibai Lin et.al. |
2509.06499 |
null |
2025-09-08 |
Phyllotaxis in a Keller-Segel model |
Michael F. Staddon et.al. |
2509.06498 |
null |
2025-09-08 |
Discovery of giant bubbles in the hot gaseous halo of the massive disk galaxy NGC 6286 |
Lin He et.al. |
2509.06470 |
null |
2025-09-08 |
VQualA 2025 Challenge on Image Super-Resolution Generated Content Quality Assessment: Methods and Results |
Yixiao Li et.al. |
2509.06413 |
null |
2025-09-08 |
Diffusion-Shock PDEs for Deep Learning on Position-Orientation Space |
Finn M. Sherry et.al. |
2509.06405 |
null |
2025-09-08 |
Non-Destructive Rail Monitoring for Defect Identification |
Elissa Akiki et.al. |
2509.06394 |
null |
2025-09-08 |
Hydrogen-induced fast fracture in a 1.5 GPa dual-phase steel |
Rama Srinivas Varanasi et.al. |
2509.06323 |
null |
2025-09-08 |
McKean-Vlasov limits of scaling-critical reaction-diffusion equations with random initial data |
Bryan Castillo et.al. |
2509.06260 |
null |
2025-09-07 |
Multi-Scale Modeling and Predictive Control of Active Brownian Particles |
Sadra Saremi et.al. |
2509.06217 |
null |
2025-09-07 |
Grasp-MPC: Closed-Loop Visual Grasping via Value-Guided Model Predictive Control |
Jun Yamada et.al. |
2509.06201 |
null |
2025-09-07 |
Forward and inverse problems of a semilinear transport equation |
Kui Ren et.al. |
2509.06183 |
null |
2025-09-07 |
The role of the initial distribution in population survival within a bounded habitat |
Rafael de la Rosa et.al. |
2509.06179 |
null |
2025-09-07 |
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts |
Duomin Wang et.al. |
2509.06155 |
null |
2025-09-07 |
If generative AI is the answer, what is the question? |
Ambuj Tewari et.al. |
2509.06120 |
null |
2025-09-10 |
The Thermodynamic Limit of Extreme First-Passage Times |
Talia Baravi et.al. |
2509.06098 |
null |
2025-09-07 |
Home-made Diffusion Model from Scratch to Hatch |
Shih-Ying Yeh et.al. |
2509.06068 |
null |
2025-09-10 |
BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models |
Yuming Li et.al. |
2509.06040 |
null |
2025-09-07 |
DreamAudio: Customized Text-to-Audio Generation with Diffusion Models |
Yi Yuan et.al. |
2509.06027 |
null |
2025-09-07 |
The Gross-Pitaewsky equation with time and space dependent coefficients |
Federico Lai et.al. |
2509.06001 |
null |
2025-09-07 |
Multi-Strategy Guided Diffusion via Sparse Masking Temporal Reweighting Distribution Correction |
Zekun Zhou et.al. |
2509.05992 |
null |
2025-09-07 |
Simulation of Solar Surface Flux Transport Constrained by Magnetic Power Spectra. I. Flux Transport Parameter |
Yukun Luo et.al. |
2509.05989 |
null |
2025-09-07 |
Imagining Alternatives: Towards High-Resolution 3D Counterfactual Medical Image Generation via Language Guidance |
Mohamed Mohamed et.al. |
2509.05978 |
null |
2025-09-09 |
Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching |
Feng Wang et.al. |
2509.05952 |
null |
2025-09-06 |
Transformer-based Topology Optimization |
Aaron Lutheran et.al. |
2509.05800 |
null |
2025-09-06 |
Hybrid Fourier Neural Operator-Plasma Fluid Model for Fast and Accurate Multiscale Simulations of High Power Microwave Breakdown |
Kalp Pandya et.al. |
2509.05799 |
null |
2025-09-06 |
Discrete-Time Quantum Random Walk for Epidemiological Modeling |
Sayan Manna et.al. |
2509.05795 |
null |
2025-09-06 |
Depth Profiling of Oxygen Migration in Ta/HfO2 Stacks During Ionic Liquid Gating |
Beatrice Bednarz et.al. |
2509.05748 |
null |
2025-09-06 |
InterAct: A Large-Scale Dataset of Dynamic, Expressive and Interactive Activities between Two People in Daily Scenarios |
Leo Ho et.al. |
2509.05747 |
null |
2025-09-06 |
High-friction limit for bipolar Euler-Riesz systems |
Nuno J. Alves et.al. |
2509.05742 |
null |
2025-09-06 |
Polarization memory effect in a multimode fiber |
Gauri Arora et.al. |
2509.05665 |
null |
2025-09-06 |
EditIDv2: Editable ID Customization with Data-Lubricated ID Feature Integration for Text-to-Image Generation |
Guandong Li et.al. |
2509.05659 |
null |
2025-09-06 |
Well-posedness and regularity theory for the fractional diffusion-wave equation in Lebesgue spaces |
Bruno de Andrade et.al. |
2509.05654 |
null |
2025-09-06 |
SuMa: A Subspace Mapping Approach for Robust and Effective Concept Erasure in Text-to-Image Diffusion Models |
Kien Nguyen et.al. |
2509.05625 |
null |
2025-09-06 |
Large and moderate deviation principles for stochastic partial differential equation on graph |
Jianbo Cui et.al. |
2509.05622 |
null |
2025-09-05 |
Perpendicular ion heating in turbulence and reconnection: magnetic moment breaking by coherent fluctuations |
Alfred Mallet et.al. |
2509.05518 |
null |
2025-09-05 |
Chemotaxis Models with Nonlinear/Porous Medium Diffusion, Consumption, and Logistic source on $\mathbb{R}^N$ : I. Global Solvability and Boundedness |
Zulaihat Hassan et.al. |
2509.05494 |
null |
2025-09-05 |
From Image Generation to Infrastructure Design: a Multi-agent Pipeline for Street Design Generation |
Chenguang Wang et.al. |
2509.05469 |
null |
2025-09-05 |
Newton to Einstein: Axiom-Based Discovery via Game Design |
Pingchuan Ma et.al. |
2509.05448 |
null |
2025-09-05 |
The MeerKAT Galaxy Cluster Legacy Survey – II. Catalogue of the diffuse radio emission in MeerKAT-GCLS clusters |
Konstantinos Kolokythas et.al. |
2509.05442 |
null |
2025-09-05 |
Diffusioosmosis of electrolyte solutions in uniformly charged channels |
Evgeny S. Asmolov et.al. |
2509.05387 |
null |
2025-09-05 |
Spin-transport characteristics in a Si-based spin metal-oxide-semiconductor field-effect transistor (spin MOSFET): Bias dependence of the spin polarization in Si and magnetoresistance in spin-valve signals |
Shoichi Sato et.al. |
2509.05384 |
null |
2025-09-05 |
Extreme Negative Polarisation of New Interstellar Comet 3I/ATLAS |
Zuri Gray et.al. |
2509.05181 |
null |
2025-09-05 |
Cheaper access to universal fluctuations in integrable spin chains from boundary effects |
Sylvain Prolhac et.al. |
2509.05176 |
null |
2025-09-05 |
Latest results from the searches for ultra-high-energy photons at the Pierre Auger Observatory |
Pierpaolo Savina et.al. |
2509.05113 |
null |
2025-09-05 |
Painting the market: generative diffusion models for financial limit order book simulation and forecasting |
Alfred Backhouse et.al. |
2509.05107 |
null |
2025-09-05 |
Physical interactions enable energy-efficient Turing patterns |
Cathelijne ter Burg et.al. |
2509.05093 |
null |
2025-09-05 |
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading |
Yang Chen et.al. |
2509.05080 |
null |
2025-09-05 |
Masked Diffusion Language Models with Frequency-Informed Training |
Despoina Kosmopoulou et.al. |
2509.05056 |
null |
2025-09-05 |
Active thermodynamics of inertial chiral active gases: equation of state and edge currents |
Lorenzo Caprini et.al. |
2509.05053 |
null |
2025-09-05 |
QCA-MolGAN: Quantum Circuit Associative Molecular GAN with Multi-Agent Reinforcement Learning |
Aaron Mark Thomas et.al. |
2509.05051 |
null |
2025-09-05 |
LUIVITON: Learned Universal Interoperable VIrtual Try-ON |
Cong Cao et.al. |
2509.05030 |
null |
2025-09-05 |
Synthetic Acceleration Preconditioners for Parametric Radiative Transfer Equations based on Trajectory-Aware Reduced Order Models |
Ning Tang et.al. |
2509.05001 |
null |
2025-09-05 |
FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies |
Moritz Reuss et.al. |
2509.04996 |
null |
2025-09-05 |
Improving Spatial Resolution of Background Oriented Schlieren Based on Directional Rays |
Xiang Li et.al. |
2509.04992 |
null |
2025-09-05 |
Magnetorotational and convective instabilities in a thin layer of electrically conductive nanofluid under an external helical magnetic field |
M. I. Kopp et.al. |
2509.04968 |
null |
2025-09-05 |
Efficient estimation of jump parameters for stochastic differential equations driven by L{é}vy processes |
Elise Bayraktar et.al. |
2509.04920 |
null |
2025-09-05 |
Survey of Profile Parameters of the $6196 Å$ Diffuse Interstellar Band. From Uniform Profiles to Doppler Splitting and Blueshifts |
M. Piecka et.al. |
2509.04915 |
null |
2025-09-05 |
Off-lattice Microscopic Monte Carlo Modeling of Molecular Hydrogen Formation on Carbonaceous Dust Grains |
N. A. Satonkin et.al. |
2509.04913 |
null |
2025-09-05 |
Spectrum of slip dynamics, scaling & statistical laws emerge from simplified model of fault and damage zone architecture |
M. Almakari et.al. |
2509.04909 |
null |
2025-09-05 |
Plug-and-Play Latent Diffusion for Electromagnetic Inverse Scattering with Application to Brain Imaging |
Rui Guo et.al. |
2509.04860 |
null |
2025-09-05 |
A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing |
Chengkai Xu et.al. |
2509.04853 |
null |
2025-09-05 |
Stable and unstable spatially-periodic canards created in singular subcritical Turing bifurcations in the Brusselator system |
Robert Jencks et.al. |
2509.04835 |
null |
2025-09-05 |
SemSteDiff: Generative Diffusion Model-based Coverless Semantic Steganography Communication |
Song Gao et.al. |
2509.04803 |
null |
2025-09-05 |
Stability and Self-Organized Patterns in Coupled Ecohydrological–Fire Dynamics: A Model of Vegetation–Rainfall–Bushfire Interactions |
Serena Dipierro et.al. |
2509.04766 |
null |
2025-09-05 |
STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs |
Han Liang et.al. |
2509.04719 |
null |
2025-09-04 |
Transforming Fashion with AI: A Comparative Study of Large Language Models in Apparel Design |
Nusrat Jahan Lamia et.al. |
2509.04705 |
null |
2025-09-04 |
On convergence of upwinding Petrov-Galerkin methods for convection-diffusion |
Constantin Bacuta et.al. |
2509.04703 |
null |
2025-09-04 |
DarkStream: real-time speech anonymization with low latency |
Waris Quamer et.al. |
2509.04667 |
null |
2025-09-04 |
Mo Atom Rearrangement Drives Layer-Dependent Reactivity in Two-Dimensional MoS2 |
Zifan Wang et.al. |
2509.04648 |
null |
2025-09-04 |
Technical Developments of DA on $\mathbb{T}^3$ |
Hangyue Zhang et.al. |
2509.04634 |
null |
2025-09-04 |
$\mathcal{L}_1$ -DRAC: Distributionally Robust Adaptive Control |
Aditya Gahlawat et.al. |
2509.04619 |
null |
2025-09-04 |
DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models |
Jin Ma et.al. |
2509.04597 |
null |
2025-09-04 |
An S-matrix Formalism for the Nonclassical Optical Response of Plasmonic Sphere Aggregates |
Xin Zheng et.al. |
2509.04589 |
null |
2025-09-04 |
Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model |
Hongyang Wei et.al. |
2509.04548 |
null |
2025-09-04 |
Spatial Patterning and Selection: How the Environment Shapes Molecular Complexity |
Alexandre Champagne-Ruel et.al. |
2509.04547 |
null |
2025-09-04 |
PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting |
Linqing Wang et.al. |
2509.04545 |
null |
2025-09-04 |
In-Context Policy Adaptation via Cross-Domain Skill Diffusion |
Minjong Yoo et.al. |
2509.04535 |
null |
2025-09-04 |
Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image – Technical Preview |
Jun-Kun Chen et.al. |
2509.04450 |
null |
2025-09-04 |
Plot’n Polish: Zero-shot Story Visualization and Disentangled Editing with Text-to-Image Diffusion Models |
Kiymet Akdemir et.al. |
2509.04446 |
null |
2025-09-04 |
Durian: Dual Reference-guided Portrait Animation with Attribute Transfer |
Hyunsoo Cha et.al. |
2509.04434 |
null |
2025-09-04 |
Few-step Flow for 3D Generation via Marginal-Data Transport Distillation |
Zanwei Zhou et.al. |
2509.04406 |
null |
2025-09-04 |
Transition Models: Rethinking the Generative Learning Objective |
Zidong Wang et.al. |
2509.04394 |
null |
2025-09-04 |
SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer |
Jimin Xu et.al. |
2509.04379 |
null |
2025-09-04 |
Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology |
Yuchen Jiao et.al. |
2509.04372 |
null |
2025-09-04 |
Sensitivities of time-dependent temperature profile predictions for NSTX with the Multi-Mode Model |
J. B. Lestz et.al. |
2509.04360 |
null |
2025-09-04 |
From Editor to Dense Geometry Estimator |
JiYuan Wang et.al. |
2509.04338 |
null |
2025-09-04 |
The limiting law of the Discrete Gaussian level-lines |
Joseph Chen et.al. |
2509.04333 |
null |
2025-09-04 |
Noisy Label Refinement with Semantically Reliable Synthetic Images |
Yingxuan Li et.al. |
2509.04298 |
null |
2025-09-04 |
TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models |
Yuxin Gong et.al. |
2509.04269 |
null |
2025-09-04 |
Thermal diffusivity measurement based on evaporative cryocooling excitation: Theory and experiments |
Pengfei Zhu et.al. |
2509.04263 |
null |
2025-09-04 |
Error analysis for learning the time-stepping operator of evolutionary PDEs |
Ke Chen et.al. |
2509.04256 |
null |
2025-09-04 |
Synthetic Survival Data Generation for Heart Failure Prognosis Using Deep Generative Models |
Chanon Puttanawarut et.al. |
2509.04245 |
null |
2025-09-04 |
Axion-Photon Conversion In Magnetized Universe: Impact On The Global 21-cm Signal |
Pravin Kumar Natwariya et.al. |
2509.04237 |
null |
2025-09-04 |
Cosmic-Ray Boosted Diffuse Supernova Neutrinos |
Alexander Sandrock et.al. |
2509.04229 |
null |
2025-09-04 |
Making neural networks understand internal heat transfer using Fourier-transformed thermal diffusion wave fields |
Pengfei Zhu et.al. |
2509.04223 |
null |
2025-09-04 |
Two-dimensional magnetic tunnel p-n junctions for low-power electronics |
Wenkai Zhu et.al. |
2509.04206 |
null |
2025-09-04 |
Laplacian Flows in Complex-valued Directed Networks: Analysis, Design, and Consensus |
Aditi Saxena et.al. |
2509.04196 |
null |
2025-09-04 |
DUDE: Diffusion-Based Unsupervised Cross-Domain Image Retrieval |
Ruohong Yang et.al. |
2509.04193 |
null |
2025-09-04 |
Set Block Decoding is a Language Model Inference Accelerator |
Itai Gat et.al. |
2509.04185 |
null |
2025-09-04 |
On Riordan groups involving formal semi-Laurent series and their Lie group structure |
Dariusz Bugajewski et.al. |
2509.04160 |
null |
2025-09-04 |
Hyper Diffusion Avatars: Dynamic Human Avatar Generation using Network Weight Space Diffusion |
Dongliang Cao et.al. |
2509.04145 |
null |
2025-09-04 |
MEPG:Multi-Expert Planning and Generation for Compositionally-Rich Image Generation |
Yuan Zhao et.al. |
2509.04126 |
null |
2025-09-04 |
A unified stabilized virtual element method for the generalized Oseen equation: stability and robustness |
Sudheer Mishra et.al. |
2509.04113 |
null |
2025-09-04 |
Depletion-Induced Interactions Modulate Nanoscale Protein Diffusion in Polymeric Crowder Solutions |
Michelle Dargasz et.al. |
2509.04087 |
null |
2025-09-04 |
Keypoint-based Diffusion for Robotic Motion Planning on the NICOL Robot |
Lennart Clasmeier et.al. |
2509.04076 |
null |
2025-09-04 |
SMooGPT: Stylized Motion Generation using Large Language Models |
Lei Zhong et.al. |
2509.04058 |
null |
2025-09-04 |
CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning |
Zeyu Gan et.al. |
2509.04027 |
null |
2025-09-04 |
Electromechanical human heart modeling for predicting endocardial heart motion |
Milad Hasani et.al. |
2509.04024 |
null |
2025-09-04 |
Divergence-Kernel method for linear responses and diffusion models |
Angxiu Ni et.al. |
2509.03992 |
null |
2025-09-04 |
NeuroBreak: Unveil Internal Jailbreak Mechanisms in Large Language Models |
Chuhan Zhang et.al. |
2509.03985 |
null |
2025-09-05 |
Improving Vessel Segmentation with Multi-Task Learning and Auxiliary Data Available Only During Model Training |
Daniel Sobotka et.al. |
2509.03975 |
null |
2025-09-04 |
ANTS: Shaping the Adaptive Negative Textual Space by MLLM for OOD Detection |
Zhu Wenjie et.al. |
2509.03951 |
null |
2025-09-04 |
Fluid boundary conditions in kinetic-diffusion Monte Carlo |
Thijs Steel et.al. |
2509.03942 |
null |
2025-09-04 |
Thickness-dependent magnon spin transport in antiferromagnetic insulators: Crossover from quasi-three-dimensional to quasi-two-dimensional regimes |
Mathias Åsan Myhre et.al. |
2509.03941 |
null |
2025-09-04 |
SwinSRGAN: Swin Transformer-based Generative Adversarial Network for High-Fidelity Speech Super-Resolution |
Jiajun Yuan et.al. |
2509.03913 |
null |
2025-09-04 |
A Generative Foundation Model for Chest Radiography |
Yuanfeng Ji et.al. |
2509.03903 |
null |
2025-09-04 |
Diffusion Generative Models Meet Compressed Sensing, with Applications to Image Data and Financial Time Series |
Zhengyi Guo et.al. |
2509.03898 |
null |
2025-09-04 |
Human Motion Video Generation: A Survey |
Haiwei Xue et.al. |
2509.03883 |
null |
2025-09-04 |
Demonstrating a family of X-ray dark-field retrieval approaches on a common set of samples |
Samantha J. Alloo et.al. |
2509.03866 |
null |
2025-09-04 |
A minimization principle behind the diffusion bridge of diurnal fish migration |
H. Yoshioka et.al. |
2509.03824 |
null |
2025-09-04 |
Machine Learning for LiDAR-Based Indoor Surface Classification in Intelligent Wireless Environments |
Parth Ashokbhai Shiroya et.al. |
2509.03813 |
null |
2025-09-04 |
Causality-guided Prompt Learning for Vision-language Models via Visual Granulation |
Mengyu Gao et.al. |
2509.03803 |
null |
2025-09-04 |
Universal Structure of Turbulent Radiative Mixing Layers |
Prateek Sharma et.al. |
2509.03802 |
null |
2025-09-04 |
A high-lying isomer in ^{92}Zr with lifetime modulated by the atomic charge states: a proposed approach for a nuclear gamma-ray laser |
C. X. Jia et.al. |
2509.03797 |
null |
2025-09-04 |
Fitting Image Diffusion Models on Video Datasets |
Juhun Lee et.al. |
2509.03794 |
null |
2025-09-03 |
Learning functions through Diffusion Maps |
Alvaro Almeida Gomez et.al. |
2509.03758 |
null |
2025-09-03 |
Effects of Bethe-Heitler pair production in ultraluminous X-ray sources |
Gustavo Esteban Romero et.al. |
2509.03735 |
null |
2025-09-03 |
LuxDiT: Lighting Estimation with Video Diffusion Transformer |
Ruofan Liang et.al. |
2509.03680 |
null |
2025-09-03 |
Applying a Gaussian networking theory to model motor-driven transport along cytoskeletal filaments |
Nadine du Toit et.al. |
2509.03671 |
null |
2025-09-06 |
Efficient Virtuoso: A Latent Diffusion Transformer Model for Goal-Conditioned Trajectory Planning |
Antonio Guillen-Perez et.al. |
2509.03658 |
null |
2025-09-05 |
Noise is All You Need: rethinking the value of noise on seismic denoising via diffusion models |
Donglin Zhu et.al. |
2509.03629 |
null |
2025-09-03 |
Statistical Analysis of PAHs as a Tracer of Anomalous Microwave Emission Using DIRBE Data |
Danielle Sponseller et.al. |
2509.03611 |
null |
2025-09-03 |
Breaking Down the $\textsf{CosmoGEMS}$ : Toward Modeling and Understanding Globular Cluster Stellar Streams in a Fully Cosmological Context |
Nondh Panithanpaisal et.al. |
2509.03599 |
null |
2025-09-02 |
Diffusion-RL Based Air Traffic Conflict Detection and Resolution Method |
Tonghe Li et.al. |
2509.03550 |
null |
2025-09-03 |
Dynamically Controlled Transport of GeV Cosmic Rays in Diverse Galactic Environments |
Ronan Hix et.al. |
2509.03519 |
null |
2025-09-03 |
Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play? |
Ouxiang Li et.al. |
2509.03516 |
null |
2025-09-03 |
OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation |
Han Li et.al. |
2509.03498 |
null |
2025-09-03 |
From Image Denoisers to Regularizing Imaging Inverse Problems: An Overview |
Hong Ye Tan et.al. |
2509.03475 |
null |
2025-09-03 |
Joint Training of Image Generator and Detector for Road Defect Detection |
Kuan-Chuan Peng et.al. |
2509.03465 |
null |
2025-09-03 |
Nitrogen chemistry of hycean worlds on the example of K2-18b |
Maja W. Radecka et.al. |
2509.03455 |
null |
2025-09-03 |
ANNIE: Be Careful of Your Robots |
Yiyang Huang et.al. |
2509.03383 |
null |
2025-09-03 |
Dynamics of Infection Spread and Hotspot Growth in Bi-Pathogen Networks |
Alyssa Yu et.al. |
2509.03374 |
null |
2025-09-03 |
Generative Auto-Bidding in Large-Scale Competitive Auctions via Diffusion Completer-Aligner |
Yewen Li et.al. |
2509.03348 |
null |
2025-09-03 |
On the MIA Vulnerability Gap Between Private GANs and Diffusion Models |
Ilana Sebag et.al. |
2509.03341 |
null |
2025-09-03 |
Dynamical interface above a hard wall and reflected SPDE on the half-line |
Pierre Faugère et.al. |
2509.03328 |
null |
2025-09-03 |
Numerical Modeling of Galactic Cosmic Ray Modulation in the Heliosphere |
D. A. Shestakov et.al. |
2509.03326 |
null |
2025-09-03 |
InfraDiffusion: zero-shot depth map restoration with diffusion models and prompted segmentation from sparse infrastructure point clouds |
Yixiong Jing et.al. |
2509.03324 |
null |
2025-09-03 |
Noise resilience of two-dimensional Floquet topological phases |
Balaganchi A. Bhargava et.al. |
2509.03296 |
null |
2025-09-03 |
SynBT: High-quality Tumor Synthesis for Breast Tumor Segmentation by 3D Diffusion Model |
Hongxu Yang et.al. |
2509.03267 |
null |
2025-09-03 |
Estudio de la eficiencia en la escalabilidad de GPUs para el entrenamiento de Inteligencia Artificial |
David Cortes et.al. |
2509.03263 |
null |
2025-09-03 |
Evaluation of Stress Detection as Time Series Events – A Novel Window-Based F1-Metric |
Harald Vilhelm Skat-Rørdam et.al. |
2509.03240 |
null |
2025-09-03 |
Deep Learning for High Speed Optical Coherence Elastography with a Fiber Scanning Endoscope |
Maximilian Neidhardt et.al. |
2509.03193 |
null |
2025-09-03 |
Dissecting the Diffuse Emission of the Galaxy with the HAWC Observatory |
Georg Schwefer et.al. |
2509.03189 |
null |
2025-09-03 |
The slow evolution of dark matter halos from cusp to core naturally produces extended stellar core-like distributions |
Jorge Sanchez Almeida et.al. |
2509.03167 |
null |
2025-09-03 |
Temporally-Aware Diffusion Model for Brain Progression Modelling with Bidirectional Temporal Regularisation |
Mattia Litrico et.al. |
2509.03141 |
null |
2025-09-03 |
RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation |
Sashuai Zhou et.al. |
2509.03131 |
null |
2025-09-03 |
On the Smart Coordination of Flexibility Scheduling in Multi-carrier Integrated Energy Systems |
Christian Doh Dinga et.al. |
2509.03126 |
null |
2025-09-03 |
Towards Realistic Hand-Object Interaction with Gravity-Field Based Diffusion Bridge |
Miao Xu et.al. |
2509.03114 |
null |
2025-09-03 |
Bounded imaginary powers of generalized diffusion operators |
Alexandre Thorel et.al. |
2509.03105 |
null |
2025-09-03 |
Collision operator for electron runaway in cold weakly-ionized plasmas |
Yeongsun Lee et.al. |
2509.03092 |
null |
2025-09-03 |
Diffusive shock acceleration: non-classical model of cosmic ray transport |
A. A. Lagutin et.al. |
2509.03091 |
null |
2025-09-03 |
High Cursive Complex Character Recognition using GAN External Classifier |
S M Rafiuddin et.al. |
2509.03062 |
null |
2025-09-03 |
DCDB: Dynamic Conditional Dual Diffusion Bridge for Ill-posed Multi-Tasks |
Chengjie Huang et.al. |
2509.03044 |
null |
2025-09-03 |
Boundary layer effects induced by the fluid in a chemotaxis-Navier-Stokes system |
Qianqian Hou et.al. |
2509.03028 |
null |
2025-09-03 |
Enhancing Robustness in Post-Processing Watermarking: An Ensemble Attack Network Using CNNs and Transformers |
Tzuhsuan Huang et.al. |
2509.03006 |
null |
2025-09-03 |
DUViN: Diffusion-Based Underwater Visual Navigation via Knowledge-Transferred Depth Features |
Jinghe Yang et.al. |
2509.02983 |
null |
2025-09-03 |
InstaDA: Augmenting Instance Segmentation Data with Dual-Agent System |
Xianbao Hou et.al. |
2509.02973 |
null |
2025-09-03 |
Non-Linear and Meta-Stable Dynamics in Financial Markets: Evidence from High Frequency Crypto Currency Market Makers |
Igor Halperin et.al. |
2509.02941 |
null |
2025-09-03 |
The Role of Far-side Magnetic Structures in Modeling 2024 Solar Eclipse |
Guanglu Shi et.al. |
2509.02911 |
null |
2025-09-02 |
The Space Coronagraph Optical Bench (SCoOB): 8. end-to-end numerical modeling of the testbed to estimate the contrast limits |
Ramya M Anche et.al. |
2509.02887 |
null |
2025-09-02 |
Fluid Model of Schrodinger equation and derivation of the quantum potential |
Lachezar Simeonov et.al. |
2509.02868 |
null |
2025-09-02 |
Predicting Movie Success with Multi-Task Learning: A Hybrid Framework Combining GPT-Based Sentiment Analysis and SIR Propagation |
Wenlan Xie et.al. |
2509.02809 |
null |
2025-09-02 |
DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off |
Jusheng Zhang et.al. |
2509.02785 |
null |
2025-09-02 |
Synthetic generation of online social networks through homophily |
Alejandro Buitrago López et.al. |
2509.02762 |
null |
2025-09-02 |
Spacetime Wavelet Method for Linear Boundary-Value Problems in Sylvester Matrix Equation Form |
Cody D. Cochran et.al. |
2509.02720 |
null |
2025-09-02 |
Ultrafast anisotropic exciton transport in phosphorene |
Kai-Wei Chang et.al. |
2509.02682 |
null |
2025-09-02 |
Explosive Dispersal Outflows as a New Class of Fermi Gamma-Ray Sources: The Case of DR21 |
Paarmita Pandey et.al. |
2509.02679 |
null |
2025-09-02 |
Double-faced white dwarfs and the magnetic inhibition of convection |
Sivan Ginzburg et.al. |
2509.02671 |
null |
2025-09-02 |
Is RL fine-tuning harder than regression? A PDE learning approach for diffusion models |
Wenlong Mou et.al. |
2509.02528 |
null |
2025-09-02 |
Unifi3D: A Study on 3D Representations for Generation and Reconstruction in a Common Framework |
Nina Wiedemann et.al. |
2509.02474 |
null |
2025-09-02 |
TeRA: Rethinking Text-guided Realistic 3D Avatar Generation |
Yanwen Wang et.al. |
2509.02466 |
null |
2025-09-02 |
Fractional differential equations: non-constant coefficients, simulation and model reduction |
Ruben Aylwin et.al. |
2509.02465 |
null |
2025-09-02 |
GenCompositor: Generative Video Compositing with Diffusion Transformer |
Shuzhou Yang et.al. |
2509.02460 |
null |
2025-09-02 |
Quantitative positivity of transition densities for random perturbations of Hamiltonian systems |
Shimaa Elesaely et.al. |
2509.02448 |
null |
2025-09-02 |
Kelvin-Helmholtz instability in binary fluids with miscibility gap |
Anubhav Dubey et.al. |
2509.02400 |
null |
2025-09-02 |
Revisiting the diffusion equation derivation in Persson’s theory of contact |
Yang Xu et.al. |
2509.02397 |
null |
2025-09-02 |
Widely non-degenerate nonlinear frequency conversion in cryogenic titanium in-diffused lithium niobate waveguides |
Nina Amelie Lange et.al. |
2509.02392 |
null |
2025-09-02 |
Category-Aware 3D Object Composition with Disentangled Texture and Shape Multi-view Diffusion |
Zeren Xiong et.al. |
2509.02357 |
null |
2025-09-02 |
A recursive formula for the $n^\text{th}$ survival function and the $n^\text{th}$ first passage time distribution for jump and diffusion processes. Applications to the pricing of $n^\text{th}$ -to-default CDS |
Alessio Lapolla et.al. |
2509.02347 |
null |
2025-09-02 |
Multi-stage PDE-based image processing techniques for noisy MRI scans |
Ksenia Slepova et.al. |
2509.02342 |
null |
2025-09-02 |
RDIT: Residual-based Diffusion Implicit Models for Probabilistic Time Series Forecasting |
Chih-Yu Lai et.al. |
2509.02341 |
null |
2025-09-02 |
Distribution estimation via Flow Matching with Lipschitz guarantees |
Lea Kunkel et.al. |
2509.02337 |
null |
2025-09-02 |
Exploring Diffusion Models for Generative Forecasting of Financial Charts |
Taegyeong Lee et.al. |
2509.02308 |
null |
2025-09-02 |
Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image Generation |
Sapir Esther Yiflach et.al. |
2509.02295 |
null |
2025-09-03 |
Sem-RaDiff: Diffusion-Based 3D Radar Semantic Perception in Cluttered Agricultural Environments |
Ruibin Zhang et.al. |
2509.02283 |
null |
2025-09-02 |
Think2Sing: Orchestrating Structured Motion Subtitles for Singing-Driven 3D Head Animation |
Zikai Huang et.al. |
2509.02278 |
null |
2025-09-02 |
Ergodicity of conditional McKean-Vlasov jump diffusions |
Jianhai Bao et.al. |
2509.02249 |
null |
2025-09-02 |
Spectrogram Patch Codec: A 2D Block-Quantized VQ-VAE and HiFi-GAN for Neural Speech Coding |
Luis Felipe Chary et.al. |
2509.02244 |
null |
2025-09-02 |
Improving atomic force microscopy structure discovery via style-translation |
Jie Huang et.al. |
2509.02240 |
null |
2025-09-02 |
Mechanical performance of hybrid polymer-lipid vesicles with leaflet asymmetry engineered using microfluidics |
Yuting Huang et.al. |
2509.02194 |
null |
2025-09-02 |
Enhancing Zero-Shot Pedestrian Attribute Recognition with Synthetic Data Generation: A Comparative Study with Image-To-Image Diffusion Models |
Pablo Ayuso-Albizu et.al. |
2509.02161 |
null |
2025-09-02 |
Nuclear fusion plasma fuelling with ice pellets using a neuromorphic controller |
L. L. T. C. Jansen et.al. |
2509.02147 |
null |
2025-09-02 |
Differentiable Expectation-Maximisation and Applications to Gaussian Mixture Model Optimal Transport |
Samuel Boïté et.al. |
2509.02109 |
null |
2025-09-02 |
GeoLayer: Towards Low-Latency and Cost-Efficient Geo-Distributed Graph Stores with Layered Graph |
Feng Yao et.al. |
2509.02106 |
null |
2025-09-02 |
A Data-Centric Approach to Pedestrian Attribute Recognition: Synthetic Augmentation via Prompt-driven Diffusion Models |
Alejandro Alonso et.al. |
2509.02099 |
null |
2025-09-02 |
Environment-Aware Channel Measurement and Modeling for Terahertz Monostatic Sensing |
Yejian Lyu et.al. |
2509.02088 |
null |
2025-09-02 |
Superexponential dissipation enhancement on $\mathbb{T}^d$ |
Keefer Rowan et.al. |
2509.02081 |
null |
2025-09-02 |
Data-Dependent Smoothing for Protein Discovery with Walk-Jump Sampling |
Srinivas Anumasa et.al. |
2509.02069 |
null |
2025-09-02 |
Measuring metal sulfides in interstellar dust with PRIMA |
Izaskun Jiménez-Serra et.al. |
2509.02067 |
null |
2025-09-02 |
Enhanced Raman scattering by fast GaN phonon-polaritons |
Mayssoune Mina et.al. |
2509.02057 |
null |
2025-09-02 |
Palette Aligned Image Diffusion |
Elad Aharoni et.al. |
2509.02000 |
null |
2025-09-02 |
Draw-In-Mind: Learning Precise Image Editing via Chain-of-Thought Imagination |
Ziyun Zeng et.al. |
2509.01986 |
null |
2025-09-03 |
Discrete Noise Inversion for Next-scale Autoregressive Text-based Image Editing |
Quan Dao et.al. |
2509.01984 |
null |
2025-09-02 |
Nonmonotonic change with energy of the mean logarithmic mass of cosmic rays in the knee region: the mechanism of formation of this feature and sources of particles |
A. A. Lagutin et.al. |
2509.01974 |
null |
2025-09-02 |
Efficient Bayesian Sampling with Langevin Birth-Death Dynamics |
Alex Leviyev et.al. |
2509.01942 |
null |
2025-09-02 |
A Diffusion-Based Framework for Configurable and Realistic Multi-Storage Trace Generation |
Seohyun Kim et.al. |
2509.01919 |
null |
2025-09-02 |
DroneSR: Rethinking Few-shot Thermal Image Super-Resolution from Drone-based Perspective |
Zhipeng Weng et.al. |
2509.01898 |
null |
2025-09-02 |
Far-infrared probing with PRIMA into particle acceleration associated with relativistic jets from active galactic nuclei |
Naoki Isobe et.al. |
2509.01876 |
null |
2025-09-04 |
RadioDiff-Loc: Diffusion Model Enhanced Scattering Congnition for NLoS Localization with Sparse Radio Map Estimation |
Xiucheng Wang et.al. |
2509.01875 |
null |
2025-09-02 |
Latent Gene Diffusion for Spatial Transcriptomics Completion |
Paula Cárdenas et.al. |
2509.01864 |
null |
2025-09-02 |
Does the high-energy AMS-02 positron flux originate from the dark matter density spikes around nearby black holes? |
Man Ho Chan et.al. |
2509.01860 |
null |
2025-09-01 |
PractiLight: Practical Light Control Using Foundational Diffusion Models |
Yotam Erel et.al. |
2509.01837 |
null |
2025-09-01 |
ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training |
Ge Yan et.al. |
2509.01819 |
null |
2025-09-03 |
Intermittent localization and fast spatial learning by non-Markov random walks with decaying memory |
Paulina R. Martín-Cornejo et.al. |
2509.01806 |
null |
2025-09-01 |
Mapping Magnetic Fields from Clouds to Cores with PRIMAger |
Kate Pattle et.al. |
2509.01796 |
null |
2025-09-01 |
High-Performance Trajectory Tracking MPC for Quadcopters with Coupled Time-Varying Constraints and Stability Proofs |
Maedeh Izadi et.al. |
2509.01767 |
null |
2025-09-01 |
Clinical Metadata Guided Limited-Angle CT Image Reconstruction |
Yu Shi et.al. |
2509.01752 |
null |
2025-09-01 |
Controllable Generation of Implied Volatility Surfaces with Variational Autoencoders |
Jing Wang et.al. |
2509.01743 |
null |
2025-09-01 |
Quadratic Growth Model with Discontinuity: A Link between Monostable and Bistable Traveling Waves |
Wonhyung Choi et.al. |
2509.01715 |
null |
2025-09-01 |
The PRIMA promise of deciphering interstellar dust evolution with observations of the nearby Universe |
Frédéric Galliano et.al. |
2509.01692 |
null |
2025-09-01 |
The Impact of Baryonic Effects on the Dynamical Masses Inferred Using Satellite Kinematics |
Josephine F. W. Baggen et.al. |
2509.01690 |
null |
2025-09-01 |
Preconditioned Regularized Wasserstein Proximal Sampling |
Hong Ye Tan et.al. |
2509.01685 |
null |
2025-09-01 |
Efficient Transformer-Inspired Variants of Physics-Informed Deep Operator Networks |
Zhi-Feng Wei et.al. |
2509.01679 |
null |
2025-09-01 |
Investigating the role of magnetic fields in the formation and evolution of striations in interstellar clouds with PRIMA |
Raphael Skalidis et.al. |
2509.01678 |
null |
2025-08-29 |
Achieving Hilbert-Schmidt Independence Under Rényi Differential Privacy for Fair and Private Data Generation |
Tobias Hyrup et.al. |
2508.21815 |
null |
2025-08-29 |
Tree-Guided Diffusion Planner |
Hyeonseong Jeon et.al. |
2508.21800 |
null |
2025-08-29 |
OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization |
Jiazheng Xing et.al. |
2508.21727 |
null |
2025-08-29 |
FLORA: Efficient Synthetic Data Generation for Object Detection in Low-Data Regimes via finetuning Flux LoRA |
Alvaro Patricio et.al. |
2508.21712 |
null |
2025-09-01 |
Infinite-Dimensional Stochastic Differential Equations and Diffusion Dynamics of Coulomb Random Point Fields |
Hirofumi Osada et.al. |
2508.21658 |
null |
2025-08-29 |
Deciphering the gamma-ray emission in the Cygnus region |
L. Haerer et.al. |
2508.21644 |
null |
2025-08-29 |
Conforming and discontinuous discretizations of non-isothermal Darcy-Forchheimer flows |
Stefano Bonetti et.al. |
2508.21630 |
null |
2025-09-02 |
Approximate calculation of multidimensional first passage times |
James F. Lutsko et.al. |
2508.21607 |
null |
2025-08-29 |
Condense to Conduct and Conduct to Condense |
Tomasz Kazana et.al. |
2508.21602 |
null |
2025-08-29 |
Fluid dynamics of charm quarks from heavy to light-ion collisions |
Federica Capellino et.al. |
2508.21600 |
null |
2025-08-29 |
OASIS: Harnessing Diffusion Adversarial Network for Ocean Salinity Imputation using Sparse Drifter Trajectories |
Bo Li et.al. |
2508.21570 |
null |
2025-08-29 |
ECHO: Ego-Centric modeling of Human-Object interactions |
Ilya A. Petrov et.al. |
2508.21556 |
null |
2025-08-29 |
Complete Gaussian Splats from a Single Image with Denoising Diffusion Models |
Ziwei Liao et.al. |
2508.21542 |
null |
2025-08-29 |
Molecular Beam Epitaxy of 2H-TaS $_2$ few-layers on GaN(0001) |
Constantin Hilbrunner et.al. |
2508.21537 |
null |
2025-08-29 |
Adaptive generative moment matching networks for improved learning of dependence structures |
Marius Hofert et.al. |
2508.21531 |
null |
2025-08-29 |
Few-Shot Neuro-Symbolic Imitation Learning for Long-Horizon Planning and Acting |
Pierrick Lorang et.al. |
2508.21501 |
null |
2025-08-29 |
Controllable 3D Molecular Generation for Structure-Based Drug Design Through Bayesian Flow Networks and Gradient Integration |
Seungyeon Choi et.al. |
2508.21468 |
null |
2025-08-29 |
Diffusion-based Multi-modal Synergy Interest Network for Click-through Rate Prediction |
Xiaoxi Cui et.al. |
2508.21460 |
null |
2025-09-01 |
Contrarian Motives in Social Learning: Information Cascades with Nonconformist Preferences |
Georgy Lukyanov et.al. |
2508.21446 |
null |
2025-08-29 |
Quantum enhanced ensemble GANs for anomaly detection in continuous biomanufacturing |
Rajiv Kailasanathan et.al. |
2508.21438 |
null |
2025-08-29 |
MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation |
Francisco Caetano et.al. |
2508.21435 |
null |
2025-08-29 |
Global Hot Gas Excess in (U)LIRGs: Replicating Galactic Nuclei Scaling Relations between Diffuse X-ray Emission and Star Formation on Galaxy-Wide Scales |
Chunyi Zhang et.al. |
2508.21401 |
null |
2025-08-29 |
Dynamics-Compliant Trajectory Diffusion for Super-Nominal Payload Manipulation |
Anuj Pasricha et.al. |
2508.21375 |
null |
2025-08-29 |
Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image |
Qingran Miao et.al. |
2508.21371 |
null |
2025-08-29 |
Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning |
Yuquan Bi et.al. |
2508.21363 |
null |
2025-08-29 |
QUAV: Quantum-Assisted Path Planning and Optimization for UAV Navigation with Obstacle Avoidance |
Nouhaila Innan et.al. |
2508.21361 |
null |
2025-08-29 |
DLGAN : Time Series Synthesis Based on Dual-Layer Generative Adversarial Networks |
Xuan Hou et.al. |
2508.21340 |
null |
2025-08-29 |
Quantum Monte Carlo Benchmarking of Molecular Adsorption on Graphene-Supported Single Pt Atom |
Jeonghwan Ahn et.al. |
2508.21339 |
null |
2025-08-29 |
Stage-Diff: Stage-wise Long-Term Time Series Generation Based on Diffusion Models |
Xuan Hou et.al. |
2508.21330 |
null |
2025-08-28 |
PHD: Personalized 3D Human Body Fitting with Point Diffusion |
Hsuan-I Ho et.al. |
2508.21257 |
null |
2025-08-28 |
Weighted Support Points from Random Measures: An Interpretable Alternative for Generative Modeling |
Peiqi Zhao et.al. |
2508.21255 |
null |
2025-08-28 |
Reverse Imaging for Wide-spectrum Generalization of Cardiac MRI Segmentation |
Yidong Zhao et.al. |
2508.21254 |
null |
2025-08-28 |
Mutual Information Rate – Linear Noise Approximation and Exact Computation |
Manuel Reinhardt et.al. |
2508.21220 |
null |
2025-08-28 |
WaveLLDM: Design and Development of a Lightweight Latent Diffusion Model for Speech Enhancement and Restoration |
Kevin Putra Santoso et.al. |
2508.21153 |
null |
2025-08-28 |
Propagation in the Fisher-KPP equation with Mixed Operator |
Begoña Barrios et.al. |
2508.21151 |
null |
2025-08-28 |
The COLIBRE project: cosmological hydrodynamical simulations of galaxy formation and evolution |
Joop Schaye et.al. |
2508.21126 |
null |
2025-08-28 |
Safe-Control: A Safety Patch for Mitigating Unsafe Content in Text-to-Image Generation Models |
Xiangtao Meng et.al. |
2508.21099 |
null |
2025-08-28 |
TrInk: Ink Generation with Transformer Network |
Zezhong Jin et.al. |
2508.21098 |
null |
2025-08-28 |
First-Place Solution to NeurIPS 2024 Invisible Watermark Removal Challenge |
Fahad Shamshad et.al. |
2508.21072 |
null |
2025-08-28 |
Dress&Dance: Dress up and Dance as You Like It - Technical Preview |
Jun-Kun Chen et.al. |
2508.21070 |
null |
2025-08-28 |
OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning |
Yuan Gong et.al. |
2508.21066 |
null |
2025-08-28 |
Mixture of Contexts for Long Video Generation |
Shengqu Cai et.al. |
2508.21058 |
null |
2025-08-28 |
HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning |
Zhi Su et.al. |
2508.21043 |
null |
2025-08-28 |
FW-GAN: Frequency-Driven Handwriting Synthesis with Wave-Modulated MLP Generator |
Huynh Tong Dang Khoa et.al. |
2508.21040 |
null |
2025-08-28 |
Reusing Computation in Text-to-Image Diffusion for Efficient Generation of Image Sets |
Dale Decatur et.al. |
2508.21032 |
null |
2025-08-28 |
System size and event shape dependence of particle-identified balance functions in proton-proton collisions at $\sqrt{s}=13$ TeV |
Subash Chandra Behera et.al. |
2508.21030 |
null |
2025-08-28 |
POSE: Phased One-Step Adversarial Equilibrium for Video Diffusion Models |
Jiaxiang Cheng et.al. |
2508.21019 |
null |
2025-08-28 |
Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance |
Luozhijie Jin et.al. |
2508.21016 |
null |
2025-08-28 |
Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees |
Yaniv Hassidof et.al. |
2508.21001 |
null |
2025-08-28 |
RANGAN: GAN-empowered Anomaly Detection in 5G Cloud RAN |
Douglas Liao et.al. |
2508.20985 |
null |
2025-08-28 |
Random attractors and nonergodic attractors for diffusions with degeneracies |
Yuri Bakhtin et.al. |
2508.20968 |
null |
2025-08-28 |
Very high-energy gamma-ray and neutrino emission from hadronic interaction in compact binary millisecond pulsars |
Vittoria Vecchiotti et.al. |
2508.20952 |
null |
2025-08-28 |
Lattice Random Walk Discretisations of Stochastic Differential Equations |
Samuel Duffield et.al. |
2508.20883 |
null |
2025-08-28 |
Understanding and evaluating computer vision models through the lens of counterfactuals |
Pushkar Shukla et.al. |
2508.20881 |
null |
2025-08-28 |
Leveraging Discriminative Latent Representations for Conditioning GAN-Based Speech Enhancement |
Shrishti Saha Shetu et.al. |
2508.20859 |
null |
2025-08-28 |
Uniform error analysis of a rectangular Morley finite element method on a Shishkin mesh for a 4th-order singularly perturbed boundary value problem |
Xiangyun Meng et.al. |
2508.20857 |
null |
2025-08-28 |
Learning Primitive Embodied World Models: Towards Scalable Robotic Learning |
Qiao Sun et.al. |
2508.20840 |
null |
2025-08-28 |
High-Resolution Atomic Magnetometer-Based Imaging of Integrated Circuits and Batteries |
Dominic Hunter et.al. |
2508.20834 |
null |
2025-08-28 |
Distinct Spatiotemporal Dynamics of Thermoelectric Transport Across Superconducting Transition |
Rajae Malek et.al. |
2508.20792 |
null |
2025-08-28 |
Prediction of sulphate hazes in the lower Venus atmosphere |
Peter Woitke et.al. |
2508.20790 |
null |
2025-08-28 |
Evaluating Compositional Generalisation in VLMs and Diffusion Models |
Beth Pearson et.al. |
2508.20783 |
null |
2025-08-28 |
Unleashing Uncertainty: Efficient Machine Unlearning for Generative AI |
Christoforos N. Spartalis et.al. |
2508.20773 |
null |
2025-08-28 |
Anomalous diffusion and run-and-tumble motion of a chemotactic particle in low dimensions |
Jacopo Romano et.al. |
2508.20756 |
null |
2025-08-28 |
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning |
Yibin Wang et.al. |
2508.20751 |
null |
2025-08-29 |
A two-state generalisation of the strong collision model |
Ola Kenji Forslund et.al. |
2508.20727 |
null |
2025-08-28 |
EEGDM: Learning EEG Representation with Latent Diffusion Model |
Shaocong Wang et.al. |
2508.20705 |
null |
2025-08-28 |
Agent-based model of information diffusion in the limit order book trading |
Mateusz Wilinski et.al. |
2508.20672 |
null |
2025-08-28 |
“Humor, Art, or Misinformation?”: A Multimodal Dataset for Intent-Aware Synthetic Image Detection |
Anastasios Skoularikis et.al. |
2508.20670 |
null |
2025-08-28 |
Amadeus: Autoregressive Model with Bidirectional Attribute Modelling for Symbolic Music |
Hongju Su et.al. |
2508.20665 |
null |
2025-08-28 |
VarDiU: A Variational Diffusive Upper Bound for One-Step Diffusion Distillation |
Leyang Wang et.al. |
2508.20646 |
null |
2025-08-28 |
CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models |
Ayan Banerjee et.al. |
2508.20640 |
null |
2025-08-28 |
EmoCAST: Emotional Talking Portrait via Emotive Text Description |
Yiguo Jiang et.al. |
2508.20615 |
null |
2025-08-28 |
Revisiting the Privacy Risks of Split Inference: A GAN-Based Data Reconstruction Attack via Progressive Feature Optimization |
Yixiang Qiu et.al. |
2508.20613 |
null |
2025-08-28 |
Physics Informed Generative Models for Magnetic Field Images |
Aye Phyu Phyu Aung et.al. |
2508.20612 |
null |
2025-08-28 |
GENRE-CMR: Generalizable Deep Learning for Diverse Multi-Domain Cardiac MRI Reconstruction |
Kian Anvari Hamedani et.al. |
2508.20600 |
null |
2025-08-28 |
Disruptive Attacks on Face Swapping via Low-Frequency Perceptual Perturbations |
Mengxiao Huang et.al. |
2508.20595 |
null |
2025-08-28 |
FastFit: Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models |
Zheng Chong et.al. |
2508.20586 |
null |
2025-08-28 |
Persode: Personalized Visual Journaling with Episodic Memory-Aware AI Agent |
Seokho Jin et.al. |
2508.20585 |
null |
2025-08-28 |
SimShear: Sim-to-Real Shear-based Tactile Servoing |
Kipp McAdam Freud et.al. |
2508.20561 |
null |
2025-08-28 |
Equilibria of aggregation-diffusion models with nonlinear potentials |
Francesco Bozzola et.al. |
2508.20523 |
null |
2025-08-28 |
Describe, Don’t Dictate: Semantic Image Editing with Natural Language Intent |
En Ci et.al. |
2508.20505 |
null |
2025-08-28 |
Run-and-tumble particle with diffusion: boundary local times and the zero-diffusion limit |
Paul C Bressloff et.al. |
2508.20473 |
null |
2025-08-28 |
Realistic and Controllable 3D Gaussian-Guided Object Editing for Driving Video Generation |
Jiusi Li et.al. |
2508.20471 |
null |
2025-08-28 |
Breaking Diffusion with Cache: Exploiting Approximate Caches in Diffusion Models |
Desen Sun et.al. |
2508.20424 |
null |
2025-09-01 |
AWorld: Orchestrating the Training Recipe for Agentic AI |
Chengyue Yu et.al. |
2508.20404 |
null |
2025-08-28 |
Mean Field Game with Reflected Jump Diffusion Dynamics: A Linear Programming Approach |
Zongxia Liang et.al. |
2508.20388 |
null |
2025-08-28 |
Do triangles matter? Replicating hypergraph disease dynamics with lower-order interactions |
Eugene Tan et.al. |
2508.20380 |
null |
2025-08-28 |
Audio-Guided Visual Editing with Complex Multi-Modal Prompts |
Hyeonyu Kim et.al. |
2508.20379 |
null |
2025-08-28 |
Numerical Method for Space-Time Fractional Diffusion: A Stochastic Approach |
Tengteng Cui et.al. |
2508.20361 |
null |
2025-08-28 |
Artificial neural network solver for Fokker-Planck and Koopman eigenfunctions |
Max Kreider et.al. |
2508.20339 |
null |
2025-08-27 |
Score-Based Diffusion Models in Infinite Dimensions: A Malliavin Calculus Perspective |
Ehsan Mirafzali et.al. |
2508.20316 |
null |
2025-08-27 |
Efficient ion re-acceleration in laboratory-produced interpenetrating collisionless shocks |
W. Yao et.al. |
2508.20303 |
null |
2025-08-27 |
Out-of-time-order correlators bridge classical transport and quantum dynamics |
Sophia N. Fricke et.al. |
2508.20235 |
null |
2025-08-27 |
Velocity Spectrum Imaging using velocity encoding preparation pulses |
Luis Hernandez-Garcia et.al. |
2508.20218 |
null |
2025-08-27 |
InfinityHuman: Towards Long-Term Audio-Driven Human |
Xiaodi Li et.al. |
2508.20210 |
null |
2025-08-27 |
The structure of the giant radio fossil in the Ophiuchus galaxy cluster |
Simona Giacintucci et.al. |
2508.20190 |
null |
2025-08-27 |
SDiFL: Stable Diffusion-Driven Framework for Image Forgery Localization |
Yang Su et.al. |
2508.20182 |
null |
2025-08-27 |
Nonlinear diffusion in relativistic kinetic theory |
Simone Calogero et.al. |
2508.20147 |
null |
2025-08-27 |
MicroLad: 2D-to-3D Microstructure Reconstruction and Generation via Latent Diffusion and Score Distillation |
Kang-Hyun Lee et.al. |
2508.20138 |
null |
2025-08-27 |
Discrete-Guided Diffusion for Scalable and Safe Multi-Robot Motion Planning |
Jinhao Liang et.al. |
2508.20095 |
null |
2025-08-27 |
AudioStory: Generating Long-Form Narrative Audio with Large Language Models |
Yuxin Guo et.al. |
2508.20088 |
null |
2025-08-27 |
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies |
Zhixuan Liang et.al. |
2508.20072 |
null |
2025-08-27 |
A unique solution to overcome the barriers to planetesimal formation at low dust-to-gas ratio |
H. Meheut et.al. |
2508.20070 |
null |
2025-08-27 |
Neural Conditional Simulation for Complex Spatial Processes |
Julia Walchessen et.al. |
2508.20067 |
null |
2025-08-27 |
Joint Analysis of HI Absorption Zeeman Measurements and the Morphology of Filamentary HI Emission |
Marta Nowotka et.al. |
2508.20065 |
null |
2025-08-27 |
Wave coarsening drives time crystallization in active solids |
Jonas Veenstra et.al. |
2508.20052 |
null |
2025-08-27 |
GS: Generative Segmentation via Label Diffusion |
Yuhao Chen et.al. |
2508.20020 |
null |
2025-08-27 |
Diffusion Language Models Know the Answer Before Decoding |
Pengxiang Li et.al. |
2508.19982 |
null |
2025-08-27 |
The Information Dynamics of Generative Diffusion |
Luca Ambrogioni et.al. |
2508.19897 |
null |
2025-08-27 |
Quantum latent distributions in deep generative models |
Omar Bacarreza et.al. |
2508.19857 |
null |
2025-08-28 |
Ego-centric Predictive Model Conditioned on Hand Trajectories |
Binjie Zhang et.al. |
2508.19852 |
null |
2025-08-27 |
Physics-Informed DeepONet Coupled with FEM for Convective Transport in Porous Media with Sharp Gaussian Sources |
Erdi Kara et.al. |
2508.19847 |
null |
2025-08-27 |
Exotic rheology of materials with active rearrangements |
Aondoyima Ioratim-Uba et.al. |
2508.19844 |
null |
2025-08-27 |
Not Every Gift Comes in Gold Paper or with a Red Ribbon: Exploring Color Perception in Text-to-Image Models |
Shay Shomer Chai et.al. |
2508.19791 |
null |
2025-08-27 |
StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation |
Xiuchao Wu et.al. |
2508.19789 |
null |
2025-08-27 |
Fast 3D Diffusion for Scalable Granular Media Synthesis |
Muhammad Moeeze Hassan et.al. |
2508.19752 |
null |
2025-08-27 |
Fractal Flow: Hierarchical and Interpretable Normalizing Flow via Topic Modeling and Recursive Strategy |
Binhui Zhang et.al. |
2508.19750 |
null |
2025-08-27 |
MC for Gastroretentive Drug Delivery |
Sebastian Lotter et.al. |
2508.19739 |
null |
2025-08-27 |
Synthetic Image Detection via Spectral Gaps of QC-RBIM Nishimori Bethe-Hessian Operators |
V. S. Usatyuk et.al. |
2508.19698 |
null |
2025-08-27 |
MnBr $_2$ on the graphene on Ir(110) substrate: growth, structure, and super-moiré |
Affan Safeer et.al. |
2508.19694 |
null |
2025-08-27 |
Atomistic insights into hydrogen migration in IGZO from machine-learning interatomic potential: linking atomic diffusion to device performance |
Hyunsung Cho et.al. |
2508.19674 |
null |
2025-08-27 |
Multi-value Probabilistic Computing with current-controlled Skyrmion Diffusion |
Thomas B. Winkler et.al. |
2508.19623 |
null |
2025-08-27 |
IELDG: Suppressing Domain-Specific Noise with Inverse Evolution Layers for Domain Generalized Semantic Segmentation |
Qizhe Fan et.al. |
2508.19604 |
null |
2025-08-27 |
Guiding Noisy Label Conditional Diffusion Models with Score-based Discriminator Correction |
Dat Nguyen Cong et.al. |
2508.19581 |
null |
2025-08-28 |
Interact-Custom: Customized Human Object Interaction Image Generation |
Zhu Xu et.al. |
2508.19575 |
null |
2025-08-27 |
Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era |
Dawei Li et.al. |
2508.19570 |
null |
2025-08-27 |
MonoRelief V2: Leveraging Real Data for High-Fidelity Monocular Relief Recovery |
Yu-Wei Zhang et.al. |
2508.19555 |
null |
2025-08-27 |
Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding |
Bowen Sun et.al. |
2508.19529 |
null |
2025-08-27 |
MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment |
Zhiting Gao et.al. |
2508.19527 |
null |
2025-08-27 |
Functionally-graded drug delivery systems with binding reactions: analytical and stochastic approaches for the fraction of drug released |
Obi A. Carwood et.al. |
2508.19510 |
null |
2025-08-27 |
DATR: Diffusion-based 3D Apple Tree Reconstruction Framework with Sparse-View |
Tian Qiu et.al. |
2508.19508 |
null |
2025-08-27 |
Sat2Flow: A Structure-Aware Diffusion Framework for Human Flow Generation from Satellite Imagery |
Xiangxu Wang et.al. |
2508.19499 |
null |
2025-08-27 |
Towards 6G Intelligence: The Role of Generative AI in Future Wireless Networks |
Muhammad Ahmed Mohsin et.al. |
2508.19495 |
null |
2025-08-26 |
MRExtrap: Longitudinal Aging of Brain MRIs using Linear Modeling in Latent Space |
Jaivardhan Kapoor et.al. |
2508.19482 |
null |
2025-08-26 |
Shining light on degeneracies and uncertainties in quantifying both exchange and restriction with time-dependent diffusion MRI using Bayesian inference |
Maëliss Jallais et.al. |
2508.19478 |
null |
2025-08-26 |
Hydrodynamic Limit of the Symmetric Zero-Range Process with Slow Boundary |
Oslenne Araújo et.al. |
2508.19447 |
null |
2025-08-26 |
On Surjectivity of Neural Networks: Can you elicit any behavior from your model? |
Haozhe Jiang et.al. |
2508.19445 |
null |
2025-08-26 |
Efficiently Generating Multidimensional Calorimeter Data with Tensor Decomposition Parameterization |
Paimon Goulart et.al. |
2508.19443 |
null |
2025-08-26 |
Quantification of mobile ions in perovskite solar cells with thermally activated ion current measurements |
Moritz C. Schmidt et.al. |
2508.19403 |
null |
2025-08-26 |
DETNO: A Diffusion-Enhanced Transformer Neural Operator for Long-Term Traffic Forecasting |
Owais Ahmad et.al. |
2508.19389 |
null |
2025-08-26 |
Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in multimodal LLMs |
Supratik Sarkar et.al. |
2508.19366 |
null |
2025-08-28 |
MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation |
Ming Chen et.al. |
2508.19320 |
null |
2025-08-26 |
Disorder-induced proximate quantum spin ice phase in Pr $_2$Sn$_2$O$_7$ |
Yi Luo et.al. |
2508.19248 |
null |
2025-08-26 |
Articulate3D: Zero-Shot Text-Driven 3D Object Posing |
Oishi Deb et.al. |
2508.19244 |
null |
2025-08-26 |
MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation |
Hao Shi et.al. |
2508.19236 |
null |
2025-08-26 |
VibeVoice Technical Report |
Zhiliang Peng et.al. |
2508.19205 |
null |
2025-08-26 |
LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding |
Julian Ost et.al. |
2508.19204 |
null |
2025-08-26 |
Planning-Query-Guided Model Generation for Model-Based Deformable Object Manipulation |
Alex LaGrassa et.al. |
2508.19199 |
null |
2025-08-26 |
All-in-One Slider for Attribute Manipulation in Diffusion Models |
Weixin Ye et.al. |
2508.19195 |
null |
2025-08-26 |
MDD: a Mask Diffusion Detector to Protect Speaker Verification Systems from Adversarial Perturbations |
Yibo Bai et.al. |
2508.19180 |
null |
2025-08-26 |
Stoch-IDENT: New Method and Mathematical Analysis for Identifying SPDEs from Data |
Jianbo Cui et.al. |
2508.19177 |
null |
2025-08-26 |
RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration |
Yan Chen et.al. |
2508.19154 |
null |
2025-08-26 |
Saddle Hierarchy in Dense Associative Memory |
Robin Thériault et.al. |
2508.19151 |
null |
2025-08-26 |
Alloyed cementite (Fe-Ni-Cr) $_3$ C: structure and hyperfine field from DFT calculations and experimental comparison |
Lyudmila V. Dobysheva et.al. |
2508.19148 |
null |
2025-08-26 |
Lattice vacancy migration barriers in Fe-Ni alloys, and why Ni atoms diffuse slowly: An ab initio study |
Adam M. Fisher et.al. |
2508.19124 |
null |
2025-08-26 |
Composition and Alignment of Diffusion Models using Constrained Learning |
Shervin Khalafi et.al. |
2508.19104 |
null |
2025-08-26 |
Evaluation of in vitro antibacterial activity and phytochemical profile of aqueous leaf extract of Asystasia variabilis |
R Wijerathna et.al. |
2508.19049 |
null |
2025-08-26 |
In-vitro Anti-bacterial Activity of Methanol and Aqueous Crude Extracts of Horsfieldia iryaghedhi |
RMHKK Rajapaksha et.al. |
2508.19025 |
null |
2025-08-28 |
STDiff: A State Transition Diffusion Framework for Time Series Imputation in Industrial Systems |
Gary Simethy et.al. |
2508.19011 |
null |
2025-08-26 |
Detection of Diffuse Radio Emission inside the Supernova Remnant G338.3-0.0 associated with the Gamma-ray Source HESS J1640-465 |
Moaz Abdelmaguid et.al. |
2508.18999 |
null |
2025-08-26 |
Krylov-Veretennikov desomposition for measure-valued processes induced by SDEs with interaction on Riemannian manifolds |
Andrey Dorogovtsev et.al. |
2508.18995 |
null |
2025-08-26 |
Junctional-Fluctuation-Mediated Fluidisation of Multi-Phase Field Epithelial Monolayers |
James N. Graham et.al. |
2508.18987 |
null |
2025-08-26 |
Vanishing Angular Viscosity Limit For Micropolar Fluid Model In $\mathbb{R}_+^2$ : Boundary Layer And Optimal Convergence Rate |
Yinghui Wang et.al. |
2508.18980 |
null |
2025-08-26 |
Linear approximations of large deviations: Cubic diffusion test |
Pelerine Tsobgni Nyawo et.al. |
2508.18977 |
null |
2025-08-26 |
Generative AI in Map-Making: A Technical Exploration and Its Implications for Cartographers |
Claudio Affolter et.al. |
2508.18959 |
null |
2025-08-26 |
Energy-Based Flow Matching for Generating 3D Molecular Structure |
Wenyin Zhou et.al. |
2508.18949 |
null |
2025-08-26 |
Stochastic Forces Enhance Tracer Diffusion in Non-motile Active Matter |
Henry Alston et.al. |
2508.18882 |
null |
2025-08-26 |
Experimental investigation of turbulence and turbulent thermal diffusion in strongly inhomogeneous and anisotropic forced convection |
E. Zarbib et.al. |
2508.18865 |
null |
2025-08-26 |
Super and Weak Poincaré Inequalities for Sticky-Reflected Diffusion Processes |
Feng-Yu Wang et.al. |
2508.18846 |
null |
2025-08-26 |
Single-Photon Detection in Few-Layer NbSe $_2$ Superconducting Nanowires |
Lucio Zugliani et.al. |
2508.18843 |
null |
2025-08-26 |
Quantum-Circuit-Based Visual Fractal Image Generation in Qiskit and Analytics |
Hillol Biswas et.al. |
2508.18835 |
null |
2025-08-26 |
On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation |
Adrian Meise et.al. |
2508.18833 |
null |
2025-08-26 |
Asymptotic limit of a vector-valued Allen-Cahn equation for phase transition dynamics |
Huan Dong et.al. |
2508.18754 |
null |
2025-08-26 |
Joint Time-Position Statistics and Fisher Information in Drift-Diffusion Molecular Channels |
Yun-Feng Lo et.al. |
2508.18680 |
null |
2025-08-26 |
ROSE: Remove Objects with Side Effects in Videos |
Chenxuan Miao et.al. |
2508.18633 |
null |
2025-08-26 |
Wan-S2V: Audio-Driven Cinematic Video Generation |
Xin Gao et.al. |
2508.18621 |
null |
2025-08-26 |
SemLayoutDiff: Semantic Layout Generation with Diffusion Model for Indoor Scene Synthesis |
Xiaohao Sun et.al. |
2508.18597 |
null |
2025-08-26 |
Search for the radiative decay of the cosmic neutrino background through spectral measurements of the cosmic infrared background using PRIMA |
Yuji Takeuchi et.al. |
2508.18590 |
null |
2025-08-25 |
Controllable Single-shot Animation Blending with Temporal Conditioning |
Eleni Tselepi et.al. |
2508.18525 |
null |
2025-08-25 |
VQualA 2025 Challenge on Face Image Quality Assessment: Methods and Results |
Sizhuo Ma et.al. |
2508.18445 |
null |
2025-08-25 |
Phase-Field Model of Freeze Casting |
Kaihua Ji et.al. |
2508.18416 |
null |
2025-08-25 |
Hillas meets Eddington: the case for blazars as ultra-high-energy neutrino sources |
Xavier Rodrigues et.al. |
2508.18345 |
null |
2025-08-25 |
ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models |
Haitang Feng et.al. |
2508.18271 |
null |
2025-08-25 |
SafeBimanual: Diffusion-based Trajectory Optimization for Safe Bimanual Manipulation |
Haoyuan Deng et.al. |
2508.18268 |
null |
2025-08-25 |
Diffusiophoretic corner flows |
Dobromir Nowak et.al. |
2508.18233 |
null |
2025-08-25 |
Follow My Hold: Hand-Object Interaction Reconstruction through Geometric Guidance |
Ayce Idil Aytekin et.al. |
2508.18213 |
null |
2025-08-25 |
New shell-model calculations of the $δ_C$ correction to superallowed $0^+\rightarrow0^+$ nuclear $β$ decay and standard-model implications |
L. Xayavong et.al. |
2508.18189 |
null |
2025-08-25 |
SpotEdit: Evaluating Visually-Guided Image Editing Methods |
Sara Ghazanfari et.al. |
2508.18159 |
null |
2025-08-25 |
Learning from Few Samples: A Novel Approach for High-Quality Malcode Generation |
Haijian Ma et.al. |
2508.18148 |
null |
2025-08-25 |
Incorporating Pre-trained Diffusion Models in Solving the Schrödinger Bridge Problem |
Zhicong Tang et.al. |
2508.18095 |
null |
2025-08-26 |
Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation |
Yaqi Li et.al. |
2508.18032 |
null |
2025-08-25 |
HD 28471: a near-resonant compact multiplanet system with a possible cold giant planet |
A. T. Stevenson et.al. |
2508.18000 |
null |
2025-08-26 |
Solute dispersion in axially strained tube flows: Large-time asymptotics and Ornstein-Uhlenbeck Gaussian profiles |
Prabakaran Rajamanickam et.al. |
2508.17982 |
null |
2025-08-25 |
Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech |
Dimme de Groot et.al. |
2508.17980 |
null |
2025-08-26 |
Generative Feature Imputing – A Technique for Error-resilient Semantic Communication |
Jianhao Huang et.al. |
2508.17957 |
null |
2025-08-25 |
Nodal error behind discrepancies between coupled cluster and diffusion Monte Carlo: AcOH dimer case study |
S. Lambie et.al. |
2508.17937 |
null |
2025-08-25 |
Parallel Nodal Interior-Penalty Discontinuous Galerkin Methods for the Subsonic Compressible Navier-Stokes Equations: Applications to Vortical Flows and VIV Problems |
Spiros Zafeiris et.al. |
2508.17917 |
null |
2025-08-25 |
Quasi-likelihood inference for SDE with mixed-effects observed at high frequency |
Maud Delattre et.al. |
2508.17910 |
null |
2025-08-25 |
Local Well-Posedness of the Cahn-Hilliard-Biot System |
Helmut Abels et.al. |
2508.17893 |
null |
2025-08-27 |
Vocoder-Projected Feature Discriminator |
Takuhiro Kaneko et.al. |
2508.17874 |
null |
2025-08-25 |
FasterVoiceGrad: Faster One-step Diffusion-Based Voice Conversion with Adversarial Diffusion Conversion Distillation |
Takuhiro Kaneko et.al. |
2508.17868 |
null |
2025-08-25 |
Diffusion-Based Data Augmentation for Medical Image Segmentation |
Maham Nazir et.al. |
2508.17844 |
null |
2025-08-25 |
Threshold Diffusions |
Lina Ji et.al. |
2508.17812 |
null |
2025-08-25 |
CEIDM: A Controlled Entity and Interaction Diffusion Model for Enhanced Text-to-Image Generation |
Mingyue Yang et.al. |
2508.17760 |
null |
2025-08-25 |
SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling |
Fanjiang Ye et.al. |
2508.17756 |
null |
2025-08-25 |
DiffusionGS: Generative Search with Query Conditioned Diffusion in Kuaishou |
Qinyao Li et.al. |
2508.17754 |
null |
2025-08-25 |
Few-shot Human Action Anomaly Detection via a Unified Contrastive Learning Framework |
Koichiro Kamide et.al. |
2508.17726 |
null |
2025-08-25 |
Instant Preference Alignment for Text-to-Image Diffusion Models |
Yang Li et.al. |
2508.17718 |
null |
2025-08-25 |
CATformer: Contrastive Adversarial Transformer for Image Super-Resolution |
Qinyi Tian et.al. |
2508.17708 |
null |
2025-08-25 |
On the Edge of Memorization in Diffusion Models |
Sam Buchanan et.al. |
2508.17689 |
null |
2025-08-25 |
Calculating the power spectrum in stochastic inflation by Monte Carlo simulation and least squares curve fitting |
Koichi Miyamoto et.al. |
2508.17654 |
null |
2025-08-27 |
ControlEchoSynth: Boosting Ejection Fraction Estimation Models via Controlled Video Diffusion |
Nima Kondori et.al. |
2508.17631 |
null |
2025-08-25 |
Effects of Near-Field Hydrodynamic Interactions on Bacterial Dynamics Near a Solid Surface |
Baopi Liu et.al. |
2508.17626 |
null |
2025-08-25 |
Steering When Necessary: Flexible Steering Large Language Models with Backtracking |
Jinwei Gan et.al. |
2508.17621 |
null |
2025-08-25 |
Preference Trajectory Modeling via Flow Matching for Sequential Recommendation |
Li Li et.al. |
2508.17618 |
null |
2025-08-25 |
JCo-MVTON: Jointly Controllable Multi-Modal Diffusion Transformer for Mask-Free Virtual Try-on |
Aowen Wang et.al. |
2508.17614 |
null |
2025-08-25 |
HotSpotter - Patterned Species Instance Recognition |
Jonathan P. Crall et.al. |
2508.17605 |
null |
2025-08-25 |
GWM: Towards Scalable Gaussian World Models for Robotic Manipulation |
Guanxing Lu et.al. |
2508.17600 |
null |
2025-08-25 |
HERO: Hierarchical Extrapolation and Refresh for Efficient World Models |
Quanjian Song et.al. |
2508.17588 |
null |
2025-08-24 |
Controllability of a system of non-autonomous degenerate coupled parabolic equations |
Alfredo S. Gamboa et.al. |
2508.17546 |
null |
2025-08-24 |
Universal scaling of higher-order cumulants in quantum isotropic spin chains |
Shixian Jiang et.al. |
2508.17535 |
null |
2025-08-24 |
Learning Reaction-Diffusion Kinetics from Mechanical Information |
Royal C. Ihuaenyi et.al. |
2508.17523 |
null |
2025-08-24 |
Variational Shape Inference for Grasp Diffusion on SE(3) |
S. Talha Bukhari et.al. |
2508.17482 |
null |
2025-08-24 |
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation |
Kaiyue Sun et.al. |
2508.17472 |
null |
2025-08-24 |
A Synthetic Dataset for Manometry Recognition in Robotic Applications |
Pedro Antonio Rabelo Saraiva et.al. |
2508.17468 |
null |
2025-08-24 |
Bias Amplification in Stable Diffusion’s Representation of Stigma Through Skin Tones and Their Homogeneity |
Kyra Wilson et.al. |
2508.17465 |
null |
2025-08-24 |
Disentangled Geometry and Appearance for Efficient Multi-View Surface Reconstruction and Rendering |
Qitong Zhang et.al. |
2508.17436 |
null |
2025-08-24 |
An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing |
Zihan Liang et.al. |
2508.17435 |
null |
2025-08-24 |
TinySR: Pruning Diffusion for Real-World Image Super-Resolution |
Linwei Dong et.al. |
2508.17434 |
null |
2025-08-24 |
Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling |
Haochen You et.al. |
2508.17426 |
null |
2025-08-24 |
Asteroid Rotation Periods: Statistical Analysis in the Diameter-Spin Distribution |
Maryam Nastaran et.al. |
2508.17415 |
null |
2025-08-24 |
MoCo: Motion-Consistent Human Video Generation via Structure-Appearance Decoupling |
Haoyu Wang et.al. |
2508.17404 |
null |
2025-08-24 |
Stability and uniqueness of bounded weak solutions to triangular degenerate cross-diffusion systems |
Xiuqing Chen et.al. |
2508.17379 |
null |
2025-08-24 |
ShaLa: Multimodal Shared Latent Space Modelling |
Jiali Cui et.al. |
2508.17376 |
null |
2025-08-24 |
Condition Weaving Meets Expert Modulation: Towards Universal and Controllable Image Generation |
Guoqing Zhang et.al. |
2508.17364 |
null |
2025-08-24 |
DiCache: Let Diffusion Model Determine Its Own Cache |
Jiazi Bu et.al. |
2508.17356 |
null |
2025-08-24 |
ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation |
Yuxuan Song et.al. |
2508.17345 |
null |
2025-08-24 |
Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing |
Tristan S. W. Stevens et.al. |
2508.17326 |
null |
2025-08-24 |
An improved nonlocal electron heat transport model for magnetized plasmas |
Z. H. Chen et.al. |
2508.17309 |
null |
2025-08-24 |
PosBridge: Multi-View Positional Embedding Transplant for Identity-Aware Image Editing |
Peilin Xiong et.al. |
2508.17302 |
null |
2025-08-24 |
FoundDiff: Foundational Diffusion Model for Generalizable Low-Dose CT Denoising |
Zhihao Chen et.al. |
2508.17299 |
null |
2025-08-24 |
4D Visual Pre-training for Robot Learning |
Chengkai Hou et.al. |
2508.17230 |
null |
2025-08-24 |
Multi-Metric Preference Alignment for Generative Speech Restoration |
Junan Zhang et.al. |
2508.17229 |
null |
2025-08-24 |
Effects of Geometric configuration in relativistic isobaric collisions at $\sqrt{s_{NN}}=200$ GeV |
Akash Das et.al. |
2508.17227 |
null |
2025-08-24 |
MMCIG: Multimodal Cover Image Generation for Text-only Documents and Its Dataset Construction via Pseudo-labeling |
Hyeyeon Kim et.al. |
2508.17199 |
null |
2025-08-23 |
Generative AI for Multimedia Communication: Recent Advances, An Information-Theoretic Framework, and Future Opportunities |
Yili Jin et.al. |
2508.17163 |
null |
2025-08-23 |
SyncGuard: Robust Audio Watermarking Capable of Countering Desynchronization Attacks |
Zhenliang Gan et.al. |
2508.17121 |
null |
2025-08-23 |
CP4SBI: Local Conformal Calibration of Credible Sets in Simulation-Based Inference |
Luben M. C. Cabezas et.al. |
2508.17077 |
null |
2025-08-23 |
LaGarNet: Goal-Conditioned Recurrent State-Space Models for Pick-and-Place Garment Flattening |
Halid Abdulrahim Kadi et.al. |
2508.17070 |
null |
2025-08-23 |
SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation |
Peng Hu et.al. |
2508.17062 |
null |
2025-08-23 |
PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models |
Xianjing Cheng et.al. |
2508.17050 |
null |
2025-08-23 |
Styleclone: Face Stylization with Diffusion Based Data Augmentation |
Neeraj Matiyali et.al. |
2508.17045 |
null |
2025-08-23 |
A Novel Local Focusing Mechanism for Deepfake Detection Generalization |
Mingliang Li et.al. |
2508.17029 |
null |
2025-08-23 |
Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation |
Konstantina Nikolaidou et.al. |
2508.17017 |
null |
2025-08-23 |
An improved lattice Boltzmann method with a novel conservative boundary scheme for viscoelastic fluid flows |
Yuan Yu et.al. |
2508.16997 |
null |
2025-08-23 |
Score Matching on Large Geometric Graphs for Cosmology Generation |
Diana-Alexandra Onutu et.al. |
2508.16990 |
null |
2025-08-23 |
HiCache: Training-free Acceleration of Diffusion Models via Hermite Polynomial-based Feature Caching |
Liang Feng et.al. |
2508.16984 |
null |
2025-08-23 |
Shape optimization problems with random coefficients via the penalty method |
Xiaowei Pang et.al. |
2508.16961 |
null |
2025-08-23 |
RPD-Diff: Region-Adaptive Physics-Guided Diffusion Model for Visibility Enhancement under Dense and Non-Uniform Haze |
Ruicheng Zhang et.al. |
2508.16956 |
null |
2025-08-23 |
Drive As You Like: Strategy-Level Motion Planning Based on A Multi-Head Diffusion Model |
Fan Ding et.al. |
2508.16947 |
null |
2025-08-23 |
Sig-DEG for Distillation: Making Diffusion Models Faster and Lighter |
Lei Jiang et.al. |
2508.16939 |
null |
2025-08-23 |
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation |
Sizhe Shan et.al. |
2508.16930 |
null |
2025-08-23 |
Structural Energy-Guided Sampling for View-Consistent Text-to-3D |
Qing Zhang et.al. |
2508.16917 |
null |
2025-08-23 |
Remarks on the three-dimensional Navier-Stokes equations with Lions’ exponent forced by space-time white noise |
Kazuo Yamazaki et.al. |
2508.16906 |
null |
2025-08-23 |
Enhanced shape recovery in advection–diffusion problems via a novel ADMM-based CCBM optimization |
Elmehdi Cherrat et.al. |
2508.16898 |
null |
2025-08-23 |
Generating Synthetic Contrast-Enhanced Chest CT Images from Non-Contrast Scans Using Slice-Consistent Brownian Bridge Diffusion Network |
Pouya Shiri et.al. |
2508.16897 |
null |
2025-08-23 |
Delta-SVD: Efficient Compression for Personalized Text-to-Image Models |
Tangyuan Zhang et.al. |
2508.16863 |
null |
2025-08-23 |
Subtleties of UV-crosslinking in microfluidic particle fabrication: UV dosage and intensity matter |
Sabrina Marnoto et.al. |
2508.16862 |
null |
2025-08-23 |
Intelligent Shanghai Typhoon Model (ISTM): A generative probabilistic emulator for typhoon hybrid modeling |
Zeyi Niu et.al. |
2508.16851 |
null |
2025-08-23 |
NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows |
Denis Tarasov et.al. |
2508.16845 |
null |
2025-08-22 |
A Fluctuating Hydrodynamics Model for Nanoscale Surfactant-laden Interfaces |
John B. Bell et.al. |
2508.16820 |
null |
2025-08-22 |
Two-Step Bose-Einstein Condensation of an ideal Magnetized Charged Bosonic gas under neutron star-like conditions |
Amanda Castillo Ayon et.al. |
2508.16799 |
null |
2025-08-22 |
TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling |
Yuancheng Wang et.al. |
2508.16790 |
null |
2025-08-22 |
Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data |
Stefania L. Moroianu et.al. |
2508.16783 |
null |
2025-08-26 |
Characterising the short-orbital period X-ray transient Swift J1910.2-0546 |
J. M. Corral-Santana et.al. |
2508.16775 |
null |
2025-08-22 |
Spontaneous spiral patterns etched on Germanium |
Yilin Wong et.al. |
2508.16764 |
null |
2025-08-22 |
A Framework for Benchmarking Fairness-Utility Trade-offs in Text-to-Image Models via Pareto Frontiers |
Marco N. Bochernitsan et.al. |
2508.16752 |
null |
2025-08-22 |
Hamiltonian Simulation for Advection-Diffusion Equation with arbitrary transport field |
Niladri Gomes et.al. |
2508.16728 |
null |
2025-08-22 |
MV-RAG: Retrieval Augmented Multiview Diffusion |
Yosef Dayani et.al. |
2508.16577 |
null |
2025-08-22 |
Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution |
Tainyi Zhang et.al. |
2508.16557 |
null |
2025-08-22 |
Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning |
Xuan Zhang et.al. |
2508.16524 |
null |
2025-08-22 |
Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation |
Zhijian Zhou et.al. |
2508.16521 |
null |
2025-08-22 |
ARSP: Automated Repair of Verilog Designs via Semantic Partitioning |
Bingkun Yao et.al. |
2508.16517 |
null |
2025-08-22 |
Seeing Clearly, Forgetting Deeply: Revisiting Fine-Tuned Video Generators for Driving Simulation |
Chun-Peng Chang et.al. |
2508.16512 |
null |
2025-08-22 |
Underdamped Langevin MCMC with third order convergence |
Maximilian Scott et.al. |
2508.16485 |
null |
2025-08-22 |
Large-scale concentration and relaxation for mean-field Langevin particle systems |
Songbo Wang et.al. |
2508.16428 |
null |
2025-08-22 |
Multiscale Growth Kinetics of Model Biomolecular Condensates Under Passive and Active Conditions |
Tamizhmalar Sundararajan et.al. |
2508.16398 |
null |
2025-08-22 |
Parrondo paradox in quantum image encryption |
Łukasz Pawela et.al. |
2508.16382 |
null |
2025-08-22 |
Observation of negative orbital torque from Vanadium |
Nikhil Vijayan et.al. |
2508.16339 |
null |
2025-08-22 |
A Sharp KL-Convergence Analysis for Diffusion Models under Minimal Assumptions |
Nishant Jain et.al. |
2508.16306 |
null |
2025-08-22 |
Towards Diagnostic Quality Flat-Panel Detector CT Imaging Using Diffusion Models |
Hélène Corbaz et.al. |
2508.16252 |
null |
2025-08-22 |
Numerical solution of the time fractional nonlinear Fisher-KPP diffusion-reaction equation using the local domain boundary element method |
Theodore V. Gortsas et.al. |
2508.16241 |
null |
2025-08-22 |
UniEM-3M: A Universal Electron Micrograph Dataset for Microstructural Segmentation and Generation |
Nan wang et.al. |
2508.16239 |
null |
2025-08-22 |
PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting |
Hohyun Na et.al. |
2508.16217 |
null |
2025-08-22 |
OmniCache: A Trajectory-Oriented Global Perspective on Training-Free Cache Reuse for Diffusion Transformer Models |
Huanpeng Chu et.al. |
2508.16212 |
null |
2025-08-22 |
Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers |
Shikang Zheng et.al. |
2508.16211 |
null |
2025-08-22 |
Competition and Attraction Improve Model Fusion |
João Abrantes et.al. |
2508.16204 |
null |
2025-08-22 |
FuXi-TC: A generative framework integrating deep learning and physics-based models for improved tropical cyclone forecasts |
Shan Guo et.al. |
2508.16168 |
null |
2025-08-22 |
Transport Properties of QGP within a Bayesian Holographic QCD Model |
Bing Chen et.al. |
2508.16167 |
null |
2025-08-22 |
RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution |
Haodong He et.al. |
2508.16158 |
null |
2025-08-22 |
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models |
Yi Zhang et.al. |
2508.16154 |
null |
2025-08-22 |
Machine Learning for Medicine Must Be Interpretable, Shareable, Reproducible and Accountable by Design |
Ayyüce Begüm Bektaş et.al. |
2508.16097 |
null |
2025-08-22 |
Two-flow Feedback Multi-scale Progressive Generative Adversarial Network |
Sun Weikai et.al. |
2508.16089 |
null |
2025-08-22 |
A Unified Voxel Diffusion Module for Point Cloud 3D Object Detection |
Qifeng Liu et.al. |
2508.16069 |
null |
2025-08-21 |
Clinically-Informed Preprocessing Improves Stroke Segmentation in Low-Resource Settings |
Juampablo E. Heras Rivera et.al. |
2508.16004 |
null |
2025-08-21 |
Multiscale Analysis of a Kinetic Model of Confined Suspensions of Self-Propelled Rods |
Leonid Berlyand et.al. |
2508.16003 |
null |
2025-08-21 |
Universal Fluctuations in the Tail Probability for d=2 Random Walks in Space-Time Random Environments |
Franscesca Ark et.al. |
2508.15999 |
null |
2025-08-21 |
Diverse Signer Avatars with Manual and Non-Manual Feature Modelling for Sign Language Production |
Mohamed Ilyes Lakhal et.al. |
2508.15988 |
null |
2025-08-21 |
UnPose: Uncertainty-Guided Diffusion Priors for Zero-Shot Pose Estimation |
Zhaodong Jiang et.al. |
2508.15972 |
null |
2025-08-21 |
Physical blowups via buffered time change in a mean-field neural network |
Nikolaos Papadopoulos et.al. |
2508.15961 |
null |
2025-08-21 |
Structure-Preserving Medical Image Generation from a Latent Graph Representation |
Kevin Arias et.al. |
2508.15920 |
null |
2025-08-21 |
Text-Driven 3D Hand Motion Generation from Sign Language Data |
Léore Bensabath et.al. |
2508.15902 |
null |
2025-08-21 |
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning |
Yijun Liu et.al. |
2508.15874 |
null |
2025-08-21 |
CineScale: Free Lunch in High-Resolution Cinematic Visual Generation |
Haonan Qiu et.al. |
2508.15774 |
null |
2025-08-21 |
Scaling Group Inference for Diverse and High-Quality Generation |
Gaurav Parmar et.al. |
2508.15773 |
null |
2025-08-21 |
Visual Autoregressive Modeling for Instruction-Guided Image Editing |
Qingyang Mao et.al. |
2508.15772 |
null |
2025-08-21 |
Waver: Wave Your Way to Lifelike Video Generation |
Yifu Zhang et.al. |
2508.15761 |
null |
2025-08-21 |
Skyrmion Lattice Order Controlled by Confinement Geometry |
Raphael Gruber et.al. |
2508.15758 |
null |
2025-08-21 |
Spatial Super-Infection and Co-Infection Dynamics in Networks |
Alyssa Yu et.al. |
2508.15740 |
null |
2025-08-21 |
Probability Density from Latent Diffusion Models for Out-of-Distribution Detection |
Joonas Järve et.al. |
2508.15737 |
null |
2025-08-21 |
The Status of the Astrophysical Parameters of Upper Main Sequence Stars |
Lukas Kueß et.al. |
2508.15722 |
null |
2025-08-21 |
WorldWeaver: Generating Long-Horizon Video Worlds via Rich Perception |
Zhiheng Liu et.al. |
2508.15720 |
null |
2025-08-21 |
Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation |
Nikita Kachaev et.al. |
2508.15663 |
null |
2025-08-21 |
When and What: Diffusion-Grounded VideoLLM with Entity Aware Segmentation for Long Video Understanding |
Pengcheng Fang et.al. |
2508.15641 |
null |
2025-08-21 |
Are Virtual DES Images a Valid Alternative to the Real Ones? |
Ana C. Perre et.al. |
2508.15594 |
null |
2025-08-21 |
Lattice distortions and non-sluggish diffusion in BCC refractory high entropy alloys |
Jingfeng Zhang et.al. |
2508.15558 |
null |
2025-08-21 |
Dream 7B: Diffusion Large Language Models |
Jiacheng Ye et.al. |
2508.15487 |
null |
2025-08-21 |
Reevaluating Anomalous Electric Fields at the Air-Water Interface: A Surface-Specific Spectroscopic Survey |
Joseph C. Shirley et.al. |
2508.15422 |
null |
2025-08-21 |
Speckle suppression in digital in-line holographic microscopy through liquid crystal dynamic scattering |
Emilia Wdowiak et.al. |
2508.15419 |
null |
2025-08-21 |
Numerical Analysis of Unsupervised Learning Approaches for Parameter Identification in PDEs |
Siyu Cen et.al. |
2508.15381 |
null |
2025-08-21 |
Diffusion-driven pattern formation in an opinion dynamical network model |
Tim Mauch et.al. |
2508.15377 |
null |
2025-08-21 |
Performance Analysis of RIS-Aided High-Mobility Wireless Systems |
Hanwen Hu et.al. |
2508.15375 |
null |
2025-08-22 |
Analytical Theory of Chiral Active Particle Transport in a Fluctuating Density Field |
Jayam Joshi et.al. |
2508.15366 |
null |
2025-08-21 |
The effect of multi-occupancy traps on the diffusion and retention of multiple hydrogen isotopes in irradiated tungsten and vanadium |
Sanjeet Kaur et.al. |
2508.15341 |
null |
2025-08-21 |
Discovering correlations between metal foam thermal characteristics and non-Fourier behavior |
Anna Fehér et.al. |
2508.15340 |
null |
2025-08-21 |
Interface fluctuations for $1$ D stochastic Allen-Cahn equation – singular regime |
Weijun Xu et.al. |
2508.15319 |
null |
2025-08-21 |
VideoEraser: Concept Erasure in Text-to-Video Diffusion Models |
Naen Xu et.al. |
2508.15314 |
null |
2025-08-21 |
HIP: Model-Agnostic Hypergraph Influence Prediction via Distance-Centrality Fusion and Neural ODEs |
Su-Su Zhang et.al. |
2508.15312 |
null |
2025-08-21 |
Modeling Long-term User Behaviors with Diffusion-driven Multi-interest Network for CTR Prediction |
Weijiang Lai et.al. |
2508.15311 |
null |
2025-08-21 |
Contribution of Globular Clusters to Diffuse Gamma-ray Emission from Galactic Plane |
Jiayin He et.al. |
2508.15295 |
null |
2025-08-21 |
Optimizing Compilation for Distributed Quantum Computing via Clustering and Annealing |
Ruilin Zhou et.al. |
2508.15267 |
null |
2025-08-21 |
Pathology-Informed Latent Diffusion Model for Anomaly Detection in Lymph Node Metastasis |
Jiamu Wang et.al. |
2508.15236 |
null |
2025-08-21 |
Pretrained Diffusion Models Are Inherently Skipped-Step Samplers |
Wenju Xu et.al. |
2508.15233 |
null |
2025-08-21 |
Collaborative Multi-Modal Coding for High-Quality 3D Generation |
Ziang Cao et.al. |
2508.15228 |
null |
2025-08-21 |
GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design |
Wen-Fan Wang et.al. |
2508.15227 |
null |
2025-08-21 |
A rutile-based homologous series Na(PtO $2$)${2\it{n}+1}$ discovered by computationally assisted high-pressure synthesis |
Yasuhito Kobayashi et.al. |
2508.15223 |
null |
2025-08-21 |
See it. Say it. Sorted: Agentic System for Compositional Diagram Generation |
Hantao Zhang et.al. |
2508.15222 |
null |
2025-08-21 |
Obstacle-tuned transition from chaotic to coherent vortex flows and odd diffusion in chiral active fluids |
Joscha Mecke et.al. |
2508.15210 |
null |
2025-08-21 |
Quantum Differential Equation Solvers with Low State Preparation Cost: Eliminating the Time Dependence in Dissipative Equations |
Gengzhi Yang et.al. |
2508.15170 |
null |
2025-08-21 |
MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion |
Xuyang Chen et.al. |
2508.15169 |
null |
2025-08-21 |
Zero-shot Volumetric CT Super-Resolution using 3D Gaussian Splatting with Upsampled 2D X-ray Projection Priors |
Jeonghyun Noh et.al. |
2508.15151 |
null |
2025-08-21 |
Electron-Ion Equilibration in the Merging Galaxy Cluster Abell 665 |
Christian Norseth et.al. |
2508.15138 |
null |
2025-08-24 |
Side Effects of Erasing Concepts from Diffusion Models |
Shaswati Saha et.al. |
2508.15124 |
null |
2025-08-20 |
Microstructural and preliminary optical and microwave characterization of erbium doped CaMoO $_4$ thin films |
Ignas Masiulionis et.al. |
2508.15122 |
null |
2025-08-24 |
CurveFlow: Curvature-Guided Flow Matching for Image Generation |
Yan Luo et.al. |
2508.15093 |
null |
2025-08-20 |
Sampling by averaging: A multiscale approach to score estimation |
Paula Cordero-Encinar et.al. |
2508.15069 |
null |
2025-08-20 |
Asymptotic analysis on narrow tubes: narrow escape problems and diffusion processes |
Wen-Tai Hsu et.al. |
2508.15060 |
null |
2025-08-20 |
Correlating Particle Acceleration Rates with Plasma Conditions in Colliding Wind Binaries |
Gislaine B Cordeiro et.al. |
2508.15059 |
null |
2025-08-20 |
An MRI Atlas of the Human Fetal Brain: Reference and Segmentation Tools for Fetal Brain MRI Analysis |
Mahdi Bagheri et.al. |
2508.15034 |
null |
2025-08-20 |
Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement |
Chunming He et.al. |
2508.15027 |
null |
2025-08-20 |
TAIGen: Training-Free Adversarial Image Generation via Diffusion Models |
Susim Roy et.al. |
2508.15020 |
null |
2025-08-20 |
Probing Magnetic Properties of RuO $_{2}$ Heterostructures Through the Ferromagnetic Layer |
Frank M. Abel et.al. |
2508.15004 |
null |
2025-08-20 |
LyLA-Therm: Lyapunov-based Langevin Adaptive Thermodynamic Neural Network Controller |
Saiedeh Akbari et.al. |
2508.14989 |
null |
2025-08-20 |
Aura-CAPTCHA: A Reinforcement Learning and GAN-Enhanced Multi-Modal CAPTCHA System |
Joydeep Chandra et.al. |
2508.14976 |
null |
2025-08-20 |
Potential and challenges of generative adversarial networks for super-resolution in 4D Flow MRI |
Oliver Welin Odeback et.al. |
2508.14950 |
null |
2025-08-19 |
Inference Time Debiasing Concepts in Diffusion Models |
Lucas S. Kupssinskü et.al. |
2508.14933 |
null |
2025-08-19 |
TOM: An Open-Source Tongue Segmentation Method with Multi-Teacher Distillation and Task-Specific Data Augmentation |
Jiacheng Xie et.al. |
2508.14932 |
null |
2025-08-20 |
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs |
Haokun Lin et.al. |
2508.14896 |
null |
2025-08-20 |
Virtual Community: An Open World for Humans, Robots, and Society |
Qinhong Zhou et.al. |
2508.14893 |
null |
2025-08-20 |
Squeezed Diffusion Models |
Jyotirmai Singh et.al. |
2508.14871 |
null |
2025-08-20 |
Critical trajectories in kinetic geometry |
Helge Dietert et.al. |
2508.14868 |
null |
2025-08-20 |
Universal winding properties of chiral active motion |
Ion Santra et.al. |
2508.14862 |
null |
2025-08-20 |
Physics-Informed ML Exploration of Structure-Transport Relationships in Hard Carbon |
Nikhil Rampal et.al. |
2508.14849 |
null |
2025-08-20 |
TransLight: Image-Guided Customized Lighting Control with Generative Decoupling |
Zongming Li et.al. |
2508.14814 |
null |
2025-08-20 |
Tinker: Diffusion’s Gift to 3D–Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization |
Canyu Zhao et.al. |
2508.14811 |
null |
2025-08-20 |
Cross-Modality Controlled Molecule Generation with Diffusion Language Model |
Yunzhe Zhang et.al. |
2508.14748 |
null |
2025-08-20 |
Modeling the impact of temperature and bird migration on the spread of West Nile virus |
Pride Duve et.al. |
2508.14740 |
null |
2025-08-20 |
GSFix3D: Diffusion-Guided Repair of Novel Views in Gaussian Splatting |
Jiaxin Wei et.al. |
2508.14717 |
null |
2025-08-20 |
The heating and cooling of 2D electrons at low temperatures |
A. K. Jain et.al. |
2508.14694 |
null |
2025-08-20 |
Virtual Multiplex Staining for Histological Images using a Marker-wise Conditioned Diffusion Model |
Hyun-Jic Oh et.al. |
2508.14681 |
null |
2025-08-21 |
Phase space transport, quasilinear diffusion and locality in phase velocity |
Didier Bénisti et.al. |
2508.14657 |
null |
2025-08-20 |
AnchorSync: Global Consistency Optimization for Long Video Editing |
Zichi Liu et.al. |
2508.14609 |
null |
2025-08-20 |
Call Option Price using Pearson Diffusion Processes |
Tapan Kar et.al. |
2508.14577 |
null |
2025-08-20 |
Minimizing Task-Oriented Age of Information for Remote Monitoring with Pre-Identification |
Shuying Gan et.al. |
2508.14575 |
null |
2025-08-20 |
EffiFusion-GAN: Efficient Fusion Generative Adversarial Network for Speech Enhancement |
Bin Wen et.al. |
2508.14525 |
null |
2025-08-20 |
SATURN: Autoregressive Image Generation Guided by Scene Graphs |
Thanh-Nhan Vo et.al. |
2508.14502 |
null |
2025-08-20 |
Multimode Fiber Imaging Based on Hydrogel Fiber |
Lele He et.al. |
2508.14501 |
null |
2025-08-20 |
DGenCTR: Towards a Universal Generative Paradigm for Click-Through Rate Prediction via Discrete Diffusion |
Moyu Zhang et.al. |
2508.14500 |
null |
2025-08-20 |
Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration |
Haoran Bai et.al. |
2508.14483 |
null |
2025-08-20 |
DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing |
Weitao Wang et.al. |
2508.14465 |
null |
2025-08-20 |
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering |
Shanlin Sun et.al. |
2508.14461 |
null |
2025-08-20 |
Early Evolution of the Cavity and Core of a Coronal Mass Ejection in the Inner Corona |
Shuting Li et.al. |
2508.14455 |
null |
2025-08-20 |
FBI: Learning Dexterous In-hand Manipulation with Dynamic Visuotactile Shortcut Policy |
Yijin Chen et.al. |
2508.14441 |
null |
2025-08-20 |
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion |
Fei Peng et.al. |
2508.14440 |
null |
2025-08-20 |
Weakly-Convex Regularization for Magnetic Resonance Image Denoising |
Akash Prabakar et.al. |
2508.14438 |
null |
2025-08-20 |
FOCUS: Frequency-Optimized Conditioning of DiffUSion Models for mitigating catastrophic forgetting during Test-Time Adaptation |
Gabriel Tjio et.al. |
2508.14437 |
null |
2025-08-20 |
HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation |
Bing Han et.al. |
2508.14431 |
null |
2025-08-20 |
Disentanglement in T-space for Faster and Distributed Training of Diffusion Models with Fewer Latent-states |
Samarth Gupta et.al. |
2508.14413 |
null |
2025-08-20 |
A Real-world Display Inverse Rendering Dataset |
Seokjun Choi et.al. |
2508.14411 |
null |
2025-08-20 |
CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities |
Yue Gong et.al. |
2508.14405 |
null |
2025-08-20 |
Img2ST-Net: Efficient High-Resolution Spatial Omics Prediction from Whole Slide Histology Images via Fully Convolutional Image-to-Image Learning |
Junchao Zhu et.al. |
2508.14393 |
null |
2025-08-20 |
Physics-Constrained Diffusion Reconstruction with Posterior Correction for Quantitative and Fast PET Imaging |
Yucun Hou et.al. |
2508.14364 |
null |
2025-08-20 |
Organ-Agents: Virtual Human Physiology Simulator via LLMs |
Rihao Chang et.al. |
2508.14357 |
null |
2025-08-20 |
SBGD: Improving Graph Diffusion Generative Model via Stochastic Block Diffusion |
Junwei Su et.al. |
2508.14352 |
null |
2025-08-20 |
A Non-Asymptotic Convergent Analysis for Scored-Based Graph Generative Model via a System of Stochastic Differential Equations |
Junwei Su et.al. |
2508.14351 |
null |
2025-08-20 |
Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation |
Lingkai Kong et.al. |
2508.14342 |
null |
2025-08-20 |
Modeling oxygen-void interactions in uranium nitride |
Mohamed AbdulHameed et.al. |
2508.14329 |
null |
2025-08-20 |
MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation |
Guile Wu et.al. |
2508.14327 |
null |
2025-08-20 |
Modeling of silver transport in cubic SiC: Integrating molecular dynamics, bounds averaging, and uncertainty quantification |
Mohamed AbdulHameed et.al. |
2508.14325 |
null |
2025-08-19 |
Tooth-Diffusion: Guided 3D CBCT Synthesis with Fine-Grained Tooth Conditioning |
Said Djafar Said et.al. |
2508.14276 |
null |
2025-08-19 |
Mean field social optimization: feedback person-by-person optimality and the dynamic programming equation |
Minyi Huang et.al. |
2508.14236 |
null |
2025-08-19 |
CO Adsorption Sites on Interstellar Water Ices Explored with Machine Learning Potentials. Binding energy distributions and snowline |
Giulia M. Bovolenta et.al. |
2508.14219 |
null |
2025-08-19 |
A well-balanced gas-kinetic scheme with adaptive mesh refinement for shallow water equations |
Gaocheng Liu et.al. |
2508.14216 |
null |
2025-08-19 |
Nonadiabatic force matching for alchemical free-energy estimation |
Jorge L. Rosa-Raíces et.al. |
2508.14179 |
null |
2025-08-19 |
DPad: Efficient Diffusion Language Models with Suffix Dropout |
Xinhua Chen et.al. |
2508.14148 |
null |
2025-08-18 |
3D Cardiac Anatomy Generation Using Mesh Latent Diffusion Models |
Jolanta Mozyrska et.al. |
2508.14122 |
null |
2025-08-19 |
InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing |
Shaoshu Yang et.al. |
2508.14033 |
null |
2025-08-19 |
Electrochemical response of biological membranes to localized currents and external electric fields |
Joshua B. Fernandes et.al. |
2508.14001 |
null |
2025-08-19 |
Physics-Based 3D Simulation for Synthetic Data Generation and Failure Analysis in Packaging Stability Assessment |
Samuel Seligardi et.al. |
2508.13989 |
null |
2025-08-20 |
Towards a general diffusion-based information quality assessment model |
Anthony Lopes Temporao et.al. |
2508.13927 |
null |
2025-08-19 |
Learning to See Through Flare |
Xiaopeng Peng et.al. |
2508.13907 |
null |
2025-08-19 |
Revisiting Diffusion Q-Learning: From Iterative Denoising to One-Step Action Generation |
Thanh Nguyen et.al. |
2508.13904 |
null |
2025-08-19 |
Diffusion-Driven High-Dimensional Variable Selection |
Minjie Wang et.al. |
2508.13890 |
null |
2025-08-19 |
Toward Deployable Multi-Robot Collaboration via a Symbolically-Guided Decision Transformer |
Rathnam Vidushika Rasanji et.al. |
2508.13877 |
null |
2025-08-19 |
SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation |
Paul Grimal et.al. |
2508.13866 |
null |
2025-08-19 |
Stochastic synaptic dynamics under learning |
Jakob Stubenrauch et.al. |
2508.13846 |
null |
2025-08-19 |
UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion |
Zihan Liang et.al. |
2508.13843 |
null |
2025-08-20 |
Latent Interpolation Learning Using Diffusion Models for Cardiac Volume Reconstruction |
Niklas Bubeck et.al. |
2508.13826 |
null |
2025-08-19 |
COCO: Cognitive Operating System with Continuous Oversight for Multi-Agent Workflow Reliability |
Churong Liang et.al. |
2508.13815 |
null |
2025-08-19 |
Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs |
Juncheng Xie et.al. |
2508.13805 |
null |
2025-08-19 |
Elementary Monte Carlo model of the anisotropic recrystallization and antiripening under intensive stirring and high supersaturations |
Serhii Abakumov et.al. |
2508.13799 |
null |
2025-08-19 |
Sketch3DVE: Sketch-based 3D-Aware Scene Video Editing |
Feng-Lin Liu et.al. |
2508.13797 |
null |
2025-08-19 |
DegDiT: Controllable Audio Generation with Dynamic Event Graph Guided Diffusion Transformer |
Yisu Liu et.al. |
2508.13786 |
null |
2025-08-19 |
Comparing Conditional Diffusion Models for Synthesizing Contrast-Enhanced Breast MRI from Pre-Contrast Images |
Sebastian Ibarra et.al. |
2508.13776 |
null |
2025-08-19 |
Eliminating Rasterization: Direct Vector Floor Plan Generation with DiffPlanner |
Shidong Wang et.al. |
2508.13738 |
null |
2025-08-19 |
Simulation of Impact-induced seismic shaking on asteroid (25143) Itokawa to address its resurfacing process |
Sunho Jin et.al. |
2508.13727 |
null |
2025-08-19 |
Unravelling disorder in kagome Yb $_{0.5}$Co$_3$Ge$_3$ |
A. Korshunov et.al. |
2508.13719 |
null |
2025-08-19 |
Diffuse-Layer Capacitance at the Potential of Zero Charge in Binary Mixtures |
Yuki Uematsu et.al. |
2508.13691 |
null |
2025-08-19 |
PHECT: A lightweight computation tool for pulsar halo emission |
Kun Fang et.al. |
2508.13667 |
null |
2025-08-19 |
Calibrated Semantic Diffusion: A p-Laplacian Synthesis with Learnable Dissipation, Quantified Constants, and Graph-Aware Calibration |
Faruk Alpay et.al. |
2508.13658 |
null |
2025-08-19 |
Personalized Subgraph Federated Learning with Sheaf Collaboration |
Wenfei Liang et.al. |
2508.13642 |
null |
2025-08-19 |
V2P: From Background Suppression to Center Peaking for Robust GUI Grounding Task |
Jikai Chen et.al. |
2508.13634 |
null |
2025-08-19 |
Text2Weight: Bridging Natural Language and Neural Network Weight Spaces |
Bowen Tian et.al. |
2508.13633 |
null |
2025-08-20 |
DiffIER: Optimizing Diffusion Models with Iterative Error Reduction |
Ao Chen et.al. |
2508.13628 |
null |
2025-08-19 |
Bridging Clear and Adverse Driving Conditions |
Yoel Shapiro et.al. |
2508.13592 |
null |
2025-08-19 |
Temporal-Conditional Referring Video Object Segmentation with Noise-Free Text-to-Video Diffusion Model |
Ruixin Zhang et.al. |
2508.13584 |
null |
2025-08-19 |
Overcoming Quantum Resistivity Scaling in Nanoscale Interconnects Using Delafossite PdCoO2 |
Seoung-Hun Kang et.al. |
2508.13573 |
null |
2025-08-19 |
A stability-enhanced nonstandard finite difference framework for solving one and two-dimensional nonlocal differential equations |
Shweta Kumari et.al. |
2508.13542 |
null |
2025-08-20 |
2D Gaussians Meet Visual Tokenizer |
Yiang Shi et.al. |
2508.13515 |
null |
2025-08-19 |
A Monte Carlo simulation on the scattering coefficients of solar radio wave propagation |
Jiazhen Gan et.al. |
2508.13494 |
null |
2025-08-19 |
The Lévy flight foraging hypothesis: comparison between stationary distributions and anomalous diffusion |
Serena Dipierro et.al. |
2508.13487 |
null |
2025-08-19 |
EventTSF: Event-Aware Non-Stationary Time Series Forecasting |
Yunfeng Ge et.al. |
2508.13434 |
null |
2025-08-19 |
Hyperactive Magnetar Eruptions: Giant Flares, Baryon Ejections, and FRBs |
Ashley Bransgrove et.al. |
2508.13419 |
null |
2025-08-18 |
Counterfactual Probabilistic Diffusion with Expert Models |
Wenhao Mu et.al. |
2508.13355 |
null |
2025-08-18 |
Susceptibility Distortion Correction of Diffusion MRI with a single Phase-Encoding Direction |
Sedigheh Dargahi et.al. |
2508.13340 |
null |
2025-08-18 |
Resistive diffusion and radiative cooling effects in magnetized oblique shocks |
R. Datta et.al. |
2508.13310 |
null |
2025-08-18 |
GaitCrafter: Diffusion Model for Biometric Preserving Gait Synthesis |
Sirshapan Mitra et.al. |
2508.13300 |
null |
2025-08-18 |
Field-level Reconstruction from Foreground-Contaminated 21-cm Maps |
Shu-Fan Chen et.al. |
2508.13265 |
null |
2025-08-18 |
4DNeX: Feed-Forward 4D Generative Modeling Made Easy |
Zhaoxi Chen et.al. |
2508.13154 |
null |
2025-08-18 |
MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models |
Haoyu He et.al. |
2508.13148 |
null |
2025-08-18 |
Some semi-decoupled algorithms with optimal convergence for a four-field linear thermo-poroelastic model |
Ziliang Li et.al. |
2508.13109 |
null |
2025-08-18 |
Precise Action-to-Video Generation Through Visual Action Prompts |
Yuang Wang et.al. |
2508.13104 |
null |
2025-08-18 |
Denoising diffusion models for inverse design of inflatable structures with programmable deformations |
Sara Karimi et.al. |
2508.13097 |
null |
2025-08-18 |
DMS:Diffusion-Based Multi-Baseline Stereo Generation for Improving Self-Supervised Depth Estimation |
Zihua Liu et.al. |
2508.13091 |
null |
2025-08-18 |
ID-Card Synthetic Generation: Toward a Simulated Bona fide Dataset |
Qingwen Zeng et.al. |
2508.13078 |
null |
2025-08-18 |
From Transthoracic to Transesophageal: Cross-Modality Generation using LoRA Diffusion |
Emmanuel Oladokun et.al. |
2508.13077 |
null |
2025-08-18 |
Reinforced Context Order Recovery for Adaptive Reasoning and Planning |
Long Ma et.al. |
2508.13070 |
null |
2025-08-18 |
Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping |
Siddharth Khandelwal et.al. |
2508.13065 |
null |
2025-08-19 |
PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models |
Pengcheng Huang et.al. |
2508.13021 |
null |
2025-08-18 |
EgoTwin: Dreaming Body and View in First Person |
Jingqiao Xiu et.al. |
2508.13013 |
null |
2025-08-18 |
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model |
Xianglong He et.al. |
2508.13009 |
null |
2025-08-18 |
Transfer Learning for Neutrino Scattering: Domain Adaptation with GANs |
Jose L. Bonilla et.al. |
2508.12987 |
null |
2025-08-18 |
The Leibenson process |
Viorel Barbu et.al. |
2508.12979 |
null |
2025-08-18 |
Compact Attention: Exploiting Structured Spatio-Temporal Sparsity for Fast Video Generation |
Qirui Li et.al. |
2508.12969 |
null |
2025-08-18 |
Self-Consistent Heating of the Magnetically Closed Solar Corona: Generation of Nanoflares, Thermodynamic Response of the Plasma and Observational Signatures |
Craig D. Johnston et.al. |
2508.12952 |
null |
2025-08-18 |
Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models |
Jianshu Zeng et.al. |
2508.12945 |
null |
2025-08-19 |
Fully Automated Segmentation of Fiber Bundles in Anatomic Tracing Data |
Kyriaki-Margarita Bintsi et.al. |
2508.12942 |
null |
2025-08-18 |
7Bench: a Comprehensive Benchmark for Layout-guided Text-to-image Models |
Elena Izzo et.al. |
2508.12919 |
null |
2025-08-18 |
FoleySpace: Vision-Aligned Binaural Spatial Audio Generation |
Lei Zhao et.al. |
2508.12918 |
null |
2025-08-18 |
S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models |
Chubin Chen et.al. |
2508.12880 |
null |
2025-08-18 |
E3RG: Building Explicit Emotion-driven Empathetic Response Generation System with Multimodal Large Language Model |
Ronghao Lin et.al. |
2508.12854 |
null |
2025-08-18 |
Strongly correlated stochastic systems |
Marco Biroli et.al. |
2508.12818 |
null |
2025-08-18 |
Next Visual Granularity Generation |
Yikai Wang et.al. |
2508.12811 |
null |
2025-08-18 |
Wavy Transformer |
Satoshi Noguchi et.al. |
2508.12787 |
null |
2025-08-18 |
Right and Wrong Ansätze for Nonlinear Waves in Stochastic PDEs |
C. H. S. Hamster et.al. |
2508.12786 |
null |
2025-08-18 |
Leveraging Diffusion Models for Stylization using Multiple Style Images |
Dan Ruta et.al. |
2508.12784 |
null |
2025-08-18 |
TURB-Scalar. A large database of passive scalar fields advected by 2D Navier-Stokes in the turbulent inverse cascade regime |
Chiara Calascibetta et.al. |
2508.12762 |
null |
2025-08-18 |
Effects of Defects on Thermal Transport across Solid/Solid Heterogeneous Interfaces |
Ershuai Yin et.al. |
2508.12744 |
null |
2025-08-18 |
Single-Reference Text-to-Image Manipulation with Dual Contrastive Denoising Score |
Syed Muhmmad Israr et.al. |
2508.12718 |
null |
2025-08-18 |
Hyperparameter Optimization in the Estimation of PDE and Delay-PDE models from data |
Oliver Mai et.al. |
2508.12715 |
null |
2025-08-18 |
Asymmetric Diffusion Recommendation Model |
Yongchun Zhu et.al. |
2508.12706 |
null |
2025-08-18 |
Deadline-Aware Bandwidth Allocation for Semantic Generative Communication with Diffusion Models |
Jinhyuk Choi et.al. |
2508.12701 |
null |
2025-08-18 |
MixCache: Mixture-of-Cache for Video Diffusion Transformer Acceleration |
Yuanxin Wei et.al. |
2508.12691 |
null |
2025-08-18 |
WP-CLIP: Leveraging CLIP to Predict Wölfflin’s Principles in Visual Art |
Abhijay Ghildyal et.al. |
2508.12668 |
null |
2025-08-18 |
Stable Diffusion-Based Approach for Human De-Occlusion |
Seung Young Noh et.al. |
2508.12663 |
null |
2025-08-18 |
Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery |
Jiyeon Kang et.al. |
2508.12650 |
null |
2025-08-18 |
Cognitive Structure Generation: From Educational Priors to Policy Optimization |
Hengnian Gu et.al. |
2508.12647 |
null |
2025-08-18 |
ViLaD: A Large Vision Language Diffusion Framework for End-to-End Autonomous Driving |
Can Cui et.al. |
2508.12603 |
null |
2025-08-19 |
A Tale of Two Sightlines: Comparison of Hydrocarbon Dust Absorption Bands toward Cygnus OB2-12 and the Galactic Center |
Yvonne J. Pendleton et.al. |
2508.12601 |
null |
2025-08-17 |
Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference |
Denis Blessing et.al. |
2508.12511 |
null |
2025-08-17 |
Say It, See It: A Systematic Evaluation on Speech-Based 3D Content Generation Methods in Augmented Reality |
Yanming Xiu et.al. |
2508.12498 |
null |
2025-08-19 |
Portable Laser-Pumped Rb Atomic Clock with Digital Circuits |
Qiang Hao et.al. |
2508.12437 |
null |
2025-08-17 |
Spin decoherence dynamics of Er $^{3+}$ in CeO$_2$ film |
Sagar Kumar Seth et.al. |
2508.12429 |
null |
2025-08-17 |
TiP4GEN: Text to Immersive Panorama 4D Scene Generation |
Ke Xing et.al. |
2508.12415 |
null |
2025-08-17 |
Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position |
Zhixin Xie et.al. |
2508.12398 |
null |
2025-08-17 |
DeCoT: Decomposing Complex Instructions for Enhanced Text-to-Image Generation with Large Language Models |
Xiaochuan Lin et.al. |
2508.12396 |
null |
2025-08-17 |
Navigating the Exploration-Exploitation Tradeoff in Inference-Time Scaling of Diffusion Models |
Xun Su et.al. |
2508.12361 |
null |
2025-08-17 |
Topological Dissipation as the Missing Link in Multiscale Polymer Dynamics |
Xu-Ze Zhang et.al. |
2508.12359 |
null |
2025-08-17 |
Synthetic Data is Sufficient for Zero-Shot Visual Generalization from Offline Data |
Ahmet H. Güzel et.al. |
2508.12356 |
null |
2025-08-17 |
Semantic Discrepancy-aware Detector for Image Forgery Identification |
Ziye Wang et.al. |
2508.12341 |
null |
2025-08-17 |
Geometry-Aware Video Inpainting for Joint Headset Occlusion Removal and Face Reconstruction in Social XR |
Fatemeh Ghorbani Lohesara et.al. |
2508.12336 |
null |
2025-08-17 |
Sketchar: Supporting Character Design and Illustration Prototyping Using Generative AI |
Long Ling et.al. |
2508.12333 |
null |
2025-08-17 |
Steering chiral active Brownian motion via stochastic position-orientation resetting |
Amir Shee et.al. |
2508.12223 |
null |
2025-08-17 |
Distribution Matching via Generalized Consistency Models |
Sagar Shrestha et.al. |
2508.12222 |
null |
2025-08-17 |
Self-Guided Action Diffusion |
Rhea Malhotra et.al. |
2508.12189 |
null |
2025-08-16 |
Critical Importance of Grain Boundaries to the Conductivity of Polycrystalline Molecular Crystals |
Shujit Chandra Paul et.al. |
2508.12172 |
null |
2025-08-16 |
Belief-Conditioned One-Step Diffusion: Real-Time Trajectory Planning with Just-Enough Sensing |
Gokul Puthumanaillam et.al. |
2508.12166 |
null |
2025-08-16 |
A Systematic Particle Filter for Estimating Time-Varying Parameters in Advection-Diffusion Equations with Source Terms |
Andrea Arnold et.al. |
2508.12155 |
null |
2025-08-16 |
Demystifying Foreground-Background Memorization in Diffusion Models |
Jimmy Z. Di et.al. |
2508.12148 |
null |
2025-08-16 |
Relativistic quintuple-zeta basis sets for the s block |
Marten L. Reitsma et.al. |
2508.12144 |
null |
2025-08-16 |
DualFit: A Two-Stage Virtual Try-On via Warping and Synthesis |
Minh Tran et.al. |
2508.12131 |
null |
2025-08-16 |
Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion |
Songwei Liu et.al. |
2508.12094 |
null |
2025-08-16 |
Strong overlap of deterministic and stochastic dynamics in a super-diffusive regime |
Muhammad Tayyab et.al. |
2508.12091 |
null |
2025-08-16 |
Generic Event Boundary Detection via Denoising Diffusion |
Jaejun Hwang et.al. |
2508.12084 |
null |
2025-08-16 |
Content Accuracy and Quality Aware Resource Allocation Based on LP-Guided DRL for ISAC-Driven AIGC Networks |
Ningzhe Shi et.al. |
2508.12079 |
null |
2025-08-16 |
Load-Balanced Diffusion Monte Carlo Method with Lattice Regularization |
Kousuke Nakano et.al. |
2508.12033 |
null |
2025-08-16 |
Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems |
Szymon Pawlonka et.al. |
2508.12026 |
null |
2025-08-16 |
Virtual Trading in Multi-Settlement Electricity Markets |
Agostino Capponi et.al. |
2508.11979 |
null |
2025-08-16 |
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding |
Yueming Xu et.al. |
2508.11952 |
null |
2025-08-19 |
Assessment of Using Synthetic Data in Brain Tumor Segmentation |
Aditi Jahagirdar et.al. |
2508.11922 |
null |
2025-08-16 |
SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress |
Lingyun Zhang et.al. |
2508.11904 |
null |
2025-08-16 |
OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation |
Jilei Mao et.al. |
2508.11898 |
null |
2025-08-16 |
Simulation of heavy quarkonium equilibration in the quark-gluon plasma |
Shouxing Zhao et.al. |
2508.11897 |
null |
2025-08-16 |
SimInterview: Transforming Business Education through Large Language Model-Based Simulated Multilingual Interview Training System |
Truong Thanh Hung Nguyen et.al. |
2508.11873 |
null |
2025-08-15 |
Serendipitous discovery of a young cluster of galaxies at $z \sim 0.5$ projected next to the nearby tadpole galaxy KUG 1138 + 327 |
Q. Daniel Wang et.al. |
2508.11819 |
null |
2025-08-15 |
FairTabGen: Unifying Counterfactual and Causal Fairness in Synthetic Tabular Data Generation |
Nitish Nagesh et.al. |
2508.11810 |
null |
2025-08-15 |
LoRAtorio: An intrinsic approach to LoRA Skill Composition |
Niki Foteinopoulou et.al. |
2508.11624 |
null |
2025-08-15 |
Dataset Creation for Visual Entailment using Generative AI |
Rob Reijtenbach et.al. |
2508.11605 |
null |
2025-08-15 |
CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion |
Zhe Zhu et.al. |
2508.11603 |
null |
2025-08-15 |
Low barrier ZrO $_x$ -based Josephson junctions |
Jaehong Choi et.al. |
2508.11593 |
null |
2025-08-15 |
Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model |
Zuo Zuo et.al. |
2508.11550 |
null |
2025-08-15 |
Physics-Informed Diffusion Models for Unsupervised Anomaly Detection in Multivariate Time Series |
Juhi Soni et.al. |
2508.11528 |
null |
2025-08-15 |
CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models |
Xiaoxue Wu et.al. |
2508.11484 |
null |
2025-08-15 |
SPG: Style-Prompting Guidance for Style-Specific Content Creation |
Qian Liang et.al. |
2508.11476 |
null |
2025-08-15 |
DPI-SPR: A Differentiable Physical Inversion for Shadow Profile Reconstruction Framework in Forward Scatter Radar |
ShuQi Lei et.al. |
2508.11470 |
null |
2025-08-15 |
Simulation-based inference using splitting schemes for partially observed diffusions in chemical reaction networks |
Petar Jovanovski et.al. |
2508.11438 |
null |
2025-08-15 |
MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image Generation |
Qian Liang et.al. |
2508.11433 |
null |
2025-08-15 |
Wavelength dependence of laser pulse filamentation around atomic resonances |
Gabor Demeter et.al. |
2508.11417 |
null |
2025-08-15 |
The Effect of Flow Parameters and Wall Models on Gas-Surface Interactions: A Numerical Investigation of dsmcFoam |
M. B. Agir et.al. |
2508.11403 |
null |
2025-08-15 |
Pairwise correlations of global times in one-dimensional Brownian motion under stochastic resetting |
Yihao Wang et.al. |
2508.11387 |
null |
2025-08-15 |
AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis |
Zonglin Wu et.al. |
2508.11375 |
null |
2025-08-15 |
GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition |
Md Asgor Hossain Reaj et.al. |
2508.11334 |
null |
2025-08-15 |
Noise Matters: Optimizing Matching Noise for Diffusion Classifiers |
Yanghao Wang et.al. |
2508.11330 |
null |
2025-08-18 |
TimeMachine: Fine-Grained Facial Age Editing with Identity Preservation |
Yilin Mi et.al. |
2508.11284 |
null |
2025-08-15 |
Probing the Representational Power of Sparse Autoencoders in Vision Models |
Matthew Lyle Olson et.al. |
2508.11277 |
null |
2025-08-15 |
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception |
Junjie Wang et.al. |
2508.11256 |
null |
2025-08-15 |
FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation |
MengChao Wang et.al. |
2508.11255 |
null |
2025-08-15 |
Graph Neural Diffusion via Generalized Opinion Dynamics |
Asela Hevapathige et.al. |
2508.11249 |
null |
2025-08-15 |
Cross-Granularity Hypergraph Retrieval-Augmented Generation for Multi-hop Question Answering |
Changjian Wang et.al. |
2508.11247 |
null |
2025-08-15 |
Efficient Image-to-Image Schrödinger Bridge for CT Field of View Extension |
Zhenhao Li et.al. |
2508.11211 |
null |
2025-08-15 |
StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation |
Seungmi Lee et.al. |
2508.11203 |
null |
2025-08-15 |
NGC 2392 and NGC 4361: Spectroscopic Diagnostics of Planetary Nebula Evolution |
Atul Kumar Singh et.al. |
2508.11202 |
null |
2025-08-15 |
Statistical Properties of Current Noise Induced by Electron-Phonon Scattering in Metallic Carbon Nanotubes |
Aina Sumiyoshi et.al. |
2508.11201 |
null |
2025-08-15 |
Representation Quantization for Collaborative Filtering Augmentation |
Yunze Luo et.al. |
2508.11194 |
null |
2025-08-15 |
Semi-supervised Image Dehazing via Expectation-Maximization and Bidirectional Brownian Bridge Diffusion Models |
Bing Liu et.al. |
2508.11165 |
null |
2025-08-15 |
LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction |
Maoquan Zhang et.al. |
2508.11153 |
null |
2025-08-15 |
Residual-based Efficient Bidirectional Diffusion Model for Image Dehazing and Haze Generation |
Bing Liu et.al. |
2508.11134 |
null |
2025-08-15 |
SQ-A: A Collision Triggered Starburst in Intra-Group Medium of Stephan’s Quintet |
C. K. Xu et.al. |
2508.11124 |
null |
2025-08-14 |
Diffusion is a code repair operator and generator |
Mukul Singh et.al. |
2508.11110 |
null |
2025-08-14 |
HierOctFusion: Multi-scale Octree-based 3D Shape Generation via Part-Whole-Hierarchy Message Passing |
Xinjie Gao et.al. |
2508.11106 |
null |
2025-08-14 |
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning |
Kelin Yu et.al. |
2508.11049 |
null |
2025-08-14 |
A porous medium equation with spatially inhomogeneous absorption. Part II: Large time behavior |
Razvan Gabriel Iagar et.al. |
2508.11046 |
null |
2025-08-14 |
3D FlowMatch Actor: Unified 3D Policy for Single- and Dual-Arm Manipulation |
Nikolaos Gkanatsios et.al. |
2508.11002 |
null |
2025-08-14 |
Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling |
Tejomay Kishor Padole et.al. |
2508.10995 |
null |
2025-08-14 |
Match & Choose: Model Selection Framework for Fine-tuning Text-to-Image Diffusion Models |
Basile Lewandowski et.al. |
2508.10993 |
null |
2025-08-14 |
The extended molecular gas of the Circinus galaxy and NGC 1097 as seen by APEX |
Akhil Lasrado et.al. |
2508.10982 |
null |
2025-08-14 |
EVCtrl: Efficient Control Adapter for Visual Generation |
Zixiang Yang et.al. |
2508.10963 |
null |
2025-08-13 |
From Promise to Practical Reality: Transforming Diffusion MRI Analysis with Fast Deep Learning Enhancement |
Xinyi Wang et.al. |
2508.10950 |
null |
2025-08-14 |
Exchange-driven self-diffusion of nanoscale crystalline parahydrogen clusters on graphite |
K. M. Kolevski et.al. |
2508.10883 |
null |
2025-08-14 |
A Survey on Diffusion Language Models |
Tianyi Li et.al. |
2508.10875 |
null |
2025-08-14 |
Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation |
Harold Haodong Chen et.al. |
2508.10858 |
null |
2025-08-16 |
Object Fidelity Diffusion for Remote Sensing Image Generation |
Ziqi Ye et.al. |
2508.10801 |
null |
2025-08-14 |
Ultra-High-Definition Reference-Based Landmark Image Super-Resolution with Generative Diffusion Prior |
Zhenning Shi et.al. |
2508.10779 |
null |
2025-08-14 |
Video-BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation |
Youping Gu et.al. |
2508.10774 |
null |
2025-08-14 |
AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences |
Jieyu Li et.al. |
2508.10771 |
null |
2025-08-14 |
Formation and protection of an Eu-Ir surface compound below hexagonal boron nitride |
Alaa Mohammed Idris Bakhit et.al. |
2508.10746 |
null |
2025-08-14 |
A Kinetic Theory Approach to Ordered Fluids |
José A. Carrillo et.al. |
2508.10744 |
null |
2025-08-14 |
Thinking Inside the Mask: In-Place Prompting in Diffusion LLMs |
Xiangqi Jin et.al. |
2508.10736 |
null |
2025-08-14 |
Exploiting Discriminative Codebook Prior for Autoregressive Image Generation |
Longxiang Tang et.al. |
2508.10719 |
null |
2025-08-14 |
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale |
NextStep Team et.al. |
2508.10711 |
null |
2025-08-14 |
CountCluster: Training-Free Object Quantity Guidance with Cross-Attention Map Clustering for Text-to-Image Generation |
Joohyeon Lee et.al. |
2508.10710 |
null |
2025-08-14 |
Probabilistic Forecasting Method for Offshore Wind Farm Cluster under Typhoon Conditions: a Score-Based Conditional Diffusion Model |
Jinhua He et.al. |
2508.10705 |
null |
2025-08-14 |
Effective permeability conditions for diffusive transport through impermeable membranes with gaps |
Molly Brennan et.al. |
2508.10694 |
null |
2025-08-14 |
Novel View Synthesis using DDIM Inversion |
Sehajdeep SIngh et.al. |
2508.10688 |
null |
2025-08-14 |
MDNS: Masked Diffusion Neural Sampler via Stochastic Optimal Control |
Yuchen Zhu et.al. |
2508.10684 |
null |
2025-08-14 |
Hybrid Generative Fusion for Efficient and Privacy-Preserving Face Recognition Dataset Generation |
Feiran Li et.al. |
2508.10672 |
null |
2025-08-14 |
Geospatial Diffusion for Land Cover Imperviousness Change Forecasting |
Debvrat Varshney et.al. |
2508.10649 |
null |
2025-08-14 |
Increasing the Utility of Synthetic Images through Chamfer Guidance |
Nicola Dall’Asen et.al. |
2508.10631 |
null |
2025-08-14 |
A Unified Framework from Boltzmann Transport to Proton Treatment Planning |
Andreas E. Kyprianou et.al. |
2508.10596 |
null |
2025-08-14 |
HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis |
Shiyu Liu et.al. |
2508.10566 |
null |
2025-08-14 |
Projected Coupled Diffusion for Test-Time Constrained Joint Generation |
Hao Luan et.al. |
2508.10531 |
null |
2025-08-14 |
EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba |
Quang Nguyen et.al. |
2508.10522 |
null |
2025-08-15 |
KDPE: A Kernel Density Estimation Strategy for Diffusion Policy Trajectory Selection |
Andrea Rosasco et.al. |
2508.10511 |
null |
2025-08-14 |
A Segmentation-driven Editing Method for Bolt Defect Augmentation and Detection |
Yangjie Xiao et.al. |
2508.10509 |
null |
2025-08-14 |
TweezeEdit: Consistent and Efficient Image Editing with Path Regularization |
Jianda Mao et.al. |
2508.10498 |
null |
2025-08-14 |
A Unified Multi-Agent Framework for Universal Multimodal Understanding and Generation |
Jiulin Li et.al. |
2508.10494 |
null |
2025-08-14 |
Jamming of active particles in narrow pores: Implications for ratchet effect and diffusion coefficient |
Šimon Pajger et.al. |
2508.10483 |
null |
2025-08-14 |
NanoControl: A Lightweight Framework for Precise and Efficient Control in Diffusion Transformer |
Shanyuan Liu et.al. |
2508.10424 |
null |
2025-08-14 |
Extracting a stochastic model for predator-prey dynamic of turbulence and zonal flows with limited data |
J. C. Huang et.al. |
2508.10408 |
null |
2025-08-14 |
Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models |
Eunseo Koh et.al. |
2508.10407 |
null |
2025-08-14 |
PQ-DAF: Pose-driven Quality-controlled Data Augmentation for Data-scarce Driver Distraction Detection |
Haibin Sun et.al. |
2508.10397 |
null |
2025-08-14 |
EDIS: A Simulation Software for Dynamic Ion Intercalation/Deintercalation Processes in Electrode Materials |
Liqi Wang et.al. |
2508.10384 |
null |
2025-08-14 |
Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models |
Hyundo Lee et.al. |
2508.10382 |
null |
2025-08-14 |
A Semantic-Aware Framework for Safe and Intent-Integrative Assistance in Upper-Limb Exoskeletons |
Yu Chen et.al. |
2508.10378 |
null |
2025-08-14 |
Scalable Modeling of Nonlinear Network Dynamics in Neurodegenerative Disease |
Daniel Semchin et.al. |
2508.10343 |
null |
2025-08-14 |
ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver |
Wenxuan Song et.al. |
2508.10333 |
null |
2025-08-14 |
Cross-view Generalized Diffusion Model for Sparse-view CT Reconstruction |
Jixiang Chen et.al. |
2508.10313 |
null |
2025-08-14 |
DiffAxE: Diffusion-driven Hardware Accelerator Generation and Design Space Exploration |
Arkapravo Ghosh et.al. |
2508.10303 |
null |
2025-08-14 |
Influence Maximization in Multi-layer Social Networks Based on Differentiated Graph Embeddings |
Ronghua Lin et.al. |
2508.10289 |
null |
2025-08-14 |
High Fidelity Text to Image Generation with Contrastive Alignment and Structural Guidance |
Danyi Gao et.al. |
2508.10280 |
null |
2025-08-14 |
A Spectral Solver to Capture Unsteady Dynamics in the Aerospike Nozzle Wake |
Zachary Pyle et.al. |
2508.10275 |
null |
2025-08-14 |
Non-Decaying Solutions to the 2D Dissipative Quasi-Geostrophic Equations |
David M. Ambrose et.al. |
2508.10254 |
null |
2025-08-13 |
Run-and-tumble dynamics with non-reciprocal transitions between three velocity states |
Julio C. R. Romo-Cruz et.al. |
2508.10213 |
null |
2025-08-13 |
Diffusive Braking of Penetrative Convection in Stably-Stratified Fluids |
Bradley W. Hindman et.al. |
2508.10174 |
null |
2025-08-13 |
Predicting First-Passage Dynamics in Disordered Systems Exactly: Application to Sparse Networks |
Daniel Marris et.al. |
2508.10140 |
null |
2025-08-13 |
The Perturbation Theory Approach to Stability in the Scattered Disk |
Matthew Belyakov et.al. |
2508.10119 |
null |
2025-08-13 |
Constrained Decoding of Diffusion LLMs with Context-Free Grammars |
Niels Mündler et.al. |
2508.10111 |
null |
2025-08-13 |
Quantum circuit simulation with a local time-dependent variational principle |
Aaron Sander et.al. |
2508.10096 |
null |
2025-08-13 |
Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design |
Yuhao Sun et.al. |
2508.10065 |
null |
2025-08-13 |
Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation |
Junyan Ye et.al. |
2508.09987 |
null |
2025-08-13 |
Story2Board: A Training-Free Approach for Expressive Storyboard Generation |
David Dinkevich et.al. |
2508.09983 |
null |
2025-08-13 |
Masquerade: Learning from In-the-wild Human Videos using Data-Editing |
Marion Lepert et.al. |
2508.09976 |
null |
2025-08-13 |
PERSONA: Personalized Whole-Body 3D Avatar with Pose-Driven Deformations from a Single Image |
Geonhee Sim et.al. |
2508.09973 |
null |
2025-08-13 |
Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models |
Luca Eyring et.al. |
2508.09968 |
null |
2025-08-13 |
Stable Diffusion Models are Secretly Good at Visual In-Context Learning |
Trevine Oorloff et.al. |
2508.09949 |
null |
2025-08-13 |
AST-n: A Fast Sampling Approach for Low-Dose CT Reconstruction using Diffusion Models |
Tomás de la Sotta et.al. |
2508.09943 |
null |
2025-08-13 |
Quo Vadis Handwritten Text Generation for Handwritten Text Recognition? |
Vittorio Pippi et.al. |
2508.09936 |
null |
2025-08-13 |
Active Particle Diffusion in Convection Roll Arrays |
Pulak Kumar Ghosh et.al. |
2508.09924 |
null |
2025-08-14 |
Prototype-Guided Diffusion: Visual Conditioning without External Memory |
Bilal Faye et.al. |
2508.09922 |
null |
2025-08-13 |
Hybrid Quantum-Classical Latent Diffusion Models for Medical Image Generation |
Kübra Yeter-Aydeniz et.al. |
2508.09903 |
null |
2025-08-13 |
Binary Mixtures in Linear Convection Arrays |
Pulak Kumar Ghosh et.al. |
2508.09902 |
null |
2025-08-13 |
Exploring the Physics of the Plasma Liner Experiment: A Multi-dimensional Study with FLASH, OSIRIS, and HELIOS |
E. C. Hansen et.al. |
2508.09895 |
null |
2025-08-13 |
Marketron Through the Looking Glass: From Equity Dynamics to Option Pricing in Incomplete Markets |
Igor Halperin et.al. |
2508.09863 |
null |
2025-08-13 |
HumanGenesis: Agent-Based Geometric and Generative Modeling for Synthetic Human Dynamics |
Weiqi Li et.al. |
2508.09858 |
null |
2025-08-13 |
Enhancing Diffusion Face Generation with Contrastive Embeddings and SegFormer Guidance |
Dhruvraj Singh Rawat et.al. |
2508.09847 |
null |
2025-08-13 |
On the Generalization Limits of Quantum Generative Adversarial Networks with Pure State Generators |
Jasmin Frkatovic et.al. |
2508.09844 |
null |
2025-08-13 |
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models |
Weigao Sun et.al. |
2508.09834 |
null |
2025-08-13 |
Physical Autoregressive Model for Robotic Manipulation without Action Pretraining |
Zijian Song et.al. |
2508.09822 |
null |
2025-08-13 |
Feature Impact Analysis on Top Long-Jump Performances with Quantile Random Forest and Explainable AI Techniques |
Qi Gan et.al. |
2508.09810 |
null |
2025-08-13 |
Condition number for finite element discretisation of nonlocal PDE systems with applications to biology |
Olusegun E. Adebayo et.al. |
2508.09781 |
null |
2025-08-13 |
Impacts of the duration and intensity of grazing cycle on vegetation population dynamics in semi-arid ecosystems with seasonal succession |
Junhong Gan et.al. |
2508.09760 |
null |
2025-08-13 |
Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection |
Zhiqiu Zhang et.al. |
2508.09746 |
null |
2025-08-13 |
MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers |
Qianru Qiu et.al. |
2508.09709 |
null |
2025-08-13 |
Hydrodynamic approximations for driven dense colloidal mixtures in narrow pores |
Frantisek Slanina et.al. |
2508.09686 |
null |
2025-08-13 |
Anomalous Transport of Elongated Particles in Oscillatory Vortical Flows |
Shiyuan Hu et.al. |
2508.09677 |
null |
2025-08-13 |
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors |
Xingyilang Yin et.al. |
2508.09667 |
null |
2025-08-13 |
NegFaceDiff: The Power of Negative Context in Identity-Conditioned Diffusion for Synthetic Face Generation |
Eduarda Caldeira et.al. |
2508.09661 |
null |
2025-08-13 |
Asymptotic-analysis-inspired boundary conditions aiming at eliminating polymer diffusive instability |
Ming Dong et.al. |
2508.09635 |
null |
2025-08-15 |
Preacher: Paper-to-Video Agentic System |
Jingwei Liu et.al. |
2508.09632 |
null |
2025-08-13 |
MInDI-3D: Iterative Deep Learning in 3D for Sparse-view Cone Beam Computed Tomography |
Daniel Barco et.al. |
2508.09616 |
null |
2025-08-13 |
Global uniform regularity for the 3D incompressible MHD equations with slip boundary condition near a background magnetic field |
Jincheng Gao et.al. |
2508.09609 |
null |
2025-08-13 |
Images Speak Louder Than Scores: Failure Mode Escape for Enhancing Generative Quality |
Jie Shao et.al. |
2508.09598 |
null |
2025-08-13 |
Dual Recursive Feedback on Generation and Appearance Latents for Pose-Robust Text-to-Image Diffusion |
Jiwon Kim et.al. |
2508.09575 |
null |
2025-08-13 |
Zeolitic imidazolate framework glasses emit white light |
Zhencai Li et.al. |
2508.09552 |
null |
2025-08-13 |
Exploring the Equivalence of Closed-Set Generative and Real Data Augmentation in Image Classification |
Haowen Wang et.al. |
2508.09550 |
null |
2025-08-13 |
Boron Clusters for Metal-Free Water Splitting |
Masaya Fujioka et.al. |
2508.09538 |
null |
2025-08-13 |
Ehrenfest Dynamics with Spontaneous Localization |
Anderson A. Tomaz et.al. |
2508.09526 |
null |
2025-08-13 |
Generation of Indian Sign Language Letters, Numbers, and Words |
Ajeet Kumar Yadav et.al. |
2508.09522 |
null |
2025-08-13 |
A hyperbolic finite difference scheme for anisotropic diffusion equations: preserving the discrete maximum principle |
Tokuhiro Eto et.al. |
2508.09509 |
null |
2025-08-13 |
Stingrays in the radio sky: Two unusual diffuse radio relic sources in the direction of the Magellanic Stream |
Zachary J Smeaton et.al. |
2508.09495 |
null |
2025-08-13 |
SARE: Semantic-Aware Reconstruction Error for Generalizable Diffusion-Generated Image Detection |
Ju Yeon Kang et.al. |
2508.09487 |
null |
2025-08-13 |
CLIP-Flow: A Universal Discriminator for AI-Generated Images Inspired by Anomaly Detection |
Zhipeng Yuan et.al. |
2508.09477 |
null |
2025-08-14 |
From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts |
Yuji Wang et.al. |
2508.09476 |
null |
2025-08-13 |
Leveraging Failed Samples: A Few-Shot and Training-Free Framework for Generalized Deepfake Detection |
Shibo Yao et.al. |
2508.09475 |
null |
2025-08-13 |
Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy |
Hao Yu et.al. |
2508.09461 |
null |
2025-08-13 |
RASR: Retrieval-Augmented Super Resolution for Practical Reference-based Image Restoration |
Jiaqi Yan et.al. |
2508.09449 |
null |
2025-08-13 |
DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation |
Haoxiang Shi et.al. |
2508.09444 |
null |
2025-08-13 |
Scaling behaviour of rotating convection in a spherical shell with different Prandtl numbers |
Wei Fan et.al. |
2508.09416 |
null |
2025-08-13 |
Dynamos driven by top-heavy double-diffusive convection in the strong-field regime |
Wei Fan et.al. |
2508.09410 |
null |
2025-08-12 |
Understanding Dementia Speech Alignment with Diffusion-Based Image Generation |
Mansi et.al. |
2508.09385 |
null |
2025-08-12 |
X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents |
Guoxian Song et.al. |
2508.09383 |
null |
2025-08-12 |
UltraLight Med-Vision Mamba for Classification of Neoplastic Progression in Tubular Adenomas |
Aqsa Sultana et.al. |
2508.09339 |
null |
2025-08-12 |
Lung-DDPM+: Efficient Thoracic CT Image Synthesis using Diffusion Probabilistic Model |
Yifan Jiang et.al. |
2508.09327 |
null |
2025-08-12 |
Quantum correction to the Langevin cross section in resonant-exchange processes |
I. Simbotin et.al. |
2508.09302 |
null |
2025-08-12 |
Evolution of a Long-Lived Deep-Seated Main-Sequence Magnetic Field During White Dwarf Cooling |
Matias Castro-Tapia et.al. |
2508.09268 |
null |
2025-08-12 |
TFZ: Topology-Preserving Compression of 2D Symmetric and Asymmetric Second-Order Tensor Fields |
Nathaniel Gorski et.al. |
2508.09235 |
null |
2025-08-12 |
GSMT: Graph Fusion and Spatiotemporal TaskCorrection for Multi-Bus Trajectory Prediction |
Fan Ding et.al. |
2508.09227 |
null |
2025-08-12 |
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models |
Wen Wang et.al. |
2508.09138 |
null |
2025-08-12 |
Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices |
Ya Zou et.al. |
2508.09136 |
null |
2025-08-13 |
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer |
Zixin Yin et.al. |
2508.09131 |
null |
2025-08-13 |
Robust quantum computational advantage with programmable 3050-photon Gaussian boson sampling |
Hua-Liang Liu et.al. |
2508.09092 |
null |
2025-08-13 |
Direct Measurement of Electron Heating in Electron-Only Reconnection in a Laboratory Mini-Magnetosphere |
Lucas Rovige et.al. |
2508.09086 |
null |
2025-08-12 |
Rankin-Selberg integrals for $\mathrm{GSpin}$ groups with application to the global Gan-Gross-Prasad conjecture |
Pan Yan et.al. |
2508.09066 |
null |
2025-08-12 |
Per-Query Visual Concept Learning |
Ori Malca et.al. |
2508.09045 |
null |
2025-08-12 |
Stochastic Decentralized Optimization of Non-Smooth Convex and Convex-Concave Problems over Time-Varying Networks |
Maxim Divilkovskiy et.al. |
2508.09029 |
null |
2025-08-12 |
Envisioning Generative Artificial Intelligence in Cartography and Mapmaking |
Yuhao Kang et.al. |
2508.09028 |
null |
2025-08-12 |
TaoCache: Structure-Maintained Video Generation Acceleration |
Zhentao Fan et.al. |
2508.08978 |
null |
2025-08-12 |
Urban-STA4CLC: Urban Theory-Informed Spatio-Temporal Attention Model for Predicting Post-Disaster Commercial Land Use Change |
Ziyi Guo et.al. |
2508.08976 |
null |
2025-08-12 |
Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation |
Soo-Whan Chung et.al. |
2508.08953 |
null |
2025-08-12 |
Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation |
Ao Ma et.al. |
2508.08949 |
null |
2025-08-12 |
EGGCodec: A Robust Neural Encodec Framework for EGG Reconstruction and F0 Extraction |
Rui Feng et.al. |
2508.08924 |
null |
2025-08-12 |
When and How Ultrasound Enhances Nanoparticle Diffusion in Hydrogels: A Stick-and-Release Mechanism |
Pablo M. Blanco et.al. |
2508.08918 |
null |
2025-08-12 |
Sound Signal Synthesis with Auxiliary Classifier GAN, COVID-19 cough as an example |
Yahya Sherif Solayman Mohamed Saleh et.al. |
2508.08892 |
null |
2025-08-12 |
Transient Noise Removal via Diffusion-based Speech Inpainting |
Mordehay Moradi et.al. |
2508.08890 |
null |
2025-08-12 |
DiffPhysCam: Differentiable Physics-Based Camera Simulation for Inverse Rendering and Embodied AI |
Bo-Hsun Chen et.al. |
2508.08831 |
null |
2025-08-12 |
Geometry-Aware Global Feature Aggregation for Real-Time Indirect Illumination |
Meng Gai et.al. |
2508.08826 |
null |
2025-08-12 |
TARA: Token-Aware LoRA for Composable Personalization in Diffusion Models |
Yuqi Peng et.al. |
2508.08812 |
null |
2025-08-12 |
Identity-Preserving Aging and De-Aging of Faces in the StyleGAN Latent Space |
Luis S. Luevano et.al. |
2508.08808 |
null |
2025-08-12 |
Anomalous Sodium Insertion in Highly Oriented Graphite: Thermodynamics, Kinetics and Evidence for Two-Sided Intercalation |
Chuanhai Gan et.al. |
2508.08806 |
null |
2025-08-14 |
Measurement-Based Quantum Diffusion Models |
Xinyu Liu et.al. |
2508.08799 |
null |
2025-08-12 |
DiffPose-Animal: A Language-Conditioned Diffusion Framework for Animal Pose Estimation |
Tianyu Xiong et.al. |
2508.08783 |
null |
2025-08-12 |
Patient-Adaptive Focused Transmit Beamforming using Cognitive Ultrasound |
Wessel L. van Nierop et.al. |
2508.08782 |
null |
2025-08-12 |
Exploring Palette based Color Guidance in Diffusion Models |
Qianru Qiu et.al. |
2508.08754 |
null |
2025-08-12 |
Elucidating Rectified Flow with Deterministic Sampler: Polynomial Discretization Complexity for Multi and One-step Models |
Ruofeng Yang et.al. |
2508.08735 |
null |
2025-08-13 |
A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models |
Lingzhe Zhang et.al. |
2508.08712 |
null |
2025-08-12 |
Towards Safe Imitation Learning via Potential Field-Guided Flow Matching |
Haoran Ding et.al. |
2508.08707 |
null |
2025-08-12 |
SafeFix: Targeted Model Repair via Controlled Image Generation |
Ouyang Xu et.al. |
2508.08701 |
null |
2025-08-12 |
Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos |
Qi Zheng et.al. |
2508.08700 |
null |
2025-08-12 |
DiffVolume: Diffusion Models for Volume Generation in Limit Order Books |
Zhuohan Wang et.al. |
2508.08698 |
null |
2025-08-12 |
Detecting Sterile Neutrino Dark Matter at MeV Gamma-Ray Observatories |
Subaru Fujisawa et.al. |
2508.08695 |
null |
2025-08-12 |
Expert-Guided Diffusion Planner for Auto-bidding |
Yunshan Peng et.al. |
2508.08687 |
null |
2025-08-12 |
In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality |
Chenrui Liu et.al. |
2508.08673 |
null |
2025-08-12 |
Nonlinear dynamics of reaction-diffusion wave trains under large and fully nonlocalized modulations |
Joannis Alexopoulos et.al. |
2508.08637 |
null |
2025-08-14 |
Yan: Foundational Interactive Video Generation |
Deheng Ye et.al. |
2508.08601 |
null |
2025-08-12 |
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space |
Jingyun Liang et.al. |
2508.08588 |
null |
2025-08-12 |
Unlocking the Potential of Diffusion Priors in Blind Face Restoration |
Yunqi Miao et.al. |
2508.08556 |
null |
2025-08-12 |
UQGNN: Uncertainty Quantification of Graph Neural Networks for Multivariate Spatiotemporal Prediction |
Dahai Yu et.al. |
2508.08551 |
null |
2025-08-12 |
Fluorescence time profile measurement of LAB based liquid scintillator in response to medium relativistic ion particles |
Xiaojie Luo et.al. |
2508.08546 |
null |
2025-08-12 |
Transition to Petschek Reconnection in Subrelativistic Pair Plasmas: Implications for Particle Acceleration |
Adam Robbins et.al. |
2508.08533 |
null |
2025-08-11 |
SynLLM: A Comparative Analysis of Large Language Models for Medical Tabular Synthetic Data Generation via Prompt Engineering |
Arshia Ilaty et.al. |
2508.08529 |
null |
2025-08-11 |
Control-affine Schrödinger Bridge and Generalized Bohm Potential |
Alexis M. H. Teter et.al. |
2508.08511 |
null |
2025-08-11 |
CObL: Toward Zero-Shot Ordinal Layering without User Prompting |
Aneel Damaraju et.al. |
2508.08498 |
null |
2025-08-11 |
MuGa-VTON: Multi-Garment Virtual Try-On via Diffusion Transformers with Prompt Customization |
Ankan Deria et.al. |
2508.08488 |
null |
2025-08-11 |
MAViS: A Multi-Agent Framework for Long-Sequence Video Storytelling |
Qian Wang et.al. |
2508.08487 |
null |
2025-08-11 |
Discrete Diffusion-Based Model-Level Explanation of Heterogeneous GNNs with Node Features |
Pallabee Das et.al. |
2508.08458 |
null |
2025-08-11 |
Hot Jupiter formation in dense stellar clusters: A Monte Carlo model applied to 47 Tucanae |
J. A. Wirth et.al. |
2508.08406 |
null |
2025-08-11 |
Wave Propagation Dynamics via Lattice Difference Equations |
Eddy Kwessi et.al. |
2508.08387 |
null |
2025-08-11 |
Spatiotemporally Consistent Indoor Lighting Estimation with Diffusion Priors |
Mutian Tong et.al. |
2508.08384 |
null |
2025-08-11 |
Exponentially Improved Constant in Quantum Solution Extraction |
Gumaro Rendon et.al. |
2508.08375 |
null |
2025-08-11 |
StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation |
Shuyuan Tu et.al. |
2508.08248 |
null |
2025-08-12 |
Cut2Next: Generating Next Shot via In-Context Tuning |
Jingwen He et.al. |
2508.08244 |
null |
2025-08-13 |
BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion |
Qiayuan Liao et.al. |
2508.08241 |
null |
2025-08-11 |
OMGSR: You Only Need One Mid-timestep Guidance for Real-World Image Super-Resolution |
Zhiqiang Wu et.al. |
2508.08227 |
null |
2025-08-11 |
Learning User Preferences for Image Generation Model |
Wenyi Mo et.al. |
2508.08220 |
null |
2025-08-11 |
Reinforcement Learning in Vision: A Survey |
Weijia Wu et.al. |
2508.08189 |
null |
2025-08-13 |
CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data |
Chongke Bi et.al. |
2508.08173 |
null |
2025-08-11 |
ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction |
Chaojun Ni et.al. |
2508.08170 |
null |
2025-08-11 |
An effective potential for generative modelling with active matter |
Adrian Baule et.al. |
2508.08146 |
null |
2025-08-11 |
Reproducing and Extending Brownian Motion in Optical Trap: A Computational Reimplementation of Volpe and Volpe (2013) |
Eyad I. B Hamid et.al. |
2508.08138 |
null |
2025-08-11 |
FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting |
Yitong Yang et.al. |
2508.08136 |
null |
2025-08-11 |
Optimal Dividend, Reinsurance, and Capital Injection Strategies for an Insurer with Two Collaborating Business Lines |
Tim J. Boonen et.al. |
2508.08130 |
null |
2025-08-11 |
Learned Regularization for Microwave Tomography |
Bowen Tong et.al. |
2508.08114 |
null |
2025-08-11 |
TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning |
Junzhe Xu et.al. |
2508.08098 |
null |
2025-08-11 |
Fast and Generalizable parameter-embedded Neural Operators for Lithium-Ion Battery Simulation |
Amir Ali Panahi et.al. |
2508.08087 |
null |
2025-08-11 |
Matrix-3D: Omnidirectional Explorable 3D World Generation |
Zhongqi Yang et.al. |
2508.08086 |
null |
2025-08-12 |
Why Bohmian velocity might not be the only quantum velocity and the role of quantum diffusion flux is super-luminal wave packets |
Charalampos Antonakos et.al. |
2508.08065 |
null |
2025-08-11 |
S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix |
Peng Dai et.al. |
2508.08048 |
null |
2025-08-12 |
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation |
Fangyuan Mao et.al. |
2508.07981 |
null |
2025-08-11 |
Well-posedness for a fourth-order nonisothermal tumor growth model of Caginalp type |
Giulia Cavalleri et.al. |
2508.07979 |
null |
2025-08-12 |
Adaptive Multiple Access and Service Placement for Generative Diffusion Models |
Hamidreza Mazandarani et.al. |
2508.07978 |
null |
2025-08-11 |
Deep imaging of the galaxy Malin 2 shows new faint structures and a candidate satellite dwarf galaxy |
Junais et.al. |
2508.07930 |
null |
2025-08-11 |
Score Augmentation for Diffusion Models |
Liang Hou et.al. |
2508.07926 |
null |
2025-08-11 |
Generative Video Matting |
Yongtao Ge et.al. |
2508.07905 |
null |
2025-08-11 |
Diffusing the Blind Spot: Uterine MRI Synthesis with Diffusion Models |
Johanna P. Müller et.al. |
2508.07903 |
null |
2025-08-12 |
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation |
Bowen Xue et.al. |
2508.07901 |
null |
2025-08-11 |
NeeCo: Image Synthesis of Novel Instrument States Based on Dynamic and Deformable 3D Gaussian Reconstruction |
Tianle Zeng et.al. |
2508.07897 |
null |
2025-08-11 |
Deep Learning-Based Desikan-Killiany Parcellation of the Brain Using Diffusion MRI |
Yousef Sadegheih et.al. |
2508.07815 |
null |
2025-08-11 |
DiTVR: Zero-Shot Diffusion Transformer for Video Restoration |
Sicheng Gao et.al. |
2508.07811 |
null |
2025-08-11 |
MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks |
Yushen Xu et.al. |
2508.07803 |
null |
2025-08-11 |
Generative Inversion for Property-Targeted Materials Design: Application to Shape Memory Alloys |
Cheng Li et.al. |
2508.07798 |
null |
2025-08-11 |
Feynman-Kac formula gor general time dependent stochastic parabolic equation on a bounded domain and applications |
Yaozhong Hu et.al. |
2508.07793 |
null |
2025-08-13 |
AgentWorld: An Interactive Simulation Platform for Scene Construction and Mobile Robotic Manipulation |
Yizheng Zhang et.al. |
2508.07770 |
null |
2025-08-11 |
Dream4D: Lifting Camera-Controlled I2V towards Spatiotemporally Consistent 4D Generation |
Xiaoyan Liu et.al. |
2508.07769 |
null |
2025-08-11 |
Sea-Undistort: A Dataset for Through-Water Image Restoration in High Resolution Airborne Bathymetric Mapping |
Maximilian Kromer et.al. |
2508.07760 |
null |
2025-08-11 |
Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild |
Haoran Wang et.al. |
2508.07759 |
null |
2025-08-11 |
Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion |
Minseo Kim et.al. |
2508.07755 |
null |
2025-08-11 |
Grouped Speculative Decoding for Autoregressive Image Generation |
Junhyuk So et.al. |
2508.07747 |
null |
2025-08-11 |
Is GAN Necessary for Mel-Spectrogram-based Neural Vocoder? |
Hui-Peng Du et.al. |
2508.07711 |
null |
2025-08-11 |
Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing |
Weitao Wang et.al. |
2508.07700 |
null |
2025-08-11 |
DiffVC-OSD: One-Step Diffusion-based Perceptual Neural Video Compression Framework |
Wenzhuo Ma et.al. |
2508.07682 |
null |
2025-08-11 |
LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering |
Xiaohang Zhan et.al. |
2508.07647 |
null |
2025-08-11 |
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning |
Jian Ma et.al. |
2508.07607 |
null |
2025-08-11 |
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation |
Wenhui Song et.al. |
2508.07603 |
null |
2025-08-11 |
ShoulderShot: Generating Over-the-Shoulder Dialogue Videos |
Yuang Zhang et.al. |
2508.07597 |
null |
2025-08-11 |
Procedural Mixture Sets |
Hendrik Rommeswinkel et.al. |
2508.07588 |
null |
2025-08-12 |
From Platform Migration to Cultural Integration: the Ingress and Diffusion of #wlw from TikTok to RedNote in Queer Women Communities |
Ziqi Pan et.al. |
2508.07579 |
null |
2025-08-11 |
UniFlow: Unifying Speech Front-End Tasks via Continuous Generative Modeling |
Ziqian Wang et.al. |
2508.07558 |
null |
2025-08-11 |
Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation |
Minghao Yin et.al. |
2508.07557 |
null |
2025-08-11 |
Physics-informed Multiresolution Wavelet Neural Network Method for Solving Partial Differential Equations |
Feng Han et.al. |
2508.07546 |
null |
2025-08-11 |
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing |
Joonghyuk Shin et.al. |
2508.07519 |
null |
2025-08-10 |
Forecasting solar power output in Ibadan: A machine learning approach leveraging weather data and system specifications |
Obarotu Peter Urhuerhi et.al. |
2508.07462 |
null |
2025-08-10 |
Unified Semiclassical Theory of Nonlinear Hall Effect:Bridging Ballistic and Diffusive Transport Regime |
Xinyu Liu et.al. |
2508.07445 |
null |
2025-08-10 |
Robust, fast, and adaptive splitting schemes for nonlinear doubly-degenerate diffusion equations |
Ayesha Javed et.al. |
2508.07420 |
null |
2025-08-10 |
CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization |
Youqi Wang et.al. |
2508.07413 |
null |
2025-08-10 |
Conditional splitting probabilities for hidden-state inference in drift-diffusive processes |
Emir Sezik et.al. |
2508.07386 |
null |
2025-08-10 |
Supercritical fluids as a distinct state of matter characterized by sub-short-range structural order |
Sha Jin et.al. |
2508.07385 |
null |
2025-08-10 |
SODiff: Semantic-Oriented Diffusion Model for JPEG Compression Artifacts Removal |
Tingyu Yang et.al. |
2508.07346 |
null |
2025-08-10 |
CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation |
Fangtai Wu et.al. |
2508.07341 |
null |
2025-08-10 |
Linear-Quadratic Mean Field Games with Common Noise: A Direct Approach |
Wenyu Cong et.al. |
2508.07271 |
null |
2025-08-10 |
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers |
Xin Ma et.al. |
2508.07246 |
null |
2025-08-10 |
Causal Negative Sampling via Diffusion Model for Out-of-Distribution Recommendation |
Chu Zhao et.al. |
2508.07243 |
null |
2025-08-10 |
HaDM-ST: Histology-Assisted Differential Modeling for Spatial Transcriptomics Generation |
Xuepeng Liu et.al. |
2508.07225 |
null |
2025-08-10 |
Neural Bridge Processes |
Jian Xu et.al. |
2508.07220 |
null |
2025-08-10 |
Explainability-in-Action: Enabling Expressive Manipulation and Tacit Understanding by Bending Diffusion Models in ComfyUI |
Ahmed M. Abuzuraiq et.al. |
2508.07183 |
null |
2025-08-10 |
CoopDiff: Anticipating 3D Human-object Interactions via Contact-consistent Decoupled Diffusion |
Xiaotong Lin et.al. |
2508.07162 |
null |
2025-08-10 |
SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models |
Ruolin Yang et.al. |
2508.07149 |
null |
2025-08-10 |
Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction |
Yu Liu et.al. |
2508.07146 |
null |
2025-08-10 |
SketchConcept: Sketching-based Concept Recomposition for Product Design using Generative AI |
Runlin Duan et.al. |
2508.07141 |
null |
2025-08-10 |
Canvas3D: Empowering Precise Spatial Control for Image Generation with Constraints from a 3D Virtual Canvas |
Runlin Duan et.al. |
2508.07135 |
null |
2025-08-10 |
On the geometric Brownian motion with state-dependent variable exponent diffusion term |
Mustafa Avci et.al. |
2508.07130 |
null |
2025-08-10 |
Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays |
Gregory Schuit et.al. |
2508.07128 |
null |
2025-08-10 |
Modelling Human Skin Morphology and Simulating Transdermal Transport of 50 Chemicals |
Milana Tesfamarian et.al. |
2508.07123 |
null |
2025-08-09 |
DexFruit: Dexterous Manipulation and Gaussian Splatting Inspection of Fruit |
Aiden Swann et.al. |
2508.07118 |
null |
2025-08-09 |
Whisfusion: Parallel ASR Decoding via a Diffusion Transformer |
Taeyoun Kwon et.al. |
2508.07048 |
null |
2025-08-09 |
A Stage-Aware Mixture of Experts Framework for Neurodegenerative Disease Progression Modelling |
Tiantian He et.al. |
2508.07032 |
null |
2025-08-09 |
Trustworthy Medical Imaging with Large Language Models: A Study of Hallucinations Across Modalities |
Anindya Bijoy Das et.al. |
2508.07031 |
null |
2025-08-09 |
Vec2Summ: Text Summarization via Probabilistic Sentence Embeddings |
Mao Li et.al. |
2508.07017 |
null |
2025-08-12 |
HiMat: DiT-based Ultra-High Resolution SVBRDF Generation |
Zixiong Wang et.al. |
2508.07011 |
null |
2025-08-09 |
Spatio-Temporal Conditional Diffusion Models for Forecasting Future Multiple Sclerosis Lesion Masks Conditioned on Treatments |
Gian Mario Favero et.al. |
2508.07006 |
null |
2025-08-09 |
Mechanism of Anisotropic Crystallization and Phase Transitions under Van der Waals Squeezing |
Yuxiang Gao et.al. |
2508.06992 |
null |
2025-08-09 |
WeatherDiffusion: Weather-Guided Diffusion Model for Forward and Inverse Rendering |
Yixin Zhu et.al. |
2508.06982 |
null |
2025-08-09 |
Structure-Preserving Digital Twins via Conditional Neural Whitney Forms |
Brooks Kinch et.al. |
2508.06981 |
null |
2025-08-09 |
CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing |
Weiyan Xie et.al. |
2508.06937 |
null |
2025-08-09 |
Unveiling the Puzzle of Brittleness in Single Crystal Iridium |
Qing Cheng et.al. |
2508.06929 |
null |
2025-08-09 |
AR-GRPO: Training Autoregressive Image Generation Models via Reinforcement Learning |
Shihao Yuan et.al. |
2508.06924 |
null |
2025-08-09 |
Talk2Image: A Multi-Agent System for Multi-Turn Image Generation and Editing |
Shichao Ma et.al. |
2508.06916 |
null |
2025-08-09 |
MultiRef: Controllable Image Generation with Multiple Visual References |
Ruoxi Chen et.al. |
2508.06905 |
null |
2025-08-09 |
Text to Speech System for Meitei Mayek Script |
Gangular Singh Irengbam et.al. |
2508.06870 |
null |
2025-08-09 |
Speech Enhancement based on cascaded two flow |
Seonggyu Lee et.al. |
2508.06842 |
null |
2025-08-09 |
FlowSE: Flow Matching-based Speech Enhancement |
Seonggyu Lee et.al. |
2508.06840 |
null |
2025-08-09 |
Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models |
Shiqian Zhao et.al. |
2508.06837 |
null |
2025-08-09 |
A Score-based Diffusion Model Approach for Adaptive Learning of Stochastic Partial Differential Equation Solutions |
Toan Huynh et.al. |
2508.06834 |
null |
2025-08-09 |
Efficient data-driven regression for reduced-order modeling of spatial pattern formation |
Alessandro Alla et.al. |
2508.06833 |
null |
2025-08-09 |
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation |
Xiao Huang et.al. |
2508.06806 |
null |
2025-08-09 |
D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning |
Shu-Ang Yu et.al. |
2508.06804 |
null |
2025-08-09 |
GaN/InN HEMT based UV photodetector on SiC with hexagonal boron nitride passivation |
Mustafa Kilin et.al. |
2508.06782 |
null |
2025-08-08 |
Topology Generation of UAV Covert Communication Networks: A Graph Diffusion Approach with Incentive Mechanism |
Xin Tang et.al. |
2508.06746 |
null |
2025-08-08 |
Design of high-mobility p-type GaN via the piezomobility tensor |
Jie-Cheng Chen et.al. |
2508.06723 |
null |
2025-08-08 |
Restage4D: Reanimating Deformable 3D Reconstruction from a Single Video |
Jixuan He et.al. |
2508.06715 |
null |
2025-08-08 |
LightSwitch: Multi-view Relighting with Material-guided Diffusion |
Yehonathan Litman et.al. |
2508.06494 |
null |
2025-08-08 |
Weak approximation of stochastic differential equations with sticky boundary conditions |
Akash Sharma et.al. |
2508.06487 |
null |
2025-08-08 |
SlimInfer: Accelerating Long-Context LLM Inference via Dynamic Token Pruning |
Lingkun Long et.al. |
2508.06447 |
null |
2025-08-08 |
SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation |
Guido Manni et.al. |
2508.06429 |
null |
2025-08-08 |
4D operando X-ray nano-holo-tomography reveals multiscale chemomechanics in Silicon-Graphite anode |
Victor Vanpeene et.al. |
2508.06413 |
null |
2025-08-08 |
FVGen: Accelerating Novel-View Synthesis with Adversarial Video Diffusion Distillation |
Wenbin Teng et.al. |
2508.06392 |
null |
2025-08-08 |
Diffuse measures and nonlinear parabolic equations |
Francesco Petitta et.al. |
2508.06384 |
null |
2025-08-08 |
ActivityDiff: A diffusion model with Positive and Negative Activity Guidance for De Novo Drug Design |
Renyi Zhou et.al. |
2508.06364 |
null |
2025-08-08 |
Quantum Algorithm for Estimating Intrinsic Geometry |
Nhat A. Nghiem et.al. |
2508.06355 |
null |
2025-08-08 |
Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging? |
Xin Ci Wong et.al. |
2508.06327 |
null |
2025-08-08 |
OM2P: Offline Multi-Agent Mean-Flow Policy |
Zhuoran Li et.al. |
2508.06269 |
null |
2025-08-08 |
ADPro: a Test-time Adaptive Diffusion Policy for Robot Manipulation via Manifold and Initial Noise Constraints |
Zezeng Li et.al. |
2508.06266 |
null |
2025-08-08 |
Tanaka formula for SDEs driven by fractional Brownian motion |
Tommi Sottinen et.al. |
2508.06261 |
null |
2025-08-08 |
Low dimensional dynamics of a sparse balanced synaptic network of quadratic integrate-and-fire neurons |
Maria V. Ageeva et.al. |
2508.06253 |
null |
2025-08-08 |
Light-Addressable Smart Nanostructures via Resonant Nanoheating |
Victor Tabouillot et.al. |
2508.06215 |
null |
2025-08-08 |
Inverse Source Problems for the Time-Fractional Evolution Equation |
Rahmonov Askar Ahmadovich et.al. |
2508.06209 |
null |
2025-08-08 |
Clinically-guided Data Synthesis for Laryngeal Lesion Detection |
Chiara Baldini et.al. |
2508.06182 |
null |
2025-08-08 |
Synthetic Data-Driven Multi-Architecture Framework for Automated Polyp Segmentation Through Integrated Detection and Mask Generation |
Ojonugwa Oluwafemi Ejiga Peter et.al. |
2508.06170 |
null |
2025-08-08 |
Sharp non-existence threshold for a parabolic Hardy-H{é}non equation with quasilinear diffusion |
Razvan Gabriel Iagar et.al. |
2508.06164 |
null |
2025-08-08 |
Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment |
Zhenbang Du et.al. |
2508.06160 |
null |
2025-08-08 |
Revealing the Staging Structural Evolution and Li (De)Intercalation Kinetics in Graphite Anodes via Machine Learning Potential |
Liqi Wang et.al. |
2508.06156 |
null |
2025-08-08 |
VISTAR:A User-Centric and Role-Driven Benchmark for Text-to-Image Evaluation |
Kaiyuan Jiang et.al. |
2508.06152 |
null |
2025-08-08 |
Improving Diagnostic Accuracy for Oral Cancer with inpainting Synthesis Lesions Generated Using Diffusion Models |
Yong Oh Lee et.al. |
2508.06151 |
null |
2025-08-08 |
DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera |
Shaohua Pan et.al. |
2508.06139 |
null |
2025-08-08 |
GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving |
Jian Wang et.al. |
2508.06113 |
null |
2025-08-08 |
MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment |
Gui Zou et.al. |
2508.06104 |
null |
2025-08-08 |
UGD-IML: A Unified Generative Diffusion-based Framework for Constrained and Unconstrained Image Manipulation Localization |
Yachun Mi et.al. |
2508.06101 |
null |
2025-08-08 |
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows |
Xiquan Li et.al. |
2508.06098 |
null |
2025-08-08 |
E-React: Towards Emotionally Controlled Synthesis of Human Reactions |
Chen Zhu et.al. |
2508.06093 |
null |
2025-08-08 |
SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment |
Yanxiao Sun et.al. |
2508.06082 |
null |
2025-08-08 |
DreamVE: Unified Instruction-based Image and Video Editing |
Bin Xia et.al. |
2508.06080 |
null |
2025-08-08 |
Towards MR-Based Trochleoplasty Planning |
Michael Wehrli et.al. |
2508.06076 |
null |
2025-08-08 |
Radio continuum and \HI 21-cm line observations of a nearby luminous infrared galaxy IRAS 17526+3253 |
Jianfeng Wu et.al. |
2508.06075 |
null |
2025-08-08 |
Real-time physics-informed reconstruction of transient fields using sensor guidance and higher-order time differentiation |
Hong-Kyun Noh et.al. |
2508.06070 |
null |
2025-08-08 |
ThematicPlane: Bridging Tacit User Intent and Latent Spaces for Image Generation |
Daniel Lee et.al. |
2508.06065 |
null |
2025-08-08 |
NEP: Autoregressive Image Editing via Next Editing Token Prediction |
Huimin Wu et.al. |
2508.06044 |
null |
2025-08-08 |
Bayesian Radio Map Estimation: Fundamentals and Implementation via Diffusion Models |
Tien Ngoc Ha et.al. |
2508.06037 |
null |
2025-08-08 |
InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow |
Yiming Gong et.al. |
2508.06033 |
null |
2025-08-08 |
Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body Parts |
Kiran Chhatre et.al. |
2508.06032 |
null |
2025-08-08 |
Improved Sub-Visible Particle Classification in Flow Imaging Microscopy via Generative AI-Based Image Synthesis |
Utku Ozbulak et.al. |
2508.06021 |
null |
2025-08-08 |
Vacuum Dealloyed Brass as Li-Metal Battery Current Collector: Effect of Zinc and Porosity |
Eric V Woods et.al. |
2508.06015 |
null |
2025-08-08 |
ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors |
Minsu Kim et.al. |
2508.06014 |
null |
2025-08-08 |
KnapFormer: An Online Load Balancer for Efficient Diffusion Transformers Training |
Kai Zhang et.al. |
2508.06001 |
null |
2025-08-08 |
Global solutions in $L^{p}{v}L^{\infty}{x}$ for the Boltzmann equation in bounded domains |
Dingqun Deng et.al. |
2508.05985 |
null |
2025-08-08 |
Revisiting $μ$ SR Studies of Ion Dynamics in the Light of Extended Kubo-Toyabe Model |
Takashi U. Ito et.al. |
2508.05968 |
null |
2025-08-08 |
Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents |
Han Lin et.al. |
2508.05954 |
null |
2025-08-08 |
A 3DGS-Diffusion Self-Supervised Framework for Normal Estimation from a Single Image |
Yanxing Liang et.al. |
2508.05950 |
null |
2025-08-08 |
Latent Policy Barrier: Learning Robust Visuomotor Policies by Staying In-Distribution |
Zhanyi Sun et.al. |
2508.05941 |
null |
2025-08-08 |
Reverse Diffusion Sequential Monte Carlo Samplers |
Luhuan Wu et.al. |
2508.05926 |
null |
2025-08-08 |
Fast, Convex and Conditioned Network for Multi-Fidelity Vectors and Stiff Univariate Differential Equations |
Siddharth Rout et.al. |
2508.05921 |
null |
2025-08-07 |
Measurement of All Flavor PeV Neutrino Flux using Combined Datasets from IceCube |
Emre Yildizci et.al. |
2508.05886 |
null |
2025-08-07 |
Emerging ultra-wide band gap semiconductors for future high-frequency electronics |
Emily M. Garrity et.al. |
2508.05823 |
null |
2025-08-07 |
FineDialFact: A benchmark for Fine-grained Dialogue Fact Verification |
Xiangyan Chen et.al. |
2508.05782 |
null |
2025-08-07 |
MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss |
Can Zhao et.al. |
2508.05772 |
null |
2025-08-07 |
UnGuide: Learning to Forget with LoRA-Guided Diffusion Models |
Agnieszka Polowczyk et.al. |
2508.05755 |
null |
2025-08-07 |
Quantum Reservoir GAN |
Hikaru Wakaura et.al. |
2508.05716 |
null |
2025-08-07 |
High multiplicity and global structure of coexistence states in a predator-prey model with saturation |
Kousuke Kuto et.al. |
2508.05714 |
null |
2025-08-07 |
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation |
Yue Liao et.al. |
2508.05635 |
null |
2025-08-07 |
GAP: Gaussianize Any Point Clouds with Text Guidance |
Weiqi Zhang et.al. |
2508.05631 |
null |
2025-08-07 |
Latent Space Diffusion for Topology Optimization |
Aaron Lutheran et.al. |
2508.05624 |
null |
2025-08-07 |
Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision |
Luozheng Qin et.al. |
2508.05606 |
null |
2025-08-07 |
Unveiling the Lithium-Ion Transport Mechanism in Li2ZrCl6 Solid-State Electrolyte via Deep Learning-Accelerated Molecular Dynamics Simulations |
Hanzeng Guo et.al. |
2508.05598 |
null |
2025-08-07 |
Discrepancy-Aware Contrastive Adaptation in Medical Time Series Analysis |
Yifan Wang et.al. |
2508.05572 |
null |
2025-08-07 |
MagicHOI: Leveraging 3D Priors for Accurate Hand-object Reconstruction from Short Monocular Video Clips |
Shibo Wang et.al. |
2508.05506 |
null |
2025-08-07 |
Heat and super-diffusive melting fronts in unsaturated porous media |
Eirik G. Flekkøy et.al. |
2508.05451 |
null |
2025-08-07 |
Whose Truth? Pluralistic Geo-Alignment for (Agentic) AI |
Krzysztof Janowicz et.al. |
2508.05432 |
null |
2025-08-07 |
MolSnap: Snap-Fast Molecular Generation with Latent Variational Mean Flow |
Md Atik Ahamed et.al. |
2508.05411 |
null |
2025-08-07 |
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation |
Wonjun Kang et.al. |
2508.05399 |
null |
2025-08-07 |
Real-Time Iteration Scheme for Diffusion Policy |
Yufei Duan et.al. |
2508.05396 |
null |
2025-08-09 |
Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms |
Jie Xiao et.al. |
2508.05387 |
null |
2025-08-07 |
Multi-Modal Multi-Behavior Sequential Recommendation with Conditional Diffusion-Based Feature Denoising |
Xiaoxi Cui et.al. |
2508.05352 |
null |
2025-08-07 |
Stranski-Krastanov Growth of Disordered ScNx Thin Films on MgO(100): Influence of Defect Densities on Electronic Structure and Transport Properties |
Susmita Chowdhury et.al. |
2508.05330 |
null |
2025-08-07 |
Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting |
Frank Ruis et.al. |
2508.05323 |
null |
2025-08-07 |
Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces |
Mathias Rose Bjare et.al. |
2508.05306 |
null |
2025-08-07 |
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens |
Nikita Dragunov et.al. |
2508.05305 |
null |
2025-08-07 |
An Investigation into the Distribution of Ratios of Particle Solver-based Likelihoods |
Emil Løvbak et.al. |
2508.05303 |
null |
2025-08-07 |
Wavelet-Guided Dual-Frequency Encoding for Remote Sensing Change Detection |
Xiaoyang Zhang et.al. |
2508.05271 |
null |
2025-08-07 |
B4DL: A Benchmark for 4D LiDAR LLM in Spatio-Temporal Understanding |
Changho Choi et.al. |
2508.05269 |
null |
2025-08-07 |
SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion |
Xiaoyang Zhang et.al. |
2508.05264 |
null |
2025-08-07 |
ArbiViewGen: Controllable Arbitrary Viewpoint Camera Data Generation for Autonomous Driving via Stable Diffusion Models |
Yatong Lan et.al. |
2508.05236 |
null |
2025-08-07 |
Parabolic abstract evolution equations in cylindrical domains and uniformly local Sobolev spaces |
Joly Romain et.al. |
2508.05220 |
null |
2025-08-07 |
An asymptotic-preserving active flux scheme for the hyperbolic heat equation in the diffusive scaling |
Junming Duan et.al. |
2508.05166 |
null |
2025-08-07 |
RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer |
Fangyu Du et.al. |
2508.05115 |
null |
2025-08-07 |
PoseGen: In-Context LoRA Finetuning for Pose-Controllable Long Human Video Generation |
Jingxuan He et.al. |
2508.05091 |
null |
2025-08-07 |
MetaDiT: Enabling Fine-grained Constraints in High-degree-of Freedom Metasurface Design |
Hao Li et.al. |
2508.05076 |
null |
2025-08-07 |
Align-for-Fusion: Harmonizing Triple Preferences via Dual-oriented Diffusion for Cross-domain Sequential Recommendation |
Yongfu Zha et.al. |
2508.05074 |
null |
2025-08-07 |
FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer |
Jian Zhu et.al. |
2508.05069 |
null |
2025-08-07 |
DualMat: PBR Material Estimation via Coherent Dual-Path Diffusion |
Yifeng Huang et.al. |
2508.05060 |
null |
2025-08-07 |
Observation of Super-ballistic Brownian Motion in Liquid |
Jason Boynewicz et.al. |
2508.05031 |
null |
2025-08-07 |
Coupled 1D Chemical Kinetic-Transport and 2D Hydrodynamic Modeling Supports a modest 1-1.5x Supersolar Oxygen Abundance in Jupiter’s Atmosphere |
Jeehyun Yang et.al. |
2508.05007 |
null |
2025-08-07 |
Switching Diffusion Systems with Past-Dependent Switching and Countable State Space: Successful Couplings and Strong Ergodicity |
Fubao Xi et.al. |
2508.04997 |
null |
2025-08-08 |
REF-VC: Robust, Expressive and Fast Zero-Shot Voice Conversion with Diffusion Transformers |
Yuepeng Jiang et.al. |
2508.04996 |
null |
2025-08-07 |
Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression |
Zheng Chen et.al. |
2508.04979 |
null |
2025-08-06 |
Simulation of Non-Premixed, Supersonic Combustion using the Discontinuous Galerkin Method on Fully Unstructured Grids |
Cal J. Rising et.al. |
2508.04930 |
null |
2025-08-06 |
Taxonomy of Faults in Attention-Based Neural Networks |
Sigma Jahan et.al. |
2508.04925 |
null |
2025-08-08 |
Learning AI Auditing: A Case Study of Teenagers Auditing a Generative AI Model |
Luis Morales-Navarro et.al. |
2508.04902 |
null |
2025-08-06 |
The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models |
Leo Zhang et.al. |
2508.04884 |
null |
2025-08-06 |
Unified Flow Matching for Long Horizon Event Forecasting |
Xiao Shou et.al. |
2508.04843 |
null |
2025-08-06 |
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off |
Seungyong Lee et.al. |
2508.04825 |
null |
2025-08-06 |
Delay-constrained re-entry governs large-scale brain seizures and other network pathologies |
Paul Triebkorn et.al. |
2508.04824 |
null |
2025-08-06 |
Single-Step Reconstruction-Free Anomaly Detection and Segmentation via Diffusion Models |
Mehrdad Moradi et.al. |
2508.04818 |
null |
2025-08-06 |
Stochastic Optimal Control with Control-Dependent Diffusion and State Constraints: A Degenerate Elliptic Approach |
Anderson O. Calixto et.al. |
2508.04809 |
null |
2025-08-06 |
Electrodeless Magnetohydrodynamic Local Force Generator for Aerocapture |
Bernard Parent et.al. |
2508.04806 |
null |
2025-08-06 |
ACM Multimedia Grand Challenge on ENT Endoscopy Analysis |
Trong-Thuan Nguyen et.al. |
2508.04801 |
null |
2025-08-08 |
Quantum-impurity sensing of altermagnetic order |
V. A. S. V. Bittencourt et.al. |
2508.04788 |
null |
2025-08-06 |
Edge-Assisted Collaborative Fine-Tuning for Multi-User Personalized Artificial Intelligence Generated Content (AIGC) |
Nan Li et.al. |
2508.04745 |
null |
2025-08-06 |
A colossal dielectric response of HfxZr1-xO2 nanoparticles |
Oleksandr S. Pylypchuk et.al. |
2508.04697 |
null |
2025-08-06 |
Diffusion in a $d$ -dimensional rough potential |
Jacob Jeffries et.al. |
2508.04674 |
null |
2025-08-06 |
HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models |
Young D. Kwon et.al. |
2508.04663 |
null |
2025-08-06 |
Stochastic Calculus for Pathwise Observables of Markov-Jump Processes: Unification of Diffusion and Jump Dynamics |
Lars Torbjørn Stutzer et.al. |
2508.04647 |
null |
2025-08-06 |
A unified model for linear responses of physical networks |
José M. Ortiz-Tavárez et.al. |
2508.04616 |
null |
2025-08-06 |
Multitask Learning with Stochastic Interpolants |
Hugo Negrel et.al. |
2508.04605 |
null |
2025-08-07 |
A Comprehensive Framework for Uncertainty Quantification of Voxel-wise Supervised Models in IVIM MRI |
Nicola Casali et.al. |
2508.04588 |
null |
2025-08-06 |
Joint Communication and Indoor Positioning Based on Visible Light in the Presence of Dimming |
A. Tarik Leblebici et.al. |
2508.04570 |
null |
2025-08-06 |
DDTracking: A Deep Generative Framework for Diffusion MRI Tractography with Streamline Local-Global Spatiotemporal Modeling |
Yijie Li et.al. |
2508.04568 |
null |
2025-08-06 |
TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning |
Yunbi Liu et.al. |
2508.04565 |
null |
2025-08-06 |
Drone Detection with Event Cameras |
Gabriele Magrini et.al. |
2508.04564 |
null |
2025-08-06 |
One Model For All: Partial Diffusion for Unified Try-On and Try-Off in Any Pose |
Jinxi Liu et.al. |
2508.04559 |
null |
2025-08-06 |
Two-Way Garment Transfer: Unified Diffusion Framework for Dressing and Undressing Synthesis |
Angang Zhang et.al. |
2508.04551 |
null |
2025-08-06 |
MSC: A Marine Wildlife Video Dataset with Grounded Segmentation and Clip-Level Captioning |
Quang-Trung Truong et.al. |
2508.04549 |
null |
2025-08-06 |
X-ray thermal diffuse scattering as a texture-robust temperature diagnostic for dynamically compressed solids |
P. G. Heighway et.al. |
2508.04525 |
null |
2025-08-06 |
$β$ -Irida-Graphene: A New 2D Carbon Allotrope for Sodium-Ion Battery Anodes |
José A. S. Laranjeira et.al. |
2508.04506 |
null |
2025-08-06 |
QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution |
Bowen Chai et.al. |
2508.04485 |
null |
2025-08-06 |
Zero-Residual Concept Erasure via Progressive Alignment in Text-to-Image Model |
Hongxu Chen et.al. |
2508.04472 |
null |
2025-08-06 |
4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation |
Shuzhou Yang et.al. |
2508.04467 |
null |
2025-08-06 |
Case Studies of Generative Machine Learning Models for Dynamical Systems |
Nachiket U. Bapat et.al. |
2508.04459 |
null |
2025-08-06 |
Cognitive Effort in the Two-Step Task: An Active Inference Drift-Diffusion Model Approach |
Alvaro Garrido Perez et.al. |
2508.04435 |
null |
2025-08-06 |
Unmasking Interstitial Lung Diseases: Leveraging Masked Autoencoders for Diagnosis |
Ethan Dack et.al. |
2508.04429 |
null |
2025-08-06 |
Hydrodynamic Effects in Cryogenic Buffer Gas Cells: Design Insights from Hybrid Simulations |
Nick Vogeley et.al. |
2508.04364 |
null |
2025-08-06 |
Derivation and Numerical Simulation of a Thermodynamically Consistent Magneto Two-Phase Flow Model for Magnetic Drug Targeting |
Eberhard Bänsch et.al. |
2508.04360 |
null |
2025-08-06 |
From Split to Share: Private Inference with Distributed Feature Sharing |
Zihan Liu et.al. |
2508.04346 |
null |
2025-08-06 |
Performative Market Making |
Charalampos Kleitsikas et.al. |
2508.04344 |
null |
2025-08-06 |
TempFlow-GRPO: When Timing Matters for GRPO in Flow Models |
Xiaoxuan He et.al. |
2508.04324 |
null |
2025-08-06 |
Wave coupling in partially ionized plasmas with shear flows I. Fast-to-Alfvén transformation |
Miquel Cantallops et.al. |
2508.04319 |
null |
2025-08-06 |
Turbulent Injection assisted by Diffusion Models for Scale Resolving Simulations |
Margaux Boxho et.al. |
2508.04318 |
null |
2025-08-06 |
Parameter Estimation for Weakly Interacting Hypoelliptic Diffusions |
Yuga Iguchi et.al. |
2508.04287 |
null |
2025-08-06 |
S2M3: Split-and-Share Multi-Modal Models for Distributed Multi-Task Inference on the Edge |
JinYi Yoon et.al. |
2508.04271 |
null |
2025-08-06 |
Sparse Narrow-Band Topology Optimization for Large-Scale Thermal-Fluid Applications |
Vladislav Pimanov et.al. |
2508.04261 |
null |
2025-08-06 |
High-Dimensional Matrix-Variate Diffusion Index Models for Time Series Forecasting |
Zhiren Ma et.al. |
2508.04259 |
null |
2025-08-06 |
Suspensions of small ultra-soft colloids remain liquids in overcrowded conditions |
Nikolaos A. Burger et.al. |
2508.04244 |
null |
2025-08-06 |
PIS3R: Very Large Parallax Image Stitching via Deep 3D Reconstruction |
Muhua Zhu et.al. |
2508.04236 |
null |
2025-08-06 |
DocVCE: Diffusion-based Visual Counterfactual Explanations for Document Image Classification |
Saifullah Saifullah et.al. |
2508.04233 |
null |
2025-08-06 |
Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction |
Yu Liu et.al. |
2508.04229 |
null |
2025-08-06 |
LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation |
Kangrui Cen et.al. |
2508.04228 |
null |
2025-08-06 |
DP-DocLDM: Differentially Private Document Image Generation using Latent Diffusion Models |
Saifullah Saifullah et.al. |
2508.04208 |
null |
2025-08-06 |
A background-free signal of jet-induced diffusion wake in quark-gluon plasma |
Zhong Yang et.al. |
2508.04194 |
null |
2025-08-06 |
Deeper Inside Deep ViT |
Sungrae Hong et.al. |
2508.04181 |
null |
2025-08-06 |
Quasi-Clique Discovery via Energy Diffusion |
Yu Zhang et.al. |
2508.04174 |
null |
2025-08-06 |
Non-Equilibrium Dynamics and First-Passage Properties of Stochastic Processes: From Brownian Motion to Active Particles |
Mathis Guéneau et.al. |
2508.04154 |
null |
2025-08-06 |
IDCNet: Guided Video Diffusion for Metric-Consistent RGBD Scene Generation with Precise Camera Control |
Lijuan Liu et.al. |
2508.04147 |
null |
2025-08-06 |
Polynomial-time sampling despite disorder chaos |
Eric Ma et.al. |
2508.04133 |
null |
2025-08-06 |
Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation |
Maximilian Ulmer et.al. |
2508.04122 |
null |
2025-08-06 |
Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework |
Yi-Ting Chen et.al. |
2508.04090 |
null |
2025-08-06 |
Long time behavior and Yaglom limit for real trait-structured Birth and Death Processes |
Pierre Collet et.al. |
2508.04089 |
null |
2025-08-06 |
Convolutional autoencoders for the reconstruction of three-dimensional interfacial multiphase flows |
Murray Cutforth et.al. |
2508.04084 |
null |
2025-08-06 |
POD-based reduced order modeling of global-in-time iterative decoupled algorithms for Biot’s consolidation model |
Huipeng Gu et.al. |
2508.04082 |
null |
2025-08-06 |
Uni-DocDiff: A Unified Document Restoration Model Based on Diffusion |
Fangmin Zhao et.al. |
2508.04055 |
null |
2025-08-06 |
Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation |
Jiayi He et.al. |
2508.04049 |
null |
2025-08-06 |
Nonlinear stability of two-dimensional periodic waves in parabolic systems with conservation laws |
L. Miguel Rodrigues et.al. |
2508.04023 |
null |
2025-08-07 |
S $^2$ Q-VDiT: Accurate Quantized Video Diffusion Transformer with Salient Data and Sparse Token Distillation |
Weilun Feng et.al. |
2508.04016 |
null |
2025-08-06 |
Constructing Generalized Sample Transition Probabilities with Biased Simulations |
Yanbin Wang et.al. |
2508.03977 |
null |
2025-08-05 |
Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm |
Lin Zhang et.al. |
2508.03955 |
null |
2025-08-05 |
Point-Based Shape Representation Generation with a Correspondence-Preserving Diffusion Model |
Shen Zhu et.al. |
2508.03925 |
null |
2025-08-05 |
Coefficient Identification Problem with Integral Overdetermination Condition for Diffusion Equations |
R. R. Ashurov et.al. |
2508.03859 |
null |
2025-08-05 |
VAE-DNN: Energy-Efficient Trainable-by-Parts Surrogate Model For Parametric Partial Differential Equations |
Yifei Zong et.al. |
2508.03839 |
null |
2025-08-05 |
HPSv3: Towards Wide-Spectrum Human Preference Score |
Yuhang Ma et.al. |
2508.03789 |
null |
2025-08-05 |
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation |
Jianxiong Gao et.al. |
2508.03694 |
null |
2025-08-05 |
LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences |
Ao Liang et.al. |
2508.03692 |
null |
2025-08-05 |
La La LiDAR: Large-Scale Layout Generation from LiDAR Data |
Youquan Liu et.al. |
2508.03691 |
null |
2025-08-05 |
Veila: Panoramic LiDAR Generation from a Monocular RGB Image |
Youquan Liu et.al. |
2508.03690 |
null |
2025-08-05 |
OmniShape: Zero-Shot Multi-Hypothesis Shape and Pose Estimation in the Real World |
Katherine Liu et.al. |
2508.03669 |
null |
2025-08-05 |
Rigidity for graph product von Neumann algebras |
Camille Horbez et.al. |
2508.03662 |
null |
2025-08-05 |
DiWA: Diffusion Policy Adaptation with World Models |
Akshay L Chandra et.al. |
2508.03645 |
null |
2025-08-05 |
Likelihood Matching for Diffusion Models |
Lei Qian et.al. |
2508.03636 |
null |
2025-08-05 |
Radiative Nonideal MHD Simulations of Inner Protoplanetary Disks: Temperature Structures, Asymmetric Winds, and Episodic Surface Accretion |
Shoji Mori et.al. |
2508.03624 |
null |
2025-08-05 |
Expanding the Standard Diffusion Process to Specified Non-Gaussian Marginal Distributions |
Robert Richardson et.al. |
2508.03617 |
null |
2025-08-05 |
CADD: Context aware disease deviations via restoration of brain images using normative conditional diffusion models |
Ana Lawry Aguila et.al. |
2508.03594 |
null |
2025-08-05 |
Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and Detection |
Long Qian et.al. |
2508.03539 |
null |
2025-08-05 |
X-ray Halos of Early-Type Galaxies with AGN Feedback and Accretion from a Circumgalactic Medium: models and observations |
Silvia Pellegrini et.al. |
2508.03536 |
null |
2025-08-05 |
CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation |
Kaishen Yuan et.al. |
2508.03535 |
null |
2025-08-05 |
LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Text-to-Image Generation |
Lianwei Yang et.al. |
2508.03485 |
null |
2025-08-05 |
When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models |
Dasol Choi Jihwan Lee et.al. |
2508.03483 |
null |
2025-08-05 |
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models |
Hyungjin Kim et.al. |
2508.03481 |
null |
2025-08-05 |
VideoGuard: Protecting Video Content from Unauthorized Editing |
Junjie Cao et.al. |
2508.03480 |
null |
2025-08-05 |
Learning to Incentivize: LLM-Empowered Contract for AIGC Offloading in Teleoperation |
Zijun Zhan et.al. |
2508.03464 |
null |
2025-08-06 |
READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation |
Haotian Wang et.al. |
2508.03457 |
null |
2025-08-05 |
Error Estimates of Semi-Lagrangian Schemes for Diffusive Conservation Laws |
Haruki Takemura et.al. |
2508.03455 |
null |
2025-08-05 |
RAAG: Ratio Aware Adaptive Guidance |
Shangwen Zhu et.al. |
2508.03442 |
null |
2025-08-05 |
Learning Latent Representations for Image Translation using Frequency Distributed CycleGAN |
Shivangi Nigam et.al. |
2508.03415 |
null |
2025-08-05 |
SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models |
Pingchuan Ma et.al. |
2508.03402 |
null |
2025-08-05 |
Delay-facilitated self-assembly in compartmentalized systems |
Severin Angerpointner et.al. |
2508.03383 |
null |
2025-08-05 |
Diffusion Once and Done: Degradation-Aware LoRA for Efficient All-in-One Image Restoration |
Ni Tang et.al. |
2508.03373 |
null |
2025-08-05 |
A Closed-Loop Multi-Agent Framework for Aerodynamics-Aware Automotive Styling Design |
Xinyu Jin et.al. |
2508.03370 |
null |
2025-08-05 |
GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images |
Yifei Sun et.al. |
2508.03357 |
null |
2025-08-05 |
Quenching time and probability estimates for a stochastic reaction-diffusion system with coupled inner singular absorption terms driven by mixed noises |
Nikos I. Kavallaris et.al. |
2508.03354 |
null |
2025-08-06 |
Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation |
Xunzhi Xiang et.al. |
2508.03334 |
null |
2025-08-05 |
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation |
Peiyu Wang et.al. |
2508.03320 |
null |
2025-08-05 |
Thermal Metamaterials for Enhanced Non-Fourier Heat Transport |
Harry Mclean et.al. |
2508.03316 |
null |
2025-08-05 |
The non-isothermal Maxwell-Stefan asymptotics of the multi-species Boltzmann equations |
Xinqiu Chen et.al. |
2508.03311 |
null |
2025-08-05 |
Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation |
Jun Luo et.al. |
2508.03300 |
null |
2025-08-05 |
Investigation on deep learning-based galaxy image translation models |
Hengxin Ruan et.al. |
2508.03291 |
null |
2025-08-07 |
Well-Posedness of the Cauchy Problem for One-Dimensional Nonlinear Diffusion Equations with Dynamic and Fourth-Type Boundary Conditions in the Lp Lq Maximal Regularity Setting |
Ken Furukawa et.al. |
2508.03288 |
null |
2025-08-07 |
Global solvability for doubly degenerate nutrient taxis system with a wide range of bacterial responses in physical dimension |
Bao-Ngoc Tran et.al. |
2508.03268 |
null |
2025-08-05 |
Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation |
Gang Dai et.al. |
2508.03256 |
null |
2025-08-05 |
V.I.P. : Iterative Online Preference Distillation for Efficient Video Diffusion Models |
Jisoo Kim et.al. |
2508.03254 |
null |
2025-08-05 |
Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent Diffusion |
Wentao Qu et.al. |
2508.03252 |
null |
2025-08-06 |
FFHQ-Makeup: Paired Synthetic Makeup Dataset with Facial Consistency Across Multiple Styles |
Xingchao Yang et.al. |
2508.03241 |
null |
2025-08-05 |
BadBlocks: Low-Cost and Stealthy Backdoor Attacks Tailored for Text-to-Image Diffusion Models |
Yu Pan et.al. |
2508.03221 |
null |
2025-08-05 |
Timing is everything: How subtle timing changes in MRI echo planar imaging can significantly alter mechanical vibrations and sound level |
Amir Seginer et.al. |
2508.03220 |
null |
2025-08-05 |
Convergence of Deterministic and Stochastic Diffusion-Model Samplers: A Simple Analysis in Wasserstein Distance |
Eliot Beyler et.al. |
2508.03210 |
null |
2025-08-05 |
Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models |
Muhammed Saeed et.al. |
2508.03199 |
null |
2025-08-05 |
An Analytic Model to Determine the Interstitial-Solute Energetics and Underlying Mechanism in Refractory High-Entropy Alloys |
Qianxi Zhu et.al. |
2508.03163 |
null |
2025-08-05 |
SARD: Segmentation-Aware Anomaly Synthesis via Region-Constrained Diffusion with Discriminative Mask Guidance |
Yanshu Wang et.al. |
2508.03143 |
null |
2025-08-05 |
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying |
Chengyu Bai et.al. |
2508.03142 |
null |
2025-08-05 |
Filtering and 1/3 Power Law for Optimal Time Discretisation in Numerical Integration of Stochastic Differential Equations |
Igor G. Vladimirov et.al. |
2508.03135 |
null |
2025-08-05 |
Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback |
Jingyi Chen et.al. |
2508.03123 |
null |
2025-08-05 |
Power System Voltage Stability Boundary: Computational Results and Applications |
Zhenyao Li et.al. |
2508.03119 |
null |
2025-08-05 |
T2UE: Generating Unlearnable Examples from Text Descriptions |
Xingjun Ma et.al. |
2508.03091 |
null |
2025-08-05 |
MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation |
Youran Zhou et.al. |
2508.03083 |
null |
2025-08-05 |
Multi-human Interactive Talking Dataset |
Zeyu Zhu et.al. |
2508.03050 |
null |
2025-08-05 |
Urban In-Context Learning: Bridging Pretraining and Inference through Masked Diffusion for Urban Profiling |
Ruixing Zhang et.al. |
2508.03042 |
null |
2025-08-05 |
Sparse Identification of Nonlinear Dynamics for Stochastic Delay Differential Equations |
Dimitri Breda et.al. |
2508.03040 |
null |
2025-08-05 |
MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention |
Qi Xie et.al. |
2508.03034 |
null |
2025-08-05 |
LiGen: GAN-Augmented Spectral Fingerprinting for Indoor Positioning |
Jie Lin et.al. |
2508.03024 |
null |
2025-08-05 |
Generating Light-based Fingerprints for Indoor Localization |
Hsun-Yu Lee et.al. |
2508.03011 |
null |
2025-08-05 |
Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models |
Fan Yang et.al. |
2508.03006 |
null |
2025-08-05 |
Diffusion Models with Adaptive Negative Sampling Without External Resources |
Alakh Desai et.al. |
2508.02973 |
null |
2025-08-05 |
Injecting Measurement Information Yields a Fast and Noise-Robust Diffusion-Based Inverse Problem Solver |
Jonathan Patsenker et.al. |
2508.02964 |
null |
2025-08-04 |
X-Actor: Emotional and Expressive Long-Range Portrait Acting from Audio |
Chenxu Zhang et.al. |
2508.02944 |
null |
2025-08-04 |
Documenting Patterns of Exoticism of Marginalized Populations within Text-to-Image Generators |
Sourojit Ghosh et.al. |
2508.02937 |
null |
2025-08-06 |
A nonstandard finite difference scheme for an SEIQR epidemiological PDE model |
Achraf Zinihi et.al. |
2508.02928 |
null |
2025-08-04 |
Goal-Oriented Adaptive Finite Element Multilevel Quasi-{M}onte {C}arlo |
Joakim Beck et.al. |
2508.02925 |
null |
2025-08-04 |
How Diffusion Prior Landscapes Shape the Posterior in Blind Deconvolution |
Minh-Hai Nguyen et.al. |
2508.02923 |
null |
2025-08-04 |
RDDPM: Robust Denoising Diffusion Probabilistic Model for Unsupervised Anomaly Segmentation |
Mehrdad Moradi et.al. |
2508.02903 |
null |
2025-08-04 |
REFLECT: Rectified Flows for Efficient Brain Anomaly Correction Transport |
Farzad Beizaee et.al. |
2508.02889 |
null |
2025-08-04 |
Memoirs of mass accretion: probing the edges of intracluster light in simulated galaxy clusters |
Tara Dacunha et.al. |
2508.02837 |
null |
2025-08-04 |
DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework |
Tongchun Zuo et.al. |
2508.02807 |
null |
2025-08-04 |
NASIM: Revealing the low surface brightness Universe from legacy VISTA data |
Elham Saremi et.al. |
2508.02780 |
null |
2025-08-04 |
D2PPO: Diffusion Policy Policy Optimization with Dispersive Loss |
Guowei Zou et.al. |
2508.02644 |
null |
2025-08-04 |
CAK: Emergent Audio Effects from Minimal Deep Learning |
Austin Rockman et.al. |
2508.02643 |
null |
2025-08-04 |
Anticipating Decoherence: a Predictive Framework for Enhancing Coherence in Quantum Emitters |
Pranshu Maan et.al. |
2508.02638 |
null |
2025-08-04 |
ReMoMask: Retrieval-Augmented Masked Motion Generation |
Zhengdao Li et.al. |
2508.02605 |
null |
2025-08-04 |
Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction |
Yuerong Song et.al. |
2508.02558 |
null |
2025-08-04 |
From Pixels to Pathology: Restoration Diffusion for Diagnostic-Consistent Virtual IHC |
Jingsong Liu et.al. |
2508.02528 |
null |
2025-08-06 |
xDeepServe: Model-as-a-Service on Huawei CloudMatrix384 |
Ao Xiao et.al. |
2508.02520 |
null |
2025-08-04 |
QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots |
Sheng Wu et.al. |
2508.02512 |
null |
2025-08-04 |
Quantitative and Predictive Folding Models from Limited Single-Molecule Data Using Simulation-Based Inference |
Lars Dingeldein et.al. |
2508.02509 |
null |
2025-08-04 |
Toward Using Machine Learning as a Shape Quality Metric for Liver Point Cloud Generation |
Khoa Tuan Nguyen et.al. |
2508.02482 |
null |
2025-08-04 |
PoseGuard: Pose-Guided Generation with Safety Guardrails |
Kongxin Wang et.al. |
2508.02476 |
null |
2025-08-04 |
Efficient spin-pumping and spin-to-charge conversion in epitaxial Mn $_3$ Sn(0001) noncollinear antiferromagnetic films |
Surya N. Panda et.al. |
2508.02415 |
null |
2025-08-04 |
Hydra: Accurate Multi-Modal Leaf Wetness Sensing with mm-Wave and Camera Fusion |
Yimeng Liu et.al. |
2508.02409 |
null |
2025-08-04 |
Inference-time Scaling for Diffusion-based Audio Super-resolution |
Yizhu Jin et.al. |
2508.02391 |
null |
2025-08-04 |
Talking Surveys: How Photorealistic Embodied Conversational Agents Shape Response Quality, Engagement, and Satisfaction |
Matus Krajcovic et.al. |
2508.02376 |
null |
2025-08-04 |
Transport-Guided Rectified Flow Inversion: Improved Image Editing Using Optimal Transport Theory |
Marian Lupascu et.al. |
2508.02363 |
null |
2025-08-04 |
Qwen-Image Technical Report |
Chenfei Wu et.al. |
2508.02324 |
null |
2025-08-04 |
Dream-to-Recon: Monocular 3D Reconstruction with Diffusion-Depth Distillation from Single Images |
Philipp Wulff et.al. |
2508.02323 |
null |
2025-08-05 |
LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training |
Sikui Zhang et.al. |
2508.02308 |
null |
2025-08-05 |
Forecasting When to Forecast: Accelerating Diffusion Models with Confidence-Gated Taylor |
Xiaoliu Guan et.al. |
2508.02240 |
null |
2025-08-04 |
Abstract Formulation of Mean-Field Models and Propagation of Chaos |
Tau Shean Lim et.al. |
2508.02224 |
null |
2025-08-04 |
A theory of strange metals |
Simone Fratini et.al. |
2508.02221 |
null |
2025-08-04 |
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference |
Yuxuan Song et.al. |
2508.02193 |
null |
2025-08-04 |
DreamPainter: Image Background Inpainting for E-commerce Scenarios |
Sijie Zhao et.al. |
2508.02155 |
null |
2025-08-04 |
AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models |
Die Chen et.al. |
2508.02151 |
null |
2025-08-04 |
VDEGaussian: Video Diffusion Enhanced 4D Gaussian Splatting for Dynamic Urban Scenes Modeling |
Yuru Xiao et.al. |
2508.02129 |
null |
2025-08-04 |
AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation |
Zhiwen Li et.al. |
2508.02107 |
null |
2025-08-04 |
Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis |
Kaiyang Ji et.al. |
2508.02106 |
null |
2025-08-04 |
“Stack It Up!”: 3D Stable Structure Generation from 2D Hand-drawn Sketch |
Yiqing Xu et.al. |
2508.02093 |
null |
2025-08-04 |
Unsupervised Multi-channel Speech Dereverberation via Diffusion |
Yulun Wu et.al. |
2508.02071 |
null |
2025-08-04 |
“Set It Up”: Functional Object Arrangement with Compositional Generative Models |
Yiqing Xu et.al. |
2508.02068 |
null |
2025-08-04 |
StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion |
Haoxin Yang et.al. |
2508.02056 |
null |
2025-08-04 |
Why Generate When You Can Transform? Unleashing Generative Attention for Dynamic Recommendation |
Yuli Liu et.al. |
2508.02050 |
null |
2025-08-04 |
Conditional Diffusion Model with Anatomical-Dose Dual Constraints for End-to-End Multi-Tumor Dose Prediction |
Hui Xie et.al. |
2508.02043 |
null |
2025-08-04 |
Frequency-Domain Denoising-Based in Vivo Fluorescence Imaging |
XuHao Yu et.al. |
2508.02025 |
null |
2025-08-04 |
Significant Mobility Enhancement in Coupled AlGaN/GaN Quantum Wells considering Inter-Well Distance and Asymmetric Widths |
Le Tri Dat et.al. |
2508.02024 |
null |
2025-08-05 |
Asymptotic analysis of the Allen-Cahn equation with dynamic boundary conditions of Cahn-Hilliard type |
Pierluigi Colli et.al. |
2508.02021 |
null |
2025-08-04 |
Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention |
Kyungmin Jo et.al. |
2508.02004 |
null |
2025-08-04 |
Generative Large-Scale Pre-trained Models for Automated Ad Bidding Optimization |
Yu Lei et.al. |
2508.02002 |
null |
2025-08-04 |
Path-Integral Formulation of Bosonic Markovian Open Quantum Dynamics with Monte Carlo stochastic trajectories using the Glauber-Sudarshan P, Wigner, and Husimi Q Functions and Hybrids |
Toma Yoneya et.al. |
2508.01991 |
null |
2025-08-04 |
Controllable and Stealthy Shilling Attacks via Dispersive Latent Diffusion |
Shutong Qiao et.al. |
2508.01987 |
null |
2025-08-04 |
Diffusion models for inverse problems |
Hyungjin Chung et.al. |
2508.01975 |
null |
2025-08-03 |
Distributed games with jumps: An $α$ -potential game approach |
Xin Guo et.al. |
2508.01929 |
null |
2025-08-03 |
On the Non-Markovian Navier-Stokes Framework for Turbulence Modeling – A Preliminary Analysis |
Siamak Kazemzadeh Hannani et.al. |
2508.01890 |
null |
2025-08-03 |
DiffusionFF: Face Forgery Detection via Diffusion-based Artifact Localization |
Siran Peng et.al. |
2508.01873 |
null |
2025-08-05 |
Moment Estimate and Variational Approach for Learning Generalized Diffusion with Non-gradient Structures |
Fanze Kong et.al. |
2508.01854 |
null |
2025-08-03 |
Diffusion-based 3D Hand Motion Recovery with Intuitive Physics |
Yufei Zhang et.al. |
2508.01835 |
null |
2025-08-03 |
Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder |
Runxuan Yang et.al. |
2508.01796 |
null |
2025-08-03 |
Exponential mixing for the stochastic Kuramoto-Sivashinsky equation on the 1D torus |
Peng Gao et.al. |
2508.01794 |
null |
2025-08-03 |
DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion |
Zhigang Sun et.al. |
2508.01778 |
null |
2025-08-03 |
Semantically-Guided Inference for Conditional Diffusion Models: Enhancing Covariate Consistency in Time Series Forecasting |
Rui Ding et.al. |
2508.01761 |
null |
2025-08-03 |
Dynamic Coupling of Infiltration-Soil Moisture Feedback:Emergent Vegetation Patterns in a Water-Vegetation Model |
Juan Yan et.al. |
2508.01755 |
null |
2025-08-03 |
Energy-Efficient Federated Learning for Edge Real-Time Vision via Joint Data, Computation, and Communication Design |
Xiangwang Hou et.al. |
2508.01745 |
null |
2025-08-05 |
Imbalance-Robust and Sampling-Efficient Continuous Conditional GANs via Adaptive Vicinity and Auxiliary Regularization |
Xin Ding et.al. |
2508.01725 |
null |
2025-08-03 |
ModFus-DM: Explore the Representation in Modulated Signal Diffusion Generated Models |
Haoyue Tan et.al. |
2508.01719 |
null |
2025-08-03 |
Versatile Transition Generation with Image-to-Video Diffusion |
Zuhao Yang et.al. |
2508.01698 |
null |
2025-08-03 |
DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing |
Yufeng Chi et.al. |
2508.01684 |
null |
2025-08-03 |
DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding |
Hanqing Wang et.al. |
2508.01651 |
null |
2025-08-03 |
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance |
Na Zhang et.al. |
2508.01650 |
null |
2025-08-03 |
Hamiltonian simulation for nonlinear partial differential equation by Schrödingerization |
Shoya Sasaki et.al. |
2508.01640 |
null |
2025-08-03 |
VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation |
Xuanran Zhai et.al. |
2508.01622 |
null |
2025-08-03 |
LLaDA-MedV: Exploring Large Language Diffusion Models for Biomedical Image Understanding |
Xuanzhao Dong et.al. |
2508.01617 |
null |
2025-08-03 |
TCDiff: Triplex Cascaded Diffusion for High-fidelity Multimodal EHRs Generation with Incomplete Clinical Data |
Yandong Yan et.al. |
2508.01615 |
null |
2025-08-03 |
Practical, Generalizable and Robust Backdoor Attacks on Text-to-Image Diffusion Models |
Haoran Dai et.al. |
2508.01605 |
null |
2025-08-03 |
Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment |
Lubin Gan et.al. |
2508.01602 |
null |
2025-08-03 |
CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation |
Sung-Wook Lee et.al. |
2508.01600 |
null |
2025-08-03 |
Why Heuristic Weighting Works: A Theoretical Analysis of Denoising Score Matching |
Juyan Zhang et.al. |
2508.01597 |
null |
2025-08-03 |
A Plug-and-Play Multi-Criteria Guidance for Diverse In-Betweening Human Motion Generation |
Hua Yu et.al. |
2508.01590 |
null |
2025-08-03 |
Censored Sampling for Topology Design: Guiding Diffusion with Human Preferences |
Euihyun Kim et.al. |
2508.01589 |
null |
2025-08-03 |
Diffusion Models for Future Networks and Communications: A Comprehensive Survey |
Nguyen Cong Luong et.al. |
2508.01586 |
null |
2025-08-03 |
Tractography-Guided Dual-Label Collaborative Learning for Multi-Modal Cranial Nerves Parcellation |
Lei Xie et.al. |
2508.01577 |
null |
2025-08-03 |
Sub 10 nm Nanochannels Enable Directional Quasi Ballistic Exciton Transport over 5 μm at Room Temperature |
Xiao-Jie Wang et.al. |
2508.01567 |
null |
2025-08-03 |
MGCR-Net:Multimodal Graph-Conditioned Vision-Language Reconstruction Network for Remote Sensing Change Detection |
Chengming Wang et.al. |
2508.01555 |
null |
2025-08-02 |
A Reward-Directed Diffusion Framework for Generative Design Optimization |
Hadi Keramati et.al. |
2508.01509 |
null |
2025-08-02 |
Instruction-based Time Series Editing |
Jiaxing Qiu et.al. |
2508.01504 |
null |
2025-08-02 |
The role of zealots in the spread of linguistic traits |
Vivian Dornelas et.al. |
2508.01500 |
null |
2025-08-02 |
TreeDiff: AST-Guided Code Generation with Diffusion LLMs |
Yiming Zeng et.al. |
2508.01473 |
null |
2025-08-02 |
Regression Augmentation With Data-Driven Segmentation |
Shayan Alahyari et.al. |
2508.01455 |
null |
2025-08-02 |
Physically-based Lighting Augmentation for Robotic Manipulation |
Shutong Jin et.al. |
2508.01442 |
null |
2025-08-02 |
Viscosity Stabilized Plug-and-Play Reconstruction |
Arghya Sinha et.al. |
2508.01441 |
null |
2025-08-02 |
Parabolic-elliptic and indirect-direct simplifications in chemotaxis systems driven by indirect signalling |
Le Trong Thanh Bui et.al. |
2508.01436 |
null |
2025-08-02 |
Artificial Intelligence and Misinformation in Art: Can Vision Language Models Judge the Hand or the Machine Behind the Canvas? |
Tarian Fu et.al. |
2508.01408 |
null |
2025-08-02 |
StyleSentinel: Reliable Artistic Copyright Verification via Stylistic Fingerprints |
Lingxiao Chen et.al. |
2508.01335 |
null |
2025-08-05 |
Zero-shot Segmentation of Skin Conditions: Erythema with Edit-Friendly Inversion |
Konstantinos Moutselos et.al. |
2508.01334 |
null |
2025-08-02 |
LinkQA: Synthesizing Diverse QA from Multiple Seeds Strongly Linked by Knowledge Points |
Xuemiao Zhang et.al. |
2508.01317 |
null |
2025-08-02 |
CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET Synthesis |
Alec Sargood et.al. |
2508.01292 |
null |
2025-08-02 |
PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation |
Zonglei Jing et.al. |
2508.01272 |
null |
2025-08-02 |
Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling |
Lexiao Zou et.al. |
2508.01264 |
null |
2025-08-02 |
NS-Net: Decoupling CLIP Semantic Information through NULL-Space for Generalizable AI-Generated Image Detection |
Jiazhen Yan et.al. |
2508.01248 |
null |
2025-08-02 |
Effect of protection zone on the dynamics of a diffusion-advection population-toxicant model |
Jing Gao et.al. |
2508.01246 |
null |
2025-08-02 |
Sliding two-dimensional superconductivity and charge-density-wave state in a bulk crystal |
Xiangqi Liu et.al. |
2508.01241 |
null |
2025-08-02 |
SketchAgent: Generating Structured Diagrams from Hand-Drawn Sketches |
Cheng Tan et.al. |
2508.01237 |
null |
2025-08-02 |
Point-wise Diffusion Models for Physical Systems with Shape Variations: Application to Spatio-temporal and Large-scale system |
Jiyong Kim et.al. |
2508.01230 |
null |
2025-08-02 |
StyDeco: Unsupervised Style Transfer with Distilling Priors and Semantic Decoupling |
Yuanlin Yang et.al. |
2508.01215 |
null |
2025-08-02 |
Energy-dependent anisotropy of cosmic-ray muons: A twelve-year study with IceCube Neutrino Observatory |
Nabin Upadhya Dhakal et.al. |
2508.01194 |
null |
2025-08-02 |
DELTAv2: Accelerating Dense 3D Tracking |
Tuan Duc Ngo et.al. |
2508.01170 |
null |
2025-08-02 |
RoboLinker: A Diffusion-model-based Matching Clothing Generator Between Humans and Companion Robots |
Jing Tang et.al. |
2508.01165 |
null |
2025-08-02 |
LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation |
Xinyu Yan et.al. |
2508.01152 |
null |
2025-08-02 |
Personalized Safety Alignment for Text-to-Image Diffusion Models |
Yu Lei et.al. |
2508.01151 |
null |
2025-08-02 |
Dataset Condensation with Color Compensation |
Huyu Wu et.al. |
2508.01139 |
null |
2025-08-01 |
Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models |
Jinsong Li et.al. |
2508.00819 |
null |
2025-08-01 |
Multibeam High Throughput Satellite: Hardware Foundation, Resource Allocation, and Precoding |
Rui Chen et.al. |
2508.00800 |
null |
2025-08-01 |
Video Generators are Robot Policies |
Junbang Liang et.al. |
2508.00795 |
null |
2025-08-01 |
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation |
Kien T. Pham et.al. |
2508.00782 |
null |
2025-08-01 |
Diffusion-Scheduled Denoising Autoencoders for Anomaly Detection in Tabular Data |
Timur Sattarov et.al. |
2508.00758 |
null |
2025-08-01 |
LeakyCLIP: Extracting Training Data from CLIP |
Yunhao Chen et.al. |
2508.00756 |
null |
2025-08-01 |
SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation |
Prerana Ramkumar et.al. |
2508.00750 |
null |
2025-08-01 |
AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation |
Le Wang et.al. |
2508.00733 |
null |
2025-08-01 |
YOLO-Count: Differentiable Object Counting for Text-to-Image Generation |
Guanning Zeng et.al. |
2508.00728 |
null |
2025-08-01 |
Controllability of diffusive Lotka-Volterra strongly competitive systems under boundary constrained controls |
Elisa Affili et.al. |
2508.00713 |
null |
2025-08-01 |
D3: Training-Free AI-Generated Video Detection Using Second-Order Features |
Chende Zheng et.al. |
2508.00701 |
null |
2025-08-01 |
On-Device Diffusion Transformer Policy for Efficient Robot Manipulation |
Yiming Wu et.al. |
2508.00697 |
null |
2025-08-01 |
Wind Power Scenario Generation based on the Generalized Dynamic Factor Model and Generative Adversarial Network |
Young-ho Cho et.al. |
2508.00692 |
null |
2025-08-01 |
Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators |
Albert Matveev et.al. |
2508.00643 |
null |
2025-08-01 |
Minimum Data, Maximum Impact: 20 annotated samples for explainable lung nodule classification |
Luisa Gallée et.al. |
2508.00639 |
null |
2025-08-01 |
DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior |
Junzhe Lu et.al. |
2508.00599 |
null |
2025-08-01 |
Wukong Framework for Not Safe For Work Detection in Text-to-Image systems |
Mingrui Liu et.al. |
2508.00591 |
null |
2025-08-01 |
Guiding Diffusion-Based Articulated Object Generation by Partial Point Cloud Alignment and Physical Plausibility Constraints |
Jens U. Kreber et.al. |
2508.00558 |
null |
2025-08-01 |
DBLP: Noise Bridge Consistency Distillation For Efficient And Reliable Adversarial Purification |
Chihan Huang et.al. |
2508.00552 |
null |
2025-08-01 |
Video Color Grading via Look-Up Table Generation |
Seunghyun Shin et.al. |
2508.00548 |
null |
2025-08-01 |
HannesImitation: Grasping with the Hannes Prosthetic Hand via Imitation Learning |
Carlo Alessi et.al. |
2508.00491 |
null |
2025-08-01 |
LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion Transformer |
Yuzhuo Chen et.al. |
2508.00477 |
null |
2025-08-01 |
A Conditional GAN for Tabular Data Generation with Probabilistic Sampling of Latent Subspaces |
Leonidas Akritidis et.al. |
2508.00472 |
null |
2025-08-01 |
Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution |
Yiwen Wang et.al. |
2508.00471 |
null |
2025-08-01 |
AutoDebias: Automated Framework for Debiasing Text-to-Image Models |
Hongyi Cai et.al. |
2508.00445 |
null |
2025-08-01 |
SDMatte: Grafting Diffusion Models for Interactive Matting |
Longfei Huang et.al. |
2508.00443 |
null |
2025-08-01 |
Diffusion-Based User-Guided Data Augmentation for Coronary Stenosis Detection |
Sumin Seo et.al. |
2508.00438 |
null |
2025-08-01 |
Accurate Latent Inversion for Generative Image Steganography via Rectified Flow |
Yuqi Qian et.al. |
2508.00434 |
null |
2025-08-01 |
Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation |
Nan Xiang et.al. |
2508.00428 |
null |
2025-08-01 |
Contact-Aware Amodal Completion for Human-Object Interaction via Multi-Regional Inpainting |
Seunggeun Chi et.al. |
2508.00427 |
null |
2025-08-01 |
Collimated QED Cascades with Curved Plasma Mirror |
Xuesong Geng et.al. |
2508.00417 |
null |
2025-08-01 |
DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space |
Junyu Chen et.al. |
2508.00413 |
null |
2025-08-01 |
Sortblock: Similarity-Aware Feature Reuse for Diffusion Model |
Hanqi Chen et.al. |
2508.00412 |
null |
2025-08-01 |
Predictive information criterion for jump diffusion processes |
Yuma Uehara et.al. |
2508.00411 |
null |
2025-08-01 |
Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency |
Xi Xue et.al. |
2508.00397 |
null |
2025-08-01 |
Sheaf Graph Neural Networks via PAC-Bayes Spectral Optimization |
Yoonhyuk Choi et.al. |
2508.00357 |
null |
2025-08-01 |
BOOD: Boundary-based Out-Of-Distribution Data Generation |
Qilin Liao et.al. |
2508.00350 |
null |
2025-08-01 |
Favorable modifications of Scrape-Off Layer (SOL) heat flux width through pulsed fuelling in ADITYA-U Tokamak |
SK Injamul Hoque et.al. |
2508.00339 |
null |
2025-08-01 |
Radially Locked Sun-Ray Patterns in Autocatalytic Reaction-Diffusion-Advection Systems |
Surya Narayan Maharana et.al. |
2508.00329 |
null |
2025-08-01 |
Steering Guidance for Personalized Text-to-Image Diffusion Models |
Sunghyun Park et.al. |
2508.00319 |
null |
2025-08-01 |
GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection |
Suhang Cai et.al. |
2508.00312 |
null |
2025-08-01 |
TopoDiffuser: A Diffusion-Based Multimodal Trajectory Prediction Model with Topometric Maps |
Zehui Xu et.al. |
2508.00303 |
null |
2025-08-01 |
Controllable Pedestrian Video Editing for Multi-View Driving Scenarios via Motion Sequence |
Danzhen Fu et.al. |
2508.00299 |
null |
2025-08-01 |
AniMer+: Unified Pose and Shape Estimation Across Mammalia and Aves via Family-Aware Transformer |
Jin Lyu et.al. |
2508.00298 |
null |
2025-08-01 |
TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models |
Christian Simon et.al. |
2508.00289 |
null |
2025-08-01 |
UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents |
Jianqiang Xiao et.al. |
2508.00288 |
null |
2025-08-01 |
Towards Robust Semantic Correspondence: A Benchmark and Insights |
Wenyue Chong et.al. |
2508.00272 |
null |
2025-08-01 |
Jet Image Generation in High Energy Physics Using Diffusion Models |
Victor D. Martinez et.al. |
2508.00250 |
null |
2025-07-31 |
Reliability of 1D radiative-convective photochemical-equilibrium retrievals on transit spectra of WASP-107b |
Thomas Konings et.al. |
2508.00177 |
null |
2025-07-31 |
DiSC-Med: Diffusion-based Semantic Communications for Robust Medical Image Transmission |
Fupei Guo et.al. |
2508.00172 |
null |
2025-07-31 |
World Consistency Score: A Unified Metric for Video Generation Quality |
Akshat Rakheja et.al. |
2508.00144 |
null |
2025-07-31 |
Entanglement spreading and emergent locality in Brownian SYK chains |
Onkar Parrikar et.al. |
2508.00060 |
null |
2025-07-31 |
Predicting Large-scale Urban Network Dynamics with Energy-informed Graph Neural Diffusion |
Tong Nie et.al. |
2508.00037 |
null |
2025-07-31 |
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis |
Bowen Zhang et.al. |
2507.23785 |
null |
2025-07-31 |
SUB: Benchmarking CBM Generalization via Synthetic Attribute Substitutions |
Jessica Bader et.al. |
2507.23784 |
null |
2025-07-31 |
General diffusions on metric graphs as limits of time-space Markov Chains |
Alexis Anagnostakis et.al. |
2507.23724 |
null |
2025-07-31 |
DiffuMatch: Category-Agnostic Spectral Diffusion Priors for Robust Non-rigid Shape Matching |
Emery Pierson et.al. |
2507.23715 |
null |
2025-07-31 |
CFDagent: A Language-Guided, Zero-Shot Multi-Agent System for Complex Flow Simulation |
Zhaoyue Xu et.al. |
2507.23693 |
null |
2025-07-31 |
UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration |
Zihan Cheng et.al. |
2507.23685 |
null |
2025-07-31 |
I2V-GS: Infrastructure-to-Vehicle View Transformation with Gaussian Splatting for Autonomous Driving Data Generation |
Jialei Chen et.al. |
2507.23683 |
null |
2025-07-31 |
Analysis of a Cross-Nonlinear Porous-Medium System Modeling Pressure-Driven Cell Population Dynamics |
Alexis Béjar-López et.al. |
2507.23680 |
null |
2025-07-31 |
DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data |
Rabeya Tus Sadia et.al. |
2507.23676 |
null |
2025-07-31 |
One-Step Flow Policy Mirror Descent |
Tianyi Chen et.al. |
2507.23675 |
null |
2025-07-31 |
Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image Synthesis |
Kunpeng Qiu et.al. |
2507.23652 |
null |
2025-07-31 |
A stochastic heat equation with non-locally Lipschitz coefficients |
Le Chen et.al. |
2507.23637 |
null |
2025-07-31 |
DivControl: Knowledge Diversion for Controllable Image Generation |
Yucheng Xie et.al. |
2507.23620 |
null |
2025-08-02 |
MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction |
Zijian Dong et.al. |
2507.23597 |
null |
2025-07-31 |
Theory of ultrafast conductance modulation in electrochemical protonic synapses by multiphase polarization |
Michael L. Li et.al. |
2507.23576 |
null |
2025-08-01 |
H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation |
Hongzhe Bi et.al. |
2507.23523 |
null |
2025-07-31 |
Conical diffraction of the synchrotron beam to probe the efficiency and morphology of blazed gratings |
K. V. Nikolaev et.al. |
2507.23513 |
null |
2025-07-31 |
Emergence of long-range non-equilibrium correlations in free liquid diffusion |
Marco Bussoletti et.al. |
2507.23507 |
null |
2025-07-31 |
Digital literacy interventions can boost humans in discerning deepfakes |
Dominique Geissler et.al. |
2507.23492 |
null |
2025-07-31 |
Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion |
Mutian Xu et.al. |
2507.23483 |
null |
2025-07-31 |
Adjoint-Based Aerodynamic Shape Optimization with a Manifold Constraint Learned by Diffusion Models |
Long Chen et.al. |
2507.23443 |
null |
2025-07-31 |
Out-of-Distribution Detection in Medical Imaging via Diffusion Trajectories |
Lemar Abdi et.al. |
2507.23411 |
null |
2025-07-31 |
An optimal preconditioner for high-order scheme arising from multi-dimensional Riesz space fractional diffusion equations with variable coefficients |
Yuan-Yuan Huang et.al. |
2507.23408 |
null |
2025-07-31 |
UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries |
Yijie Zhu et.al. |
2507.23372 |
null |
2025-07-31 |
IN45023 Neural Network Design Patterns in Computer Vision Seminar Report, Summer 2025 |
Radu-Andrei Bourceanu et.al. |
2507.23357 |
null |
2025-07-31 |
Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads |
Yingjie Zhou et.al. |
2507.23343 |
null |
2025-07-31 |
EMU and the DRAGNs I: A Catalogue of DRAGNs |
Ray P. Norris et.al. |
2507.23337 |
null |
2025-07-31 |
Classifying Compact Radio Emission in Nearby Galaxies: a 10GHz Study of Active Galactic Nuclei, Supernovae, Anomalous Microwave Emission and Star Forming Regions |
Kristen C. Dage et.al. |
2507.23332 |
null |
2025-07-31 |
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models |
Alfio Ferrara et.al. |
2507.23313 |
null |
2025-07-31 |
PriorFusion: Unified Integration of Priors for Robust Road Perception in Autonomous Driving |
Xuewei Tang et.al. |
2507.23309 |
null |
2025-08-01 |
Training-free Geometric Image Editing on Diffusion Models |
Hanshen Zhu et.al. |
2507.23300 |
null |
2025-07-31 |
UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing |
Hao Tang et.al. |
2507.23278 |
null |
2025-07-31 |
PixNerd: Pixel Neural Field Diffusion |
Shuai Wang et.al. |
2507.23268 |
null |
2025-07-31 |
Automated Mapping the Pathways of Cranial Nerve II, III, V, and VII/VIII: A Multi-Parametric Multi-Stage Diffusion Tractography Atlas |
Lei Xie et.al. |
2507.23245 |
null |
2025-07-31 |
BS-1-to-N: Diffusion-Based Environment-Aware Cross-BS Channel Knowledge Map Generation for Cell-Free Networks |
Zhuoyin Dai et.al. |
2507.23236 |
null |
2025-07-31 |
Adversarial-Guided Diffusion for Multimodal LLM Attacks |
Chengwei Xia et.al. |
2507.23202 |
null |
2025-07-30 |
X-NeMo: Expressive Neural Motion Reenactment via Disentangled Latent Attention |
Xiaochen Zhao et.al. |
2507.23143 |
null |
2025-07-30 |
Nonzero $\mathfrak{n}$ cohomology of Totally Degenerate Limit of Discrete Series representations |
Jin Kunwoo Lee et.al. |
2507.23102 |
null |
2025-07-30 |
Diffusion model for gradient preconditioning in hyperspectral imaging inverse problems |
Jonathan Monsalve et.al. |
2507.23065 |
null |
2025-07-30 |
Reference-Guided Diffusion Inpainting For Multimodal Counterfactual Generation |
Alexandru Buburuzan et.al. |
2507.23058 |
null |
2025-07-30 |
Search for Neutrinos from the Galactic 4FGL Sources with the Pion-bump Signature with IceCube |
Alejandra Granados et.al. |
2507.23040 |
null |
2025-07-30 |
Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction |
Giuseppe Cartella et.al. |
2507.23021 |
null |
2025-07-30 |
Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods |
Siwoo Park et.al. |
2507.23010 |
null |
2025-07-30 |
LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis |
Jamil Fayyad et.al. |
2507.23001 |
null |
2025-07-29 |
Neural Autoregressive Modeling of Brain Aging |
Ridvan Yesiloglu et.al. |
2507.22954 |
null |
2025-07-30 |
AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS |
Hai Ling et.al. |
2507.22880 |
null |
2025-07-30 |
Robust Contract with Career Concerns |
Tan Gan et.al. |
2507.22852 |
null |
2025-07-30 |
Morph: ChirpTransformer-based Encoder-decoder Co-design for Reliable LoRa Communication |
Yidong Ren et.al. |
2507.22851 |
null |
2025-07-30 |
DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion |
Qingcheng Zhao et.al. |
2507.22825 |
null |
2025-07-30 |
Design and Analysis of Plasmonic-Nanorod-Enhanced Lead-Free Inorganic Perovskite/Silicon Heterojunction Tandem Solar Cell Exceeding the Shockley-Queisser Limit |
Md. Sad Abdullah Sami et.al. |
2507.22803 |
null |
2025-07-31 |
G-Core: A Simple, Scalable and Balanced RLHF Trainer |
Junyu Wu et.al. |
2507.22789 |
null |
2025-07-30 |
DO-EM: Density Operator Expectation Maximization |
Adit Vishnu et.al. |
2507.22786 |
null |
2025-08-01 |
Next Tokens Denoising for Speech Synthesis |
Yanqing Liu et.al. |
2507.22746 |
null |
2025-07-30 |
Zero-Shot Image Anomaly Detection Using Generative Foundation Models |
Lemar Abdi et.al. |
2507.22692 |
null |
2025-07-30 |
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing |
Federico Girella et.al. |
2507.22627 |
null |
2025-07-30 |
Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions |
Yiting Qu et.al. |
2507.22617 |
null |
2025-07-30 |
Generative Active Learning for Long-tail Trajectory Prediction via Controllable Diffusion Model |
Daehee Park et.al. |
2507.22615 |
null |
2025-07-30 |
ShortFT: Diffusion Model Alignment via Shortcut-based Fine-Tuning |
Xiefan Guo et.al. |
2507.22604 |
null |
2025-07-30 |
Diffusion Models for Influence Maximization on Temporal Networks: A Guide to Make the Best Choice |
Aaqib Zahoor et.al. |
2507.22589 |
null |
2025-07-30 |
DACA-Net: A Degradation-Aware Conditional Diffusion Network for Underwater Image Enhancement |
Chang Huang et.al. |
2507.22501 |
null |
2025-07-30 |
LoReUn: Data Itself Implicitly Provides Cues to Improve Machine Unlearning |
Xiang Li et.al. |
2507.22499 |
null |
2025-07-30 |
Visual Language Models as Zero-Shot Deepfake Detectors |
Viacheslav Pirogov et.al. |
2507.22469 |
null |
2025-07-30 |
TopoLiDM: Topology-Aware LiDAR Diffusion Models for Interpretable and Realistic LiDAR Point Cloud Generation |
Jiuming Liu et.al. |
2507.22454 |
null |
2025-07-30 |
GVD: Guiding Video Diffusion Model for Scalable Video Distillation |
Kunyang Li et.al. |
2507.22360 |
null |
2025-07-29 |
Trade-offs in Image Generation: How Do Different Dimensions Interact? |
Sicheng Zhang et.al. |
2507.22100 |
null |
2025-07-29 |
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again |
Zigang Geng et.al. |
2507.22058 |
null |
2025-07-30 |
See Different, Think Better: Visual Variations Mitigating Hallucinations in LVLMs |
Ziyun Dai et.al. |
2507.22003 |
null |
2025-07-29 |
Enhancing Generalization in Data-free Quantization via Mixup-class Prompting |
Jiwoong Park et.al. |
2507.21947 |
null |
2025-07-29 |
Anyone Can Jailbreak: Prompt-Based Attacks on LLMs and T2Is |
Ahmed B Mustafa et.al. |
2507.21820 |
null |
2025-07-29 |
Control Copy-Paste: Controllable Diffusion-Based Augmentation Method for Remote Sensing Few-Shot Object Detection |
Yanxing Liu et.al. |
2507.21816 |
null |
2025-07-29 |
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE |
Junzhe Li et.al. |
2507.21802 |
null |
2025-07-29 |
APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing |
Sangmin Han et.al. |
2507.21690 |
null |
2025-07-29 |
GuidPaint: Class-Guided Image Inpainting with Diffusion Models |
Qimin Wang et.al. |
2507.21627 |
null |
2025-07-29 |
Locally Controlled Face Aging with Latent Diffusion Models |
Lais Isabelle Alves dos Santos et.al. |
2507.21600 |
null |
2025-07-29 |
Neural network enabled wide field-of-view imaging with hyperbolic metalenses |
Joel Yeo et.al. |
2507.21562 |
null |
2025-07-29 |
Chain-of-Cooking:Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance |
Mengling Xu et.al. |
2507.21529 |
null |
2025-07-29 |
BANG: Dividing 3D Assets via Generative Exploded Dynamics |
Longwen Zhang et.al. |
2507.21493 |
null |
2025-07-29 |
Retrieve-Augmented Generation for Speeding up Diffusion Policy without Additional Training |
Sodtavilan Odonchimed et.al. |
2507.21452 |
null |
2025-07-30 |
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation |
Shijie Zhou et.al. |
2507.21391 |
null |
2025-07-28 |
Exploring Probabilistic Modeling Beyond Domain Generalization for Semantic Segmentation |
I-Hsiang Chen et.al. |
2507.21367 |
null |
2025-07-28 |
A Contrastive Diffusion-based Network (CDNet) for Time Series Classification |
Yaoyu Zhang et.al. |
2507.21357 |
null |
2025-07-28 |
HDR Environment Map Estimation with Latent Diffusion Models |
Jack Hilliard et.al. |
2507.21261 |
null |
2025-07-28 |
Adaptive Multimodal Protein Plug-and-Play with Diffusion-Based Priors |
Amartya Banerjee et.al. |
2507.21260 |
null |
2025-07-28 |
Learning from Limited and Imperfect Data |
Harsh Rangwani et.al. |
2507.21205 |
null |
2025-08-01 |
Flow Matching Policy Gradients |
David McAllister et.al. |
2507.21053 |
null |
2025-07-29 |
JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1 |
Xinhan Di et.al. |
2507.20987 |
null |
2025-07-28 |
Adapting Vehicle Detectors for Aerial Imagery to Unseen Domains with Weak Supervision |
Xiao Fang et.al. |
2507.20976 |
null |