pith. sign in

Gang Yu

Identifiers

  • name variant Gang Yu 0.60 · backfill

Papers (80)

  1. ShutterMuse: Capture-Time Photography Guidance with MLLMs cs.CV · 2026 · author #7
  2. FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining cs.CV · 2026 · author #12
  3. High-order synchrosqueezed wavelet-chirplet transform for instantaneous frequency and chirprate estimation eess.SP · 2026 · author #4
  4. Rethinking Memory as Continuously Evolving Connectivity cs.CL · 2026 · author #9
  5. StepAudio 2.5 Technical Report eess.AS · 2026 · author #99
  6. Vision Foundation Models as Generalist Tokenizers for Image Generation cs.CV · 2026 · author #6
  7. Head Forcing: Long Autoregressive Video Generation via Head Heterogeneity cs.CV · 2026 · author #3
  8. Step-Audio-R1.5 Technical Report eess.AS · 2026 · author #17
  9. MMPhysVideo: Scaling Physical Plausibility in Video Generation via Joint Multimodal Modeling cs.CV · 2026 · author #5
  10. Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks cs.CL · 2026 · author #14
  11. Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models cs.CL · 2025 · author #11
  12. Step-Audio 2 Technical Report cs.CL · 2025 · author #8
  13. Step1X-Edit: A Practical Framework for General Image Editing cs.CV · 2025 · author #23
  14. Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model cs.CV · 2025 · author #36
  15. ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment cs.CV · 2024 · author #6
  16. AppAgent: Multimodal Agents as Smartphone Users cs.CV · 2023 · author #8
  17. VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations cs.CV · 2023 · author #5
  18. TapMo: Shape-aware Motion Generation of Skeleton-free Characters cs.GR · 2023 · author #6
  19. Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering cs.CV · 2023 · author #3
  20. Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning cs.CV · 2023 · author #7
  21. IT3D: Improved Text-to-3D Generation with Explicit View Synthesis cs.CV · 2023 · author #5
  22. Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image cs.CV · 2023 · author #5
  23. Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation cs.CV · 2023 · author #9
  24. MotionGPT: Human Motion as a Foreign Language cs.CV · 2023 · author #5
  25. STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection cs.CV · 2023 · author #5
  26. StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation cs.CV · 2023 · author #5 as printed: Gang YU
  27. Capturing the motion of every joint: 3D human pose and shape estimation with independent tokens cs.CV · 2023 · author #6
  28. A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction cs.CV · 2023 · author #5 as printed: Gang YU
  29. End-to-End 3D Dense Captioning with Vote2Cap-DETR cs.CV · 2023 · author #6 as printed: Gang YU
  30. Executing your Commands via Motion Diffusion in Latent Space cs.CV · 2022 · author #8
  31. Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report cs.CV · 2022 · author #26
  32. Learning Variational Motion Prior for Video-based Motion Capture cs.CV · 2022 · author #7
  33. Coordinates Are NOT Lonely -- Codebook Prior Helps Implicit Neural 3D Representations cs.CV · 2022 · author #6 as printed: Gang YU
  34. Hierarchical Normalization for Robust Monocular Depth Estimation cs.CV · 2022 · author #4
  35. D&D: Learning Human Dynamics from Dynamic Camera cs.CV · 2022 · author #5
  36. Enhancing Quality of Pose-varied Face Restoration with Local Weak Feature Sensing and GAN Prior cs.CV · 2022 · author #5
  37. Designing thermal radiation metamaterials via hybrid adversarial autoencoder and Bayesian optimization cs.LG · 2022 · author #3
  38. TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation cs.CV · 2022 · author #7
  39. An Energy-concentrated Wavelet Transform for Time Frequency Analysis of Transient Signals eess.SP · 2022 · author #2
  40. Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation cs.CV · 2021 · author #3
  41. Sketch Me A Video cs.CV · 2021 · author #2
  42. Fine-grained Identity Preserving Landmark Synthesis for Face Reenactment cs.CV · 2021 · author #5
  43. A multi-stage semi-supervised improved deep embedded clustering method for bearing fault diagnosis under the situation of insufficient labeled samples cs.LG · 2021 · author #2
  44. Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification cs.CV · 2021 · author #3
  45. Identification of Pediatric Respiratory Diseases Using Fine-grained Diagnosis System cs.AI · 2021 · author #1
  46. Shuffle Transformer with Feature Alignment for Video Face Parsing cs.CV · 2021 · author #6
  47. Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer cs.CV · 2021 · author #5
  48. Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report eess.IV · 2021 · author #10
  49. Reconstruction of Supernova Gravitational Waves Waveforms: Comparing Three Time-frequency Transform Methods astro-ph.HE · 2020 · author #3
  50. BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation cs.CV · 2020 · author #4
  51. Context Prior for Scene Segmentation cs.CV · 2020 · author #4
  52. High-Order Information Matters: Learning Relation and Topology for Occluded Person Re-Identification cs.CV · 2020 · author #7
  53. State-Aware Tracker for Real-Time Video Object Segmentation cs.CV · 2020 · author #4
  54. Real-Time Semantic Segmentation via Multiply Spatial Fusion Network cs.CV · 2019 · author #4
  55. SiamFC++: Towards Robust and Accurate Visual Tracking with Target Estimation Guidelines cs.CV · 2019 · author #5
  56. Learnable Tree Filter for Structure-preserving Feature Transform cs.CV · 2019 · author #4
  57. Double Anchor R-CNN for Human Detection in a Crowd cs.CV · 2019 · author #6
  58. Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection cs.CV · 2019 · author #5
  59. Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network cs.CV · 2019 · author #7
  60. TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection cs.CV · 2019 · author #3
  61. Shape Robust Text Detection with Progressive Scale Expansion Network cs.CV · 2019 · author #6
  62. ThunderNet: Towards Real-time Generic Object Detection cs.CV · 2019 · author #5
  63. An End-to-End Network for Panoptic Segmentation cs.CV · 2019 · author #6
  64. WIDER Face and Pedestrian Challenge 2018: Methods and Results cs.CV · 2019 · author #17
  65. Rethinking on Multi-Stage Networks for Human Pose Estimation cs.CV · 2019 · author #7
  66. Scene Text Detection with Supervised Pyramid Context Network cs.CV · 2018 · author #4
  67. Modeling Local Geometric Structure of 3D Point Clouds using Geo-CNN cs.CV · 2018 · author #3
  68. BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation cs.CV · 2018 · author #5
  69. CrowdHuman: A Benchmark for Detecting Human in a Crowd cs.CV · 2018 · author #5
  70. Learning a Discriminative Feature Network for Semantic Segmentation cs.CV · 2018 · author #5
  71. SFace: An Efficient Network for Face Detection in Large Scale Variations cs.CV · 2018 · author #4
  72. DetNet: A Backbone network for Object Detection cs.CV · 2018 · author #3
  73. SOT for MOT cs.CV · 2017 · author #3
  74. Cascaded Pyramid Network for Multi-Person Pose Estimation cs.CV · 2017 · author #5
  75. Light-Head R-CNN: In Defense of Two-Stage Object Detector cs.CV · 2017 · author #3
  76. Face Attention Network: An Effective Face Detector for the Occluded Faces cs.CV · 2017 · author #3
  77. MegDet: A Large Mini-Batch Object Detector cs.CV · 2017 · author #7
  78. Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network cs.CV · 2017 · author #3
  79. Provable Secure Identity Based Generalized Signcryption Scheme cs.CR · 2010 · author #1
  80. Sieving by large integers and covering systems of congruences math.NT · 2005 · author #5

Mentions

  • 2310.14487 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2310.12678 #6 · arxiv_oai · confidence 0.70 Gang Yu
  • 2309.09724 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 2309.02999 #7 · arxiv_oai · confidence 0.70 Gang Yu
  • 2308.11473 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2307.10984 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2306.14795 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2306.17115 #9 · arxiv_oai · confidence 0.70 Gang Yu
  • 2205.14377 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2306.02763 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2305.19012 #5 · arxiv_oai · confidence 0.70 Gang YU
  • 2212.04048 #8 · arxiv_oai · confidence 0.70 Gang Yu
  • 2303.00298 #6 · arxiv_oai · confidence 0.70 Gang Yu
  • 2301.06782 #5 · arxiv_oai · confidence 0.70 Gang YU
  • 2301.02508 #6 · arxiv_oai · confidence 0.70 Gang YU
  • 2211.04470 #26 · arxiv_oai · confidence 0.70 Gang Yu
  • 2210.15134 #7 · arxiv_oai · confidence 0.70 Gang Yu
  • 2210.11170 #6 · arxiv_oai · confidence 0.70 Gang YU
  • 2210.09670 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 2209.08790 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2205.01063 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 2204.05525 #7 · arxiv_oai · confidence 0.70 Gang Yu
  • 1903.11752 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2202.10690 #2 · arxiv_oai · confidence 0.70 Gang Yu
  • 2111.13010 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 2109.13521 #2 · arxiv_oai · confidence 0.70 Gang Yu
  • 2110.04708 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2110.04710 #2 · arxiv_oai · confidence 0.70 Gang Yu
  • 2108.13098 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 2108.10818 #1 · arxiv_oai · confidence 0.70 Gang Yu
  • 2106.08650 #6 · arxiv_oai · confidence 0.70 Gang Yu
  • 2106.03650 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2105.08630 #10 · arxiv_oai · confidence 0.70 Gang Yu
  • 2010.14409 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 1908.05900 #7 · arxiv_oai · confidence 0.70 Gang Yu
  • 2004.02147 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 2004.01547 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 2003.08177 #7 · arxiv_oai · confidence 0.70 Gang Yu
  • 1911.06188 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2003.00482 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 1911.07217 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 1909.12513 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 1909.09998 #6 · arxiv_oai · confidence 0.70 Gang Yu
  • 1908.09492 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 1903.12473 #6 · arxiv_oai · confidence 0.70 Gang Yu
  • 1905.13417 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 1901.00148 #7 · arxiv_oai · confidence 0.70 Gang Yu
  • 1903.05027 #6 · arxiv_oai · confidence 0.70 Gang Yu
  • 1902.06854 #17 · arxiv_oai · confidence 0.70 Gang Yu
  • 1811.08605 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 1811.07782 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 1808.00897 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 1805.00123 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 1804.09337 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 1804.06559 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 1804.06215 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 1711.07240 #7 · arxiv_oai · confidence 0.70 Gang Yu
  • 1711.07319 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 1712.01059 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 1711.07264 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 1711.07246 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 1703.02719 #3 · arxiv_oai · confidence 0.70 Gang Yu
  • 1004.1304 #1 · arxiv_oai · confidence 0.70 Gang Yu
  • math/0507374 #5 · arxiv_oai · confidence 0.70 Gang Yu
  • 2606.25763 #7 · arxiv_oai · confidence 0.70 Gang Yu
  • 2606.20506 #12 · arxiv_oai · confidence 0.70 Gang Yu
  • 2606.01965 #4 · arxiv_oai · confidence 0.70 Gang Yu
  • 2604.25719 #17 · arxiv_oai · confidence 0.70 Gang Yu
  • 2605.28773 #9 · arxiv_oai · confidence 0.70 Gang Yu
  • 2605.23463 #99 · arxiv_oai · confidence 0.70 Gang Yu
  • 2605.18390 #6 · arxiv_oai · confidence 0.70 Gang Yu
  • 2502.10248 #36 · arxiv_oai · confidence 0.70 Gang Yu
  • 1004.1304 #1 · backfill · confidence 0.70 Gang Yu
  • 2312.13771 #8 · arxiv_oai · confidence 0.70 Gang Yu
  • 2507.16632 #8 · arxiv_oai · confidence 0.70 Gang Yu

Frequent Coauthors