Logo serein
Serein

Serein

A passionate developer and researcher interested in 3D reconstruction, game development, and front-end technologies.

DeveloperResearcherGamerAnime Fan

About

Sun Yat-sen University

Sun Yat-sen University

M.S. Remote Sensing Science and Technology

2024 - Present

Research focus on 3D generation and reconstruction.

Sun Yat-sen University

Sun Yat-sen University

B.S. Remote Sensing Science and Technology

2020 - 2024

Graduated with honors.

News
2026.02

πŸŽ‰ MajutsuCity is accepted by CVPR 2026 !

2026.02

πŸŽ‰ UrbanFeel is accepted by ICLR 2026 !

2025.09

πŸŽ‰ Blink-Twice is accepted by NeurIPS 2025 D&B !

2025.02

πŸŽ‰ Scene4U is accepted by CVPR 2025 !

2025.02

πŸŽ‰ LOKI is accepted by ICLR 2025 as a Spotlight (8/8/8/8) !

Publications
arXiv 2026

Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation

Jun He*, Junyan Ye*, Zilong Huang , Dongzhi Jiang, Chenjue Zhang, Leqi Zhu, Renrui Zhang, Xiang Zhang, Weijia Li

arXiv 2025

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

Junyan Ye*, Leiqi Zhu*, Yuncheng Guo, Dongzhi Jiang, Zilong Huang , Yifan Zhang, Haohuan Fu, Conghui He, Weijia Li

CVPR 2026

MajutsuCity: Language-driven Aesthetic-adaptive City Generation with Controllable 3D Assets and Layouts

Zilong Huang , Jun He, Xiaobin Huang, Ziyi Xiong, Yang Luo, Junyan Ye, Weijia Li, Yiping Chen, Ting Han

arXiv 2025

SatSAM2: Motion-Constrained Video Object Tracking in Satellite Imagery using Promptable SAM2 and Kalman Priors

Ruijie Fan*, Junyan Ye*, Huan Chen, Zilong Huang , Xiaolei Wang, Weijia Li

NeurIPS 2025 Datasets & Benchmarks

BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception

Junyan Ye, Dongzhi Jiang, Jun He, Baichuan Zhou, Zilong Huang , Zhiyuan Yan, Hongsheng Li, Conghui He

ICLR 2026

UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective

Jun He, Yi Lin, Zilong Huang , Jiacong Yin, Junyan Ye, Yuchuan Zhou, Weijia Li, Xiang Zhang

arXiv 2025

Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation

Junyan Ye*, Dongzhi Jiang*, Zihao Wang, Leqi Zhu, Zhenghao Hu, Zilong Huang , Jun He, Zhiyuan Yan, Jinghua Yu, Hongsheng Li, Conghui He, Weijia Li

arXiv 2025

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Zhiyuan Yan*, Junyan Ye*, Weijia Li, Zilong Huang , Shenghai Yuan, Xiangyang He, Kaiqing Lin, Jun He, Conghui He, Li Yuan

CVPR 2025

Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration

Zilong Huang , Jun He, Junyan Ye, Lihan Jiang, Weijia Li, Yiping Chen, Ting Han

ICLR 2025 Spotlight

Loki: A comprehensive synthetic data detection benchmark using large multimodal models

Junyan Ye*, Baichuan Zhou*, Zilong Huang* , Junan Zhang*, Tianyi Bai*, Hengrui Kang, Jun He, Honglin Lin, Zhihao Wang, Tong Wu, Zhizheng Wu, Yiping Chen, Dahua Lin, Conghui He, Weijia Li

arXiv 2024

CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis

Weijia Li*, Jun He*, Junyan Ye*, Huaping Zhong*, Zhimeng Zheng, Zilong Huang , Dahua Lin, Conghui He

Projects