About me

👋 Hello, welcome to my homepage!

My name is Zhiwei Wang(王志伟), and I am currently a Ph.D. candidate at Zhejiang University of Technology. My research primarily focuses on multi-modal image fusion and perception, as well as their efficient deployment on embedded systems for real-world applications.

🔥 News

2026.01: 🎉 Joint Segmentation and Grading with Iterative Optimization for Multimodal Glaucoma Diagnosis — Accepted by ISBI 2026

2025.10: 😄 Awarded Outstanding Graduate Student Cadre

2025.09: 🎉🎉🎉 DiFusionSeg: Diffusion-Driven Semantic Segmentation with Multi-Modal Image Fusion for Enhanced Perception — Accepted by Knowledge-Based Systems

2024.10: 🎉🎉 U-Net Semantic Segmentation-Based Calorific Value Estimation of Straw Multifuels for Combined Heat and Power Generation Processes — Accepted by Energies

2024.09: 🎉 Multi-Encoder Feature Fusion Model for Complex Combustion State Recognition — Accepted by CAC 2024

Server

Resume

Education

  1. Zhejiang University of Technology

    Ph.D. in Control Science and Engineering (Rank: 5/40)
    Core Courses: Matrix Theory, Modern Signal Processing, Optimization, Embedded Artificial Intelligence, Digital Image Processing, Audio-Visual Multimedia Technology, Embedded Systems, Graph Theory and Networks, Computer Vision…

  2. The University of Hong Kong

    Visiting Student

Internships

  1. Sightcare, China.

    2022.05 — 2024.05
    • 1. Conducted research and development of intelligent text analysis algorithms based on deep learning;
    • 2. Developed algorithms for reading order prediction and layout analysis for distorted and tilted pages;
    • 3. Optimized and deployed deep learning algorithms onto AI chips (HiSilicon, Rockchip, Bitmain) for lightweight acceleration;
    • 4. Developed assistive vision products on HiSilicon platforms (Hi3516 DV300/DV500) and Rockchip platforms (RV1126, RK3566);
    • 5. Adapted new sensors (imx214, imx586) for image acquisition modules;
    • 6. Adapted new screens (MIPI, RGB, EDP) for display modules.
  2. Silan, China.

    2020.05 — 2020.09
    • 1. Designed schematic diagrams and layouts for demo boards based on chip manuals and performed soldering and debugging;
    • 2. Conducted high/low-temperature, lightning, and EMI testing;
    • 3. Analyzed test data and completed chip test reports.
  3. Languages

    • Mandarin
      100%
    • English
      80%
    • Deutsch
      60%
    • Cantonese
      10%

    Skills

    • Linux
      80%
    • Pytorch
      70%
    • Python
      80%
    • C/C++
      80%

Publications

  • DiFusionSeg

    Knowledge-Based Systems

    DiFusionSeg: Diffusion-Driven Semantic Segmentation with Multi-Modal Image Fusion for Enhanced Perception

    Paper | Code

  • MAUGIF

    Arxiv

    MAUGIF: Mechanism-Aware Unsupervised General Image Fusion via Dual Cross-Image Autoencoders

    Paper | Code

  • IMO

    ISBI 2026

    Joint Segmentation and Grading with Iterative Optimization for Multimodal Glaucoma Diagnosis

    Paper | Code

  • Design conferences in 2022

    Conference on Automation and Control (CAC)

    Multi-Encoder Feature Fusion Model for Complex Combustion State Recognition

    Paper | Code

  • Design conferences in 2022

    Energies

    U-Net Semantic Segmentation-Based Calorific Value Estimation of Straw Multifuels for Combined Heat and Power Generation Processes

    Paper | Code

Full Paper List

2026

  • [1] Wang, Zhiwei, Yuxing Li, Meilu Zhu, Defeng He and Edmund Y.Lam. "Joint Segmentation and Grading with Iterative Optimization for Multimodal Glaucoma Diagnosis" 2026 IEEE 23nd International Symposium on Biomedical Imaging (ISBI). IEEE, 2026.
  • [2] Zhu, Meilu, Yuxing Li, Zhiwei Wang and Edmund Y.Lam. "Personalized Federated Learning with Residual Fisher Information for Medical Image Segmentation" 2026 IEEE 23nd International Symposium on Biomedical Imaging (ISBI). IEEE, 2026.

2025

  • [1] Wang, Zhiwei, Defeng He, Li Zhao, Bo Liu, Yayu Zheng and Xiaoqin Zhang. “DiFusionSeg: Diffusion-driven semantic segmentation with multi-modal image fusion for enhanced perception.” Knowl. Based Syst. 330 (2025): 114481.
  • [2] Yang, Kunjing*, Zhiwei Wang* and Minru Bai. “MAUGIF: Mechanism-Aware Unsupervised General Image Fusion via Dual Cross-Image Autoencoders.” (2025).

2024

  • [1] Li, Lianming, Zhiwei Wang and Defeng He. “U-Net Semantic Segmentation-Based Calorific Value Estimation of Straw Multifuels for Combined Heat and Power Generation Processes.” Energies (2024): n. pag.
  • [2] 王志伟, 何德峰, 李廉明, 等. 面向复杂燃烧状态识别的多编码器特征融合模型[C]//中国自动化学会. 2024中国自动化大会论文集, 2024: 383-388.

Projects

Others