|
董俊杰
Junjie Dong
个人简介
我自 2025 年起在香港城市大学数据科学学院攻读博士学位,导师为柯庆(Prof. Qing Ke)。2025 年 6 月,我在大连理工大学软件学院获得硕士学位,导师为何增有教授。我的研究聚焦于可解释学习、数据挖掘与 AI4Science 场景中的离散序列分析和语言模型,相关成果发表于 Information Sciences 与 Pattern Recognition 等期刊。
研究兴趣
- 离散序列分析
- 可解释机器学习
- 化学信息学与数据挖掘
- 科学语言模型与 AI4Science 应用
学术服务
发表论文
- Junjie Dong, Xinyi Yang, Mudi Jiang, Lianyu Hu, Zengyou He*. Interpretable Sequence Clustering. Information Sciences, 2025-01. [DOI] [arXiv] [代码]
- Junjie Dong, Mudi Jiang, Lianyu Hu, Zengyou He*. Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence Classification. Data Mining and Knowledge Discovery, 2025-07. [DOI] [arXiv]
- Zengyou He*, Zerun Li, Junjie Dong, Xinying Liu, Mudi Jiang, Lianyu Hu. Conjunction Subspaces Test for Conformal and Selective Classification. Information Sciences, 2025-07. [DOI] [arXiv]
- Lianyu Hu, Zerun Li, Junjie Dong, Mudi Jiang, Zengyou He*. Statistical Significance of Cluster Membership for Categorical Data. Engineering Applications of Artificial Intelligence, 2025-11. [DOI]
- Lianyu Hu, Mudi Jiang, Junjie Dong, Xinying Liu, Zengyou He*. Interpretable Categorical Data Clustering via Hypothesis Testing. Pattern Recognition, 2025-06. [DOI]
- Zengyou He*, Lianyu Hu, Jinfeng He, Junjie Dong, Mudi Jiang, Xinying Liu. Significance-Based Interpretable Sequence Clustering. Information Sciences, 2025-06. [DOI]
- Lianyu Hu, Junjie Dong, Mudi Jiang, Yan Liu, Zengyou He*. Clusterability Test for Categorical Data. Knowledge and Information Systems, 2025-01-06. [DOI] [arXiv]
- Yiwen Yang, Shuang Cai, Chunhao Mo, Junjie Dong, Sheng Chen, Zhiguo Wen. Profiles of Antibiotic Resistome Risk in Diverse Water Environments. Communications Earth & Environment, 2025-02-27. [DOI]
- Lianyu Hu, Mudi Jiang, Junjie Dong, Xinying Liu, Zengyou He*. Interpretable Clustering: A Survey. arXiv, 2024. [arXiv]
- Junjie Dong, Zhuoqi Lyu, Qing Ke. Towards Understanding Evolution of Science Through Language Model Series. arXiv, 2024. [arXiv]
 |
Education
- Ph.D. Student
City University of Hong Kong (School of Data Science), 2025 - present
- Supervisor: Prof. Qing Ke
- M.S.
Dalian University of Technology (School of Software), 2022 - 2025
- Advisor: Prof. Zengyou He
- National Scholarship, Ministry of Education of China (2022)
- Outstanding Graduate of Dalian City
- B.Eng. / B.Sc.
Dalian University of Technology & University of Leicester (LIIDUT), 2018 - 2022
- Outstanding Graduate of Liaoning Province
- First-Class Honours Bachelor's Degree
- National Scholarship, Ministry of Education of China (2019)
- National Scholarship, Ministry of Education of China (2020)
Email: jd445@qq.com
Personal Website: junjie.102514.xyz
ORCID: 0000-0001-8267-9181
CV: Download
|
About
I am a Ph.D. student in the School of Data Science at City University of Hong Kong, supervised by Prof. Qing Ke. I received my M.S. in Software Engineering from Dalian University of Technology in June 2025, advised by Prof. Zengyou He. My research focuses on interpretable learning, data mining, and AI4Science applications, with an emphasis on discrete sequence analysis and scientific language models.
Research Interests
- Discrete sequence analysis
- Interpretable machine learning
- Cheminformatics and data mining
- Scientific language models and AI4Science
Academic Service
- Reviewer, Engineering Applications of Artificial Intelligence (Elsevier, 2025; listed on ORCID)
- Reviewer, Knowledge and Information Systems (Springer Nature, 2025; listed on ORCID)
Publications
- Junjie Dong, Xinyi Yang, Mudi Jiang, Lianyu Hu, Zengyou He*. Interpretable Sequence Clustering. Information Sciences, 2025-01. [DOI] [arXiv] [code]
- Junjie Dong, Mudi Jiang, Lianyu Hu, Zengyou He*. Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence Classification. Data Mining and Knowledge Discovery, 2025-07. [DOI] [arXiv]
- Zengyou He*, Zerun Li, Junjie Dong, Xinying Liu, Mudi Jiang, Lianyu Hu. Conjunction Subspaces Test for Conformal and Selective Classification. Information Sciences, 2025-07. [DOI] [arXiv]
- Lianyu Hu, Zerun Li, Junjie Dong, Mudi Jiang, Zengyou He*. Statistical Significance of Cluster Membership for Categorical Data. Engineering Applications of Artificial Intelligence, 2025-11. [DOI]
- Lianyu Hu, Mudi Jiang, Junjie Dong, Xinying Liu, Zengyou He*. Interpretable Categorical Data Clustering via Hypothesis Testing. Pattern Recognition, 2025-06. [DOI]
- Zengyou He*, Lianyu Hu, Jinfeng He, Junjie Dong, Mudi Jiang, Xinying Liu. Significance-Based Interpretable Sequence Clustering. Information Sciences, 2025-06. [DOI]
- Lianyu Hu, Junjie Dong, Mudi Jiang, Yan Liu, Zengyou He*. Clusterability Test for Categorical Data. Knowledge and Information Systems, 2025-01-06. [DOI] [arXiv]
- Yiwen Yang, Shuang Cai, Chunhao Mo, Junjie Dong, Sheng Chen, Zhiguo Wen. Profiles of Antibiotic Resistome Risk in Diverse Water Environments. Communications Earth & Environment, 2025-02-27. [DOI]
- Lianyu Hu, Mudi Jiang, Junjie Dong, Xinying Liu, Zengyou He*. Interpretable Clustering: A Survey. arXiv, 2024. [arXiv]
- Junjie Dong, Zhuoqi Lyu, Qing Ke. Towards Understanding Evolution of Science Through Language Model Series. arXiv, 2024. [arXiv]
|