Publications

 1. Shenglin Zhang, Jun Zhu, Bowen Hao, Yongqian Sun*, Xiaohui Nie, Jingwen Zhu, Xilin Liu, Xiaoqian Li, Yuchi Ma, Dan Pei. Fault Diagnosis for Test Alarms in Microservices Through Multi-source Data. ACM International Conference on the Foundations of Software Engineering (FSE), Industry Track. Porto de Galinhas, Brazil, July 15-19, 2024 (CCF A)
 2. Shenglin Zhang, Yongxin Zhao, Xiao Xiong, Yongqian Sun*, Xiaohui Nie, Jiacheng Zhang, Fenglai Wang, Xian Zheng, Yuzhi Zhang, Dan Pei. Illuminating the Gray Zone: Non-Intrusive Gray Failure Localization in Server Operating Systems. ACM International Conference on the Foundations of Software Engineering (FSE), Industry Track. Porto de Galinhas, Brazil, July 15-19, 2024 (CCF A)
 3. Zhaoyang Yu#, Shenglin Zhang#, Mingze Sun, Li Yingke, Zhaoyankai, Xiaolei Hua, Lin Zhu, Xidao Wen, Dan Pei*. Fine-Tuning for Unsupervised KPI Anomaly Detection for Mobile Web Systems.The Web Conference 2024. Singapore, May 13 – 17, 2024 (CCF A).
 4. Pengxiang Jin#, Shenglin Zhang#, Minghua Ma, Haozhe Li, Yu Kang, Liqun Li, Yudong Liu, Bo Qian, Chaoyun Zhang, Pu Zhao, Shilin He, Federica Sarro, Yingnong Dang, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang. Assess and Summarize: Improve Outage Understanding with Large
  Language Models. The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE). San Francisco, California, United States, December 2023 (CCF A) [paper].
 5. Shenglin Zhang, Zhongjie Pan, Heng Liu, Pengxiang Jin, Yongqian Sun*, Qianyu Ouyang, Jiaju Wang, Xueying Jia, Yuzhi Zhang, Hui Yang, Yongqiang Zou, and Dan Pei. Efficient and Robust Trace Anomaly Detection for Large-Scale Microservice Systems. The 34th IEEE International Symposium on Software Reliability Engineering (ISSRE 2023). Florence, Italy, October 2023 (CCF B) [paper] [slides].
 6. Dongwen Li, Shenglin Zhang, Yongqian Sun*, Yang Guo, Zeyu Che, Shiqi Chen, Zhenyu Zhong, Minghan Liang, Minyi Shao, Mingjie Li, Shuyang Liu, Yuzhi Zhang, and Dan Pei. An Empirical Analysis of Anomaly Detection Issues for Multivariate Time Series. The 34th IEEE International Symposium on Software Reliability Engineering (ISSRE 2023). Florence, Italy, October 2023 (CCF B) [paper] [slides].
 7. Zhaoyang Yu, Changhua Pei, Shenglin Zhang, Xidao Wen, Jianhui Li, Gaogang Xie, and Dan Pei. AutoKAD: Empowering KPI Anomaly Detection with Label-Free Deployment. The 34th IEEE International Symposium on Software Reliability Engineering (ISSRE 2023). Florence, Italy, October 2023 (CCF B) [paper] [slides].
 8. Sibo Xia, Minghua Ma, Pengxiang Jin, Liyue Cui, Shenglin Zhang*, Wa Jin, Yongqian Sun, Dan Pei. A Study on Response Time Anomaly Diagnosis for Search Service. Journal of Computer Research and Development, 2023 (CCF T1, in Chinese) [paper].
 9. Yicheng Sui, Yuzhe Zhang, Jianjun Sun, Ting Xu, Shenglin Zhang*, Zhengdan Li, Yongqian Sun, Fangrui Guo, Junyu Shen, Yuzhi Zhang, Dan Pei, Xiao Yang, Li Yu. LogKG: Log Failure Diagnosis through Knowledge Graph. IEEE Transactions on Services Computing, 2023 (CCF A) [paper].
 10. Ling Ma, Qiliang Fan, Ting Xu,  Guanchen Guo, Shenglin Zhang*, Yongqian Sun, Yuzhi Zhang. A Scheduling Policy Based on Deep Reinforcement Learning in a Colocated Cloud Environment. Journal on Communications, 2023(CCF T1, in Chinese) [paper].
 11. Shenglin Zhang, Pengxiang Jin, Zihan Lin, Yongqian Sun*, Bicheng Zhang, Sibo Xia, Zhengdan Li, Zhenyu Zhong, Minghua Ma, Wa Jin, Dai Zhang, Zhenyu Zhu, Dan Pei. Robust Failure Diagnosis of Microservice System through Multimodal Data. IEEE Transactions on Services Computing, 2023 (CCF A) [paper].
 12. Chenyu Zhao, Minghua Ma, Zhenyu Zhong, Shenglin Zhang*, Zhiyuan Tan, Xiao Xiong, Lulu Yu, Jiayi Feng, Yongqian Sun, Yuzhi Zhang, Dan Pei, Qingwei Lin, Dongmei Zhang. Robust Multimodal Failure Detection for Microservice Systems. THE 29TH ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Long Beach, CA, USA, August 2023 (CCF A) [paper] [slides].
 13. Yongqian Sun, Minghan Liang, Zeyu Che, Dongwen Li, Tinghua Zheng, Shenglin Zhang*, Pengtian Zhu, Yuzhi Zhang, Dan Pei. Efficient Multivariate Time Series Anomaly Detection Through Transfer Learning for Large-Scale Web services. The 2023 IEEE International Conference on Web Services, Chicago, USA, July 2023 (CCF B short paper) [paper] [slides].
 14. Yongqian Sun, Daguo Cheng, Tiankai Yang, Shenglin Zhang*, Man Zhu, Xiao Xiong, Qiliang Fan, Minghan Liang, Dan Pei, Tianchi Ma, Yu Chen. Efficient and Robust KPI Outlier Detection for Large-Scale Datacenters. IEEE Transactions on Computers, 2023 (CCF A, accepted, to appear) [paper].
 15. Zeyan Li, Junjie Chen, Yihao Chen, Chengyang Luo, Yiwei Zhao, Yongqian Sun, Kaixin Sui, Xiping Wang, Dapeng Liu, Xing Jin, Qi Wang, Dan Pei. Generic and Robust Root Cause Localization for Multi-Dimensional Data in Online Service Systems, Journal of Systems & Software, 2023, (CCF B, accepted, to appear) [paper].
 16. Qingyang Yu, Changhua Pei, Bowen Hao, Mingjie Li, Zeyan Li, Shenglin Zhang, Xianglin Lu, Rui Wang, Jiaqi Li, Zhenyu Wu, and Dan Pei. CMDiagnostor: An Ambiguity-Aware Root Cause Localization Approach Based on Call Metric Data.The 2023 ACM Web Conference (WWW), April 30 – May 4, 2023, Austin, USA (CCF A) [paper].
 17. Weibin Meng, Yuzhe Zhang, Federico Zaiter, Ying Liu, Shenglin Zhang*, Shimin Tao, Yichen Zhu, Tao Han, Yongpeng Zhao, En Wang, Yuzhi Zhang, Dan Pei. LogSummary: Unstructured Log Summarization for Software Systems. IEEE Transactions on Network and Service Management (TNSM), January 2023 (SCI Indexed, IF: 4.758) [paper].
 18. Xiaolei Hua, Lin Zhu, Shenglin Zhang, Zeyan Li, Su Wang, Dong Zhou, Shuo Wang, Chao Deng. GenAD: General Representations of Multivariate Time Series for Anomaly Detection. Electronics Letters, December 2022 (SCI indexed) [paper].
 19. Yiran Cheng, Bo Cheng, Pengxiang Jin, Yongqian Sun*, Xiaohui Nie, Nengwen Zhao, Shenglin Zhang, Dan Pei. Effective Attribute Selection for Multi-dimensional Root Cause Analysis. IEEE International Symposium on Software Reliability Engineering (ISSRE), Charlotte, North Carolina, USA, October 31 – November 3, 2022 (CCF B) [paper] [slides].
 20. Xuanrun Wang, Kanglin Yin, Qianyu Ouyang, Xidao Wen, Shenglin Zhang, Wenchi Zhang, Li Cao, Jiuxue Han, Xing Jin, Dan Pei. Identifying Erroneous Software Changes through Self-Supervised Contrastive Learning on Time Series Data. IEEE International Symposium on Software Reliability Engineering (ISSRE), Charlotte, North Carolina, USA, October 31 – November 3, 2022 (CCF B) [paper] [slides].
 21. Shenglin Zhang, Zhenyu Zhong, Dongwen Li, Qiliang Fan, Yongqian Sun*, Man Zhu, Yuzhi Zhang, Dan Pei, Jiyan Sun, Yinlong Liu, Hui Yang, Yongqiang Zou. Efficient KPI Anomaly Detection Through Transfer Learning for Large-Scale Web Services. IEEE Journal on Selected Areas in Communications (JSAC), 40 (8):2440-2455, August 2022 (CCF A, SCI Indexed, Impact Factor: 9.144) [paper].
 22. Yongqian Sun, Kunlin Jian, Liyue Cui, Guifei Jiang, Shenglin Zhang*, Yuzhi Zhang, Dan Pei. Online Malicious Domain Name Detection with Partial Labels for Large-Scale Dependable Systems. The Journal of Systems & Software, 190: 1-12, 2022 (CCF B, SCI Indexed, Impact Factor: 2.829) [paper].
 23. Yongqian Sun, Daguo Cheng, Pengxiang Jin, Quan Ding, Shenglin Zhang*, Xu Chen, Yuzhi Zhang, Minghan Liang, Dan Pei, Jianyan Zheng, Sen Luo, Xinyu Tang. Robust Anomaly Clue Localization of Multi-dimensional Derived Measure for Online Video Services. IEEE Transactions on Services Computing. 16(2): 1387-1401, March 2022 (CCF A, SCI Indexed, Impact Factor: 8.216) [paper].
 24. Xianglin Lu, Zhe Xie, Zeyan Li, Mingjie Li, Xiaohui Nie, Nengwen Zhao, Qingyang Yu, Shenglin Zhang, Kaixin Sui, Lin Zhu and Dan Pei. Generic and Robust Performance Diagnosis via Causal Inference for OLTP Database systems. IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid), May 16-19, 2022 (CCF C) [paper].
 25. Shenglin Zhang, Dongwen Li, Zhenyu Zhong, Jun Zhu, Minghan Liang, Jiexi Luo, Yongqian Sun*, Ya Su, Sibo Xia, Zhongyou Hu, Yuzhi Zhang, Dan Pei, Jiyan Sun, and Yinlong Liu. Robust System Instance Clustering for Large-Scale Web Services. The Web Conference (WWW), Virtual Conference, April 25-29, 2022 (CCF A) [paper] [slides].
 26. Shenglin Zhang, Chenyu Zhao, Yicheng Sui, Ya Su*, Yongqian Sun, Yuzhi Zhang, Dan Pei, Yizhe Wang. “Robust KPI Anomaly Detection for Large-Scale Software Services with Partial Labels”. IEEE International Symposium on Software Reliability Engineering (ISSRE), October 25-28, 2021, Wuhan, China (CCF B) [paper] [slides].
 27. Minghua Ma, Shenglin Zhang*, Junjie Chen, Haozhe Li, Yongliang Lin, Jim Xu, Xiaohui Nie, Bo Zhu, Yong Wang. “Jump-Starting Multivariate Time Series Anomaly Detection for Online Service Systems”. USENIX Annual Technical Conference (USENIX ATC), Virtual Conference, July 14-16, 2021 (CCF A) [paper] [slides].
 28. Ya Su, Youjian Zhao, Ming Sun, Shenglin Zhang*, Xidao Wen, Yongsu Zhang, Xian Liu, Xiaozhou Liu, Junliang Tang, Wenfei Wu, Dan Pei. “Detecting Outlier Machine Instances through Gaussian Mixture Variational Autoencoder with One Dimensional CNN”. IEEE Transactions on Computers (TC), 71(4):892-905. (CCF A, SCI indexed, Impact Factor: 2.711) [paper].
 29. Weibin Meng, Ying Liu, Shenglin Zhang*, Federico Zaiter, Yuzhe Zhang, Yuheng Huang, Zhaoyang Yu, Yuzhi Zhang, Lei Song, Ming Zhang, Dan Pei. “LogClass: Anomalous Log Identification and Classification with Partial Labels”. IEEE Transactions on Network and Service Management (TNSM), 18(2): 1870-1884, June 2021 (SCI indexed, Impact Factor: 3.878, CCF C) [paper].
 30. Ming Sun, Ya Su, Shenglin Zhang, Yuanpu Cao, Yuqing Liu, Dan Pei, Wenfei Wu, Yongsu Zhang, Xiaozhou Liu, Junliang Tang. “CTF: Anomaly Detection in High-Dimensional Time Series with Coarse-to-Fine Model Transfer”. IEEE International Conference on Computer Communications (INFOCOM) 2021, Virtual Conference, May 2021 (CCF A) [paper] [slides].
 31. Ping Liu, Shenglin Zhang, Yongqian Sun*, Yuan Meng, Jiahai Yang, Dan Pei. “FluxInfer: Automatic Diagnosis of Performance Anomaly for Online Database System”.International Performance Computing and Communications Conference (IPCCC) 2020, November 2020, Virtual Conference (CCF C) [paper] [slides].
 32. Rui Chen, Shenglin Zhang, Dongwen Li, Yuzhe Zhang, Fangrui Guo, Weibin Meng, Dan Pei, Yuzhi Zhang, Xu Chen, Yuqing Liu. “Cross-System Log Anomaly Detection for Software Systems”. IEEE International Symposium on Software Reliability Engineering (ISSRE), Virtual Conference, October 2020 (CCF B) [paper] [slides].
 33. Ping Liu, Haowen Xu, Qianyu Ouyang, Rui Jiao, Zhekang Chen, Xiaoying Bai, Shenglin Zhang, Jiahai Yang, Linlin Mo, Jice Zeng, Wenman Xue, Dan Pei. “Unsupervised Detection of Microservice Trace Anomalies through Service-Level Deep Bayesian Networks”.  IEEE International Symposium on Software Reliability Engineering (ISSRE), Virtual Conference, October 2020 (CCF B) [paper] [slides].
 34. Ruming Tang, Cheng Huang, Yanti Zhou, Hanwen Wu, Xianglin Lu, Yongqian Sun, Qi Li, Jinjin Lin, Weiyao Huang, Siyuan Sun, Dan Pei. “A Practical Machine Learning-Based Framework to Detect DNS Covert Communication in Enterprises”. SecureComm 2020, Virtual Conference, October 2020 (CCF C) [paper] [slides]
 35. Minghua Ma, Zheng Yin, Shenglin Zhang, Sheng Wang, Christopher Zheng, Xinhao Jiang, Hanwen Hu, Cheng Luo, Yilin Li, Nengjun Qiu, Feifei Li, Changcheng Chen, Dan Pei. “Diagnosing Root Causes of Intermittent Slow Queries in Cloud Databases”. International Conference on Very Large Data Bases (VLDB), Virtual Conference, August 2020 (CCF A) [paper] [slides].
 36. Weibin Meng, Ying Liu, Federico Zaiter, Shenglin Zhang*, Yihao Chen, Yuzhe Zhang, Yichen Zhu, En Wang, Ruizhi Zhang, Shimin Tao, Dian Yang, Rong Zhou, Dan Pei. “LogParse: Making Log Parsing Adaptive through Word Classification”. IEEE International Conference on Computer Communications (ICCCN) 2020, Virtual Conference, August 3-6, 2020 (CCF C) [paper] [slides].
 37. Weibin Meng, Ying Liu, Yuheng Huang, Shenglin Zhang*, Federico Zaiter, Bingjin Chen, Dan Pei. “A Semantic-aware Representation Framework for Online Log Analysis”.  IEEE International Conference on Computer Communications (ICCCN) 2020, Virtual Conference, August 3-6, 2020 (CCF C) [paper] [slides].
 38. Yuan Meng, Shenglin Zhang*, Yongqian Sun, Ruru Zhang, Zhilong Hu, Yiyin Zhang, Chenyang Jia, Zhaogang Wang, Dan Pei. “Localizing Failure Root Causes in a Microservice through Causality Inference”.  International Symposium on Quality of Service (IWQoS), Virtual Conference, June 2020 (CCF B) [paper] [slides]
 39. Zhang Shenglin, Li Dongwen, Sun Yongqian*, Meng Weibin, Zhang Yuzhe, Zhang Yuzhi, Liu Ying, Pei Dan. “Unified Anomaly Detection for Syntactically Diverse Logs in Cloud Datacenter”.  Journal of Computer Research and Development, Volume 57, Issue 4, pp 778-790, 2020 [paper]
 40. Ruming Tang, Zheng Yang, Zeyan Li, Weibin Meng, Haixin Wang, Qi Li, Yongqian Sun, Dan Pei, Tao Wei, Yanfei Xu, Yan Liu. “ZeroWall: Detecting Zero-Day Web Attacks through Encoder-Decoder Recurrent Neural Networks”. IEEE International Conference on Computer Communications (INFOCOM), Virtual Conference, Apr 27-30, 2020 (CCF A) [paper] [slides]
 41. Ping Liu, Yu Chen, Xiaohui Nie, Jing Zhu, Shenglin Zhang, Kaixin Sui, Ming Zhang, Dan  Pei. “FluxRank: A Widely-Deployable Framework to Automatically Localizing Root Cause Machines for Software Service Failure Mitigation”. IEEE International Symposium on Software Reliability Engineering (ISSRE), Berlin, Germany, October 2019 (CCF B) [paper] [slides]
 42. Zeyan Li, Chengyang Luo, Yiwei Zhao, Yongqian Sun*, Kaixin Sui, Xiping Wang, Dapeng Liu, Xing Jin, Qi Wang, Dan Pei, Generic and Robust Localization of Multi-Dimensional Root Causes“,IEEE International Symposium on Software Reliability Engineering (ISSRE), Berlin, Germany, October 2019 (CCF B) [paper] [slides]
 43. Weibin Meng, Ying Liu, Yichen Zhu, Shenglin Zhang*, Dan Pei, Yuqing Liu, Yihao Chen, Ruizhi Zhang, Shimin Tao, Pei Sun, Rong Zhou. “LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs”. International Joint Conference on Artificial Intelligence (IJCAI), Macao, China, August 2019 (CCF A, acceptance rate 17.9%) [paper][slides].
 44. Yuan Meng, Shenglin Zhang*, Zijie Ye, Benliang Wang, Zhi Wang, Yongqian Sun, Qitong Liu, Shuai Wang, Dan Pei. “Causal Analysis of the Unsatisfying Experience in Realtime Mobile Multiplayer Games in the Wild”. IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China, July 2019 (CCF B, acceptance rate 30%) [paper].
 45. Jiahao Bu, Ying Liu, Shenglin Zhang*, Weibin Meng, Qitong Liu, Xiaotian Zhu, Dan Pei. “Rapid Deployment of Anomaly Detection Models for Large Number of Emerging KPI Streams”. International Performance Computing and Communications Conference (IPCCC), Orlando, Florida, USA, November 2018 (CCF C, acceptance rate 28.8%) [paper] [slides]
 46. Minghua Ma, Shenglin Zhang*, Dan Pei, Xin Huang, Hongwei Dai. “ Robust and Rapid Adaption for Concept Drift in Software System Anomaly Detection”.IEEE International Symposium on Software Reliability Engineering (ISSRE), Memphis, TN, USA, October 2018 (Best Research Paper Award, CCF B, acceptance rate 23.96%, 2/96) [paper] [slides]
 47. Shenglin Zhang, Ying Liu, Weibin Meng, Zhiling Luo, Jiahao Bu, Sen Yang, Peixian Liang, Dan Pei*, Jun (Jim) Xu, Yuzhi Zhang, Yu Chen, Hui Dong, Xianping Qu, Lei Song. “PreFix: Switch Failure Prediction in Datacenter Networks “. ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems 2018, Irvine, California, USA, June 2018 (CCF B, acceptance rate 20%, 54/270, one of the only two papers with institutes in Mainland China)  [paper]  [slides]
 48. Weibin Meng, Ying Liu, Shenglin Zhang*, Dan Pei, Hui Dong, Lei Song, Xulong Luo. “Device-Agnostic Log Anomaly Classification with Partial Labels”, IEEE/ACM International Symposium on Quality of Service (IWQOS), Banff, Alberta, Canada, June 2018 (CCF B) [paper] [slides]
 49. Shenglin Zhang, Ying Liu, Dan Pei, and Baojun Liu*. “Measuring BGP AS Path Looping (BAPL) and Private AS Number Leaking (PANL)”, Journal of Tsinghua University (Science and Technology), Volume 23, Number 1, pp 22– 34, February 2018 (SCI Indexed, IF 1.328) [paper]
 50. Shenglin Zhang, Ying Liu, Dan Pei*, Yu Chen, Xianping Qu, Shimin Tao, Zhi Zang, Xiaowei Jing, Mei Feng. “FUNNEL: Assessing Software Changes in Web-based Services”, IEEE Transactions on Services Computing, 11(1): 34-48, January – February 2018 (CCF A, SCI Indexed, IF 5.823) [paper]
 51. Shenglin Zhang, Weibin Meng, Jiahao Bu, Sen Yang, Ying Liu, Dan Pei, Jun (Jim) Xu, Yu Chen, Hui Dong, Xianping Qu, Lei Song. “Syslog Processing for Switch Failure Diagnosis and Prediction in Datacenter Networks”, IEEE/ACM International Symposium on Quality of Service (IWQOS), VILANOVA I LA GELTRÚ, SPAIN, June 2017 (CCF B, acceptance rate 19.99%, 29/146) [paper] [slides]
 52. Shenglin Zhang, Ying Liu, Dan Pei, Yu Chen, Xianping Qu, Shimin Tao, and Zhi Zang. “Rapid and Robust Impact Assessment of Software Changes in Large Internet-based Services”, ACM International Conference on emerging Networking EXperiments and Technologies (CoNEXT), Heidelberg, Germany, December, 2015, 13 pages (CCF B, acceptance rate 20.92%, 41/196) [paper] [slides]
 53. Ying Liu, Gang Ren, Jianping Wu, Shenglin Zhang, Lin He, Yihao Jia. “Building An IPv6 Address Generation and Traceback System With NIDTGA in Address Driven Network”, SCIENCE CHINA Information Sciences, Volume 58, Issue 12, pp 1-14, December 2015 (CCF A, SCI Indexed, IF 3.304) [paper]
 54. Ying Liu, Shenglin Zhang*, and Hongying Liu. “A bottleneck-free model for P4P”, SCIENCE CHINA Information Sciences, Volume 58, Issue 10, pp 1-15, October 2015 (CCF A, SCI Indexed, IF 3.304) [paper]
 55. Shenglin Zhang, Ying Liu, Dan Pei. “A Measurement Study on BGP AS Path Looping Behavior”. IEEE International Conference on Computer Communications and Networks (ICCCN), Shanghai, China, August 4, 2014, 7 pages. (CCF C, acceptance rate 28%) [paper] 
 56. Ying Liu, Shenglin Zhang*, Hongying Liu. “An Improved Cooperative Model of P2P and ISP”. Frontiers in Internet Technologies, 85-96, LNCS, Springer, 2013. (EI Indexed) [paper]

Visits: 12053