Wei Xu
徐 葳
Associate Professor
Institute for Interdisciplinary Information Sciences
Tsinghua University
(first_name).(last_name).0 at gmail.com
I am an assoicate professor at the Institute for Interdisciplinary Information Sciences of Tsinghua University in Beijing.
I have a broad research interest in distributed system design, big data and financial technology. My current projects include privacy-preserving computation, data center networking, large scale system for machine learning and data mining, as well as various big data applications.
I received my Ph.D from UC Berkeley in 2010. I was a member of RAD Lab in EECS Department. My advisors are Prof. David Patterson and Prof. Armando Fox. My dissertation is on analyzing free text console logs for problem detection. I worked for Google for 2.5 years as a software engineer before joining Tsinghua.
My Google Scholar Page tracks more recent publications, and may be more up-to-date than this pape.
Announcements
Looking for postdocs
I am looking for postdocs to work with me on a variety of projects in
big data, distributed systems and financial technology. If you are
interested, please drop me an email and we can chat.s
The details are here ( Chinese | English )
Clinically applicable histopathological diagnosis system for gastric cancer detection using deep learning
Zhigang Song, Shuangmei Zou, Weixun Zhou, Yong Huang, Liwei Shao, Jing Yuan, Xiangnan Gou, Wei Jin, Zhanbo Wang, Xin Chen,
Xiaohui Ding, Jinhong Liu, Chunkai Yu, Calvin Ku, Cancheng Liu, Zhuo Sun, Gang Xu, Yuefeng Wang, Xiaoqing Zhang, Dandan Wang,
Shuhao Wang, Wei Xu, Richard C. Davis and Huaiyin Shi
In Nature Communications, volume 11, Article number: 4294 (2020)
[pdf]
Gosig: A Scalable and High-Performance Byzantine Consensus System for Consortium Blockchains on Wide Area Network
Peilun Li, Guosai Wang, Xiaoqi Chen, Fan Long and Wei Xu
To appear in Symposium on Cloud Computing (SOCC) 2020
FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems
Lu Chen, Jiao Sun and Wei Xu
In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD) 2020
[pdf|slides]
Concerto: cooperative network-wide telemetry with controllable error rate
Yiran Li, Kevin Gao, Xin Jin and Wei Xu
In Proceedings of the 11th ACM SIGOPS Asia-Pacific Workshop on Systems (APSYS) 2020
[pdf|slides|video]
Secure multiparty computation for privacy-preserving drug discovery
Rong Ma, Yi Li, Chenxing Li, Fangping Wan, Hailin Hu, Wei Xu and Jianyang Zeng
In Bioinformatics 2020
[pdf]
Modeling Heterogeneous Statistical Patterns in High-dimensional Data by Adversarial Distributions: An Unsupervised Generative Framework
Han Zhang, Wenhao Zheng, Charley Chen, Kevin Gao, Yao Hu, Ling Huang and Wei Xu
In Proceedings of The Web Conference (WWW) 2020
[pdf|slides|poster]
A Decentralized Blockchain with High Throughput and Fast Confirmation
Chenxin Li, Peilun Li, Dong Zhou, Zhe Yang, Ming Wu, Guang Yang, Wei Xu, Fan Long and Andrew Chi-Chih Yao
In Proceedings of USENIX Annual Technical Conference (USENIX-ATC) 2020
[pdf|slides|video]
FDHelper: Assist Unsupervised Fraud Detection Experts with Interactive Feature Selection and Evaluation
Jiao Sun, Yin Li, Charley Chen, Jiahe Lee, Xin Liu, Zhongping Zhang, Ling Huang, Lei Shi and Wei Xu
In Proceedings of The ACM CHI Conference on Human Factors in Computing Systems (CHI), 2020
[pdf]
SGXPy: Protecting Integrity of Python Applications with Intel SGX
Denghui Zhang, Guosai Wang, Wei Xu and Kevin Gao
In Proceedings of The 26th Asia-Pacific Software Engineering Conference (APSEC) 2019
[pdf ]
CAMEL: A Weakly Supervised Learning Framework for Histopathology Image Segmentation
Gang Xu, Zhigang Song, Zhuo Sun, Calvin Ku, Zhe Yang, Cancheng Liu, Shuhao Wang, Jianpeng Ma and Wei Xu
In Proceedings of ICCV 2019
[pdf|poster]
Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction
Shun Zheng, Wei Cao, Wei Xu and Jiang Bian
In Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP)
[pdf|poster]
PrivPy: General and Scalable Privacy-Preserving Data Mining
Yi Li, Yitao Duan, Shuoyao Zhao, Yu Yu and Wei Xu
In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD) 2019
[pdf|slides|video]
DIAG-NRE: A Neural Pattern Diagnosis Framework for Distantly Supervised Neural Relation Extraction
Shun Zheng, Xu Han, Yankai Lin, Peilin Yu, Lu Chen, Ling Huang, Zhiyuan Liu and Wei Xu
In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL) 2019
[pdf|poster]
No Place to Hide: Catching Fraudulent Entities in Tensors
Yikun Ban, Xin Liu, Ling Huang, Yitao Duan, Xue Liu and Wei Xu
In Proceedings of The World Wide Web Conference (WWW) 2019
[paper|poster]
Data-Driven Data Center Temperature Modeling and Prediction
Charley Chen, Guosai Wang, Jiao Sun and Wei Xu
In Proceedings of ACM SIGOPS Asia-Pacific Workshop on Systems (APSys) 2018
[pdf|slides]
Exploring Business Models and Dynamic Pricing Frameworks for SPOC Services
Zhengyang Song, Yongzheng Jia and Wei Xu
In Proceedings of the 2nd International Workshop on Data Management and Mining on MOOCs (DMMOOC) 2018
[pdf|slides]
Do Not Pull My Data for Resale: Protecting Data Providers using Data Retrieval Pattern Analysis
Guosai Wang, Shiyang Xiang, Yitao Duan, Ling Huang and Wei Xu
In Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
2018 (Short paper)
[pdf]
FraudVis: Understanding Unsupervised Fraud Detection Algorithms
Jiao Sun, Qixin Zhu, Zhifei Liu, Xin Liu, Yueming Wang, Jihae Lee, Lei Shi, Ling Huang and Wei Xu
In Proceedings of the 11th IEEE Pacific Visualization Symposium (PacificVis 2018) (Short Paper)
[pdf|slides]
DumbNet: A Smart Data Center Network Fabric with Dumb Switches
Yiran Li, Da Wei, Xiaoqi Chen, Ziheng Song, Ruihan Wu, Yuxing Li, Xin Jin and Wei Xu
In Proceedings of the European Conference on Computer Systems (EuroSys) 2018
[pdf|slides]
When Online Dating Meets Nash Social Welfare: Achieving Efficiency and Fairness
Yongzheng Jia, Xue Liu and Wei Xu
In Proceedings of the Web Conference (WWW) 2018
[pdf|slides]
A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization
Shun Zheng, Jialei Wang, Fen Xia, Wei Xu and Tong Zhang
In Journal of Machine Learning Research (JMLR) 2017
[pdf]
PEM: Practical Differentially Private System for Large-Scale Cross-Institutional Data Mining
Yi Li, Yitao Duan and Wei Xu
In Proceedings of The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (PKDD) 2017
[pdf|slides]
Session-Based Fraud Detection in Online E-Commerce Transactions Using Recurrent Neural Networks
Shuhao Wang, Cancheng Liu, Xiang Gao, Hongtao Qu and Wei Xu
In Proceedings of The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (PKDD) 2017
[pdf|slides]
Improving Click-Through Rate Prediction Accuracy in Online Advertising by Transfer Learning
Yuhan Su, Zhongming Jin, Ying Chen, Xinghai Sun, Yaming Yang, Fangzheng Qiao, Fen Xia and Wei Xu
In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence (WI) 2017
[pdf|slides]
An Optimization Framework For Online Ride-sharing Markets
Yongzheng Jia, Wei Xu and Xue Liu
In Proceedings of the IEEE International Conference On Distributed Computing Systems (ICDCS) 2017
[pdf|slides]
Joint Training for Pivot-based Neural Machine Translation
Yong Cheng, Qian Yang, Yang Liu, Maosong Sun and Wei Xu
In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) 2017
[pdf|slides]
Learning to Read Chest X-Ray Images from 16000+ Examples Using CNN
Yuxi Dong, Yuchao Pan, Jun Zhang and Wei Xu
In Proceedings of the 2nd International Workshop on Big Data Analytics for Smart and Connected Health (BIGDATA4HEALTH) 2017
[pdf]
What Can We Learn from Four Years of Data Center Hardware Failures?
Guosai Wang, Lifei Zhang and Wei Xu
In Proceedings of The 47th IEEE/IFIP International Conference on Dependable Systems and Networks (DSN) [BEST PAPER AWARD] 2017
[pdf|slides]
Identifying Carotid Plaque Composition in MRI with Convolutional Neural Networks
Yuxi Dong, Yuchao Pan, Xihai Zhao, Rui Li, Chun Yuan and Wei Xu
In Proceedings of the 3rd IEEE International Conference on Smart Computing (SMARTCOMP) 2017
[pdf|slides]
DataLab: Introducing Software Engineering Thinking into Data Science Education at Scale
Yang Zhang, Tingjian Zhang, Yongzheng Jia, Jiao Sun, Fangzhou Xu and Wei Xu
In Proceedings of The 39th International Conference on Software Engineering - Software Engineering Education and Training Track (ICSE-SEET) 2017
[pdf|slides]
Towards Economic Models for MOOC Pricing Strategy Design
Yongzheng Jia, Zhengyang Song, Xiaolan Bai and Wei Xu
In Proceedings of The 1st International Workshop on Data Management and Mining on MOOCs (DMMOOC) 2017
[pdf|slides]
Maximum Reconstruction Estimation for Generative Latent Variable Models
Yong Cheng, Yang Liu ang Wei Xu
In Proceedings of The 31st AAAI Conference on Artificial Intelligence (AAAI) 2017
[pdf|slides]
CIDS: Adapting Legacy Intrusion Detection Systems to the Cloud with Hybrid Sampling
Qingtang Xia, Tianjia Chen and Wei Xu
In Proceedings of The 6th IEEE International Symposium on Cloud and Service Computing (SC2) 2016
[pdf|slides]
Semi-supervised Learning for Neural Machine Translation
Yong Cheng, Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun and Yang Liu
In Proceedings of The Annual Meeting of the Association for Computational Linguistics (ACL’16) 2016
[pdf|slides]
Debugging OpenStack Problems Using a State Graph Approach
Yong Xiang, Hu Li, Sen Wang, Charley Peter Chen and Wei Xu
In Proceedings of ACM SIGOPS Asia-Pacific Workshop on Systems (APSys'16) [BEST PAPER AWARD] Hong Kong, China, 2016
[pdf|slides]
Optimizing Hash-based Distributed Storage Using Client Choices
Peilun Li and Wei Xu
In Proceedings of ACM SIGOPS Asia-Pacific Workshop on Systems (APSys'16) Hong Kong, China, 2016
[pdf|slides]
Optimizing Bulk Transfers with Software-Defined Optical WAN
Xin Jin,Yiran Li, Da Wei, Siming Li, Jie Gao, Lei Xu, Guangzhi Li, Wei Xu and Jennifer Rexford
In Proceedings of Sigcomm 2016 Brazil, August, 2016
[pdf|slides]
A 12-Rack, 180-Server Datacenter Network (DCN) Using Multiwavelength Optical Switching and Full Stack Optimization
Da Wei, Lei Xu, Xin Jin, Yiran Li and Wei Xu
In Optical Fiber Communication Conference (OFC), (Postdeadline Paper PDP) USA, March, 2016
[pdf|slides]
Predicting Inter-Data-Center Network Traffic Using Elephant Flow and Sublink Information
Yi Li, Hong Liu, Wenjun Yang, Dianming Hu, Xiaojing Wang and Wei Xu
In IEEE Transactions on Network and Service Management, 13, no. 4 (2016): 782-792
[pdf]
DataLab: A Version Data Management and Analytics System
Yang Zhang, Fangzhou Xu, Erwin Frise, Siqi Wu, Bin Yu and Wei Xu
In Proceedings of ICSE first International Workshop on BIG Data Software Engineering, Austin USA, May, 2016
[pdf|slides]
Improving Spark Performance with Zero-copy Buffer Management and RDMA
Hu Li and Wei Xu
In proceedings of IEEE INFOCOM First International Workshop on Big Data Sciences, Technologies and Applications (BDSTA 2016),
San Francisco, USA, Apr, 2016
[pdf|slides]
cOSPREY: A Cloud-Based Distributed Algorithm for Large-Scale Computational Protein Design
Yuchao Pan, Yuxi Dong, Jingtian Zhou, Mark Hallen, Bruce R. Donald, Jianyang Zeng and Wei Xu
In Journal of computational biology (JCB), 2016
[pdf|source code]
Increasing Large-Scale Data Center Capacity by Statistical Power Control
Guosai Wang, Shuhao Wang, Bing Luo, Xin Jin, Yinghang Zhu, Wenjun Yang, Longbo Huang, Weisong Shi, Dianming Hu and Wei Xu
In proceedings of the European Conference on Computer Systems (EuroSys 2016), London, UK, Apr, 2016
[pdf|slides]
Scalable Kernel TCP Design and Implementation for Short-Lived Connections
Xiaofeng Lin, Yu Chen, Xiaodong Li, Junjie Mao, Wei Xu, Jiaquan He, and Yuanchun Shi
In proceedings of the 21th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2016), Atlanta, USA, Apr, 2016
[pdf]
MED: The Monitor-Emulator-Debugger for Software-Defined Networks
Quanquan Zhi, Wei Xu
In proceedings of IEEE International Conference on Computer Communications (INFOCOM 2016), San Francisco, USA, Apr, 2016
[pdf|slides]
Inter-Data-Center Network Traffic Prediction with Elephant Flows
Yi Li, Hong Liu, Wenjun Yang, Dianming Hu and Wei Xu
In proceedings of IEEE/IFIP Network Operations and Management Symposium (NOMS 2016), Istanbul, Turkey, Apr, 2016
[pdf|slides]
An Efficient Parallel Algorithm for Accelerating Computational Protein Design
Yichao Zhou, Wei Xu, Bruce R. Donald, Jianyang Zeng
In proceedings of ISMB 2014, Bioinformatics. Boston, Massachusetts, USA, July 2014
[PubMed Link]
Advances and Challenges in Log Analysis
Adam Oliner, Archana Ganapathi, Wei Xu
In Communications of ACM (CACM) and ACM Queue, (Invited article), Feb, 2012
[ACM Digital Library Link]
Experience on Mining Google's Production Console Logs
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In the Workshop on Managing Systems via Log Analysis and Machine Learning Techniques (SLAML '10), Vancouver, BC Oct, 2010
[pdf]
A graphical representation for identifier structure in logs
Ariel Rabkin, Avani Wildani, Randy Katz, Wei Xu, Armando Fox
In the Workshop on Managing Systems via Log Analysis and Machine Learning Techniques (SLAML '10), Vancouver, BC Oct, 2010
[pdf]
Detecting Large Scale System Problems by Mining Console Logs
Wei Xu
PhD dissertation, UC Berkeley, July, 2010
[pdf]
Using Machine Learning Techniques in Console Log Analysis
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In Proc. of the 27th International Conference on Machine Learning (ICML’10), (Invited application paper) Haifa, Israel, June 2010
[pdf]
Online system problem detection by mining patterns of console logs
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In Proc. of the IEEE International Conference on Data Mining (ICDM’ 09), Miami, FL, December 2009
[pdf]
Large-scale system problem detection by mining console logs
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In Proc. of the 22nd ACM Symposium on Operating Systems Principles (SOSP’ 09), Big Sky, MT, October 2009
[pdf][dataset][code]
Mining console logs for large-scale system problem detection
Wei Xu, Ling Huang, Armando Fox, David Patterson, and Michael Jordan
In Proc. of the 3rd workshop on Tackling Computer Systems Problems with Machine Learning Techniques (SysML’08), San Diego, CA, December 2008
[pdf]
Regulating workload in J2EE application servers
Wei Xu, Zhangxi Tan, Armando Fox and David Patterson
In Proc. of the 1st International Workshop on Feedback Control Implementation and Design in Computing Systems and Networks (FeBID’06), Vancouver, Canada, April 2006
[pdf]
Predictive control for dynamic resource allocation in enterprise data centers
Wei Xu, Xiaoyun Zhu, Sharad Singhal, and Zhikui Wang
In Proc. of the 10th IEEE/IFIP Network Operations & Management Symposium (NOMS'06), Vancouver, BC, Apr. 2006
[pdf]
Feedback control theory and processing system log streams
Wei Xu
Master thesis, EECS Department, UC Berkeley, December, 2005
[pdf]
Control considerations for scaling event processing
Wei Xu, Joseph L. Hellerstein, Bill Kramer and David Patterson
In Proc. of the 16th IFIP/IEEE Distributed Systems: Operations and Management (DSOM'05), Barcelona, Spain, October 2005
[pdf]
A flexible framework for statistical learning and data mining from system log streams
Wei Xu, Peter Bodik and David Patterson
In Proc. of Workshop on Temporal Data Mining: Algorithms, Theory and Applications at The Fourth IEEE International Conference
on Data Mining (ICDM'04), Brighton, UK, Nov, 2004
[pdf]
Peer-to-Peer support for massively multiplayer games
Bjorn Knutsson, Honghui Lu, Wei Xu and Bryan Hopkins
In Proc. of the 23rd Conference of the IEEE Communications Society (INFOCOM’04), Hong Kong, March 2004
[pdf]
Non-technical Publications
建立与研究生的信任——写在我的第一个博士生毕业之际 (In Chinese)
徐葳
In 科技导报 2017,35(18)[pdf]
From MOOC to SPOC: Lessons from MOOC at Tsinghua and UC Berkeley (In Chinese)
Wei Xu, Yongzheng Jia, Armando Fox and David Patterson
In Modern Distance Education Research, 2014
[pdf]
CV
Here is my CV (as of Aug, 2018).