Wei Tan

Quantitative Researcher
Citadel LLC

Email:	[Turn on javascirpt to check the link]

Interest

deep learning and NLP in quantitative finance
high-performance computing, GPU, and distributed systems

Biography

Before joining Citadel he was a Research Staff Member at the Cognitive Computing division, IBM T. J. Watson Research Center. Wei has a wide range of research interests in distributed computing, machine learning and GPU computing. Specifically, he worked on GPU accelerated platform for large-scale machine learning. He developed cuMF, by far the fastest matrix factorization library on GPUs.

Prior to that, he worked with Prof. Ian Foster on grid computing, at the University of Chicago and Argonne National Laboratory. He received his Ph.D. from Tsinghua University, China.

He is a recipient of the IEEE Peter Chen Big Data Young Researcher Award in 2016, and IBM Outstanding Technical Acoomplishment Award for many times. He also received best (student) paper awards from IEEE ICWS, IEEE SCC and IEEE ccGrid. His research and software have been incorporated in IBM products, open-source offerings and patent portfolio.

News

PC member. The International Conference for High Performance Computing, Networking, Storage, and Analysis. November 15–20, 2020. SC 20
"CuLDA: Solving Large-scale LDA Problems on GPUs", HPDC 19. (arXiv, code)
The most recent paper on cuMF, "Matrix Factorization on GPUs with Memory Optimization and Approximate Computing", ICPP, August 13-16, 2018, Eugene, Oregon, USA. (arXiv)
Area Vice Chair, IEEE IPDPS, Vancouver, Canada. May 21-25, 2018.
Invited Talk, "Matrix Factorization on GPUs: A Tale of Two Algorithms", ParLearning Workshop, IEEE IPDPS 2018.

Selected Publications

For the full publication list see Google Scholar.

Business and Scientific Workfows: A Web Service-Oriented Approach.
Wei Tan, MengChu Zhou.
Wiley-IEEE Press, 2013. [Book]
CuMF SGD: Fast and Scalable Matrix Factorization.
Xiaolong Xie, Wei Tan, Liana L. Fong, Yun Liang.
ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2017.
[arXiv Preprint] [code]
Faster and Cheaper: Parallelizing Large-Scale Matrix Factorization on GPUs.
Wei Tan, Liangliang Cao, Liana L. Fong.
ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2016.
[arXiv Preprint] [code]
SR-LDA: Mining Effective Representations for Generating Service Ecosystem Knowledge Maps.
Bing Bai, Yushun Fan, Wei Tan, Jia Zhang
IEEE International Conference on Services Computing, (SCC), 2017. [Paper]
Best Paper Award
Dilated Recurrent Neural Networks.
Shiyu Chang, Yang Zhang, Wei Han, Mo Yu, Xiaoxiao Guo, Wei Tan, Xiaodong Cui, Michael J. Witbrock, Mark A. Hasegawa-Johnson, Thomas S. Huang
Neural Information Processing Systems, (NIPS), 2017. [Paper]
DeepSea: Progressive Workload-Aware Partitioning of Materialized Views in Scalable Data Analytics.
Jiang Du, Renée J Miller, Boris Glavic, Wei Tan
International Conference on Extending Database Technology, (EDBT), 2017. [Paper]

Open-source Software

cuMF: CUDA Matrix Factorization library for recommender systems, link prediction, word embedding and other latent models. Also integrated with Spark to accelerate ALS in MLLlib (cuMF, IBMSparkGPU).

Awards

2017, 2016, 2014 Outstanding Technical Achievement Award, IBM
2017 Best Paper Award, IEEE SCC
2016 Peter Chen Big Data Young Researcher Award, IEEE
To who "has made significant contributions to Big Data research as evidenced by top publications, citations and awards"
2016 Best Student Paper Award Runner-Up, IEEE ICWS
2015 Best Paper Award, IEEE/ACM CCGrid
2014 Best Student Paper Award, IEEE ICWS
2011 Best Paper Award, IEEE SCC
2010 Pacesetter Award, Argonne National Laboratory, USA
"for excellence in achievement and performance which truly surpasses normal job expectations"
2008 caBIG Teamwork Award, National Cancer Institute, USA for who "made significant contributions to the caBIG community"
2008 Outstanding Poster Award, Biomedical Informatics Without Borders Meeting, National Cancer Institute (NCI), USA and National Cancer Research Institute (NCRI), UK
2006 IBM Ph.D. Fellowship Award
"honors exceptional Ph.D. students who have an interest in solving problems that are important to IBM and fundamental to innovation"

Patents

US10587681, Deployment of multi-task analytics applications in multi-clouds.
US10354006, System, method, and recording medium for web application programming interface recommendation with consumer provided content.
US10380222, Matrix factorization with two-stage data block dispatch associated with graphics processing units.
US10319069, Matrix factorization with approximate computing.
US10310908, Dynamic usage balance of central processing units and accelerators.
US10203988, Adaptive parallelism of task execution on machines with accelerators.
US10346505, System, method, and recording medium for differentiated and partial feature update in alternating least square.
US10169275, System, method, and recording medium for topology-aware parallel reduction in an accelerator.
US9626736, Memory-aware matrix factorization.
US10423575, Computational storage for distributed computing.
US9998531, Computer-based, balanced provisioning and optimization of data transfer resources for products and services.
US9998550, Network based service composition with variable conditions.
US9460147, Partition-based index management in hadoop-like data stores.
US9218383, Differentiated secondary index maintenance in log structured NoSQL data stores.
US9996568, Index maintenance based on a comparison of rebuild vs. update.
US9311252, Hierarchical storage for LSM-based NoSQL stores.
US9736199, Dynamic and collaborative workflow authoring with cloud-supported live feedback.
US8843894, Preferential execution of method calls in hybrid systems.

Last updated in July, 2020.