[Home]
Publications (outdated):
DBLP
Google Scholar
Patents
2021
- SODA, Coresets for Clustering in Excluded-minor Graphs and Beyond
with Shaofeng H.-C. Jiang, Robert Krauthgamer, Xuan Wu
Full version here
- NSDI, Twenty Years After: Hierarchical Core-Stateless Fair Queueing
with Zhuolong Yu, Jingfeng Wu, Ion Stoica, Xin Jin,
2020
-
Workshop on Optimization for Machine Learning (OPT), 2020,
Direction Matters: On the Implicit Regularization Effect of Stochastic Gradient Descent with Moderate Learning Rate
with Jingfeng Wu, Difan Zou, Quanquan Gu
Full version here
- IEEE BigData, Sketch and Scale: Geo-distributed tSNE and UMAP
with Viska Wei, Nikita Ivkin, Alexander Szalay
Full version here
- BMC Neurology, Longitudinal functional and imaging outcome measures in FKRP limb-girdle muscular dystrophy
with Doris Gay, Yee Leung, Michael A. Jacobs, Shivani Ahlawat, Alex E. Bocchieri, Vishwa S. Parekh, Katherine Summerton, Jennifer Mansour, Genila Bibat, Carl Morris, Shannon Marraffino, Kathryn R. Wagner,
Full version here
- SIGMETRICS, I Know What You Did Last Summer: Network Monitoring using Interval Queries
with Nikita Ivkin, Ran Ben Basat, Zaoxing Liu, Gil Einziger, Roy Friedman,
Journal Version in POMACS
- FOCS, Near Optimal Linear Algebra in the Online and Sliding Window Models
with Petros Drineas, Cameron Musco, Christopher Musco, Jalaj Upadhyay, David P. Woodruff, Samson Zhou
Full version here
- ICML, FetchSGD: Communication-Efficient Federated Learning with Sketching
with Daniel Rothchild, Ashwinee Panda, Enayat Ullah, Nikita Ivkin, Ion Stoica, Joseph Gonzalez, Raman Arora
Full version here
Code here
- ICML, On the Noisy Gradient Descent that Generalizes as SGD
with Jingfeng Wu, Wenqing Hu, Haoyi Xiong, Jun Huan, Zhanxing Zhu
Full version here
- ICML, Obtaining Adjustable Regularization for Free via Iterate Averaging
with Jingfeng Wu, Lin Yang
Full version here
- ICML, Schatten Norms in Matrix Streams: Hello Sparsity, Goodbye Dimension
with Aditya Krishnan, Roi Sinoff, Robert Krauthgamer
Full version here
- ICML, Coresets for Clustering in Graphs of Bounded Treewidth
with Daniel Baker, Lingxiao Huang, Shaofeng H.-C. Jiang, Robert Krauthgamer, Xuan Wu
Full version here
- SIGCOMM, NetLock: Fast, Centralized Lock Management Using Programmable Switches
with Zhuolong Yu, Yiwen Zhang, Mosharaf Chowdhury, Xin Jin
Full version here
- MIDL, Multitask radiological modality invariant landmark localization using deep reinforcement learning
with Vishwa S. Parekh, Alex E. Bocchieri, Michael A. Jacobs
Full version here
- APoCS, Memory-Efficient Performance Monitoring on Programmable Switches with Lean Algorithms
with Zaoxing Liu, Samson Zhou, Ori Rottenstreich, Jennifer Rexford
Full version here
2019
- CoNEXT, QPipe: Quantiles Sketch Fully in the Data Plane
with Nikita Ivkin, Zhuolong Yu, Xin Jin
Full version here
- OPT-ML, Obtaining Regularization for Free via Iterate Averaging
with Jingfeng Wu, Lin Yang
Full version here
- The Annual Conference on Astronomical Data Analysis and Software Systems (ADASS)
Six Dimensional Streaming Algorithm for Cluster Finding in N-Body Simulations
with Aidan Reilly, Nikita Ivkin, Gerard Lemson, Alex Szalay
Full version here
- RANDOM, Streaming Coresets for M-Estimators
with Dan Felmdan, Harry Lang, Daniela Rus
Full version here
- APPROX, Improved Algorithms for Time Decay Streams
with Enayat Ullah, Harry Lang, Samson Zhou
Full version here
- UAI, Online Factorization and Partition of Complex Networks by Random Walk
with Lin F. Yang, Tuo Zhao, Mengdi Wang
Full version here
- MIDL, Multiparametric Deep Learning Tissue Signatures for Muscular Dystrophy: Preliminary Results
with Alex E. Bocchieri, Vishwa S. Parekh, Kathryn R. Wagner, Shivani Ahlawat, Doris G. Leung, Michael A. Jacobs
Full version here
- SIGCOMM, NitroSketch: Robust and General Sketch-based Monitoring in Software Switches
with Zaoxing Liu, Ran Ben Basat, Gil Einziger, Yaron Kassner, Roy Friedman, Vyas Sekar
Full version here
- SIGCOMM (posters and demos), Attack Time Localization using Interval Queries
with Nikita Ivkin, Ran Ben Basat, Zaoxing Liu, Gil Einziger, Roy Friedman
Full version here
- ICML, Coresets for Ordered Weighted Clustering
with Shaofeng Jiang, Robert Krauthgamer, Xuan Wu
Full version here
- CSR, Approximations of Schatten Norms via Taylor Expansions
Full version here
- FAST, DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching, (best paper)
with Zaoxing Liu, Zhihao Bai, Zhenming Liu, Xiaozhou Li, Changhoon Kim, Xin Jin, Ion Stoica
Full version here
- SoCG, The One-Way Communication Complexity of Dynamic Time Warping Distance
with Moses Charikar, William Kuszmaul, David P. Woodruff, Lin F. Yang
Full version here
- NeurIPS, Communication-efficient Distributed SGD with Sketching
with Nikita Ivkin, Daniel Rothchild, Enayat Ullah, Ion Stoica, Raman Arora
Full version here
Code here
2018
- Astronomy and Computing, Streaming Tools for Analyzing $N$-body Simulations: Finding Halos and Investigating Excursion Sets in One Pass,
with Nikita Ivkin, Zaoxing Liu, Lin F. Yang, Srinivas Suresh Kumar, Gerard Lemson, Mark Neyrinck, Alexander S. Szalay, Tamas Budavari
Full version here
- NSF Workshop Reports, Challenges and Opportunities in Big Data Research: Outcomes from the Second Annual Joint PI Meeting of the NSF BIGDATA Research Program and the NSF Big Data Regional Innovation Hubs and Spokes Programs 2018
with Swarup Samarth, Raman Arora, Doina Caragea, Melissa Cragin, Jennifer Dy, Vasant Honavar et al
Full version here
- NeurIPS, Differentially Private Robust Low-Rank
Approximation
with Raman Arora, Jalaj Upadhyay
Full version here
- NeurIPS, The Physical Systems Behind Optimization Algorithms
with Lin F. Yang, Raman Arora, Vladimir Braverman, Tuo Zhao
Full version here
- OSDI, ASAP:Fast, Approximate Graph Pattern Mining at Scale
with Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Ion Stoica
Full version here
- APPROX, Nearly Optimal Distinct Elements and Heavy Hitters on Sliding Windows
with Elena Grigorescu, Harry Lang, David Woodruff, Samson Zhou
Full version here
- ICML, Matrix Norms in Data Streams: Faster, Multi-Pass and Row-Order
with Stephen R. Chestnut, Robert Krauthgamer, Yi Li, David P. Woodruff, Lin F. Yang
Full version here
- HotCloud, Towards Fast and Scalable Graph Pattern Mining
with Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Ion Stoica
Full version here
- ICALP, Revisiting Frequency Moment Estimation in Random Order Streams
with Emanuele Viola, David Woodruff, Lin F. Yang
Full version here
- ICALP, Approximate Convex Hull of Data Streams
with Avrim Blum, Ananya Kumar, Harry Lang, Lin F. Yang
Full version here
2017
- OPT, Dynamic Factorization and Partition of Complex Networks
with Lin F. Yang, Tuo Zhao, Mengdi Wang
Full version arXiv
- FWCG, Approximate Convex Hull of Data Streams
with Avrim Blum, Ananya Kumar, Harry Lang, Lin Yang
Full version here
- ICML, Clustering High Dimensional Dynamic Data Streams
with Gereon Frahling, Harry Lang, Christian Sohler, Lin F. Yang
Full version here
- STOC, CStreaming Symmetric Norms via Measure Concentration
with Stephen R. Chestnut, Robert Krauthgamer, Lin F. Yang
Full version here
- CALDAM, Accurate Low-Space Approximation of Metric k-Median for Insertion-Only Streams
with Harry Lang, Keith Levin
Full version here
- PODS, BPTree: an $\ell_2 $ heavy hitters algorithm using constant memory
with Stephen R. Chestnut, Nikita Ivkin, Jelani Nelson, Zhengyu Wang, and David P. Woodruff,
Full version here
2016
- Encyclopedia of Algorithms, Sliding Window Algorithms
Full version here
- RANDOM, Approximating subadditive Hadamard functions on implicit matrices
with Alan Roytman, Gregory Vorsanger
Full version here
- SIGCOMM, One Sketch to Rule Them All: Rethinking Network Flow Monitoring with UnivMon
with Zaoxing Liu, Antonis Manousis, Greg Vorsanger, Vyas Sekar
Selected as Plenary Talk ("Best of Theory") at STOC 2018.
Full version here
- PODS, Streaming Space Complexity of Nearly All Functions of One Variable on Frequency Vectors
with Stephen Chestnut, David Woodruff, Lin Yang
Full version here
- STOC, Beating CountSketch for Heavy Hitters in Insertion Streams
with Stephen Chestnut, Nikita Ivkin, David R. Woodruff
Full version here
- SODA, Clustering Problems on Sliding Windows
with Harry Lang, Keith Levin, Morteza Monemizadeh
Full version here
2015
- IPL, Weighted Sampling Without Replacement from
Data Streams
with Rafail Ostrovsky, Gregory Vorsanger
Full version here
- HotNets, Enabling a "RISC" Approach for Software-Defined Monitoring using Universal Streaming
with Zaoxing Liu, Gregory Vorsanger, Vyas Sekar
Full version here
- FSTTCS, Clustering on Sliding Windows in Polylogarithmic Space
with Harry Lang, Keith Levin, Morteza Monemizadeh
Full version here
- MFCS, New Bounds for the CLIQUE-GAP Problem Using Graph Decomposition Theory
with Zaoxing Liu, Tejasvam Singh, N. V. Vinodchandran, Lin F. Yang
Full version here
Journal version in Algorithmica
- E-Science, Streaming Algorithms for Halo Finders
with Zaoxing Liu, Nikita Ivkin, Lin Yang, Mark Neyrinck, Gerard Lemson, Alexander Szalay, Tamas Budavari, Randal Burns and Xin Wang
Full version here
- RANDOM, Universal sketches for the frequency negative moments and other decreasing streaming sums
with Stephen Chestnut
Full version here
- RANDOM, Zero-One Laws for Sliding Windows and Universal Sketches
with Rafail Ostrovsky and Alan Roytman
Full version here
2014
- RANDOM, An Optimal Algorithm for Large Frequency Moments Using $O(n^{1-2/k})$ Bits
with Jonathan Katzman, Charles Seidell, and Gregory Vorsanger
Full version here
- COCOON, Sampling from Dense Streams Without Penalty: Improved Bounds for Frequency Moments and Heavy Hitters
with Gregory Vorsanger
Full version here
2013
- ICALP, How Hard Is Counting Triangles in the Streaming Model?
with Rafail Ostrovsky, Dan Vilenchik
Full version here
- COCOON, How to Catch $L_2$-Heavy-Hitters on Sliding Windows
with Ran Gelles, Rafail Ostrovsky
Full version here
Journal version in Theoretical Computer Science (TCS), here
- RANDOM, Approximating Large Frequency Moments with Pick-and-Drop Sampling
with Rafail Ostrovsky
Full version here
- RANDOM, Generalizing the Layering Method of Indyk and Woodruff: Recursive Sketches for Frequency-Based Vectors on Streams
with Rafail Ostrovsky
Full version here
2011
- ACM-SIAM, Streaming $k$-means on Well-Clusterable Data
with Adam Meyerson, Rafail Ostrovsky, Alan Roytman, Michael Shindler, Brian Tagiku
Full version here
2010
- STOC, Measuring Independence of Datasets
with Rafail Ostrovsky
Full version here
- STOC, Zero-One Frequency Laws
with Rafail Ostrovsky
Full version here
- STACS, AMS Without $\bf{4}$-Wise Independence on Product Domains
with Kai-Min Chung, Zhenming Liu, Michael Mitzenmacher, Rafail Ostrovsky
Full version here
- SIAM Journal on Computing, Effective Computations on Sliding Windows
with Rafail Ostrovsky
Full version here
2009
- TALG, A linear algorithm for computing convex hulls for random lines
with Daniel Berend
Full version here
- PODS, Optimal Sampling from Sliding Windows
with Rafail Ostrovsky, Carlo Zaniolo
Full version here
Journal version in JCSS here
2007
- FOCS, Smooth Histograms for Sliding Windows
with Rafail Ostrovsky
Full version here
2006
- SIGMETRICS Performance Evaluation Review, Batched disk scheduling with delays
with Eitan Bachmat
Full version here
2005
- AofA, Convex Hull for Intersections of Random Lines
with Daniel Berend
Full version here