Instructors




Class Schedule


Week Date Instructor Topics Slides Notes
1 24/10/2017 Edith Cohen Distributed and streamed data.
Approximate summary structures.
Frequent elements: Misra Gries Summaries
Set memebership: Bloom filters
Counting: Morris approximate counters
Distinct counting: preview
PPT PDF
2 31/10/2017 Edith Cohen
Sampling-based summaries (MinHash sketches):
Distinct counting
Reservoir sampling, Distinct sampling
Domain-size estimators
PPT PDF
3 7/11/2017 Edith Cohen Sampling-based summaries (continued)
Set relations (union size, Jaccard, cosine similarity)
Entities as sets of features:
Broder's n-grams for text documents
Estimating set relations from MinHash sketches
weighted sampling: Poisson PPS, multiobjective
Bottom-k samples/sketches
Intro to Linear sketches (random projections)
PPT PDF
4 14/11/2017 Haim Kaplan Linear sketches (random projections)
Second frequency moment: The AMS sketch
Dimensionality reduction:
Johnson-Lindenstrauss transform
PPT PDF
5 21/11/2017 Haim Kaplan Dimensionality reduction (continued)
Matrix factorizations: PCA, SVD
notes (PDF)
6 28/11/2017 Amos Fiat Data Privacy PPT PDF
7 5/12/2017 Uri Stemmer Practical local differential privacy
Users retain their data and send safe randomizations.
notes (PDF) Guest lecture
8 12/12/2017 Haim Kaplan Nearest neighbor search/classifiers
Locality Sensitive Hashing (LSH)
PPT PDF
9 19/12/2017 Edith Cohen Graph mining/learning:
Introduction
Eigenvector centrality: Degree to PageRank
MinHash sketches of reachability sets:
(reach centrality, similarity, influence)
Efficient computation.
PPT PDF
10 26/12/2017 Edith Cohen Graph mining/learning (continued):
Distance-based graph mining
All-distance sketches (ADS):
Efficient computation, Applications
PPT PDF
11 2/1/2018 Edith Cohen Graph mining/learning (continued):
ADS application: Approximate distance oracles
Submodular set functions:
Definition, properties, applications
Maximizing submodular monotone functions:
(Approximate) Greedy maximization
PPT PDF tentative
12 9/1/2018 Edith Cohen Graph mining/learning (continued)
Graph-based influence maximization
Semi-supervised learning
PPT PDF tentative
13 16/01/2018 Haim Kaplan clustering PPT PDF

Last modified: Mon Jan 8 22:54:53 IST 2018