This course seeks a balance between foundational
but relatively basic material in algorithms, statistics, graph theory and related fields, with real-world
applications inspired by the current practice of
internet and cloud services. Specifically, this
course will look at social & information networks,
recommender systems, clustering and community
detection, search/retrieval/topic models,
dimensionality reduction, stream computing, and
online ad auctions. Together, these provide a
good coverage of the main uses for data mining
and analytics applications in social networking,
e-commerce, social media, etc. The course is a
combination of theoretical materials and weekly
laboratory sessions, where several large-scale
datasets from the real world will be explored.
For this, students will work with a dedicated
infrastructure based on Hadoop & Apache Spark.
Outcome: Not Provided