This includes the collection of homework assignment for CSCI 5523- Data Mining for spring 2025 at University of Minnesota. All the assignment are released after their extended due dates. It includes assignment on following topics:
- Introduction to Pyspark
- SON algorithm (Using APriori and PCY in first pass) for frequent itemsets identification
- MinHashing