This repository is an educational project focused on big data processing using Python. It includes scripts and datasets for tasks such as word count analysis and data manipulation.
-
Updated
Mar 11, 2025 - Python
This repository is an educational project focused on big data processing using Python. It includes scripts and datasets for tasks such as word count analysis and data manipulation.
This repository presents a 2-round coreset-based MapReduce algorithm designed to address the k-center problem with z outliers.
📘 This repository contains the assignments (from 2019, 2021, 2022, 2023, 2024, and 2025 sessions), notes (from 2025 sessions) for the Big Data Computing course offered by SWAYAM-NPTEL and final exam review & analysis for the 2025 session.
This is a small project for Big Data Computing course, applying Dimensionality Reduction, Sampling and Clustering for topic detection in text documents.
Homeworks from the Big Data Computing course, UniPD, 2021/22
Add a description, image, and links to the big-data-computing topic page so that developers can more easily learn about it.
To associate your repository with the big-data-computing topic, visit your repo's landing page and select "manage topics."