big-data
Here are 58 public repositories matching this topic...
Big Data Docker Data Science Spark Spark3 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook
-
Updated
Sep 29, 2024 - Python
Families In the WIld: A Kinship Recogntion Toolbox.
-
Updated
Sep 14, 2024 - Python
📈📊 Big Data Notebooks . ▫️ Análisis masivos de datos con pyspark ▫️ Ingesta de datos. ▫️ Algoritmos de machine learning con datos masivos. ▫️ Procesamiento de mensajes en tiempo real con Kafka.
-
Updated
Aug 31, 2024 - Jupyter Notebook
汐洛彖夲肜矩阵(Sillot T☳Converbenk Matrix),致力于服务智慧新彖乄
-
Updated
Sep 13, 2024 - TypeScript
Notebooks performed during BigData subject - 8th semester - Systems Engineering - Universidad Santo Tomás
-
Updated
Aug 12, 2024 - Jupyter Notebook
Um repositório em Python para armazenar códigos de exercícios da disciplina Análise de Dados e Big Data. Também, está presente o trabalho da disciplina, feito com o Jupyter Notebook.
-
Updated
Jun 29, 2024 - Jupyter Notebook
Repository containing the notebook for my big data project involving EDA and Machine Learning on the NY Taxi Fare dataset.
-
Updated
Jun 18, 2024 - Jupyter Notebook
Data science encompasses a wide range of areas, topics, and sub-domains such as Big Data, Machine & Deep learning (ETL, TensorFlow, Keras), Data Mining/Visualization (EDA), BI, Predictive Analytics, Statistical Analytics, etc.
-
Updated
May 3, 2024
Course Content and Code Notebooks - CSE 5717 is a graduate course on Big Data Analytics at UCONN.
-
Updated
Apr 25, 2024 - Jupyter Notebook
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
Updated
Mar 20, 2024 - Python
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
-
Updated
Mar 16, 2024 - Jupyter Notebook
This is my final project for PSTAT 135, Big Data Analytics, using PySpark to conduct county-wide voter turnout regression analysis by demographic. This project was done in collaboration with Tyler Kim and Erasmo Rivas. The GCP storage bucket linked below contains the full project, while the Jupyter notebook and exported PDF are included here.
-
Updated
Feb 21, 2024 - Jupyter Notebook
Build a movie recommendation data pipeline using Azure services for efficient data ingestion, transformation, and orchestration. Utilize Azure Blob Storage, Azure Databricks, and Azure Data Factory to implement collaborative filtering and PySpark ML for accurate movie recommendations.
-
Updated
Sep 30, 2023 - Jupyter Notebook
Collection of research notebooks done by Eonian
-
Updated
Aug 15, 2023 - Jupyter Notebook
Project developed for the exam of Big Data computing. The code includes a set of python notebooks that implement different approaches for movie shot classification and clustering tasks.
-
Updated
Aug 5, 2023 - Jupyter Notebook
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
Updated
Jul 26, 2023 - Python
Improve this page
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."