Big data training material
-
Updated
Jun 29, 2023 - Python
Big data training material
Telegram-bot from Big Data contest "Maraton Big Data Entel" (3rd place)
A duplicate of the streamparse quickstart wordcount example for use with Apache Storm. This version adds significant comments in an attempt to make the learning curve of Storm and streamparse simpler for those beginning to learn the framework.
TwitterSocialAnalisis - Measure your popularity on twitter
Labs of "Big-Data Frameworks II" @ Efrei Paris
A collection of useful Map Reduce programs that provide insight on HTML documents stored on an Apache Hadoop file system maintained by the University of Notre Dame.
DESCRIÇÃO Com base no repositório disponibilizado pelo expert, te desafiamos a replicar e, porque não, melhorar o algoritmo de extração/contabilização de palavras. Para isso, você pode ordenar as palavras por ocorrência e não por ordem alfabética (apresentando as mais citadas no texto com prioridade), por exemplo. Sinta-se à vontade para evoluir…
FastApi & cassandraDb project about data search engine for huge amount of users data
MNE-CAMCAN for processing the Cambridge Centre for Ageing and Neuroscience (Cam-CAN) MEG dataset using MNE-Python
The simplest index-and-search engine for huge multiline text files. Focused primarily on bioinformatics. Inspired by tabix, but isn't its replacement. Written in Python. Works on top of Zstandard Seekable & pyzstd SeekableZstdFile.
User-friendly Python DataFrames 🔵🟡 powered by Julia 🔴🟢🟣
Library for accessing and sharing datasets for audio, computer vision, and natural language processing (NLP) tasks
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."