Apache Spark - A unified analytics engine for large-scale data processing
Getting started with machine learning
Today, machine learning—the study of algorithms that make data-based predictions—has found a new audience and a new set of possibilities.
Apache Hadoop
A curated list of awesome computer vision resources
Assorted data from the General Services Administration.
An index of all open-source data
An unofficial repository of National Park Service data.
Data and code behind the articles and graphics at FiveThirtyEight
Cool links & research papers related to Machine Learning applied to source code (MLonCode)
ID3-based implementation of the ML Decision Tree algorithm
📖 A curated list of resources dedicated to Natural Language Processing (NLP)
A toolkit for developing and comparing reinforcement learning algorithms.
Reinforcement learning resources curated
Principal Component Analysis on music loops
Ruby gem to calculate the similarity between texts using tf*idf
Large-scale linear classification, regression and ranking in Python
scikit-learn: machine learning in Python
An Open Source Machine Learning Framework for Everyone
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…