OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
-
Updated
Jul 8, 2024 - C++
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
Clink is a library that provides APIs and infrastructure to facilitate the development of parallelizable feature engineering operators that can be used in both C++ and Java runtime.
EBIC - AI-based parallel biclustering algorithm
fast and comprehensive k-mer counting package
Generate balanced uint64 hash for string. Widely used in the generation of feature id in machine learning.
C++ implementation of oral cancer detection on CT images
FEATure HashER
Contains the codes for Extended Histogram of Gradients for object recognition developed by me during my PhD studies.
minimal workflow engine for data processing (POC)
An App To Design Your Next State Of The Art Machine Learning Pipeline In A Single Place.
Add a description, image, and links to the feature-engineering topic page so that developers can more easily learn about it.
To associate your repository with the feature-engineering topic, visit your repo's landing page and select "manage topics."