FindingData
DashBoard - DashB.ai π π
DashBoard - DashB.ai π π
A No code machine learning platform
π Overview
- This is a web app that automates the data preprocessing pipeline.Target is to automate the whole machine learning pipeline.But this project is final till data preprocessing pipeline.
- Currently this project is in developement phase.
- User can upload comma seperated value files or directly fetch the data from mysql database.(Make sure mysql is installed in your system).
- User's have all the command what to perform and what to not so selected operations can be passed to the pipeline to showcase the result.
- User's can visualize the data using dataviz tool comes along with Dash.ai which can visualize the data without writing any code. (Made by Dash by plotly)

β Built With
scikit learn
plotly
Dash
bootstrap
π’ Getting Started
To get a local copy up and running follow these simple steps. make sure git is installed in yout machine.
Installation
- Clone the repo
git clone https://github.com/IMsumitkumar/No-code-ML-platform-DashB.ai- create a virtual env and activate
conda create -n <env_name> python=3.7conda activate <env_name>- Install dependencies
inside-your-local-repository
pip install -r requirements.txtRUN
STEP 1 : Migrate the databse tables and create superuser
python manage.py makemigrationspython manage.py migratepython manage.py createsuperuser username : ***** email : ***** password : ******STEP 2
python manage.py runserverSTEP 3 : OPTIONAL
For email recovery you have to set our credentials in DashB -> settings.py
Set your email and password in DashB/settings.pyPreprocessing Pipeline Tree
βββ Handle DatatypesβΒ Β βββ Drop unnecessary features.βΒ Β βββ replace inf with NaN.βΒ Β βββ Make sure all the column names are of string type and clean them.βΒ Β βββ Remove the column if target column has NaN.β βββ Remove Duplicate columnsβ βββ handle numerical, catergorical and time features.βΒ Β βββ Try to determine Ml usecase and encode.βββ Handle Missing ValuesβΒ Β βββββββ Numerical FeaturesβΒ Β βββ Replace with mean.βΒ Β βββ Replace with median.βΒ Β βββ Repalce with Mode.β βββ Replace with standard deviation.β βββ Replace with zero.βΒ Β βββββββ Categorical FeaturesβΒ Β βββ Replace with mean.βΒ Β βββ Replace with "Missing".β βββ Repalce with Most frequent value.βββ Removing zero and near zero variance columnsβΒ Β βββ Eliminate the features that have zero varinace,βΒ Β βββ Eliminate the features that have near zero variace.βββ Group Similiar FeaturesβΒ Β βββ Group more than two features Make new features with them.βββ Normalization and TransformationβΒ Β βββββββ Operations to apply only on numerical featuresβΒ Β βββ ZScoreβΒ Β βββ MinMaxβΒ Β βββ Quantileβ βββ MaxAbsβΒ Β βββ Yeo-JohnsonβΒ Β βββββββ Target t7ransformation (regression)βΒ Β βββ Box-CoxβΒ Β βββ Yeo-Johnsonβββ Making Time FeaturesβΒ Β βββ Take a time feature and extract more features from itβΒ Β βββ (Day, Month, Year, Hour, Minute, Second, Quantile, Quarter, Day of week, week day name, day of year, week of year )βββ Feature EncodingβΒ Β βββββββ Ordinal EncodingβΒ Β βββ LabelEncodingβ βββ Target Guided ordinal encodingβΒ Β βββββββ One hot encodingβΒ Β βββ KDD orangeβΒ Β βββ Mean Encodingβ βββ Counter/frequency encodingβββ Removing OutliersβΒ Β βββ Isolaton ForestβΒ Β βββ KNNβΒ Β βββ PCAβΒ Β βββ Elliptical envelopeβββ Feature SelectionβΒ Β βββ Chi squared (Not working perfectly)βΒ Β βββ RFE (Not working on all the data)βΒ Β βββ Lasso (works perfectly)βΒ Β βββ Random ForestβΒ Β βββ lgbm (works perfectly)βΒ Β βββ Remove zero variance featuresβββ Imbalance Dataset (Not done yet)βΒ Β βββ Ensemble techniques automatically handles imblance datasetβΒ Β βββ Undersampling (Not a good idea)βΒ Β βββ Oversampling βΒ Β βββ SMOTEβΒ Β βββ Isolation ForestβββNExt StepDirectory Tree
βββ accounts βΒ Β ββββββββββββ # handles login, signup and password recovery. βββ DashBβΒ Β ββββββββββββ # main folder contains wsgi, routing, settings and urls.βββ dataβΒ Β ββββββββββββ # main folder for performing pipeline.βββ VizβΒ Β ββββββββββββ # project app for data visualizatio tool.βββ staticβΒ Β ββββββββββββ # contains static files.βββ mediaβΒ Β ββββββββββββ # storage folder of uploaded media.βββ templatesβΒ Β ββββββββββββ # contains landing page templatesβββ manage.pyβββ requirements.txtβββ LICENSEβββ README.mdβββ db.sqlite3Contributing
Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.
- Fork the Project
https://github.com/IMsumitkumar/No-code-ML-platform-DashB.ai/tree/main/DashB - Create your Feature Branch
git checkout -b feature/AmazingFeature - Commit your Changes
git commit -m 'Add some AmazingFeature' - Push to the Branch
git push origin feature/AmazingFeature - Open a Pull Request
Edit this page on GitHub