Usage¶
Setup¶
mlf-core based mlflow projects require either Conda or Docker to be installed. The usage of Docker is highly preferred, since it ensures that system-intelligence can fetch all required and accessible hardware. This cannot be guaranteed for Mac let alone Windows environments.
Conda¶
There is no further setup required besides having Conda installed and CUDA configured for GPU support. mlflow will create a new environment for every run.
Docker¶
If you use Docker you should not need to build the Docker container manually, since it should be available on Github Packages or another registry. However, if you want to build it manually for e.g. development purposes, ensure that the names matches the defined name in the ``MLproject``file. This is sufficient to train on the CPU. If you want to train using the GPU you need to have the NVIDIA Container Toolkit installed.
Training¶
Training on the CPU¶
Set your desired environment in the MLproject file. Start training using mlflow run ..
You need to disable CUDA to train on the CPU! See parameters.
Training using GPUs¶
Conda environments will automatically use the GPU if available.
Docker requires the accessible GPUs to be passed as runtime parameters. To train using all gpus run mlflow run . -A gpus=all.
You can replace all with specific GPU ids (e.g. 0) if desired.
Parameters¶
training-dataPath to the training data csv file['train.csv': string]test-dataPath to the test data csv file['test.csv': string]cudaWhether to train with CUDA support (=GPU)['True': string]max_epochsNumber of epochs to train[1000: int]general-seedPython, Random, Numpy seed[0: int]xgboost-seedXGBoost specific seed[0: int]single-precision-histogramWhether to enable single precision for histogram building['True': string]