Installation#
To use Distributed Data processing, training, and/or inference#
We recommend using conda to prepare the Python environment.
conda create -n py37 python=3.7 # "py37" is conda environment name, you can use any name you like.
conda activate py37
pip install bigdl-orca
You can install bigdl-orca nightly build version using
pip install --pre --upgrade bigdl-orca
To use RayOnSpark#
There’re some additional dependencies required for running RayOnSpark. Use extra key [ray] to install.
pip install bigdl-orca[ray]
or to install nightly build, use
pip install --pre --upgrade bigdl-orca[ray]
To use Orca AutoML#
There’re some additional dependencies required for Orca AutoML support. Use extra key [automl] to install.
pip install bigdl-orca[automl]
Note that with extra key of [automl], pip will automatically install the additional dependencies for distributed hyper-parameter tuning,
including ray[tune]==1.9.2, scikit-learn, tensorboard, xgboost.
To use Pytorch Estimator, you need to install Pytorch with pip install torch==1.8.1.
To use TensorFlow/Keras AutoEstimator, you need to install Tensorflow with pip install tensorflow==1.15.0.