Installation
A step-by-step guide to setting up Dataverse for you
Dataverse can be installed using pip:
pip:pip install dataversePrerequisites
1. Install JDK
1-1. Install JDK
sudo apt-get update
sudo apt-get install openjdk-11-jdk1-2. Set Java environment variable
echo "export JAVA_HOME=/usr/lib/jvm/java-11-openjdk-amd64" >> ~/.bashrc source ~/.bashrc2. Install PySpark
2-1. Install PySpark
pip install pyspark2-2. Set PySpark environment variables
echo "export SPARK_HOME=$(pip show pyspark | grep Location | awk '{print $2 "/pyspark"}')" >> ~/.bashrc
echo "export PYSPARK_PYTHON=python3" >> ~/.bashrcsource ~/.bashrc1. Install JDK (Java Development Kit)
2. Install Apache Spark
Method A. Manual Installation
tar -zxvf {YOUR-DOWNLOADED-SPARK-FILE}Method B. Via HomeBrew
3. Set Environment Variables
3-1. Set JAVA_HOME
JAVA_HOMEcd {YOUR-JAVA-DIRECTORY}
vi ~/.bash_profileexport JAVA_HOME={YOUR-JAVA-DIRECTORY}
export PATH=$PATH:$JAVA_HOME/bin3-2. Set SPARK_HOME and PYSPARK_PYTHON
SPARK_HOME and PYSPARK_PYTHONLast updated