Ubuntu下安装spark

方法一:

jps 查看Java 包

sudo apt-get install openjdk**

sudo apt-get install scala

选择安装源然后 sudo wget 下载链接

sudo tar xf sprak***

cd sprk**

sudo ./bin/pyspark (不打sudo 会有error)

最终报错(重启后不再报错)

方法二:

解压 java jdk-8u**: sudo tar xf *** -C /opt/

解压Scala :sudo tar xf *** -C /opt/

解压spark:sudo tar xf *** -C /opt/

sudo gedit /etc/profile

加入:

export JAVA_HOME=/opt/jdk1.8.0_111
export JRE_HOME=${JAVA_HOME}/jre  
export CLASSPATH=.:${JAVA_HOME}/lib:${JRE_HOME}/lib  
export PATH=${JAVA_HOME}/bin:$PATH

export SCALA_HOME=/opt/scala-2.12.1
export PATH=${SCALA_HOME}/bin:$PATH

export SPARK_HOME=/opt/spark-2.1.0-bin-hadoop2.7
export PATH=${SPARK_HOME}/bin:$PATH
export PYTHONPATH=/opt/spark-2.1.0-bin-hadoop2.7/python

保存退出。

source /etc/profile

将spark××/python/lib/目录下的py4j**.zip压缩文件解压到spark××/python/目录下:sudo unzip -d /opt/spar*/python/ py4j**.zip

import pyspark 成功

但是pyspark 下sc ,name 'sc' is not defined !!! ubuntu spark 环境搭建http://blog.csdn.net/u010171031/article/details/51849562有讲

关机重启试试重启后不再报错