找到你要的答案

Development with Apache Spark
与Apache星火开发

java  apache-spark 

How to deal with code that runs before foreach block in Apache Spark?
如何处理代码运行在Apache Spark foreach块?

java  apache-spark  spark-streaming 

Spark 1.3.1 install failed in MLlib when I run make-distribution.sh in Ubuntu 14.04
火花1.3.1安装失败在MLlib当我运行在Ubuntu 14.04 make-distribution.sh

java  scala  apache-spark  apache-spark-mllib 

Can apache spark run without hadoop?
可以运行Hadoop的Apache的火花?

hadoop  amazon-s3  apache-spark  mapreduce  mesos 

how to handle the Exception in spark map() function?
如何在火花map()函数处理异常?

scala  apache-spark 

Building a spark streaming sample app
构建一个火花流示例应用程序

scala  maven  apache-spark  hbase  spark-streaming 

When I submit a Spark job through Pyspark, how can I ensure which Python is used on the workers?
当我提交一个火花工作的Pyspark,我怎么能保证它使用Python的工人?

python  apache-spark  pyspark 

How to tune Spark application with hadoop custom input format
如何调整与Hadoop应用自定义格式输入的火花

hadoop  mapreduce  apache-spark 

How to deal with code that runs before foreach block in Apache Spark?
如何处理代码运行在Apache Spark foreach块?

java  apache-spark  spark-streaming 

Task not serializable - Regex
任务不可序列化-正则表达式

regex  scala  apache-spark 

Spark 1.5.1 standalone cluster - Exception in thread “main” akka.actor.ActorNotFound: Actor not found for
火花1.5.1独立集群在线程的“主”akka.actor.actornotfound例外:演员没有找到

apache-spark 

Is there a better way for reduce operation on RDD[Array[Double]]
有没有减少对RDD [数组[双] ]操作更好的方式

scala  apache-spark  reduce  rdd 

Spark 1.5.1 standalone cluster - wrong Akka remoting config?
火花1.5.1独立集群错阿卡远程配置?

apache-spark  akka  akka-remote-actor 

Test Spark with Tachyon
与超光速粒子测试火花

scala  apache-spark  tachyon 

Task not serializable - Regex
任务不可序列化-正则表达式

regex  scala  apache-spark 

Need some help on setting up spark for cassandra on java
需要一些帮助建立java卡桑德拉火花

java  eclipse  apache-spark  spark-cassandra-connector 

Is there a better way for reduce operation on RDD[Array[Double]]
有没有减少对RDD [数组[双] ]操作更好的方式

scala  apache-spark  reduce  rdd 

Save ML model for future usage Spark >= 1.6 Spark < 1.6
保存ML模型以备将来使用 火花> 1.6 火花<1.6

apache-spark  pyspark  apache-spark-mllib  apache-spark-ml 

Spark: additional properties in a directory
火花:目录中的附加属性

apache-spark  apache-spark-sql 

Spark issue with the class generated from avro schema
从公司的架构产生火花问题类

serialization  apache-spark  avro 

Is there a better way for reduce operation on RDD[Array[Double]]
有没有减少对RDD [数组[双] ]操作更好的方式

scala  apache-spark  reduce  rdd 

How to print rdd in python in spark
如何在Python中打印RDD的火花

python  apache-spark  pyspark  apache-spark-sql 

How to print rdd in python in spark
如何在Python中打印RDD的火花

python  apache-spark  pyspark  apache-spark-sql 

Error in running livy spark server in hue
在色相李维火花运行服务器错误

apache-spark  bigdata  hue 

pyspark, importing schema through json file
pyspark,通过JSON文件导入模式

python  json  apache-spark  pyspark  pyspark-sql 

Mapreduce Vs Spark Vs Storm Vs Drill - For Small files
MapReduce与火花VS风暴VS钻小文件

hadoop  apache-spark  hive  apache-storm  apache-drill 

pyspark, importing schema through json file
pyspark,通过JSON文件导入模式

python  json  apache-spark  pyspark  pyspark-sql 

Task not serializable - Regex
任务不可序列化-正则表达式

regex  scala  apache-spark 

Apache spark , spark-submit, what is the behavior of --total-executor-cores option
Apache的火花,火花提交,行为--总执行内核选项是什么

multithreading  hadoop  apache-spark  pyspark  cpu-cores 

Spark JSON text field to RDD
火花的JSON文本字段RDD

scala  cassandra  apache-spark  rdd