找到你要的答案

java.lang.NoSuchMethodError Jackson databind and Spark
java.lang.nosuchmethoderror Jackson databind和火花

json  scala  jackson  apache-spark 

DataFrame.save() / sqlContext.load loses “nullable” status of schema
数据框。save() / sqlcontext.load失去“图式可空”状态

apache-spark 

How to read Azure Table Storage data from Apache Spark running on HDInsight
如何从运行于Apache的火花HDInsight读表格储存数据

azure  apache-spark  windows-azure-storage  hdinsight 

How to Convert a Column of Dataframe to A List in Apache Spark?
如何将一列数据框在Apache Spark列表?

scala  apache-spark  apache-spark-sql  spark-dataframe 

pySpark Create DataFrame from RDD with Key/Value
pyspark创建从RDD与键/值数据框

apache-spark  pyspark 

pySpark Create DataFrame from RDD with Key/Value
pyspark创建从RDD与键/值数据框

apache-spark  pyspark 

When specifying local[n1,n2,n3] for spark master, what are the three parameters?
在指定的地方[ N1、N2、N3 ]火花的主人,三个参数是什么?

apache-spark 

Spark Dataframe parallel read
火花帧并行读取

amazon-s3  apache-spark 

Reading S3 files in a for loop in a Spark application
在一个火花应用环读S3文件

apache-spark 

When specifying local[n1,n2,n3] for spark master, what are the three parameters?
在指定的地方[ N1、N2、N3 ]火花的主人,三个参数是什么?

apache-spark 

Apache Spark: Master removed our application: Failed when using saveAsTextFile on large RDD
Apache的火花:主人删除我们的应用:失败的时候使用RDD saveastextfile大

apache-spark  pyspark 

How to read Azure Table Storage data from Apache Spark running on HDInsight
如何从运行于Apache的火花HDInsight读表格储存数据

azure  apache-spark  windows-azure-storage  hdinsight 

How to read a nested collection in Spark
如何读取火花中的嵌套集合

hadoop  hive  apache-spark  parquet 

how to get scala string split to match python
如何让Scala与Python字符串分割

python  scala  split  apache-spark 

spark worker running contineuosly giving errors
火花工人运行contineuosly给错误

cassandra  apache-spark 

Loading a large Hbase table into SPARK RDD takes long time
一个大的HBase表加载到火花RDD需要很长时间

hbase  apache-spark  apache-spark-sql 

How to get term-document matrix from multiple documents with Spark?
如何从多文档的火花中获取术语文档矩阵?

java  apache-spark  text-mining  apache-spark-mllib  term-document-matrix 

RDD join : After joining two different pair RDDs, the resulted RDD key value and order has changed?
RDD连接:连接两种不同的对睡眠呼吸障碍后,导致RDD关键价值和秩序发生了变化?

java  join  apache-spark  rdd 

how to get scala string split to match python
如何让Scala与Python字符串分割

python  scala  split  apache-spark 

RDD join : After joining two different pair RDDs, the resulted RDD key value and order has changed?
RDD连接:连接两种不同的对睡眠呼吸障碍后,导致RDD关键价值和秩序发生了变化?

java  join  apache-spark  rdd 

Spark Streaming on EC2: Exception in thread “main” java.lang.ExceptionInInitializerError
Spark在EC2:在线程异常“主要”java.lang.exceptionininitializererror

scala  maven  amazon-ec2  apache-spark  spark-streaming 

how to convert directstream from kafka into data frames in spark 1.3.0
如何将directstream从卡夫卡到火花1.3.0数据帧

apache-spark  hive  streaming  apache-kafka 

How to use RDD persist and cache?
如何使用RDD坚持和缓存?

java  apache-spark  bigdata  spark-streaming 

spark worker running contineuosly giving errors
火花工人运行contineuosly给错误

cassandra  apache-spark 

Apache Spark Mysql connection suitable jdbc driver not found
Apache Spark MySQL连接JDBC驱动程序没有找到合适的

mysql  jdbc  apache-spark 

SparkContext.clean java.util.zip.ZipException: invalid LOC header (bad signature)
sparkcontext.clean java.util.zip.zipexception:LOC标头无效(不签名)

apache-spark 

Spark (pyspark) having difficulty calling statistics methods on worker node
火花(pyspark)有困难,号召工人节点统计方法

python  osx  apache-spark  pyspark 

Apache Spark - Dealing with Sliding Windows on Temporal RDDs
Apache的火花处理时态RDDS滑动窗口

algorithm  scala  apache-spark 

Apache Spark fails to process a large Cassandra column family
Apache的火花不能处理大卡桑德拉列族

java  cassandra  apache-spark  apache-spark-sql  spark-cassandra-connector 

Setting spark memory allocations for extracting 125 Gb of data…ExecutorLostFailure
提取125 GB的数据…executorlostfailure设置火花内存分配

apache-spark  apache-spark-sql  hawq