找到你要的答案

Identify trending topics in Twitter
确定趋势话题在Twitter

twitter  machine-learning  apache-spark  k-means  spark-streaming 

Setting AWS credentials on Spark program using 3 methods, None of them works
设定AWS凭据星火计划采用3种方法,他们没有工作

amazon-s3  apache-spark  hive  hdfs 

error while loading , Error accessing .ivy2/cache/org.apache.spark/spark-core_2.11/jars/spark-core_2.11-1.4.0.jar
加载错误访问时出错。ivy2 /缓存/ org.apache.spark/spark-core_2.11/jars/spark-core_2.11-1.4.0.jar

scala  apache-spark  sbt 

Is there any specific sbt version required to compile the cassandra-spark-connector
是否有任何具体的SBT版本需要编译卡桑德拉火花连接器

cassandra  apache-spark  datastax-enterprise 

Neo4J - Finding the widest path on very large graphs
Neo4j的发现最宽的路径上非常大的图

algorithm  graph  neo4j  apache-spark  bigdata 

Setting AWS credentials on Spark program using 3 methods, None of them works
设定AWS凭据星火计划采用3种方法,他们没有工作

amazon-s3  apache-spark  hive  hdfs 

How to create correct data frame for classification in Spark ML
如何创建正确的数据帧在火花ML分类

scala  apache-spark  apache-spark-sql  apache-spark-mllib 

Severe straggler tasks due to Locality Level being “Any” and a Network Fetch on cached RDD
严重落伍的任务由于当地水平的“任何”和网络读取缓存的RDD

apache-spark 

dropped from memory error with graphX query
从内存中删除与该查询错误

scala  apache-spark  spark-graphx 

How to Subset a Spark Dataframe Using Another Dataframe
如何使用另一个数据帧的帧子集的火花

scala  apache-spark  apache-spark-sql 

Distributed sum of numbers
分布数和

apache-spark  distributed-computing  apache-zookeeper 

How to iterate records spark scala?
如何遍历记录火花斯卡拉?

scala  apache-spark  avro 

Error: Must specify a primary resource (JAR or Python file) - Spark submit Python app
错误:必须指定一个主要资源(JAR或Python文件)-火花提交Python应用程序

python  deployment  apache-spark  pyspark 

How to configure Apache Spark random worker ports for tight firewalls?
如何配置Apache Spark随机工人港口严密的防火墙?

configuration  apache-spark  worker  ports 

How to convert a key and list of values to a dataframe in pyspark?
如何将一个键和值的列表中pyspark一帧?

pandas  apache-spark  apache-spark-sql  pyspark  spark-dataframe 

Spark - how to flatmap sorted RDD using a stateful mapper?
火花-如何flatmap排序RDD使用状态映射?

apache-spark  rdd 

Spark: Cannot add RDD elements into a mutable HashMap inside a closure
火花:无法添加元素到里面的一个闭合的RDD的HashMap

scala  hashmap  apache-spark  rdd 

Apache Spark DataFrame no RDD partitioning
Apache Spark帧没有RDD分区

java  parallel-processing  apache-spark 

Spark groupByKey alternative
火花groupbykey替代

python  apache-spark  pyspark 

Spark GraphX: how to insert just a node to a graph
火花GraphX:如何将只是一个节点图

scala  apache-spark  spark-graphx 

JDBC connection fails to connect Teradata from apache spark
JDBC连接无法连接Teradata从Apache的火花

java  jdbc  apache-spark  apache-spark-sql 

Where is executed the Apache Spark reductionByWindow function?
在执行Apache Spark reductionbywindow功能?

apache-spark  spark-streaming  windowed 

JDBC connection fails to connect Teradata from apache spark
JDBC连接无法连接Teradata从Apache的火花

java  jdbc  apache-spark  apache-spark-sql 

Apache Spark - Feature Extraction Word2Vec example and exception
Apache的火花特征提取Word2vec例子和例外

python  scala  exception  apache-spark  word2vec 

Apache Spark - Parallel Processing of messages from Kafka - Java
Apache的火花-并行处理的消息从卡夫卡- java

java  apache-spark  apache-kafka  spark-streaming 

How to rewrite the code to avoid using SqlContext.read() in Spark 1.3.1?
如何避免使用sqlcontext重写代码。火花1.3.1 read()?

apache-spark  apache-spark-sql 

pyspark: how to free resources
pyspark:如何免费资源

hadoop  apache-spark  pyspark 

You need to build spark before running this program
运行这个程序之前需要建立火花

apache-spark 

How to rewrite the code to avoid using SqlContext.read() in Spark 1.3.1?
如何避免使用sqlcontext重写代码。火花1.3.1 read()?

apache-spark  apache-spark-sql 

How to rewrite the code to avoid using SqlContext.read() in Spark 1.3.1?
如何避免使用sqlcontext重写代码。火花1.3.1 read()?

apache-spark  apache-spark-sql