Internship Redefined: Why it was needed and how we did it?

  Many a times, the term internship is ill-defined to both the company and the candidate. Company perceives an intern as a trainee who works without or on less pay, in order to gain some extremely confined work experience. On the other hand, candidate takes an internship to be an opportunity where he is...

Friend Follower Analysis using Apache Spark GraphX’s PageRank algorithm

GraphX is Apache Spark’s API for graphs and graph-parallel computation. This includes transformation, exploration, and graph computation. Data can be viewed both as graph & collections. This use case discusses friend follower analysis using Apache Spark GraphX’s PageRank operator. PageRank measures the importance of each vertex in a graph, by determining which vertexes have the...

Apache Spark: Where is it going?

  Databricks ran Apache Spark Survey 2016 this summer to identify how organizations are using Apache Spark. The survey results suggest that Spark’s growth continues across various industries, building sophisticated data solutions by people in various functional roles. Databricks 2016 survey results reflect answers from 900 distinct organizations and 1615 respondents, who were predominantly...