Category: BigData

Neo4j Fundamentals

December 02, 2017, December 02, 2017 | Comments

category: BigData
neo4j

Neo4j is a graph database which stores connections between nodes as first citizens. Different from traditional relational databases, such as Oracle, a graph database infers from data connections rather than...
CONTINUE READING ...

Preliminary Spark SQL

October 12, 2016, October 12, 2016 | Comments

category: BigData
spark

This following records my test of Spark SQL on Jupyter notebook.

Step 1 - Working with Spark Context Invoke the spark context: sc. The version method will return the working...
CONTINUE READING ...

Introduction to Spark

October 05, 2016, October 05, 2016 | Comments

category: BigData
spark

Apache Spark is an open source in-memory cluster computing framework optimized for extremely fast and large scale data processing. It started from AMPLab at UC Berkeley by Matei Zaharia in...
CONTINUE READING ...

Big Data Terminology

September 12, 2016, January 11, 2018 | Comments

category: BigData
bigdata

Accumulo - A computer software project that developed a sorted, distributed key/value store based on the BigTable technology from Google. It is a system built on top of Apache Hadoop,...
CONTINUE READING ...

Feel free to reach me out by my social profiles

Copyright 2015 - 2018 Another Peak created with by Yi Du. Powered by Jekyll and Hosted on GitHub.