Bi-kipedia Hadoop Hadoop is een gratis, Java gebaseerd framework, voor het processen van grote volumes aan data in een gedistribueerde omgeving, typisch in de context van "Big Data". woensdag, 12 oktober 2016
Bi-kipedia Data Lake A data lake is a storage repository in which we hold raw data in its native format woensdag, 12 oktober 2016
Bi-kipedia Data Lake A data lake is an important element of the buzzwords related to Big Data and Advanced Analytics. A data lake is a storage repository in which we hold raw data in its native format; this can be structured (e.g. entire source data-tables), semi-structured, and unstructured (e.g., photos, tweets) data. woensdag, 12 oktober 2016
Bi-kipedia Lambda Architecture A Lambda architecture is a data-processing architecture combining both batch- and (real-time) stream-processing methods. woensdag, 12 oktober 2016
Bi-kipedia Lambda Architecture A Lambda architecture is a data-processing architecture combining both batch- and (real-time) stream-processing methods. The advantage of this dual architecture is the ability to handle massive amounts of data while maintaining real-time monitoring. As a result, it is generally used as a basis for big data architectures. woensdag, 12 oktober 2016
Bi-kipedia R language R is an open-source language and environment designed for statistical computing and graphics. woensdag, 12 oktober 2016
Bi-kipedia R language R, also called GNU S, is an open-source language and environment designed for statistical computing and graphics. Although it's been existing for 20 years, R remains the most popular statistical programming language used today. woensdag, 12 oktober 2016
Bi-kipedia Hadoop Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment, typically in a Big Data context. At it's core, Apache Hadoop is woensdag, 12 oktober 2016
Bi-kipedia Spark Apache Spark is an open source cluster computing framework released in 2014. It serves as an alternative to (part of) Apache Hadoop as it was originally developed in response to limitations in the Hadoop MapReduce cluster computing paradigm (read more on Hadoop). As such, Apache Spark complements Apache Hadoop as Spark replaces the use of Hadoop MapReduce but still requires (if used on scale) the other Hadoop elements: i.e., a file system (e.g. Hadoop Distributed File System) and a cluster manager (e.g. Hadoop YARN). woensdag, 12 oktober 2016
Bi-kipedia CIO - Chief Information Officer Chief Information Officer (CIO) is de job titel die over het algemeen gegeven wordt aan de person binnen de organisatie die verantwoordelijk is voor de information technologie (IT) en the computer systemen die de objectieven van de organisatie ondersteunen. woensdag, 12 oktober 2016