B REAKOUT S ESSION - Deep Analytics Pipeline -

BREAKOUT SESSION- Deep Analytics Pipeline - 3rd Workshop on Big Data Benchmarking July 16-17 Xi‘an, China

DEEP ANALYTICAL PIPELINE– FURTHER DEVELOPMENT (1) • Pipeline should not be enlarged to more domains, since query types are similar • Loading aspect of data: • Differentiation between staging servers and analytic server • Raw data has to be there, loading time should be restricted to the “batch”-processing • Sanity checks should be included to check for bottlenecks, e.g., client is not able to produce the amount of data • Pre-configuration of some parameter

DEEP ANALYTICAL PIPELINE– FURTHER DEVELOPMENT (2) • Multi-tendency is excluded from the pipeline • Stream-based queries are excluded, but stream-based loading shall be desired to ensure velocity • Reference implementations need data specifications to hook in between the stages

DEEP ANALYTICAL PIPELINE– OPEN ISSUES • How to deal with different metrics between the stages? • Different kinds of inputs for data generation?

B REAKOUT S ESSION - Deep Analytics Pipeline -

B REAKOUT S ESSION - Deep Analytics Pipeline -

Presentation Transcript

Differentiated Instruction W ork S ession

South Kingstown CIRCLE S trategic P lanning S ession 1

IB2 F inal S ession

In today’s s ession w e w ill :

Goals of b reakout discussion

Public School-Operated UPK I nformation S ession

B REAKOUT S ESSION - B IG B ENCH -

B B S

S ession 3

This s ession will start soon ….

In today’s s ession we will :

Report S ession 6.1.2.

b  s 

Deep Analytics Pipeline A Benchmark Proposal

A Deep Dive into Nagios Analytics

R eading S haring S ession

s b

Review of L ast S ession

INFO s ession Be lgrade , 12 th April 201 8 .