70 likes | 211 Views
Cassandra at General Sentiment. Architecture. NLP. Internet. Spidering. HADOOP. UI. CASSANDRA. Restful API. Cassandra at General Sentiment. Schema Batch insertions from Hadoop Hosted on EC2 Montitoring. Schema. Row Key is Entity Name Column Family Sentiment and volume counts
E N D
Architecture NLP Internet Spidering HADOOP UI CASSANDRA Restful API
Cassandra at General Sentiment • Schema • Batch insertions from Hadoop • Hosted on EC2 • Montitoring
Schema • Row Key is Entity Name • Column Family • Sentiment and volume counts • Co-reference counts • Entity name inverted index • Column name is a date • Column value is serialized data structure
Batch Insertions • From Hadoop • No Compaction during insertions • No Hinted Handoffs
Cassandra on EC2 • Instance types • M1.large vs m1.xlarge • Instance disks vs EBS • RAID-0 + xfs • Scribe for logging
Monitoring • Monit • Ganglia