Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

COMP3350

Assignment 3 - Big Data Technologies

Trimester 2, 2023 Singapore

Due Date Aug 7th 5pm

This is an INDIVIDUAL assignment.

For Assignment 3 each student will

•    upload the written report to the Canvas by Aug 7th 5pm

•    present your report to class on lecture or tutorial session on Week 12, and

•   submit summary and evaluation of peer presentation by Aug 14th  5pm on Canvas.

Weighting

20% of course mark

Assignment Information

Section 1: NoSQL (20 marks)

In this assignment, each student will select one NoSQL data store (Key-Value, Graph Databases, Column-based, Document Store, etc.) and a product that implements the selected technology:

1.  Write  a  research  report  on  the  NoSQL  data  store  and  product  (approx.  1000 words). Your report needs to discuss the NoSQL:

•    Data Model and its features

•   Justification for its development (e.g. advantages vs limitations - compared to relational model, application areas, etc.)

In the NoSQL product, you have chosen, discuss the

•   Storage architecture

•   Scalability (horizontal vs vertical scaling)

•   Query capabilities

•   Concurrency control features (ACID vs BASE)

•    Fault tolerance

and present a demo of basic CRUD operations on the selected product. (10 marks)

2.  Present  your  report  to  the  class  (duration:  5-7   minutes  presentation).  Your presentation  should  include  a  demo  of  the  technology  presented  in  use.  For example,a demo of CRUD operations of a NoSQL system showcasing its features. (8 marks)

3.  Contribute  a  summary  and  evaluation  of one  peer  student  presentation  on  a different technology from your presentation.  Include the following  in the  peer- evaluation:

a.  Name of student

b.  Topic of presentation

c.  Summary of technology presented and

d.  An  evaluation of the presentation – what was presented well, what was missed, how to improve etc.(2 marks)

Sample Topic Areas and Technologies:

•    Key-Value: DynamoDB, LevelDB, Riak, Oracle NoSQL, etc.

•   Column-based: BigTable, HBase, Amazon SimpleDB, Hypertable, etc.

•    Document Store: MongoDB, CouchDB, etc.

•   Graph Databases: Neo4J, DEX, GraphBase, GraphChi, etc.

Section 2: Map-Reduce Exercise (Optional – 2 bonus marks)

In this exercise you write a MapReduce job in Java that reads any text input and computes the average length of all words that start with each character.

For any text input, the job should report the average length of words that begin with ‘a’, ‘b’, and so forth.

For example, for input:

Now is not the time

The output would be:

N            3.0

i            2.0

n            3.0

t            3.5

Submission

1.  The Research Report is to be submitted via Canvas - Assignment 3 Written Report. 2.  The report needs to be presented in class in Week 12 lecture or tutorial sessions.

3.  Peer-evaluation  of  student  presentation  submitted  via  Assignment  3  –  Peer Evaluation Canvas Discussion.

4.  The  Map  Reduce  exercise  is  to  be  submitted  via  Canvas  –  Assignment  3 MapReduce  Exercise  (zipped  file  containing  all  Java  files)  and  demonstrated during tutorial sessions (optional).

The assessment RUBRIC is given below:

 

Excellent

Good

Satisfactory

Poor

Fail

 

 

 

 

 

Written Report

(10)

(9-10)

(6-8)

(3-5)

(1-2)

(0)

A comprehensive

view and

understanding of the technology

area is presented detailing all

aspects specified

with clear detailed examples and

justification. The report is clearly

presented without any errors.

 

The report covers most areas and

presents a

detailed

understanding

and justification    of the technology area with

examples

incorporated

The report has

sufficient

information on the

topic area. Ideas and concepts

presented in

understandable form. Basic

understanding of concepts only.

Does not cover

many aspects

specified in the

assignment

 

 

 

Ideas and

concepts are

disconnected.

Lack of flow and clarity.

 

 

 

 

No submission

 

 

 

Presentation

(8)

(8-6)

(5-4)

(2-3)

(1)

(0)

The presentation demonstrates an in-depth

understanding of the topic areas

with demo,

examples and

justification.

The presentation demonstrates

understanding of many aspects of  the technology

areas and

presented with

examples and

justification

Able to present

ideas and

concepts on the

 

Lack of

demonstration on understanding of  concepts and

topic area

 

 

 

No Presentation

topic area in a clear form.

However, lack depth and

demonstration understanding the topic area.

 

of

of

of

 

 

 

Peer Evaluation

(2)

(2)

(1)

(0)

 

 

Summarises and critically evaluates peer presentation

 

Summarises some points from the

peer presentation. Demonstrates lack

of understanding and does not

critically evaluate presentation.

 

 

No submission

 

 

Map Reduce

Exercise

(2)

(2)

(1)

(0)

 

Demonstrating excellent understanding of Map-Reduce programming model -   code working without errors

 

 

Demonstrating good understanding of Map-Reduce programming model -

code working with errors

 

 

No submission