COMP3350 Assignment 3 – Big Data Technologies Trimester 2, 2023
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
COMP3350
Assignment 3 - Big Data Technologies
Trimester 2, 2023 Singapore
Due Date Aug 7th 5pm
This is an INDIVIDUAL assignment.
For Assignment 3 each student will
• upload the written report to the Canvas by Aug 7th 5pm
• present your report to class on lecture or tutorial session on Week 12, and
• submit summary and evaluation of peer presentation by Aug 14th 5pm on Canvas.
Weighting
20% of course mark
Assignment Information
Section 1: NoSQL (20 marks)
In this assignment, each student will select one NoSQL data store (Key-Value, Graph Databases, Column-based, Document Store, etc.) and a product that implements the selected technology:
1. Write a research report on the NoSQL data store and product (approx. 1000 words). Your report needs to discuss the NoSQL:
• Data Model and its features
• Justification for its development (e.g. advantages vs limitations - compared to relational model, application areas, etc.)
In the NoSQL product, you have chosen, discuss the
• Storage architecture
• Scalability (horizontal vs vertical scaling)
• Query capabilities
• Concurrency control features (ACID vs BASE)
• Fault tolerance
and present a demo of basic CRUD operations on the selected product. (10 marks)
2. Present your report to the class (duration: 5-7 minutes presentation). Your presentation should include a demo of the technology presented in use. For example,a demo of CRUD operations of a NoSQL system showcasing its features. (8 marks)
3. Contribute a summary and evaluation of one peer student presentation on a different technology from your presentation. Include the following in the peer- evaluation:
a. Name of student
b. Topic of presentation
c. Summary of technology presented and
d. An evaluation of the presentation – what was presented well, what was missed, how to improve etc.(2 marks)
Sample Topic Areas and Technologies:
• Key-Value: DynamoDB, LevelDB, Riak, Oracle NoSQL, etc.
• Column-based: BigTable, HBase, Amazon SimpleDB, Hypertable, etc.
• Document Store: MongoDB, CouchDB, etc.
• Graph Databases: Neo4J, DEX, GraphBase, GraphChi, etc.
Section 2: Map-Reduce Exercise (Optional – 2 bonus marks)
In this exercise you write a MapReduce job in Java that reads any text input and computes the average length of all words that start with each character.
For any text input, the job should report the average length of words that begin with ‘a’, ‘b’, and so forth.
For example, for input:
Now is not the time
The output would be:
N 3.0
i 2.0
n 3.0
t 3.5
Submission
1. The Research Report is to be submitted via Canvas - Assignment 3 Written Report. 2. The report needs to be presented in class in Week 12 lecture or tutorial sessions.
3. Peer-evaluation of student presentation submitted via Assignment 3 – Peer Evaluation Canvas Discussion.
4. The Map Reduce exercise is to be submitted via Canvas – Assignment 3 MapReduce Exercise (zipped file containing all Java files) and demonstrated during tutorial sessions (optional).
The assessment RUBRIC is given below:
|
Excellent |
Good |
Satisfactory |
Poor |
Fail |
|
Written Report (10) |
(9-10) |
(6-8) |
(3-5) |
(1-2) |
(0) |
|
A comprehensive view and understanding of the technology area is presented detailing all aspects specified with clear detailed examples and justification. The report is clearly presented without any errors. |
The report covers most areas and presents a detailed understanding and justification of the technology area with examples incorporated |
The report has sufficient information on the topic area. Ideas and concepts presented in understandable form. Basic understanding of concepts only. Does not cover many aspects specified in the assignment |
Ideas and concepts are disconnected. Lack of flow and clarity. |
No submission |
||
Presentation (8) |
(8-6) |
(5-4) |
(2-3) |
(1) |
(0) |
|
The presentation demonstrates an in-depth understanding of the topic areas with demo, examples and justification. |
The presentation demonstrates understanding of many aspects of the technology areas and presented with examples and justification |
Able to present ideas and concepts on the |
Lack of demonstration on understanding of concepts and topic area |
No Presentation |
||
topic area in a clear form. However, lack depth and demonstration understanding the topic area. |
of of of |
|||||
Peer Evaluation (2) |
(2) |
(1) |
(0) |
|||
Summarises and critically evaluates peer presentation |
Summarises some points from the peer presentation. Demonstrates lack of understanding and does not critically evaluate presentation. |
No submission |
||||
Map Reduce Exercise (2) |
(2) |
(1) |
(0) |
|||
Demonstrating excellent understanding of Map-Reduce programming model - code working without errors |
Demonstrating good understanding of Map-Reduce programming model - code working with errors |
No submission |
2023-08-03