42047 – Data Processing Using Python AUTUMN 2022
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
42047 – Data Processing Using Python
AUTUMN 2022
ASSIGNMENT - Data Analysis and Visualization
Summary
This assessment requires you to perform exploratory data analysis and data visualization on a publicly available dataset. Your task includes:
Understanding the dataset
Apply appropriate visualization techniques
Extract information from each for the attributes and other specifics of the data set
using statistical techniques .
Use summary statistics to spot problems such as missing values, outliers, data
ranges that are too wide or narrow, and units of data.
In the assignment report, provide an explanation about each of the techniques
applied and the inference as a result of the analysis .
Assignment Objectives
The purpose of this assignment is to demonstrate competence in the following skills.
Able to write python programs and use various python packages such as NumPy,
Matplotlib, and Pandas for data exploration and analysis
Able to read data from multiple sources and manipulate data for analysis and
visualization.
Apply statistical tests and data visualisation techniques to analyse data and interpret
the results .
Tasks
In this assignment, you need to design and implement a data analysis process, pre- processing the dataset, incorporating appropriate data visualization techniques, and statistical testing.
Choose a publicly available dataset, or a Kaggle competition dataset, or a custom dataset. Download, explore, and clean it as required to perform data analysis and visualization.
A number of sample datasets are available in the reference given below:
UCI Machine Learning Repository:https://archive.ics.uci.edu/ml/datasets.php https://www.dataquest.io/blog/free-datasets-for-projects/
Additional Information:
Assessment Submission
You must upload data files and the solution file to Canvas. This must be done by the Due Date. You may submit as many times as you like until the due date. The final submission you make is the one that will be marked. If you have not uploaded your zip file within 7 days of the Due Date, or it cannot be run in the lab, then your assignment will receive a zero mark.
PLEASE NOTE 1: It is your responsibility to make sure you have thoroughly tested
your program to make sure it is working correctly .
PLEASE NOTE 2: Your final submission to Canvas is the one that is marked. It does
not matter if earlier submissions were working; they will be ignored. Download your submission from Canvas and test it thoroughly.
Return of Assessed Assignment
It is expected that marks will be made available 2 weeks after the submission via
Canvas.
Queries
If you have a problem, such as an illness that will affect your assignment submission, contact the subject coordinator as soon as possible.
Dr. Nabin Sharma
Room: CB11.07.124
Phone: 9514 1835
Email:Nabin.Sharma@uts.edu.au
If you have a question about the assignment, please post it to the Canvas discussion board for this subject so that everyone can see the response.
If serious problems are discovered in assignment specification, the class will be informed via an announcement on Canvas. It is your responsibility to make sure you frequently check Canvas.
PLEASE NOTE : If the answer to your questions can be found directly in any of the following
Subject outline
Assignment specification
Canvas FAQ
Canvas discussion board
You will be directed to these locations rather than given a direct answer.
Extensions and Special Consideration
Please refer to subject outline.
Academic Standards and Late Penalties
Please refer to subject outline.
2022-04-23