闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

SEES0083 - Quantitative Methods

2022

PART A: Please Answer all Exercises

Exercise 1 (4 points)

A uniform distribution has the following support -50 and +50.

(a) Draw the probability density function of the distribution

(b) What is the mass of probability up to 0 (write the computation)?

(c) What is the total mass of probability of the following three intervals [- 25;0] [7;10] [47.5;50] (write the computation of the probability masses for each interval and then their sum)?

(d) What is the first, second and third quartile of this uniform distribution?

Exercise 2 (4 points)

Consider the following probability distribution function of the random variable X assuming values from -8 to 0 and with the associated probabilities, P(x), written below:

x	-8	-7	-6	-5	-4	-3	-2	- 1	0
P(x)	0.03	0.05	0.02	0.1	0.2	0.3	0.2	0.099	0.001

(a) What is P ( -8 < X ≤ 0 )?

(b) What is P ( -7 < X < - 1)?

(d) What is P ( -4 < X < -4)?

Exercise 3 (4 points)

In a car shopping centre there are 12347 cars on display. 26% of them have air-conditioning. 57% of cars with air conditioning also have CD player, while only 46% of cars without air conditioning have a CD player. Define the random variables as follows:

X = 1 if car has air conditioning; X = 0 otherwise

Y = 1 if car has CD player; Y = 0 otherwise

(a) 2 points Find the joint probability function of X and Y and draw the complete table with joint and marginal probabilities.

(b) 2 points Find the conditional probability density function of Y, given X = 0.

Exercise 4 (12 points)

A random variable x has the following probability density function:

f(x)=x+1

f(x)=1-x

f(x)=0

for - 1<x≤0;

for 0<x<1;

otherwise.

a) (4 points) Graph the probability density function for X.

b) (4 points) Show that the density function has the properties of the probability density function. What would be the area underneath the density function? Which are the boundaries of this area? Show both a geometrical solution, a mathematical solution (integrals).

c) (4 points) Find the probability that X takes value between -0.5 and 0.5 both geometrically and mathematically using integrals .

PART B: answer all questions (total 48 points).

1. Open STATA.

2. Open the dataset EXAM.dta (located into SEES0083 Moodle page)

3. Set a seed that is composed on the following way; follow the instruction keeping the order of actions

• Your own year of birth (4-digit)

• your own month of birth (2-digit)

• your own day of birth (2-digit)

Example: if the date of birth is 1972 May the 9th In STATA command window type: set seed 19720509

• generate a random number

In STATA command window type: generate u = runiform()

• drop the observations in which the newly created variable “u” (it should

be in your list of variables) is strictly greater than or equal to 0.5 In STATA command window type: drop if u>=0.5

Compliments! You have now created a unique dataset for your exercise (in other words no other student has the same database) in which you need to perform the tasks below.

Exercise 5 (12 points)

A researcher is trying to understand whether the average sales in Spain (ES) are comparable to the average sales in Italy (IT). For this purpose, she is using the database you have just created. The variable for sales is called “r_OperRevTurnThEuro” (expressed in ‘000 of Euros) and number of employees is called “numberofemployees” . The first output analysed by the researcher is what will come out as the following command is generated (summary stats for the two variables only for Spain):

In STATA command window type: sum r_OperRevTurnThEur

numberofemployees if country_ACRONYM=="ES"

• Copy and paste the result into the exam sheet.

The second output analysed by the researcher is what will come out as the following command is generated (summary stats for the two variables only for Italy):

In STATA command window type: sum r_OperRevTurnThEur

numberofemployees if country_ACRONYM=="IT"

• Copy and paste the result into the exam sheet.

1. How would the researcher interpret the output of this tables?

a. (3 points) Can the researcher conclude that there is a systematic (statistically significant) difference between the sales and number of employees in Spain and Italy?

2. The researcher runs a more appropriate test i.e. a two-sample t-test for the comparison of the mean of sales in Spain (ES) vs. Italy (IT). The researcher sets its alpha at 0.01 (1%)

In STATA command window type : ttest r_OperRevTurnThEur,

by(country_ACRONYM)

• Copy and paste the result into the exam sheet.

• (3 points) What are the three null hypotheses and the three alternative hypotheses?

• (6 points) What does the researcher conclude? Should the researcher be concerned about the difference between the two samples?

Exercise 6 (24 points)

Now the researcher runs four types of regression analyses:

1. LEVEL of Value Added (name of variable r_addedvaluetheur, unit of measure thousands of euros) on LEVEL of fixed assets (name of variable r_fixedassetstheur, unit of measure thousand EUROS)

2. LEVEL of Value Added (name of variable r_addedvaluetheur, unit of measure thousands of euros) on Natural LOG of fixed assets (name of variable LN_r_fixedassetstheur, unit of measure natural log of thousand EUROS)

3. Natural LOG of Value Added (name of variable LN_r_addedvaluetheur, unit of measure natural log thousands of euros) on LEVEL of fixed assets (name of variable r_fixedassetstheur, unit of measure thousand EUROS)

4. Natural LOG of Value Added (name of variable LN_r_addedvaluetheur, unit of measure natural log thousands of euros) on Natural LOG of fixed assets (name of variable LN_r_fixedassetstheur, unit of measure natural log of thousand EUROS)

• Report the STATA output for all 4 models

Questions:

• (8 points) FOR EACH OF THE 4 MODELS: Interpret the estimated two coefficients (constant and X variable, i.e. independent variable)

• (8 points) FOR EACH OF THE 4 MODELS: What is the predicted level of value added for a company with 1 million euros in fixed assets?

• (8 points) FOR EACH OF THE 4 MODELS: What is the expected change in value added if there is a positive change in fixed assets of 1 million euros?

Exercise 7(16 points)

The researcher from the exercise 6 decides to run a multiple regression analysis with an additional explanatory variable on the right-hand-side, which is number of employees (into STATA called numberofemployees). The researcher could use the variable in levels or in transformed natural log (in STATA LN_numberofemployees).

• Report the STATA output for all 4 NEW models level-level

level-log;

log-level;

log-log

Questions:

a) (6 points) Look at the results of the four new models. Which model would you suggest to be the “best” and why? Explain in details the logic of your answer.

b) (10 points) Comment the four regression outputs when compared to the exercise 8. You need to highlight the additional information the extra- variable can bring and explain for each regression the interpretation of the coefficients. Would a company be better off in increasing number of employees or fixed assets?

PART C

Exercise 8 (12 points)

What is the difference between exploratory factor analysis and confirmatory factor analysis?

• (6 points) Explain the logic of your answers by drawing two graphs, one for exploratory and one for confirmatory and elaborate. For the latter also show the path diagram

• (6 points) Now consider a path diagram with Structural Equation Modelling in the book. Identify exogenous constructs, endogenous contracts and explain the difference between variable in ovals and variable in squared boxes

Exercise 9 (12 points)

Look at the following dendrogram.

1 point) What is the maximum number of groups the cluster analysis can identify?

1 point) What is the minimum number of groups the cluster analysis can identify?

10 points) Find all the number and compositions of groups the cluster analysis can identify by changing the Euclidian distance cut-off points→ . In the example below the groups are identified with the cut-off point 40: ADBCEF and G: 2