联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-21:00
  • 微信:codinghelp

您当前位置:首页 >> C/C++编程C/C++编程

日期:2023-08-27 09:06


Assignment 1

Question 1 [Total 23 Marks]

A group of researchers are interested in studying the prevalence of obesity, diabetes, and other

cardiovascular risk factors in Subang Jaya, Selangor. To gain more insight into this question,

1150 subjects were interviewed and some of the results obtained are compiled in the data file

A1 S2 2023.xls. The columns provide the following information:

Column A: the patient ID

Column B: the level of stabilised glucose

Column C: The total level of cholesterol

Column D: the level of high-density-lipoprotein (“good” cholesterol)

Column E: the weight of the patient

Column F: the gender of the patient

Column G: the type of body frame (small, medium, large)

The data is available on the “A1 S2 2023.xls” file on the Moodle. You must use your subsample

of the survey data. Your sample will consist of 200 observations starting from the respondent

whose ID is the same as the last three digits of your student number. For example, if your

student number is 20275749, you would use individuals 749 to 948.

All tables, graphs and comments for this question should be places in the designated spaces in

the Worksheet Results.

(a) Complete Table (a). Use Countif or another method to find the frequencies for the

number of male and female patients in the sample and hence complete Table (a).

[2 marks]

(b) Display the data in Table (a) using an appropriate chart to be placed in the Graph (b)

Textbox. [2 marks]

(c) Using Countif or any other appropriate method, complete Table (c) by filling in the

frequencies of male and female patients according to their type of body frame.

[2 marks]

(d) Display the data in Table (c) using an appropriate chart to be placed in the Graph (d)

Textbox. [2 marks]

(e) Complete Table (e) containing the summary statistics for the HDL (high-density-

lipoprotein or “good” cholesterol) variable according to the patient gender.

[2 marks]

(f) Complete the grouped frequency Table for the HDL (“good” cholesterol) for female and

male patients [Table (f)]. Find the frequency and hence calculate the percentage

frequency and cumulative percentage frequency for female and male patients. [2 marks]

(g) Is the level of “good” cholesterol (HDL) different for the two groups? Use figures from

Table (e) to help you explain any differences. [3 marks]

(h) Construct percentage frequency polygons for the HDL for female and male students as one chart

as Graph (h). [3 marks]

(i) Discuss the shape of the percentage frequency polygons for the HDL levels for female

and male patients. [3 marks]

(j) List the four measures of variability from the summary statistics. Which one of the

HDL (female or male patients) shows more variability? You are required to use your

sample result to answer this question. [4 marks]

Question 2 [Total 13 Marks]

a. Based on your sample size, construct a contingency Table between the gender of the

patient and the type of body frame. [1.5 marks]

b. Who are the majority of patients and what is their probability? [1.5 marks]

c. What is the probability that the randomly selected patient is a medium body frame?

[2 mark]

d. What is the probability that a randomly selected patient is female and has a large

frame? [2 marks]

e. Using the conditional percentage and appropriate Research Question, write a short report

[about 70 words] to hospital management regarding the gender of the patient and the type

of body frame. [6 marks]

Question 3 [Total 8 Marks]

The file travellers.xls on Moodle contains a worksheet of raw data. The data have been

collected from 3999 travelers as they arrive at Kuala Lumpur International Airport. The sheet

contains the country (region) they came from and the main purpose of their visit (work, study

or tourism), so there are two categorical variables to be examined: one is Region and the other

is Purpose.

You must use your subsample of the survey data. Your sample will consist of 500 observations

starting from the respondent whose ID is the same as the last three digits of your student

number. For example, if your student number is 20275249, you would use individuals 249 to

748.

Do travelers from all regions tend to visit Kuala Lumpur for study? You are required to identify

the dependent and independent variable. Using the conditional percentage and appropriate

Research Question indicate if there is an association between region and purpose of visit to

Kuala Lumpur. [8 marks]

Question 4 [Total 6 Marks]

General Hospital's patient account division has compiled data on the age of accounts

receivables. The data collected indicate that the age of the accounts follows a normal

distribution with a population mean of 28 days and a population standard deviation of 8 days.

a. What proportion of the accounts are between 20 and 40 days old? [2 marks]

b. What proportion of the accounts are less than 30 days old? [1 mark]

c. What is the number of days in which 75% of all accounts are above? [3 marks]


版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。 站长地图

python代写
微信客服:codinghelp