Our Services

Get 15% Discount on your First Order

[rank_math_breadcrumb]

In this assignment, we will run a correlation and bivariate regression analysis to explore the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 p

In this assignment, we will run a correlation and bivariate regression analysis to explore the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 population). We will use the R functions ggplot to create a scatterplot, cor to generate a correlation, and lm to run a bivariate regression.

Assignment 6 Instructions

  1. Read Chapters 9 & 11
  2. To test the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 population) you will need to extract the appropriate data from the textbook database using MySQL Workbench, prepare the data using Excel, then load the data into RStudio, run the appropriate analyses, and describe the results.
  3. It is expected that for this assignment you will be able to use MySQL Workbench, Excel, and RStudio more on your own but there are some additional instructions see the attachment below.
  4. For this assignment, you will submit a Word document containing a scatterplot, correlation, and regression output.
  5. You will also need to interpret the findings and answer the following questions:

a. What is the hypothesis for this analysis?

b. Did you find a relationship between median_aqi and child_mortality?

c. What conclusions might you draw from these findings?

In this assignment, we will run a correlation and bivariate regression analysis to explore the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 population). We will use the R functions ggplot to create a scatterplot, cor to generate a correlation, and lm to run a bivariate regression.

In this assignment, we will run a correlation and bivariate regression analysis to explore the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 population). We will use the R functions ggplot to create a scatterplot, cor to generate a correlation, and lm to run a bivariate regression.

Is there a relationship between the rate of air pollution particulates measured in US counties and population health—to include diabetes, low birth weight, frequent mental distress, poor or fair health, and life expectancy?

The dependent variables for the proposed study include the following:

1. Diabetes prevalence, low birth weight, frequent mental distress, poor or fair health, and life expectancy

2. In the case of the proposed research question, there is one independent variable, hazardous air pollutants. Pollutants are measured by the air quality index (AQI), which provides the number of good days, moderate days, unhealthy days, very unhealthy days, and hazardous days based on the air pollutants in the air for that specific county. The AQI is considered a measure of air quality by the EPA. For the remainder of this chapter, the independent variable will be referred to as air pollution particulate matter or PM.

Hypotheses

The research questions will require more than one hypothesis. The hypotheses will include the following:

H10: There is not a relationship between the PM and the prevalence rate of diabetes in US counties.

H1A: There is a relationship between the PM and the prevalence rate of diabetes in US counties.

H20: There is not a relationship between the PM and the rate of infants born with low birth weight in US counties.

H2A: There is a relationship between the PM and the rate of infants born with low birth weight in US counties.

H30: There is not a relationship between the PM and the rate of frequent mental distress in US counties.

H3A: There is a relationship between the PM and the rate of frequent mental distress in US counties.

H40: There is not a relationship between the PM and the rate of poor or fair health in US counties.

H4A: There is a relationship between the PM and the rate of poor or fair health in US counties.

H50: There is not a relationship between the PM and life expectancy in US counties.

H5A: There is a relationship between the PM and life expectancy in US counties.

To answer our research questions, we require a specific process of data acquisition, preparation, and discovery. The following steps will be explained in detail:

1. Extract the data sets from MySQL.

a. The corrplot package was added for use in this chapter:

library(corrplot)

## corrplot 0.84 loaded

2. Prepare the data.

a. In Microsoft Excel, we did have to find and replace all NULL values with blanks once the data set was downloaded from MySQL.

3. Import the data into RStudio.

4. Perform descriptive statistics on the variables of PM, diabetes, frequent mental distress, poor or fair health, and life expectancy.

5. Conduct similar linear regressions to examine the relationship between PM and measures of population health at the US county level.

The Analysis

This section provides a step-by-step description of obtaining and analyzing the data required to answer the proposed research question.

Step 1: Extract the Data

In order to carry out the analysis, we need the following data:

1. A list of all US counties with measures of levels of the AQI

2. A measure of the prevalence of diabetes cases for each county

3. A measure of the prevalence of births that are considered to be low birth weight for each county

4. A measure of the prevalence of frequent mental distress for each county

5. A measure of the prevalence of poor or fair health for each county

6. A measure of the age of life expectancy for each county

One data set is used to obtain the needed data to answer the proposed research questions. The MySQL query uses data from the air_pollutants, geo_fips_region, and chr_health_outcomes tables of data.

The MySQL script that was used to extract the data from the database included the following:

SELECT

f.state_name,

f.area_name,

f.region,

f.subregion,

a.days_with_aqi,

a.good_days,

a.moderate_days,

a.unhealthy_for_sensitive_groups_days,

a.unhealthy_days,

a.very_unhealthy_days,

a.hazardous_days,

a.median_aqi,

c.child_mortality,

c.diabetes_prevalence,

c.infant_mortality,

c.frequent_mental_distress,

c.poor_or_fair_health,

c.life_expectancy,

c.low_birthweight,

c.premature_age_adjusted_mortality,

c.premature_death

FROM

air_pollutants AS a

JOIN

geo_fips_region AS f ON a.state_county_fips = f.state_county_fips

JOIN

chr_health_outcomes AS c ON c.fips_code = f.state_county_fips;

The data set can be exported from MySQL by selecting the Export button and naming the file aq.csv. This data set includes days_with_aqi, good_days, moderate_days, unhealthy_for_sensitive_groups_days, unhealthy_days, very_unhealthy_days, hazardous_days, median_aqi, child_mortality, diabetes_prevalence, infant_mortality, frequent_mental_distress, poor_or_fair_health, life_expectancy, low_birthweight, premature_age_adjusted_mortality, and premature_death.

Step 2: Prepare the Data

One of the first steps in data preparation is to carefully examine the data for erroneous values or blanks. An erroneous value may be a value that was used as a filler. A filler is often used as a default value to indicate a blank field. For example, the value –1111.1 may be used to represent a blank field entry. Also, a value of NULL is common after exporting data from Microsoft Excel. One way to approach erroneous values is to replace the values with a blank entry. Filler values must be removed from your data set because the filler data often hold a numeric value, which can skew the results. If the filler values are formatted as text, the column may not support a quantitative analysis such as the calculation of a mean. If numeric fillers are included in the data set, this can skew the results when calculating the mean or alter the results when performing inferential statistical analyses.

The data set that was queried from MySQL and exported as aq.csv contains NULL values that need to be removed. Open the file in Microsoft Excel by double-clicking the file. Under the Home tab, select the magnifying glass and select Replace. In the window that appears, enter “NULL” into the “Find what:” text field and leave the “Replace with:” field blank. After you select Replace All, any NULL value will be replaced with a blank entry. See figure 9 .1 as an example of how to use the Find and Replace functionality in Microsoft Excel.

Share This Post

Email
WhatsApp
Facebook
Twitter
LinkedIn
Pinterest
Reddit

Order a Similar Paper and get 15% Discount on your First Order

Related Questions

Assignment 5 Overview This assignment allows us to run an analysis from beginning to end. We will extract data from the textbook database using MySQL Workbench, import the data into RStudio, and creat

Assignment 5 Overview This assignment allows us to run an analysis from beginning to end. We will extract data from the textbook database using MySQL Workbench, import the data into RStudio, and create a scatterplot and Pearson correlation. Assignment 5 Instructions In Chapter 7, we worked with different charge and

Looking for someone to do my assignment below Read the required material for this week. Write a short paper discussing the different types of contingencies and contract planning. For this paper, you

Looking for someone to do my assignment below  Read the required material for this week. Write a short paper discussing the different types of contingencies and contract planning. For this paper, you will need to find an article in the library that relates to this week’s learning objective or topics discussed

Course Project: Part 2 (170 Points, Due Week 5)Outline Section D: Target Market and SegmentationSection E: Value Proposition and Competitive AnalysisSection F: Marketing Promotion and Pricing Strategy

Course Project: Part 2 (170 Points, Due Week 5)Outline Section D: Target Market and Segmentation Section E: Value Proposition and Competitive Analysis Section F: Marketing Promotion and Pricing Strategy References Details Section D: Target Market and Segmentation: (50 points) Please review the Week 4 lesson for information related to Sections

Looking for someone to do my assignment below : topic: investigate the administration of contracts in Joint Contingency Contracting (JCC) operations. Prepare an abstract and annotated bibliography i

Looking for someone to do my assignment below : topic: investigate the administration of contracts in Joint Contingency Contracting (JCC) operations. Prepare an abstract and annotated bibliography in support of your DEFM421 research topic paper. This will build upon the optional outline you submitted in weeks 3 and 4. The

Looking for someone to do my assignment below Each student is required to read a journal article on issues addressing a contracting issue related to this course. This article must be different than y

Looking for someone to do my assignment below  Each student is required to read a journal article on issues addressing a contracting issue related to this course. This article must be different than your first article submission. The journals must be research oriented, such as those found at Air University or related

Assignment 3 Overview As with Microsoft Excel, knowledge and use of relational databases is an important skill for a data analyst. Often large amounts of data are stored in relational databases and a

Assignment 3 Overview As with Microsoft Excel, knowledge and use of relational databases is an important skill for a data analyst. Often large amounts of data are stored in relational databases and a data analyst needs to understand how to combine data from various tables and export those data into

Looking for someone to do my assignment below Assignment: Conduct research in the library and/or on the web that pertains to the Joint Theater Support Contracting Command (JTSCC). What are the iss

Looking for someone to do my assignment below  Assignment:  Conduct research in the library and/or on the web that pertains to the Joint Theater Support Contracting Command (JTSCC). What are the issues or problems within the framework or process of operating the JTSCC? Give real world examples. Instructions:  Write a four-page

Microsoft Excel is an important tool for a healthcare data analyst because of its widespread availability to prepare them, intuitive interface, and general functionality. Data (particularly financial

Microsoft Excel is an important tool for a healthcare data analyst because of its widespread availability to prepare them, intuitive interface, and general functionality. Data (particularly financial data) is often provided in an Excel file, and for small datasets, availability and Data (particularly financial data) is often provided in an

Looking for someone to do my assignment below Each student is required to read a journal article on issues addressing a contracting issue related to this course. The journals must be research oriente

Looking for someone to do my assignment below  Each student is required to read a journal article on issues addressing a contracting issue related to this course. The journals must be research oriented, such as those found at Air University or APUS library related journals. The selected article should cite references and

Looking for someone to do my assignment below W3: Research Paper OutlineAssignment: Select a topic that relates to Contract Management with a focus on Joint Contingency Contracting or any topic from t

Looking for someone to do my assignment below W3: Research Paper OutlineAssignment: Select a topic that relates to Contract Management with a focus on Joint Contingency Contracting or any topic from the learning objectives found in the lessons and syllabus. You can research an existing program, or you can write about the contract management process

Looking for someone to do my assignment please see details below Read the required readings and lesson for this week. Write a short paper on the issues or problems facing the Contingency Contractin

Looking for someone to do my assignment please see details below  Read the required readings and lesson for this week. Write a short paper on the issues or problems facing the Contingency Contracting Officer (CCO) or the Senior Contracting Official (SCO) and their responsibilities. Are there any ethical issues? Do research

The Merchant of Venice is a comedy by William Shakespeare that highlights the contradictions between an evolving capitalist system and the moral values of the characters. With many of the plays, confl

The Merchant of Venice is a comedy by William Shakespeare that highlights the contradictions between an evolving capitalist system and the moral values of the characters. With many of the plays, conflicts stemming from characters’ actions contradicting their values. Here, we’ll analyze scenes and character interactions in the play to

For this Market Research assignment, you will be finalizing the selection of your sole proprietorship for your final Course Project. Before beginning this assignment, please read the Course Project Ov

For this Market Research assignment, you will be finalizing the selection of your sole proprietorship for your final Course Project. Before beginning this assignment, please read the Course Project Overview and Chapters 7, 8, 9, 10, and 15 of the Successful Business Plan textbook. Here are the steps required: Identify

2-3 Assignment: Management Approaches BUS 210 Module Two Assignment Guidelines and RubricOverview Using management skills and practices that suit your team and its objectives can help to ensure the te

2-3 Assignment: Management Approaches BUS 210 Module Two Assignment Guidelines and RubricOverview Using management skills and practices that suit your team and its objectives can help to ensure the team’s efficiency and productivity. Analyze the scenario provided to determine how management skills and practices can help support the team. Scenario Imagine

Greetings, looking for someone to do my assignment please see details below Assignment: For this paper, you will need to find an article in the library that relates to the command and contracting lin

Greetings, looking for someone to do my assignment please see details below  Assignment: For this paper, you will need to find an article in the library that relates to the command and contracting lines of authority and structure and then summarize and discuss the article as well as describe command and contracting