Our Services

Get 15% Discount on your First Order

[rank_math_breadcrumb]

In this assignment, we will run a correlation and bivariate regression analysis to explore the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 p

In this assignment, we will run a correlation and bivariate regression analysis to explore the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 population). We will use the R functions ggplot to create a scatterplot, cor to generate a correlation, and lm to run a bivariate regression.

Assignment 6 Instructions

  1. Read Chapters 9 & 11
  2. To test the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 population) you will need to extract the appropriate data from the textbook database using MySQL Workbench, prepare the data using Excel, then load the data into RStudio, run the appropriate analyses, and describe the results.
  3. It is expected that for this assignment you will be able to use MySQL Workbench, Excel, and RStudio more on your own but there are some additional instructions see the attachment below.
  4. For this assignment, you will submit a Word document containing a scatterplot, correlation, and regression output.
  5. You will also need to interpret the findings and answer the following questions:

a. What is the hypothesis for this analysis?

b. Did you find a relationship between median_aqi and child_mortality?

c. What conclusions might you draw from these findings?

In this assignment, we will run a correlation and bivariate regression analysis to explore the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 population). We will use the R functions ggplot to create a scatterplot, cor to generate a correlation, and lm to run a bivariate regression.

In this assignment, we will run a correlation and bivariate regression analysis to explore the relationship between median_aqi and child_mortality (the Deaths among children under age 18 per 100,000 population). We will use the R functions ggplot to create a scatterplot, cor to generate a correlation, and lm to run a bivariate regression.

Is there a relationship between the rate of air pollution particulates measured in US counties and population health—to include diabetes, low birth weight, frequent mental distress, poor or fair health, and life expectancy?

The dependent variables for the proposed study include the following:

1. Diabetes prevalence, low birth weight, frequent mental distress, poor or fair health, and life expectancy

2. In the case of the proposed research question, there is one independent variable, hazardous air pollutants. Pollutants are measured by the air quality index (AQI), which provides the number of good days, moderate days, unhealthy days, very unhealthy days, and hazardous days based on the air pollutants in the air for that specific county. The AQI is considered a measure of air quality by the EPA. For the remainder of this chapter, the independent variable will be referred to as air pollution particulate matter or PM.

Hypotheses

The research questions will require more than one hypothesis. The hypotheses will include the following:

H10: There is not a relationship between the PM and the prevalence rate of diabetes in US counties.

H1A: There is a relationship between the PM and the prevalence rate of diabetes in US counties.

H20: There is not a relationship between the PM and the rate of infants born with low birth weight in US counties.

H2A: There is a relationship between the PM and the rate of infants born with low birth weight in US counties.

H30: There is not a relationship between the PM and the rate of frequent mental distress in US counties.

H3A: There is a relationship between the PM and the rate of frequent mental distress in US counties.

H40: There is not a relationship between the PM and the rate of poor or fair health in US counties.

H4A: There is a relationship between the PM and the rate of poor or fair health in US counties.

H50: There is not a relationship between the PM and life expectancy in US counties.

H5A: There is a relationship between the PM and life expectancy in US counties.

To answer our research questions, we require a specific process of data acquisition, preparation, and discovery. The following steps will be explained in detail:

1. Extract the data sets from MySQL.

a. The corrplot package was added for use in this chapter:

library(corrplot)

## corrplot 0.84 loaded

2. Prepare the data.

a. In Microsoft Excel, we did have to find and replace all NULL values with blanks once the data set was downloaded from MySQL.

3. Import the data into RStudio.

4. Perform descriptive statistics on the variables of PM, diabetes, frequent mental distress, poor or fair health, and life expectancy.

5. Conduct similar linear regressions to examine the relationship between PM and measures of population health at the US county level.

The Analysis

This section provides a step-by-step description of obtaining and analyzing the data required to answer the proposed research question.

Step 1: Extract the Data

In order to carry out the analysis, we need the following data:

1. A list of all US counties with measures of levels of the AQI

2. A measure of the prevalence of diabetes cases for each county

3. A measure of the prevalence of births that are considered to be low birth weight for each county

4. A measure of the prevalence of frequent mental distress for each county

5. A measure of the prevalence of poor or fair health for each county

6. A measure of the age of life expectancy for each county

One data set is used to obtain the needed data to answer the proposed research questions. The MySQL query uses data from the air_pollutants, geo_fips_region, and chr_health_outcomes tables of data.

The MySQL script that was used to extract the data from the database included the following:

SELECT

f.state_name,

f.area_name,

f.region,

f.subregion,

a.days_with_aqi,

a.good_days,

a.moderate_days,

a.unhealthy_for_sensitive_groups_days,

a.unhealthy_days,

a.very_unhealthy_days,

a.hazardous_days,

a.median_aqi,

c.child_mortality,

c.diabetes_prevalence,

c.infant_mortality,

c.frequent_mental_distress,

c.poor_or_fair_health,

c.life_expectancy,

c.low_birthweight,

c.premature_age_adjusted_mortality,

c.premature_death

FROM

air_pollutants AS a

JOIN

geo_fips_region AS f ON a.state_county_fips = f.state_county_fips

JOIN

chr_health_outcomes AS c ON c.fips_code = f.state_county_fips;

The data set can be exported from MySQL by selecting the Export button and naming the file aq.csv. This data set includes days_with_aqi, good_days, moderate_days, unhealthy_for_sensitive_groups_days, unhealthy_days, very_unhealthy_days, hazardous_days, median_aqi, child_mortality, diabetes_prevalence, infant_mortality, frequent_mental_distress, poor_or_fair_health, life_expectancy, low_birthweight, premature_age_adjusted_mortality, and premature_death.

Step 2: Prepare the Data

One of the first steps in data preparation is to carefully examine the data for erroneous values or blanks. An erroneous value may be a value that was used as a filler. A filler is often used as a default value to indicate a blank field. For example, the value –1111.1 may be used to represent a blank field entry. Also, a value of NULL is common after exporting data from Microsoft Excel. One way to approach erroneous values is to replace the values with a blank entry. Filler values must be removed from your data set because the filler data often hold a numeric value, which can skew the results. If the filler values are formatted as text, the column may not support a quantitative analysis such as the calculation of a mean. If numeric fillers are included in the data set, this can skew the results when calculating the mean or alter the results when performing inferential statistical analyses.

The data set that was queried from MySQL and exported as aq.csv contains NULL values that need to be removed. Open the file in Microsoft Excel by double-clicking the file. Under the Home tab, select the magnifying glass and select Replace. In the window that appears, enter “NULL” into the “Find what:” text field and leave the “Replace with:” field blank. After you select Replace All, any NULL value will be replaced with a blank entry. See figure 9 .1 as an example of how to use the Find and Replace functionality in Microsoft Excel.

Share This Post

Email
WhatsApp
Facebook
Twitter
LinkedIn
Pinterest
Reddit

Order a Similar Paper and get 15% Discount on your First Order

Related Questions

Week 4 – Assignment: Avoid/Reduce (Use the A in the A-S-I Framework) Instructions Assignment Directions: Part #2 of 4: Focusing on the Avoid/Reduce pillar of the Avoid-Shift-Improve (A-S-I) Framework,

Week 4 – Assignment: Avoid/Reduce (Use the A in the A-S-I Framework) Instructions Assignment Directions: Part #2 of 4: Focusing on the Avoid/Reduce pillar of the Avoid-Shift-Improve (A-S-I) Framework, describe and depict the need to improve the efficiency of the transport system as a whole. This is a big picture

Week 3 – Assignment: City Selection (Do your research) – DueApr 26, 2026 11:59 PMTLMT311 B002 Spring 2026Assignment Directions: Part #1 of 4: Using course readings and resources including the United S

Week 3 – Assignment: City Selection (Do your research) – DueApr 26, 2026 11:59 PMTLMT311 B002 Spring 2026Assignment Directions: Part #1 of 4: Using course readings and resources including the United States Pocket Guide to Transportation [webpage],  Transportation Statistics Annual Report [webpage] (TSAR), and Transportation Geography of the United Sates [webpage], identify a major

submit a 5-7 page research paper on the following topic… https://freakonomics.com/podcast/what-do-people-do-all-day/ After you listen to the entire podcast, I want to receive a fully researched anal

submit a 5-7 page research paper on the following topic… https://freakonomics.com/podcast/what-do-people-do-all-day/ After you listen to the entire podcast, I want to receive a fully researched analysis referencing specific parts of this podcast, commenting on why jobs disappear, what jobs are expected to appear in the future, what it means for

Looking for someone to do my assignment below The assignment this week is to create an 18-page PowerPoint presentation that covers the below topics. Describe the responsibilities of the contracting o

Looking for someone to do my assignment below  The assignment this week is to create an 18-page PowerPoint presentation that covers the below topics. Describe the responsibilities of the contracting officer Describe the basics of government contracts Explain the contracting cycle LO-   Explain how procurement fraud and ethics plays a pivotal

Project 4 assignment continues from Week Seven. You will evaluate the impact of forecasting process and demand planning on supply chain effectiveness. Your assignment will be in Chapters 8 and 9. C

Project 4 assignment continues from Week Seven. You will evaluate the impact of forecasting process and demand planning on supply chain effectiveness. Your assignment will be in Chapters 8 and 9.    Continue using the Project 4 Template, an Excel spreadsheet workbook to answer 6 data analysis questions. There are 6 tabs

Complete the following questions with sufficient explanation for why you chose the answer. Submission should be a word document, double spaced with reference included (APA format). CRITICAL THINKI

Complete the following questions with sufficient explanation for why you chose the answer.  Submission should be a word document, double spaced with reference included (APA format).  CRITICAL THINKING EXERCISES 1. The invasive radiology department has just purchased a piece of equipment that enables it to perform many procedures faster. One

Looking for someone to do my assignment below : the topic is: investigate the administration of contracts in Joint Contingency Contracting (JCC) operations. Complete your final research topic paper. T

Looking for someone to do my assignment below : the topic is: investigate the administration of contracts in Joint Contingency Contracting (JCC) operations. Complete your final research topic paper. The final paper should be between 8 to 10 pages for the content, not counting the title page or the reference page.

Looking for someone to do my assignment below Each student is required to read a journal article on issues addressing a contracting issue related to this course. This article must be different than yo

Looking for someone to do my assignment below Each student is required to read a journal article on issues addressing a contracting issue related to this course. This article must be different than your first two article submissions. The journals must be research oriented, such as those found at Air University or

Plaintiffs James and Betty Tonkovich own approximately 850 acres in Belmont County, Ohio. Plaintiffs belong to a group of landowners known as Belmont Leasing Group, which leases land for oil and gas e

Plaintiffs James and Betty Tonkovich own approximately 850 acres in Belmont County, Ohio. Plaintiffs belong to a group of landowners known as Belmont Leasing Group, which leases land for oil and gas exploration. In July 2011, Plaintiffs executed three oil and gas leases with Defendant, Gulfport Energy Corp. Defendant subsequently

LITERATURE REVIEW: FINAL ASSIGNMENT OVERVIEW A critical skill at this level is the ability to conduct scholarly research guided by clearly articulated research questions that address a specific issue

LITERATURE REVIEW: FINAL ASSIGNMENT OVERVIEW A critical skill at this level is the ability to conduct scholarly research guided by clearly articulated research questions that address a specific issue or problem. It is important that you include and address the research questions provided. Changing the research question changes the focus

Assignment 5 Overview This assignment allows us to run an analysis from beginning to end. We will extract data from the textbook database using MySQL Workbench, import the data into RStudio, and creat

Assignment 5 Overview This assignment allows us to run an analysis from beginning to end. We will extract data from the textbook database using MySQL Workbench, import the data into RStudio, and create a scatterplot and Pearson correlation. Assignment 5 Instructions In Chapter 7, we worked with different charge and

Looking for someone to do my assignment below Read the required material for this week. Write a short paper discussing the different types of contingencies and contract planning. For this paper, you

Looking for someone to do my assignment below  Read the required material for this week. Write a short paper discussing the different types of contingencies and contract planning. For this paper, you will need to find an article in the library that relates to this week’s learning objective or topics discussed

Course Project: Part 2 (170 Points, Due Week 5)Outline Section D: Target Market and SegmentationSection E: Value Proposition and Competitive AnalysisSection F: Marketing Promotion and Pricing Strategy

Course Project: Part 2 (170 Points, Due Week 5)Outline Section D: Target Market and Segmentation Section E: Value Proposition and Competitive Analysis Section F: Marketing Promotion and Pricing Strategy References Details Section D: Target Market and Segmentation: (50 points) Please review the Week 4 lesson for information related to Sections

Looking for someone to do my assignment below : topic: investigate the administration of contracts in Joint Contingency Contracting (JCC) operations. Prepare an abstract and annotated bibliography i

Looking for someone to do my assignment below : topic: investigate the administration of contracts in Joint Contingency Contracting (JCC) operations. Prepare an abstract and annotated bibliography in support of your DEFM421 research topic paper. This will build upon the optional outline you submitted in weeks 3 and 4. The

Looking for someone to do my assignment below Each student is required to read a journal article on issues addressing a contracting issue related to this course. This article must be different than y

Looking for someone to do my assignment below  Each student is required to read a journal article on issues addressing a contracting issue related to this course. This article must be different than your first article submission. The journals must be research oriented, such as those found at Air University or related