Our Services

Get 15% Discount on your First Order

[rank_math_breadcrumb]

MINi PROJECT

2021Fall_IE342_MiniProject2 5 (Instructor: Azadeh Haghighi)

Mini Project 2

Directions:

In this project, you are going to graphically present, study, analyze and evaluate the distributions of a dataset. Upload your whole package (including any codes, plots, and the report) to the blackboard by the due date.

The Data:

Collect the size of 1000 files on your computer. Convert all sizes to the same unit (e.g., either KB or MB). You can do a quick search online (or get help form your classmates) to find out how to collect file sizes automatically using windows dir command. You can also decide to manually record the file sizes if you cannot figure out how to do it automatically.

Dataset

Use the above dataset to plot the necessary graphs and answer the following questions.

  1. Extract the first digit of collected file sizes and collect them in a table. The first digit is defined as the first nonzero digit from the left. For example, 1294 has first digit 1, and 0.34 has first digit 3.
  2. Before ploting the distribution of these first digits, which probability distribution do you think would best represent it? Why?
  3. Now plot the probability distribution of these first digits (vertical axis represents the probability while the horizontal axis represents the digit 1, 2 , 3, 4, 5, 6, 7, 8, 9). Looking at the plot, how does the probability pattern change as you move from digit 1 to 9? (i.e., does it increase or decrease?)
  4. Now use the following function to calculate and plot the probability of digit !.

“(!) = log!” )1 + 1!, ,

! = 1, 2, 3, 4, 5, 6, 7, 8, 9

5. Merge the plots obtained in part 4 with part 3 so that they are shown side by side as clustered columns (see below for a visualization example, the probability values are just provided randomly in this figure and do not reflect any realistic case). Compare the two plots side by side. What is your observation? Do you see any relation among the two plots?

1 0.8 0.6 0.4 0.2 0 … 1 Plot from 3 Plot from 4 3 2

  1. Search for “Benford’s law” online and explain what it states.
  2. Using Benford’s law explain what you observe in the plot in part 5?
  3. Find an application of Benford’s law in real life by searching the internet. Explain why this law is useful for that specific application.

Share This Post

Email
WhatsApp
Facebook
Twitter
LinkedIn
Pinterest
Reddit

Order a Similar Paper and get 15% Discount on your First Order

Related Questions

Help with statistics and economics

I need help with a tutor for two classes: statistics and also economics. I’m not sure how this website works but I want consistent help not just one time. Would prefer two separate tutors but if you are good at both subjects I’m not against that and would appreciate it.

Home work

Competencies 1. Describe the data using the measures of central tendency and measures of variability. 2. Apply the normal distribution, standard normal distribution, and central limit theorem. 3. Develop a confidence interval for a population parameter. 4. Evaluate hypothesis tests for population parameters from one population. 5. Evaluate hypothesis tests

Math Unit VII

See attached Unit VII Journal Watch  Video 4: Embracing Challenges . Reflect on your time in this course. What challenges have you faced? What strategies or mindsets did you implement to overcome these challenges? How could you improve your strategy and mindset so that they are more effective? Your journal

What is Methodology

How do I identify specific mixed-methods designs (e.g., convergent parallel, explanatory sequential) and how do I make clear the sequence and Priority of qualitative vs. quantitative strands. Which one should I do first and second? How can I integrate the two?  How do i fully explain the justification for secondary data analysis?

Discussion Activity 3

  Understanding the intricacies of data interpretation is crucial in social work practice, as it informs decision-making and helps guide interventions that impact individuals and communities. Two important concepts in this regard are standard errors and statistical significance. These concepts help social workers assess the reliability of data and the

Cardiac Rhythm Interpretation Education Poster Project

Cardiac Rhythm Interpretation Education Poster Project Due: Sat Feb 21, 2026 11:59am Ungraded, 50 Possible Points50 Points Possible Attempt In Progress Cardiac Rhythm Interpretation Education Poster Project  For this project, you will create a professional, patient-safety-focused educational poster on a selected cardiac rhythm or cardiac emergency. The poster must demonstrate your understanding of

Mathematics Homework

  Use the information needed for week 6 and answer the questions 1 Create a pie chart for the variable insurance in excel:  · Review this video to learn how to create a pie chart in excel.Links to an external site.     · First, you have to create a frequency

Excel

 What is Excel to you, and how do you use it? Is this software important in today’s business environment? 

ma

Handout #4 Math 101 Name: _____________________________ Basic Probability & Odds Two dice are rolled. Use the table from our notes to identify the sample space. Then calculate the probability of each event. 1. What is the probability that we roll a sum less than 8? 2. What is the probability

MAT DB V

See attached Unit V Discussion Board Watch  Video 3: The Power of “Yet,”  and evaluate your course progress. Are you meeting your goals for the course? What goals do you have for the remainder of the class? What strategy can you implement to help you achieve those goals? Next, respond

MAT IV JL

See Attached Watch the  video  The Power of Making Mistakes . Describe a situation where you made a mistake in this course. How can you learn from your mistake in order to better understand the course material? What did this mistake teach you? If this was another person describing their

home work

Competency Evaluate hypothesis tests for population parameters from two populations. Project Deliverable tie to Competency  In your fifth assignment, you will continue applying hypothesis testing methods to real-world situations. This assignment will require you to expand your understanding of situations that involve comparing two populations, which is a critical skill

home work

Competency Evaluate hypothesis tests for population parameters from one population. Project Deliverable tie to Competency  In the fourth assignment, you will need to make data-driven decisions by evaluating claims about a single population. You will be tasked with completing several hypothesis tests, which follow a structured process: stating hypotheses, selecting

it is a MATHEMATICS QUESTION

  Three resistors of 2 Ω, 3 Ω and 6 Ω are connected in parallel. This combination is connected in series with a 4 Ω resistor. The whole circuit is connected to a 12 V battery. Find: Total resistance of the circuit   Total current from the battery   Current

home work

Competency Develop a confidence interval for a population parameter. Project Deliverable tie to Competency  In the third assignment, you will construct and interpret confidence intervals, which is a fundamental concept in inferential statistics. By estimating population parameters such as the mean and proportion using sample data, you will gain a

Math

Which inequality is equivalent to –m ≥ 15? CLEAR CHECK m ≥ 15 m ≥ -15 m ≤ -15 m ≤ 15

Home work

Competency Apply the normal distribution, standard normal distribution, and central limit theorem. Project Deliverable tie to Competency  In module 02, you will summarize graphical and numerical methods to represent quantitative data. This includes utilizing the central limit theorem to calculate probabilities for real-world problems. It will also require you to