Data you need:
Instructions:
Follow the directions for each problem.
UseJMPfor all the calculations.
The data sets needed are available in Contents page of D2L in theComputer Projectmodule.
Createone filefor your project.
Download the project in a D2LdropboxcalledComputer Project.
Answer all the questions thoroughly hand in the answers to the questions fully with JMPoutputs. Cut and paste theJMPoutput into Word or other document software to create only one file.
Organize your responses with respect to the problems given.
If there are any questions do not hesitate to ask your TA or instructor.
Working with other students is fine. Howevereach studentsprojectmustbe a product of his/her own solutions of the problems.This is to beyour own work. Any projects that are exactly the samewill not be graded.
Total possible points 50.
Datasets:
You create your own.A sample of 45 gas station prices of regular unleaded gasoline.
Housessp15.JMP:A random sample of24houses for sale in the area around UWM. Variables are:List Price of a homePrice number of bedroomsBedandsize of the houseSquare Feet.
Airline.JMP:A Two-way table that comparesairlines to on-time status.
Problem1:(5points)Because of the talk about gas prices we want to know information about the price of regular unleaded gasoline for the gas stations in the Milwaukee area.
What is the populationof interest?
Find the gas prices of regular unleaded gasoline for45gas stations in the Milwaukee area.Hint: You can use any online source as in AAA or gasbuddy.com or you can drive around.Describe how you found the sample of45gas stations.Here is the website for AAA:HYPERLINK http://aaa.opisnet.com/index.aspxhttp://aaa.opisnet.com/index.aspx
Construct a histogramof theregular unleaded gasoline prices that you found.Describe the distribution of this variable.Give the shape center and spread according to the histogram.
Determine the following descriptive statistics from your sample of gasoline prices.
mean
standard deviation
median
Q1the first quartile
Q3the third quartile
Problem2:(22points)Weare interested in estimating the mean price of unleaded gasoline in the Milwaukee area. The following will give us this estimation.
Using yourdata that you found in Problem1 determine a 99% confidence interval for the meangasolineprices.
Give an interpretation of this confidence interval.
LastmonthAAA gave an averagegasoline price of $2.29in Milwaukee. According to your data can we say there is a significant difference in the mean gasoline prices compared tomonthago?
Give the null and alternative hypothesis
Describe the assumptions of this hypothesis test. Determine if the test statistic you are using is appropriate.Fully explain.
Determine the p-value
Give a conclusion of this hypothesis test.Use = 0.01.
Compare the results of the significance test to the 99% confidence interval for the mean gasoline price per gallon. Does the conclusion in partivstill hold for the confidence interval? Fully explain.
This part(c)is to test Quantitative Literacy and will be graded by the following rubric.
Assessment Rubric(points)
Learning Outcome
Assessment Item
4
3
2
1
Students will recognize and construct mathematical models and/or hypotheses that represent quantitative information.
Give the null and alternative hypotheses to determine if the mean gasoline price per gallon has significantly changed from last years price.
Skillfully converts relevant information into an appropriate and desired hypothesis that contributes to a further or deeper understanding.
Competently converts relevant information into an appropriate and desired hypothesis.
Completes conversion relevant information into a hypothesis but is only partially appropriate or accurate.
Completes conversion relevant information into a hypothesis but is inappropriate or inaccurate.
Students will evaluate the validity of these models and hypothesis.
Describe the assumptions of this hypothesis test. Determine if the test statistic you are using is appropriate.
Explicitly describes the assumptions of the hypothesis test and provides compelling rationale for why this test statistic is appropriate. Shows awareness that confidence in final conclusions is limited by the accuracy of this hypothesis.
Explicitly describes the assumptions of the hypothesis test and provides compelling rationale for why this test statistic is appropriate.
Explicitly describes the assumptions of the hypothesis test.
Attempts to describe the assumptions of the hypotheses test.
Students will analyze and manipulate mathematical models using quantitative information.
Determine the p-value of this significance test.
Analyses are attempted and all are successful to answer the problem. Analyses are also presented elegantly. (clearly concisely etc.)
Analyses are attempted and all are successful to answer the problem.
Analyses are attempted and some are successful to answer the problem.
Analyses are attempted but are incorrect to answer the problem.
Students will reach logical conclusions predictions or inferences.
Give the conclusion of this significance test use 0.01 as the level of significance.
Provides correct conclusion based on the quantitative information derived. Makes appropriate inferences based on that information.
Provides correct conclusion based on the quantitative information derived.
Provides somewhat correct conclusion based on the quantitative information derived but may have some wrong conclusions.
Attempts to provide correct conclusion based on the information but draws incorrect conclusions about what the information means.
Students will assess the reasonableness of their conclusions.
Compare the results of the significance test to the 99% confidence interval for the mean gasoline price per gallon.Does the conclusion in part iv stillholdfor the confidence interval?
Uses the quantitative information effectively as a basis for deep and thoughtful judgments drawing insightful carefully qualified assessment for the reasonableness of their conclusions.
Uses the quantitative information as a basis for competent judgments drawing reasonable and appropriately qualified assessment for the reasonableness of their conclusions.
Uses the quantitative information effectively as a basis for workmanlike (without inspiration) judgments drawing plausible assessment for the reasonableness of their conclusions.
Uses the quantitative information as a basis for tentative basic judgments assessment for the reasonableness of their conclusions.
Problem3: (8points)Using theJMP datasetHousessp15.JMPwe want to determine if thesize of the house(Square feet)can predict thelist price(Price).
Give a scatterplot ofPrice(y-axis) andSquare feet(x-axis). Describe the relationship between price and size by describing the form direction andstrength.Note any outliers or influential points.
Estimate the correlation coefficient betweenPriceandSquare feet.
Determine the simple linear regression lineequationto predictPricebySquare feetof the house.
What is the slopeb1? Give the interpretation of what that means aboutthePricewith respect toSquareFeet.
Using the regression equation predict thePriceof ahouse that is1800sqft.
What percent of variation inPricecanbe explained by this regressionequation?
Problem4:(10 points)Using theJMP datasetHousessp15.JMPwe are going to look at the difference between the mean price ofa 4-bedroom housecompared to the mean price ofa 3-bedroom house.
Give the summary statistics for price of 4-bedroom houses andthepriceof4-bedroom houses.Hint: you can useBedasthe by variables in the dialog box for Distribution.
Create a boxplot ofthepricebetween4-bedroom houses and 3-bedroom houses.Write out similarities or differences to thelist price of the houses compared tonumber of bedrooms.
Determinea 98% confidence for the differenceforthemeanprice4-bedroom houses and3-bedroom houses.
Are the mean house prices for a 3bedroom housesignificantly less than themeanhouse pricesfor 4bedroom houses?
Give the null and alternative hypothesis.
Give thep-value.
Make adecision of the test. Use =0.02
Give a conclusion in answer to the question above.
Problem5:(5points)Supposethe Federal Aviation Administration (FAA) would like to compare the on-time performances of different airlines on domestic nonstop flights. The following table shows three different airlines and the frequency of flights thatarrivedearly on-time and late for each.This is also in the JMP fileairline.jmp.
Airline
Status
Southwest
US Airways
Delta
Early
20
24
22
On-time
60
55
50
Late
25
30
14
We want to determine ifon-timestatusand airlineare independent of one another.
Give the null and alternative hypothesis.
In the contingency table output from JMP includecount expectedandcell chi square.
Give theP-value and decision of this test.
What can we conclude from this significance test?Using = 0.05.