Learning Goal: I’m working on a r presentation and need an explanation and answe
Learning Goal: I'm working on a r presentation and need an explanation and answer to help me learn.Find one dataset that is of interest to you. Some places to find datasets include:The R Project for Statistical Computing (Links to an external site.) Kaggle (Links to an external site.) U.S. Government's Open Data (Links to an external site.) or your own data. Your data set should have at least 700, but less than 6000, records and eight (8) attributes and the data should not be “clean”. Part of this assignment will require you to clean the data yourself.Please see any accompanying Data Dictionary to understand the fields and values in your chosen dataset are available.The assignment has three parts.Part IPlease review the Data Dictionary document as you review the datasets if one is provided. In order to understand the data we first need to run some descriptive statistics on the data set. Start by providing the following for each appropriate variable in the dataset:Summarize the data in a table. Graphs that help visualize the data. These can be bar charts, histograms, pie charts, etc. Be sure the chosen graph best represents the information you want to highlight. Explain the story the data is telling you. What business question do your descriptive analyses answer? Provide a brief discussion of the findings. If there are any unusual values, discuss them. If data values are “out of range,” clean the data as needed. Delete the out of range values and run the analysis again. If you remove out of range values for any of the variables, present both the analysis with the out of range values and the analysis without the out of range value(s). Identify additional questions that the data is leading your to ask. What new attributes are needed to answer those questions? Refer to this Assignment Part I Section Example for how the questions in Part I should be answered.Part IICreate new attributes based on the data and the questions you identified in Part 1. For your data set, compute differences between appropriate variable values and create a new variable. For example, if the data shows yearly sales for different years, by month, calculate the increase or decrease in sales from month to month.Then, compute the mean and median for each of the variables you have computed.Part IIINow that you have worked with the data, what is the data saying to you? What have you learned about the attributes? What are some follow-up questions you would like to have answered? Identify 3-5 observations or follow-up questions that you have.Complete all data management task in R.What to Submit(what you should give me)A presentation slide deck (5-8 slides not including Title and reference list slide) with your findings. FormatYour presentation must:Tell the story of your data through the use of descriptive statistics and visualizations Remember your visualizations are the primary vehicle you'll use to convey information in an analytics presentation Include very concise written information that is highly connected to the points made in the visualizations as a Notes section on each slide. Properly cite all sources using APA citation rules. Requirements: 800

Leave a Reply

Your email address will not be published.