首页 > > 详细

The project should include the following

 The project should include the following along with justifications and explanations in each step:

1. A description of the data and the goal of the analysis.
2. The type of statistical methods and models used in the analysis.
3. Transformations of variables (if needed) and interactions (if needed) in the analysis.
4. Initial analysis that includes all the variables and all the summary statistics such as
parameter estimates, their standard errors, p-values etc.
5. All the relevant diagnostics (as appropriate) needed for the initial analysis (step 4).
6. Model selection.
7. The recommended final model along with the summary statistics for this model (as in
part 4), and the relevant plots (as needed).
8. Summary of findings, conclusion and recommendations for further analysis (if any).
For each of the two problems, your report may include the following sections:
(i) Introduction: Statement of the problem
(ii) Materials and Methods: Description of the data and methods used in the analyses.
(iii) Results: Explanation of the results of your analyses. You can cut and paste the
relevant parts of your computer outputs and refer to them in explaining your results.
(iv) Conclusion and Discussion: Highlight the main points and discuss them.
Format:
● The report should be typed and well formatted as a complete stand-alone document (not a
list or bullet points, etc).
● The report should not contain code or raw R output. The R codes should be in an
appendix.
● There should be a title page with names and student IDs of all group members.
2. Ischemic heart disease. (file: ischemic)
Data were collected by a health insurance company on its subscribers who had made claims
resulting from ischemic (heart disease) for the time period of January 1, 1998 through December
31, 1999. The response is the number emergency room visits, and the goal is to model its mean
as a function of 8 other variables. You may try models with all the predictor variables
untransformed, and predictor variables transformed by square root (except gender). Use Poisson
regression to perform data summary, goodness-of-fit and model selection.
The data are given in the file ischemic.xls. The columns are
Column 1: cost, total cost of claims made by subscriber (dollars),
Column 2: age, age of subscriber (years),
Column 3, gender of subscriber (1=male, 0=otherwise),
Column 4: inter, total number of interventions or procedures carried out,
Column 5: drugs, number of tracked drugs prescribed,
Column 6: complications, number of other complications that arose during the heart disease
treatment,
Column 7: comorbidities, number of other diseases that the subscriber had during the period,
Column 8: duration, number of days of duration of treatment condition,
Column 9: visits, number of emergency room visits.
联系我们
  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-21:00
  • 微信:codinghelp
热点标签

联系我们 - QQ: 99515681 微信:codinghelp
程序辅导网!