News:
- I added a reproducible example using artificial data which you can run by calling 'example(build.panel)'. This means you can try out the package before bothering to download anything and it provides a simple test of the main function.
- I've included a suggestion to use the R survey package to analyse this dataset and made it explicit in the examples how to obtain the desired weights for each wave. Note that your results are invalid in the majority of cases if you ignore the survey design (i.e. the weights).
- I got some useful comments from Anthony Damico (thanks!) and integrated the SAScii package. (check out his tutorials at http://www.asdfree.com/). This allows one to download the data directly from the PSID server into R, thereby removing any dependency on Stata or SAS to preprocess the raw data. (As is common with large datasets, the raw data come in ASCII format that needs to be fixed up into rows and columns.) The downside is that downloading directly takes a rather long time: downloading FAM1985ER, FAM1986ER and the index IND2009ER took 3 and a half hours.
Hopefully I can get another round of feedback (particularly from a windows user: I could not test that all the paths are written correctly on a unix system) before submitting to CRAN.
I saw Anthony Damico's comments on a previous post of yours, and am thrilled to see that this has led to removing a dependency (the extra download time is, in my opinion, a fine trade-off). Your work will be extremely valuable to the R user community: there is a real need for adsfree-like code and packages that bring large social science datasets to R, so thanks to you both are in order!
ReplyDeleteHey Fr good to get some feedback. yes Anthony enlightened me with his ASCii package: I've been looking for something like it for a long time (but didn't quite know what to look for). the stuff he put up at adsfree.com is superb, and a completely different scale (and scope) than this here. That monetDB connection they've got? boy...
ReplyDeleteI wrote this package up to deal with the PSID once you have it on your disk, at which point you still have a couple of problems to solve. Nothing a more or less experienced useR couldn't deal with, but it would be case by case and truck loads of unnecessary overhead. Hopefully it'll be useful to some. Oh, please let me know how you get along if you try it out.
of course Anthony's site is www.asdfree.com (and not adsfree.com)! I always type it wrong the first time round. It's an interesting typo to make.
ReplyDelete