For loss of a better place, I'll store my recipe for homemade pizza dough here. This will make dough for 14 people.
I've got some questions regarding this issue, maybe someone out there has a clue.

The Austrian contingent was 300 out of 1000 soldiers

source: http://www.guardian.co.uk/world/2013/jun/06/israel-angry-austria-golan-heights.
I just pushed the most recent version of the PSID panel data builder introduced a little while ago. Got some user feedback and made some improvements. The package is hosted on github.

News:

I added a reproducible example using artificial data which you can run by calling 'example(build.panel)'.
I got intrigued by the numbers presented in this news article talking about the re-trial in the Amanda Knox case. The defendants, accused and initially convicted of murder, were acquitted in the appeal's instance when the judge ruled that the forensic evidence was insufficiently conclusive.
I just finished reading the extraordinary book tomorrow's table by P. Ronald and Raoul Adamchak. (I linked to Ronald's blog).
Economists frequently use public datasets. One frequently used dataset is the Panel Study of Income Dynamics, short PSID, maintained by the Institute of Social Research at the University of Michigan.

Economists frequently use public datasets. One frequently used dataset is the Panel Study of Income Dynamics, short PSID, maintained by the Institute of Social Research at the University of Michigan.

I'm introducing psidR, which is a small helper package for R here which makes constructing panels from the PSID a bit easier.

One potential difficulty with the PSID is to construct a longitudinal dataset, i.e. one where individuals are followed over several survey waves. There are several solutions.

  1. In the so-called data center, users can use drill-down menus to select relevant variables from each wave. If the user wants only recent waves, there exists a subsetting mechanism (e.g. only household heads younger than 55). As the required dataset gets larger, this becomes unhandy, as the interface gets slower and slower, and the clicking procedure is rather error prone. The main motivation for this package is that I've spent too many hours clicking on cryptic variable names only to realize after I was done that I had forgotten a variable. Unacceptable.
  2. User may download the data and attempt to merge the annual interview files in order to obtain the desired panel. Though conceptually not very difficult (there is an individual index file, which provides a link for individuals across years), it is a cumbersome accounting exercise to find the right variable names from each year and do the right merges. 
  3. One can use psidR. The main function is inspired by the Stata add on package psiduse. Here is the function's signature.
build.panel(datadir,fam.vars,ind.vars=NULL,fam.files=NULL,ind.file=NULL,heads.only=TRUE,core=TRUE,design="balanced",verbose=FALSE)
  • There is a default behaviour, where the user only points towards a data directory. otherwise one can specify custom locations for family files and individual index.
  • you can supply the PSID data in stata format or csv files
  • The user has to supply a data.frame "fam.vars" which lists the variable names for all required waves. 
  • it's possible to tell the function that a certain variable is missing in a given year (without the variable getting dropped, so you can impute it later on)
  • One can subset the data for household heads only
  • there is a switch to only get the core sample
  • There are 3 different sample designs to choose from: balanced panel (all individuals must be present in all waves), k-period panel (individuals must be at least k periods present) and unbalanced (all included)
  • with "verbose=TRUE" the function prints comments as you go along. 
An issue could be memory. The dataset is quite big. I use data.table to keep things manageable, but it's hard to get around a data.table of 628MB, which is the size of the individual file index. The verbose option prints memory load at various points, so you may be able to intervene and through out some things if you hit a limit.
7

View comments

I was recently asked by a friend whether it's worth to buy a house in the UK. That is, assuming they could put down the money, whether it was worth buying as opposed to renting.
So Paul Krugman laments in his post that policy makers across Europe have blindly signed up to the "Austerity only" ticket. He cites some evidence which I find fairly convincing. I just want to raise the point that what he says cannot be used as a critique against the Monti government.
File under: getting data.

Oh the irony. Exactly one day after I start to read the great book on Open Government by O'Reilly media (which they released in tribute to Aaron Schwartz), I come in need of time series data on unemployment rates in the United States.
Here is my suggestion: why does Ryanair in it's confirmation email not have a subject line that is actually conveying some information about the flight concerned? Currently, the subject line of such a confirmation email is

Ryanair Travel Itinerary - Don't Forget You MUST Check-in Online and Print O
Code Readability is maybe the most important part of producing reproducible research. If it's impossible (i.e.
The Deferred Acceptance Algorithm (DAA) goes back to Gale and Shapley (1962). They introduce a rather simple algorithm that finds a stable matching for example for college admissions or in a marriage market.
This is according to the latest study by uswitch.com. 46% of Britons have thought about emigrating.

flo.
I just finished reading "the big short", a rather vivid account of the events surrounding the 2008 financial crisis, as told by Michael Lewis, who is a former employee of a big - deceased - Wall Street bank called Salomon Brothers. I'll have to say more about the book.
I've started to fork out the R-related entries to www.r-bloggers.com. It's a great place to learn R and see what others are up to. The blog has a very nice mixture of topics ranging from extremely simple (stuff that I would post) to extremely advanced, to the very vanguard of R development.

flo.
have a look at this, nicely done by the NYTimes.

flo.
Following the last post, a couple of questions came up:

How could it be that bad quality mortgages in the US could have such an effect on the global economy? Is it because we are so tightly connected to the US economy or is it because something similar happened everywhere?

I would say it's a mixtu
Hi guys.

this week we have some "light entertainment". I found this little film about how all the strings (could) get together to fabricate what we call "the credit crunch". Needless to say that in a 10 minute animation one can hardly explain anything in great detail.
An interested reader asks:

And what about the government? Does it make sense to pump billions of dollars into the auto industry - as is happening in the US at the moment? In the end this is money that is taken away from taxpayers, because sooner or later they will have to pay it back - regardless i
I hope you are doing fine. The snow is gone and London is as grey/green as ever - everything back to normal.

As normal as it can get those days, that is. And apart from the weather, there are few things you would label "normal" around here.
this week has seen some spectacular action in the UK. We started off with heavy snowfalls in sunday night, covering London and the southeast of the country by as much as 20 cm of snow.
I hope this message finds you well.
I am leaving for 10 days of vacation. Exams were horrible. Tell you more about it soon. Will get sun-burned first.flo.
If you are interested in prose, find a lot of it (and other stories) in Pablo's blog. He is based in Buenos Aires. This link goes to one of my favourite stories:

pablejacioflo.
links
About Me
About Me
Blog Archive
Loading
Dynamic Views theme. Powered by Blogger. Report Abuse.