It turns out this data is very hard to match up with the car data we have. For example, a
2005 Mini Cooper has 12 entries in the EPA data. There's the
Mini Cooper,
Mini Cooper S,
Mini Cooper Convertible, and the
Mini Cooper S Convertible. All of these come in
Manual or
Auto transmission—each with separate mileage estimates. This is true across most makes/models. In some cases the differences are types of fuels, 4-wheel-drive vs. 2-wheel-drive, and/or the number of cylinders.
Our car data has some of this engine and body data, but we chose to keep the data entry form very simple. When you add a 2005 Mini Cooper with
our form, you have the option to choose convertible vs. coupe, but not the transmission type, and not the
S class. (Plus, I'm guessing a large swath of potential users don't know whether they have a V6 or a V8 under the hood, let alone some of the
more obscure differences.) Making users choose this level of detail just to get EPA estimates seems like a bad trade off.
So with these 12 records for a 2005 Mini Cooper, should I just average them all to give a "best-guess" EPA estimate? Should we provide an "advanced" car form so folks who know an S from a non-S can fill in that info?