Re: US Census database (Tiger 2004FE) - 4.4G

Поиск
Список
Период
Сортировка
От Mark Woodward
Тема Re: US Census database (Tiger 2004FE) - 4.4G
Дата
Msg-id 22672.24.91.171.78.1123190535.squirrel@mail.mohawksoft.com
обсуждение исходный текст
Ответ на Re: US Census database (Tiger 2004FE) - 4.4G  (Ron Mayer <rm_pg@cheapcomplexdevices.com>)
Список pgsql-hackers
I thought bout it, but it isn't the best program around, but it does work.
My program also reformats numbers, i.e. long/lat become properly
decimal-ed numerics, zips become integers, etc.

The question is...

Do you download the raw data and convert it into a database, or do you
download the pre-formatted database?

I would say the preformated database is easier to manage. There are
hundreds of individual zips files, in each of those files 10 or so data
files.



> Mark Woodward wrote:
>> It is 4.4G in space in a gzip package.
>>
>> I'll mail a DVD to two people who promise to host it for Hackers.
>
> Would it be easier to release the program you did to do
> this conversion?
>
>
> I use this pretty short (274 line) C program:
>     http://www.forensiclogic.com/tmp/tgr2sql.c
> to convert the raw tiger files
> from http://www.census.gov/geo/www/tiger/index.html
> into SQL statements that can be loaded by postgresql.
>
> The #define SQL line controls if it makes data
> with INSERT statements or for COPY statements.
>
> ---------------------------(end of broadcast)---------------------------
> TIP 6: explain analyze is your friend
>



В списке pgsql-hackers по дате отправления:

Предыдущее
От: Marko Kreen
Дата:
Сообщение: Re: openbsd, plpython, missing threading symbols
Следующее
От: Ian Burrell
Дата:
Сообщение: Re: Solving the OID-collision problem