Changing csv structure and corresponding etl

Поиск

Список

Период

Сортировка

От	Shekar Tippur
Тема	Changing csv structure and corresponding etl
Дата	13 декабря 2018 г. 19:51:59
Msg-id	CED2688C-EC81-4679-8FD8-C64E249099FD@gmail.com обсуждение исходный текст
Список	pgsql-sql

Дерево обсуждения

Hello,

I am using redshift to store data from csv backups that appear at a regular interval. I use pyspark (psycopg library)
toperform etl. The issue is that the csv structure changes in between and the etl job fails.  
The issue I found is that the column gets mixed up.
For example, the original column list was A,B,C,D. In the next iteration, the columns can be A, B, C, X,Y, D
I read from some of the other posts that it is not possible to alter a table to add a column in a particular position
withinPostgres.  
The table itself currently has millions of rows. Merging tables whenever I get a change may not be a good option. I.e.
createa union of existing table and new data, drop the original table and rename union to original. 
Any pointers in how to proceed?

Thanks,
Shekar
Sent from my iPhone

В списке pgsql-sql по дате отправления:

Предыдущее

От: "Rossi, Maria"
Дата: 13 декабря 2018 г., 19:51:02
Сообщение: RE: Postgres size greater than 1TB

Следующее

От: Larry Rosenman
Дата: 15 декабря 2018 г., 05:13:59
Сообщение: GRANT SELECT ON ALL TABLES IN SCHEMA... doesn't apply to new tables?

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Changing csv structure and corresponding etl

Предыдущее

Следующее