ignore null values when loading data into BigQuery

1.3k Views Asked by Beginner At 08 March 2022 at 11:08

I am loading data into bigquery, the data should be in order when I upload it from csv, but once loaded it gets shuffled in bigquery, as below,

actual table: This how it should be in bigquery

id	name	location
1	aaa	bbbb
2	ccc	dddd
3		eeee
4	fff	gggg

uploaded table: but in the below format it got updated once I uploaded from csv to bigquery

id	name	location
3	null	eeee
2	ccc	dddd
1	aaa	bbbb
4	fff	gggg

Even if I use pandas.sort_values(by='id'), I am not getting the table in correct order, it gets reshuffled, no idea on what basis.

What changes/steps to be done to get the actual table uploaded into the bigquery as it is in the same format and ignoring null values while loading data into the bigquery?

Thanks in advance

Original Q&A

There are 2 best solutions below

Betjens On 09 March 2022 at 11:56 BEST ANSWER

To complement Cylldby answer. If you still need that raw order, you can just add a column named index and populated with your index data from your dataframe into your table ( or working temp table )

df['index_col'] = df.index

Note: if there are multiple files, you can also set the custom index of your dataframe chunks. For more info there is this good answer which explores index reordering.

So, on you BigQuery table you can just order by index_col and preserver the raw order that got uploaded from your file.

Cylldby On 08 March 2022 at 15:29

A very important aspect of BigQuery is that

If an ORDER BY clause is not present, the order of the results of a query is not defined (ref here)

If you want to display the rows as you wish, simply add an ORDER BY clause:

ORDER BY `id`

As for ignoring the NaN values, just filter you dataframe with .notna() before using to_gbq.

ignore null values when loading data into BigQuery

There are 2 best solutions below

Related Questions in PANDAS

Related Questions in GOOGLE-CLOUD-PLATFORM

Related Questions in GOOGLE-BIGQUERY

Related Questions in PANDAS.DATAFRAME.TO-GBQ

Trending Questions

Popular # Hahtags

Popular Questions