Hi have noticed that when you set firstrow = 2 the result set has misisng rows.
This can be easily noticed:
The query below (querying a public data source) returns 41165. Setting firstrow = 3 return 41119 (my expectation is that it should only have 1 row less).
Interestingly, changing the query to select count(*) has expected behaviour (i.e. rowcount will decrease by 1 if firstrow is incremented).
I noticed the issues after troubleshooting a sum funtion which returned less than i was expecting.
select COUNT(c1)
from openrowset(
bulk 'https://pandemicdatalake.blob.core.windows.net/public/curated/covid-19/ecdc_cases/latest/ecdc_cases.csv',
format = 'csv',
parser_version = '2.0',
firstrow = 2) as rows
Thank you for raising this, we are aware of this issue. Fix for this will land soon.
In the meantime, you can use
parser_version = '1.0'.Try using this query: