Delta live tables problem - The column name(s) 'Alpha Source' , 'Alpha source' are duplicated in dataset

30 Views Asked by yashpandey8055 At 23 February 2024 at 14:32

I am trying to load csv files from Datalake into delta tables but I am getting duplicate column name issue while spinning up the tables.

My CSV looks something like this -

Id  Alpha Source    Alpha source
1   AKH              null
2   AKG

And I trying to load the table from abfss -

@dlt.table(comment="load csv files in bronze",
       name = "dev.bronze.logs",
       table_properties = {
           'delta.columnMapping.mode': 'name'
       })
def table():
 landing_zone_path = "abfss://[email protected]/log/"
 df = spark.readStream.format("cloudFiles") \
    .option("cloudFiles.format", "csv") \
    .option("header","true")\
    .option("inferSchema", "True")\
    .load(landing_zone_path)
 return df

Instead it fails -

I would expect the additional columns to go into _rescue_data but thats not happening. I also tried using spark.conf.set('spark.sql.caseSensitive', True) but that dint work too.

Original Q&A

Delta live tables problem - The column name(s) 'Alpha Source' , 'Alpha source' are duplicated in dataset

There are 0 best solutions below

Related Questions in DATABRICKS

Related Questions in DELTA-LIVE-TABLES

Trending Questions

Popular # Hahtags

Popular Questions