Selecting a column with backtick in its name - AnalysisException: cannot resolve Column

186 Views Asked by Jim Macaulay At 04 August 2023 at 12:53

I have a data frame which has the below column:

Last Login- Date & Time(Incl. Time Zone)

When I read the data and print the schema, the column gets printed
df.printSchema()

But when I try selecting the column from the data frame it fails.

df.select(col("Last Login- Date & Time(Incl. Time Zone)"))

AnalysisException: cannot resolve '`Last Login- Date & Time(Incl. Time > Zone)`' 
given input columns: [`Last Login- Date & Time(Incl. Time Zone)`]

Original Q&A

There are 2 best solutions below

notNull On 04 August 2023 at 13:53

Try by replacing backquotes(`) with _.

Example:

from pyspark.sql.functions import *
df = spark.createDataFrame([('1',)],['`Last Login- Date & Time(Incl. Time Zone)`'])
df = df.toDF(*(c.replace('`', '_') for c in df.columns))
df.selectExpr("`_Last Login- Date & Time(Incl. Time Zone)_`").show()
#+------------------------------------------+
#|_Last Login- Date & Time(Incl. Time Zone)_|
#+------------------------------------------+
#|                                         1|
#+------------------------------------------+

ZygD On 04 August 2023 at 15:05

As can be seen in the screenshot, your column name is surrounded with backticks `. If this is not intentional, you may want to remove the backticks. On the other hand, when selecting the column, you should use triple backticks for every backtick in the column name:

from pyspark.sql import functions as F
df = spark.range(1).toDF('`Last Login- Date & Time(Incl. Time Zone)`')
df.printSchema()
# root
#  |-- `Last Login- Date & Time(Incl. Time Zone)`: long (nullable = false)

df.select(F.col("```Last Login- Date & Time(Incl. Time Zone)```")).show()
# +------------------------------------------+
# |`Last Login- Date & Time(Incl. Time Zone)`|
# +------------------------------------------+
# |                                         0|
# +------------------------------------------+

Selecting a column with backtick in its name - AnalysisException: cannot resolve Column

There are 2 best solutions below

Related Questions in DATAFRAME

Related Questions in APACHE-SPARK

Related Questions in PYSPARK

Related Questions in SELECT

Related Questions in PYSPARK-SCHEMA

Trending Questions

Popular # Hahtags

Popular Questions