Split a Column Based on first instance

41 Views Asked by Tinkinc At 06 February 2024 at 16:08

Looking to split a df | series column into 2 parts based on the first "_"

example in column:

Male_85__and_over

test['gender'] = test['column_Name_pivoted'].str.split('_').str[0]
test['age'] = test['column_Name_pivoted'].str.split('_',n=1).str[1:]

Output is not what I was looking for:

gender	age
Male	[85__and_over]

There are 3 best solutions below

Tim Biegeleisen On 06 February 2024 at 16:12

You could use str.extract here:

test[["gender", "age"]] = test.str.extract(r'([^_]+)_([^_]+)')

Tinkinc On 06 February 2024 at 16:18

test[['gender','age']] =  test["column_Name_pivoted"].str.split("_", n=1, expand=True)

brunns On 06 February 2024 at 16:18

Try str.partition():

test['gender'], _, test['age'] = test['column_Name_pivoted'].str.partition("_")