I have a data-frame, with a column text containing strings like "bla bla bla Qwerty ### X/6 bla bla bla".
I want to first of all filter the data-frame and remain only with rows containing "Qwerty ### X/6". (It's fine if it also contain "bla bla" like in the example above.
Second, to create a new column that will have the number X extracted from the string in column text.
CodePudding user response:
Try this:
filtered_df = df[
df.text.str.contains('Qwerty ### \d/6', regex=True)
].copy()
filtered_df['number'] = filtered_df.text.str.extract(
'Qwerty ### (\d )/6'
)
