I have a dataset that shows which community belongs to parent community. It looks like this:
| COMMUNITY | PARENT COMMUNITY |
|---|---|
| Community 1 | Community A |
| Community 2 | Community A |
| Community 3 | NaN |
| Community 4 | Community B |
| Community 5 | Community C |
I have a second dataset with raw values, that looks like this:
| ID | COMMUNITY |
|---|---|
| 1 | Community 4 |
| 2 | Community 5 |
| 3 | Community 1 |
| 4 | Community 1 |
| 5 | Community 1 |
| 6 | Community 3 |
| 7 | Community 7 |
I need to transform second dataset using first dataset, so in the end it looks like this:
| ID | COMMUNITY | PARENT COMMUNITY |
|---|---|---|
| 1 | Community 4 | Community B |
| 2 | Community 5 | Community C |
| 3 | Community 1 | Community A |
| 4 | Community 1 | Community A |
| 5 | Community 1 | Community A |
| 6 | Community 3 | Community 3 |
| 7 | Community 7 | NaN |
Also, Please pay attention to PARENT COMMUNITY for Community 3 and 7.
CodePudding user response:
using pandas you can merge the two dataframes like so:
df3 = df2.merge(df1, on='COMMUNITY', how='left')
If you want to read more you can check out the documentaion
