| Matricule (type array) |
|---|
| [TKI1] |
| [TKI4] |
I will obtain this dataframe
| Matricule (type string) |
|---|
| TKI1 |
| TKI4 |
CodePudding user response:
Since your Marticule is of ArrayType in the beginning. You can directly utilise getItem as below -
Data Preparation
df = pd.DataFrame({
'Matricule':[['TKI1'],['TKI4']],
})
sparkDF = sql.createDataFrame(df)
sparkDF.show()
---------
|Matricule|
---------
| [TKI1]|
| [TKI4]|
---------
sparkDF.printSchema()
root
|-- Matricule: array (nullable = true)
| |-- element: string (containsNull = true)
Get Item
sparkDF = sparkDF.withColumn('Matricule_string',F.col('Matricule').getItem(0))
sparkDF.show()
--------- ----------------
|Matricule|Matricule_string|
--------- ----------------
| [TKI1]| TKI1|
| [TKI4]| TKI4|
--------- ----------------
