Python Script to convert Json to CSV file

chvsnarayana · Apr-25-2023, 04:48 PM

Hi All,

We have one requirement to convert Json to CSV file. In json file, values are there for all the columns, some columns are missing but when data populated in CSV file for those values missing columns should have blank/null, next column values should not overwrite.

Json Sample data:
{"results":[{"fruit": "Apple","size": "Large","color": "Red"},{"fruit": "Banana","color": "Yellow"},{"fruit": "Watermelon","size": "Large"},{"fruit": "Orange","color": "Orange"}]}

Expected format:
Fruit,Size,Color
Apple,Large,Red
Banana,,Yellow
Watermelon,Large,
Orange,,Orange

Require your help in script preparation. Whether we can handle all the columns/Headings while reading data from json file or do we need to handle all the columns/headings while populating data in csv file...Please advise

Thanks in advance

Axel_Erfurt · Apr-25-2023, 05:10 PM

Show what you tried and what error message you got.

**buran** · Apr-25-2023, 05:11 PM

Do you know all fields in advance or do you need to handle them dynamically?

**deanhystad** · (This post was last modified: Apr-25-2023, 07:43 PM by deanhystad.)

Can you use pandas? Using pandas this is a few lines of code.

I don't understand this:

Output:
Whether we can handle all the columns/Headings while reading data from json file

You don't read a json file, you load it. The json.load(file) command reads the entire json and returns Python objects. In this case it returns a dictionary containing a list of dictionaries. You'll have to scan through the list to find all the keys. The keys will be the column headings for your csv file. Then you'll have to unpack the dictionaries into some kind of table, or maybe just write directly to the csv file. Either way you'll have to provide a filler when a dictionary does not have all the heading keys.

DeaD_EyE · (This post was last modified: Apr-26-2023, 07:39 AM by DeaD_EyE.)

You should use first csv.

import csv
import io

results = {
    "results": [
        {"fruit": "Apple", "size": "Large", "color": "Red"},
        {"fruit": "Banana", "color": "Yellow"},
        {"fruit": "Watermelon", "size": "Large"},
        {"fruit": "Orange", "color": "Orange"},
    ]
}

fields = tuple(results["results"][0])

with io.StringIO() as fake_file:
    writer = csv.DictWriter(fake_file, fieldnames=fields)
    writer.writeheader()
    for row in results["results"]:
        writer.writerow(row)

    print(fake_file.getvalue())

The shorter version uses the method writerows, where the for-loop is inside the csv-module:

import csv
import io

results = {
    "results": [
        {"fruit": "Apple", "size": "Large", "color": "Red"},
        {"fruit": "Banana", "color": "Yellow"},
        {"fruit": "Watermelon", "size": "Large"},
        {"fruit": "Orange", "color": "Orange"},
    ]
}

fields = tuple(results["results"][0])
with io.StringIO() as fake_file:
    writer = csv.DictWriter(fake_file, fieldnames=fields)
    writer.writeheader()
    writer.writerows(results["results"])

    print(fake_file.getvalue())

Output:

Output:fruit,size,color
Apple,Large,Red
Banana,,Yellow
Watermelon,Large,
Orange,,Orange

**buran** · (This post was last modified: Apr-26-2023, 08:31 AM by buran.)

(Apr-26-2023, 07:39 AM)DeaD_EyE Wrote: fields = tuple(results["results"][0])

Note, it could be just coincidence that first element has all possible keys. That's why I asked if OP knows full list of field names in advance. Otherwise they need to loop over all dicts and combine keys. In this case it's also important to clarify if the order of the field names in the csv matters

DeaD_EyE · (This post was last modified: Apr-26-2023, 02:37 PM by DeaD_EyE.)

If each data point has different keys, you can collect them:

#modified example data

results = {
    "results": [
        {"fruit": "Apple", "size": "Large", "color": "Red"},
        {"fruit": "Banana", "color": "Yellow"},
        {"fruit": "Watermelon", "size": "Large", "humidity": 0.9},
        {"fruit": "Orange", "color": "Orange", "diameter": 33.5},
    ]
}

# getting all keys from all results and keeping the insertion order

fields = tuple(dict.fromkeys([key for row in results["results"] for key in row]))

The nested list comprehension as nested for-loops:

fields = {}
for row in results["results"]:
    for key in row:
        print(f"Adding {key} to fields.")
        fields[key] = None
    print("Next row", end="\n\n")

print()
print("Fields before it's converted to a list or tuple")
print(fields)
print()
print("After conversion to a tuple")
fields = tuple(fields)
print(fields)

If you don't care about order, you could use a set instead. If you want to sort the fields, it's also possible.

sorted_fields = sorted(set(key for result in results["results"] for key in result.keys()))

Or as a for-loop:

fields = set()
for result in results["results"]:
    for key in result:
        fields.add(key)
fields = sorted(fields)

**deanhystad** · Apr-26-2023, 03:22 PM

Though this is not posted in the Homework Forum, it is obviously a homework question.

DeaD_EyE · Apr-26-2023, 10:31 PM

Oh... the good part is, that it's not the complete solution.
Reading and writing is not included.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Best way to feed python script of a file	absolut	6	1,144	Jan-11-2025, 07:03 AM Last Post: Gribouillis
	Convert Json to table format	python_student	4	14,657	Dec-05-2024, 04:32 PM Last Post: Larz60+
	JSON File - extract only the data in a nested array for CSV file	shwfgd	2	1,085	Aug-26-2024, 10:14 PM Last Post: shwfgd
	Trying to generating multiple json files using python script	dzgn989	4	2,286	May-10-2024, 03:09 PM Last Post: deanhystad
	encrypt data in json file help	jacksfrustration	1	2,244	Mar-28-2024, 05:16 PM Last Post: deanhystad
	[SOLVED] Correct way to convert file from cp-1252 to utf-8?	Winfried	8	10,148	Feb-29-2024, 12:30 AM Last Post: Winfried
	parse json field from csv file	lebossejames	4	2,033	Nov-14-2023, 11:34 PM Last Post: snippsat
	Convert File to Data URL	michaelnicol	3	2,689	Jul-08-2023, 11:35 AM Last Post: DeaD_EyE
	Is there a .bat DOS batch script to .py Python Script converter?	pstein	3	8,125	Jun-29-2023, 11:57 AM Last Post: gologica
	Loop through json file and reset values [SOLVED]	AlphaInc	2	5,490	Apr-06-2023, 11:15 AM Last Post: AlphaInc

Python Script to convert Json to CSV file

User Panel Messages

Announcements