Merge JSON files prioritizing the updated values from most recent file

nebulae · Apr-17-2019, 10:15 AM

Hi folks!

I have series of JSON files similar to example below. "update" in "header" is the file creation timestamp. "hours" are timestamps for the values "x" and "y".

I would like to combine them in a single CSV file to import into Excel. The problem is most of the files contain updated "x" and "y" values in comparison to preceding file, e.g. first two timestamps are same as the last two timestamps from previous file, but "x" and "y" values were updated. So for the same timestamps there are updated more accurate "x" and "y" values.

With my limited knowledge I have tried to write a script which is ignoring older values from older file, comparing the "update" timestamp. It works only when I ignore "x" and "y" values and it plots series of hours.

Without further investigation why it does not work properly, I would like to ask to guide me to choose the right approach. I am sure there are more convenient ways doing it.

Thanks!

{
"header":{
"update":1555054504000
},
"data":{
"hours":[
1555038000000,
1555048800000,
1555059600000,
1555070400000,
1555081200000,
1555092000000,
1555102800000,
1555113600000
],
"x":[
241.095130810609,
235.6587698951538,
234.52988957999375,
238.14886341887896,
240.9792842156129,
234.37616327308106,
236.4281670519553,
239.34914914407685
],
"y":[
273.9192290114759,
271.7583893617311,
270.7841492576362,
277.412376380971,
279.51083939292204,
280.7639255517393,
277.92215250624633,
272.7410417669065
]
}
}

#! /usr/bin/python3
import os, json

path_to_json = 'data/'
json_files = [pos_json for pos_json in os.listdir(path_to_json) if pos_json.endswith('.json')]

files = list(enumerate(json_files))

def appendJsonContent(fileNumber):
    information = []
    fileName = files[fileNumber][1]
    with open(os.path.join(path_to_json, fileName)) as jsonFile:
        json_text = json.load(jsonFile)
        for e, hours in enumerate(json_text['data']['hours']):
            x = json_text['data']['x'][e]
            y = json_text['data']['x'][e]
            information.append(str(hours) + " | " + str(x) + " | " + str(y))
    return information

# return the number of values which is different from the second list
def uniqueDataNumber(i):
    firstSetData = appendJsonContent(i)
    secondSetData = appendJsonContent(i+1)
    mergedSetData = list(set(firstSetData + secondSetData))
    return len(mergedSetData)-len(firstSetData)

for index in range(0, len(files), 2):
    for e in range(uniqueDataNumber(0)):
        print(appendJsonContent(index)[e])
    for e in range(uniqueDataNumber(1)):
        print(appendJsonContent(index+1)[e])

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Write a dictionary with arrays as values into JSON format	paul18fr	3	5,781	Oct-20-2021, 10:38 AM Last Post: buran
	Python - Merge existing cells of Excel file created with xlsxwriter	manonB	0	3,772	Mar-10-2021, 02:17 PM Last Post: manonB
	HELP! Importing json file into csv into jupyter notebook	vilsef	2	2,616	Jan-22-2021, 11:06 AM Last Post: snippsat
	JSON file Loading issue	punna111	4	8,675	Jun-29-2020, 08:07 AM Last Post: buran
	Loading multiple JSON files to create a csv	0LI5A3A	0	2,132	Jun-28-2020, 10:35 PM Last Post: 0LI5A3A
	Indirectlty convert string to float in JSON file	WBPYTHON	6	6,029	May-06-2020, 12:09 PM Last Post: WBPYTHON
	Help batch converting .json chosen file to MySQL	BrandonKastning	2	2,384	Mar-14-2020, 09:19 PM Last Post: BrandonKastning
	Pandas merge csv files	karlito	2	3,225	Dec-16-2019, 10:59 AM Last Post: karlito
	save my sensor data from the bme680 into a json or csv file	Plastefuchs84	1	3,176	Aug-23-2019, 03:04 AM Last Post: Plastefuchs84
	More recent package/library for difflib?	twinpiques	2	8,302	Jul-12-2019, 01:34 AM Last Post: twinpiques

Merge JSON files prioritizing the updated values from most recent file

User Panel Messages

Announcements