"Dilute" JSON as a template?

I have a JSON with over 18,000 lines and would like to thin out the arrays so that in the end each JSON attribute is represented only once, so that I have this JSON in "simple" form, so that the creation of graphs is not so cumbersome and the creator does not have to feel through the thousands of lines first.

Is there a generator, code snippet or something similar that can make this easier for me?

(1 votes)
Loading...

Similar Posts

Subscribe
Notify of
9 Answers
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Erzesel
1 year ago

Are you sure your JSON looks like that? Is that supposed to be an array of objects? An object which contains several times the same property “Mattri” is not possible.

In your case, when converting into an object, only in each step of the Property Mattri would the currently object array [ {“Attribute”: “Value1”}, {“Attribute”: “Value2”}] assigned. see Powershelldemo:

$TestJson= '
{
  "MAttri":[
    {"Attribut": "Wert1"},
    {"Attribut": "Wert2"}],
   "MAttri":[
    {"Attribut": "Wert1"},
    {"Attribut": "Wert2"}],
   "MAttri":[
    {"Attribut": "Wert1"},
    {"Attribut": "Wert3"}]
}
'
$TestJson|ConvertFrom-Json

As a result, a shoe would be:

$testJSON='
[
    {
        "Matrix":  [
                      {
                          "Attribut":  "Wert1"
                      },
                      {
                          "Attribut":  "Wert7"
                      }
                  ]
    },
    {
        "Matrix":  [
                      {
                          "Attribut":  "Wert1"
                      },
                      {
                          "Attribut":  "Wert7"
                      }
                  ]
    },
    {
        "Matrix":  [
                      {
                          "Attribut":  "Wert2"
                      },
                      {
                          "Attribut":  "Wert7"
                      }
                  ]
    },
    {
        "Matrix":  [
                      {
                          "Attribut":  "Wert1"
                      },
                      {
                          "Attribut":  "Wert5"
                      }
                  ]
    },
    {
        "Matrix":  [
                      {
                          "Attribut":  "Wert9"
                      },
                      {
                          "Attribut":  "Wert2"
                      }
                  ]
    },
    {
        "Matrix":  [
                      {
                          "Attribut":  "Wert1"
                      },
                      {
                          "Attribut":  "Wert7"
                      }
                  ]
    }
]
'
$testOpjArr=$testJSON|ConvertFrom-Json


Write-Host 'Ausgangsobject' -fo Green
$testOpjArr|fl *
#$testOpjArr |Sort-Object { $_.Matrix.Attribut }|fl *
Write-Host 'Alle Duplicate entfernt' -fo Green
$testOpjArr |Sort-Object { $_.Matrix.Attribut } -unique|fl *
$testOpjArr |Sort-Object { $_.Matrix.Attribut } -unique|ConvertTo-JSON

…that all duplicates will fly out.

You just want to destroy the following duplicates

Write-Host 'Aufeinander folgende Duplicate  entfernt' -fo Green
$cleanedObjArr = $testOpjArr|%{$last=$Null}{
      #das vergleichen von Objecten ist immer Pain in the Ass
      #billiger  Powershelltrick: einen vergleichsfähigen Output in einen String speichern
      #bei diesem billigen object reicht Format-List Output
    $CurredCompStr =$_|fl|Out-String
    $LastCompStr =$Last|fl|Out-String
     #...und die  strings  vergleichen
    if ($CurredCompStr -ne $LastCompStr){
        $_  #object  zürckgeben ...oder  nicht
    }
    $last=$_ #vergleichsobject merken
}
$cleanedObjArr|fl *
$cleanedObjArr|ConvertTo-JSON

instead of my static JsonStrings it is cheap to import from/in a file: https://devblogs.microsoft.com/scripting/powertip-convert-json-file-to-powershell-object/

Otherwise, it is possible to play out the power of Powershell’s string operations and to disassemble any Json as a string to compare and resume the parts.

but no longer today…😴

Erzesel
1 year ago

to the quick way through brutal stringacrobatics

$TestJson= '
{
  "MAttri":[
  {"Attribut": "Wert1"},
  {"Attribut": "Wert2"}],
  "MAttri":[
  {"Attribut": "Wert1"},
  {"Attribut": "Wert2"}],
  "MAttri":[
  {"Attribut": "Wert1"},
  {"Attribut": "Wert3"}]
}
'

 # setze um "zerstörungsfrei"  zu splitten, einen markanten Delimite vor "MAttri"
 #der letzte block soll aussehn wie die andern, also ein Komma dran und eiene Splitmarkierung
$TestJson=$TestJson -replace '("MAttri")','SplittMarker$1' -replace '(\}\])(\r\n)','$1,$2SplittMarker'
$SplitedTxt = $TestJson -split 'SplittMarker'
 #entferne aufeinanderfolgende Dupletten
 #selct -uniqe funktioniert nur mit primitiven Datentypen
($SplitedTxt|Select -unique) -join '' -replace '(\}\]),(\r\n\})','$1$2' #wieder einen String draus machen und  das Letzte Komma entfernen

…but this is less flexible than to act with objects and convert them

The Delimiter alone needs some RegEX art (there is already enough an inconspicuous space and the thing flies around your ears)

jo135
1 year ago

The usual all-purpose weapon for JSON is jq.

I can’t give you an ad hoc command for exactly your task, but a creative Google search could bring you (“jq remove duplicate object members” or the like)

apophis
1 year ago

I like to use ChatGPT for this.
For such annoying tasks, which may last unnecessarily long as tables, rewrite values or customize Jsons, this is really good.

Of course, in the end, you have to check if everything is as you want.
But generally it works pretty well.

apophis
1 year ago

Hoppla, the part I’ve read over smoothly and rather paid attention to the example in demand. ^^