Look for the conventions used by xml, yaml, json and the like. Check that they have not been adulterated by any subsequent processing.
Extract bits from json with jq.
cat unknown | jq '.' cat unknown | jq '.journal' cat unknown | jq '.journal[1]' cat unknown | jq '.journal[] | .type'