i'm not 100% sure about this but i am starting to think that the way #jsonld context declarations propagate by default is generally an anti-pattern

jens@social.finkhaeuser.de

@trwnh Or to go back to a previous phrasing: you think in terms of knowledge graphs. In JSON, the graph *is* the structure of the serialization.

You cannot make JSON understand a knowledge graph *unless* you map it into the graph structure that JSON understands. Anything else is futile.

trwnh@mastodon.social

@jens i’m not “complaining”, and it’s not even about LD per se; it’s about the boundary between resources. what you call “two different structures” can be combined without regard for semantics. i just want to preserve semantics.

if the JSON vs JSONLD thing is tripping you up, then consider an example where someone dumps the json to a plaintext string and tries to do an even more naive string find-and-replace. or consider an example where someone manipulates the raw bytes

trwnh@mastodon.social

@jens no offense taken, btw!

trwnh@mastodon.social

@jens if the argument is “they can’t be combined while preserving JSONLD semantics”, i would argue that they can. if anything, they can’t be combined while preserving *JSON* semantics, because nesting a JSON object under a certain key in the document fundamentally alters the semantics of that nested object (if i understand you correctly)

trwnh@mastodon.social

@jens the mere act of nesting erases the boundary between parent and child; the parent’s semantics always override whatever semantics the child had.

is that correct?

jens@social.finkhaeuser.de

@trwnh It's a weirdly LD way of putting it.

In what I call structured data now, the semantics are derived from the position in the data graph, i.e. the structure, and whichever definitions are associated with this structure.

If we take e.g. https://schema.org/Person 's example 4, ".children.name" has semantics derived from the fact that ".children" has the definition "must also be a Person", not because ".children.@type" has a value of "Person". That field is, from a structured data point of...

jens@social.finkhaeuser.de

@trwnh ... view, pretty much superfluous.

It's not *entirely* superfluous if you consider that Any or Variant definitions may exist, in which case you need sole type specifier. But it's superfluous in this case, because the textual definition of the meaning of ".children" is unambiguous: https://schema.org/children

The contained object must be Person.

So rather than saying that the parent *overrides*, it's better to state that the parent determines what is permissable.

But that also means...

jens@social.finkhaeuser.de

@trwnh ... that ".children" and ".follows" do not have the same semantics. Yes, the specs both state they're Person, but those aren't the semantics really. The semantics are that one Person is a child of the enclosing Person, and the other is a "uni-directional social relation" (https://schema.org/follows).

So moving those objects elsewhere in the graph modifies the semantics.

This is what LD doesn't really "get", because the knowledge graph is built (can be built) out of a flat list of triplets.

jens@social.finkhaeuser.de

@trwnh Structured data is never this "flat", the structure is the semantics and vice versa.

So, by assuming the JSON representation is arbitrarily malleable, you're breaking the structured data model.

You can argue that sucks, FWIW, but it's nonetheless the case for all structured data: JSON, YAML, TOML, XML, SGML, CBOR, all of those (more or less) follow the structured data model (attributes in XML/SGML and to a lesser degree YAML have additional structural properties, sure).

trwnh@mastodon.social

@jens even without LD you'd still have issues if you applied plaintext merging rules on json objects dumped to strings, which is the equivalent operation at a lower level.

the thing is, in the example you give, you're staying within the schema.org vocabulary, and you're also looking at it purely from the lens of the root object (the Person that is the parent). i'm looking at it from the lens of the nested object.

if a Person's child is also a Person, then can't that child also have children?

trwnh@mastodon.social

@jens say we have a JSON object, id: foo.

".children" has an id: bar.

".children.children" has an id: baz.

it seems entirely reasonable to me to consider foo, bar, and baz to be individual JSON objects, just nested within each other. is there anything wrong with this interpretation?

in an XML tree, you can consider a subtree to also be an XML tree. maybe that subtree originated from a different XML document, like how we can include SVG in HTML.

trwnh@mastodon.social

@jens the crossover from "HTML semantics" to "SVG semantics" is invisible to most people who aren't aware of the distinction, but that doesn't mean it doesn't exist.

jens@social.finkhaeuser.de

@trwnh Yes, you would have the same issues, but there's a semantic (haha) difference:

Lower level processors cannot make assumptions about higher level processors, without being told to make them. So a string processor cannot really perform any modifications to the string, unless they're told it's fine to do them.

Similarly, a JSON processor cannot make modifications to the document structure, unless they're told that's fine.

Here's the rub: in the example you were giving a while back, of a..

jens@social.finkhaeuser.de

@trwnh ... JSON processor resolving JSON references and replacing them with their referred to objects, this sits somewhere on an intermediate layer between JSON and JSON-LD.

If the processor "speaks" references, then according to any and all rules about processing them, this is fine.

It isn't doing anything wrong, because the data structure derived from parsing the doc-with-refs-preserved and the one derived from parsing doc-with-refs-resolved is the same.

In that sense, it isn't changing...

jens@social.finkhaeuser.de

@trwnh ...anything at all about the represented data.

If this changes anything about the semantics of the LD, then JSON-LD shouldn't be using JSON references (brutally put, I know), because it knowingly breaks the underlying JSON semantics.

jens@social.finkhaeuser.de

@trwnh Since we're working with plain text analogies here, this is like putting non-ASCII bytes into a string without specifying anything about what they mean (i.e. an encoding).

trwnh@mastodon.social

@jens we might finally be on the same page

> Lower level processors cannot make assumptions about higher level processors, without being told to make them.

this is what i was trying to do, yeah. how can a lower level processor preserve higher level semantics? what do you need to "tell" them to retain alignment?

the references are kind of unavoidable, since this is the web we're talking about. you have multiple documents, but naive processors try to be "clever" and save you HTTP requests.

trwnh@mastodon.social

@jens the "rub" is that AS2 claims to be "JSON-LD but you can totally ignore the LD part we promise lol"

you can *mostly* ignore a lot of the complexity, but only if you stay within the boundaries of activity+json semantics, i.e. "don't use external vocabs, and if you do, don't redefine activitystreams terms"

the extensibility mechanism is ld+json. you can include ld+json inside activity+json but you need to not violate the constraints of activity+json.

trwnh@mastodon.social

@jens but also, any json can become ld+json with a sufficiently complex context definition. the ld+json processor cannot understand the implicit semantics of json; it needs to be made explicit in the form of a jsonld context.

we want the jsonld context's described semantics to not diverge from the *actual* semantics. we can't be aware of every vocab or media type in the world. so the semantic boundaries between individual resources are important.

but we do want to reduce those pesky roundtrips

trwnh@mastodon.social

@jens this is why i put "clever" in quotation marks. like, yes, that's just how references work, no problems there.

the problem is that you are taking something that isn't activity+json and sticking it in an activity+json document.

more precisely, the media type no longer applies to the *entire* document; there is a fragment of the document that has a different media type (like the hypothetical schemadotorg+json i was talking about earlier)

NodeBB-ActivityPub Bridge Test Instance

i'm not 100% sure about this but i am starting to think that the way #jsonld context declarations propagate by default is generally an anti-pattern