Skip to content
  • Categories
  • Recent
  • Popular
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse

NodeBB-ActivityPub Bridge Test Instance

  1. Home
  2. Categories
  3. Uncategorized
  4. just like there’s a distinction between non-information resources and information resources, or between binary resources and text resources, maybe there should be a distinction between descriptors documents and content documents

just like there’s a distinction between non-information resources and information resources, or between binary resources and text resources, maybe there should be a distinction between descriptors documents and content documents

Scheduled Pinned Locked Moved Uncategorized
15 Posts 2 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • trwnh@mastodon.socialT trwnh@mastodon.social

    just like there’s a distinction between non-information resources and information resources, or between binary resources and text resources, maybe there should be a distinction between descriptors documents and content documents

    trwnh@mastodon.socialT This user is from outside of this forum
    trwnh@mastodon.socialT This user is from outside of this forum
    trwnh@mastodon.social
    wrote on last edited by
    #2

    in other words maybe embedding content alongside metadata is actually mixing concerns

    trwnh@mastodon.socialT 1 Reply Last reply
    0
    • trwnh@mastodon.socialT trwnh@mastodon.social

      in other words maybe embedding content alongside metadata is actually mixing concerns

      trwnh@mastodon.socialT This user is from outside of this forum
      trwnh@mastodon.socialT This user is from outside of this forum
      trwnh@mastodon.social
      wrote on last edited by
      #3

      it might not be a problem with smaller content, like say an as:Note, but stuffing the full html structured contents of an entire article into even rss or atom seems like it could get out of hand really quickly. this is why feeds are often limited to 10-20 items or otherwise only include a summary, right?

      so maybe it makes sense to treat even text content as a separate thing, just like we do with binary resources.

      trwnh@mastodon.socialT 1 Reply Last reply
      0
      • trwnh@mastodon.socialT trwnh@mastodon.social

        it might not be a problem with smaller content, like say an as:Note, but stuffing the full html structured contents of an entire article into even rss or atom seems like it could get out of hand really quickly. this is why feeds are often limited to 10-20 items or otherwise only include a summary, right?

        so maybe it makes sense to treat even text content as a separate thing, just like we do with binary resources.

        trwnh@mastodon.socialT This user is from outside of this forum
        trwnh@mastodon.socialT This user is from outside of this forum
        trwnh@mastodon.social
        wrote on last edited by
        #4

        html kinda doesn’t make this distinction. there’s a head-body split but that’s not the same as a metadata-content split. you can embed metadata into body content just as equally as you can embed it in head tags (example: rdfa)

        trwnh@mastodon.socialT 1 Reply Last reply
        0
        • trwnh@mastodon.socialT trwnh@mastodon.social

          html kinda doesn’t make this distinction. there’s a head-body split but that’s not the same as a metadata-content split. you can embed metadata into body content just as equally as you can embed it in head tags (example: rdfa)

          trwnh@mastodon.socialT This user is from outside of this forum
          trwnh@mastodon.socialT This user is from outside of this forum
          trwnh@mastodon.social
          wrote on last edited by
          #5

          i guess this is basically the distinction between embedded metadata and sidecar metadata, is what i was trying to get at

          trwnh@mastodon.socialT 1 Reply Last reply
          0
          • trwnh@mastodon.socialT trwnh@mastodon.social

            i guess this is basically the distinction between embedded metadata and sidecar metadata, is what i was trying to get at

            trwnh@mastodon.socialT This user is from outside of this forum
            trwnh@mastodon.socialT This user is from outside of this forum
            trwnh@mastodon.social
            wrote on last edited by
            #6

            what i’m thinking is that sidecar metadata can be stored 1:1 or 1:n — if it’s 1:1 you might as well embed it if you can, either as some kind of frontmatter or inline with rdfa. but having frontmatter means every single processor that touches your content needs to be aware of the existence of that frontmatter (and strip it). so frontmatter isn’t as portable as i would like. basically a document with frontmatter is no longer that content type; it is a new media type for each combination.

            trwnh@mastodon.socialT 1 Reply Last reply
            0
            • trwnh@mastodon.socialT trwnh@mastodon.social

              what i’m thinking is that sidecar metadata can be stored 1:1 or 1:n — if it’s 1:1 you might as well embed it if you can, either as some kind of frontmatter or inline with rdfa. but having frontmatter means every single processor that touches your content needs to be aware of the existence of that frontmatter (and strip it). so frontmatter isn’t as portable as i would like. basically a document with frontmatter is no longer that content type; it is a new media type for each combination.

              trwnh@mastodon.socialT This user is from outside of this forum
              trwnh@mastodon.socialT This user is from outside of this forum
              trwnh@mastodon.social
              wrote on last edited by
              #7

              example: markdown is text/markdown but if you add frontmatter it is now something different. but there isn’t a standard type for this; instead, every application implements frontmatter parsing independently. there isn’t consensus on the delimiter or on the format. the definition of a new media type should include the delimiter and the format; for example, “delimit with three dashes and serialize frontmatter as yaml” or “delimit with three pluses and serialize frontmatter as toml”

              trwnh@mastodon.socialT 1 Reply Last reply
              0
              • trwnh@mastodon.socialT trwnh@mastodon.social

                example: markdown is text/markdown but if you add frontmatter it is now something different. but there isn’t a standard type for this; instead, every application implements frontmatter parsing independently. there isn’t consensus on the delimiter or on the format. the definition of a new media type should include the delimiter and the format; for example, “delimit with three dashes and serialize frontmatter as yaml” or “delimit with three pluses and serialize frontmatter as toml”

                trwnh@mastodon.socialT This user is from outside of this forum
                trwnh@mastodon.socialT This user is from outside of this forum
                trwnh@mastodon.social
                wrote on last edited by
                #8

                earlier i said that html’s head-body split is not the same as the metadata-content split i am after; after some further thought, this isn’t really true. i think what i am trying to model here is a way to be able to detect and handle arbitrary header data, by unwrapping it to get at the body content. but i’m realizing that this body content may itself have its own nested headers and body…

                trwnh@mastodon.socialT 1 Reply Last reply
                0
                • trwnh@mastodon.socialT trwnh@mastodon.social

                  earlier i said that html’s head-body split is not the same as the metadata-content split i am after; after some further thought, this isn’t really true. i think what i am trying to model here is a way to be able to detect and handle arbitrary header data, by unwrapping it to get at the body content. but i’m realizing that this body content may itself have its own nested headers and body…

                  trwnh@mastodon.socialT This user is from outside of this forum
                  trwnh@mastodon.socialT This user is from outside of this forum
                  trwnh@mastodon.social
                  wrote on last edited by
                  #9

                  more precisely there is a format to the header data and there is a format to the body content

                  an http request/response can be serialized as a text file which has http headers and http body, and then that http body can be of a certain content type like html, which itself has html headers and html body. the html body content is often also of type html

                  you can progressively wrap or unwrap “body content” with “header data” in different formats. i’m not sure how best to describe this…

                  trwnh@mastodon.socialT 1 Reply Last reply
                  0
                  • trwnh@mastodon.socialT trwnh@mastodon.social

                    more precisely there is a format to the header data and there is a format to the body content

                    an http request/response can be serialized as a text file which has http headers and http body, and then that http body can be of a certain content type like html, which itself has html headers and html body. the html body content is often also of type html

                    you can progressively wrap or unwrap “body content” with “header data” in different formats. i’m not sure how best to describe this…

                    trwnh@mastodon.socialT This user is from outside of this forum
                    trwnh@mastodon.socialT This user is from outside of this forum
                    trwnh@mastodon.social
                    wrote on last edited by
                    #10

                    how can we generalize this header+content pattern, basically

                    i’m fairly sure you need to at least define header type, delimiter type, content type

                    example:
                    - header = toml
                    - delimiter = +++ to start, +++ to end
                    - content = html

                    is this enough to describe a canonical data format?

                    trwnh@mastodon.socialT 1 Reply Last reply
                    0
                    • trwnh@mastodon.socialT trwnh@mastodon.social

                      how can we generalize this header+content pattern, basically

                      i’m fairly sure you need to at least define header type, delimiter type, content type

                      example:
                      - header = toml
                      - delimiter = +++ to start, +++ to end
                      - content = html

                      is this enough to describe a canonical data format?

                      trwnh@mastodon.socialT This user is from outside of this forum
                      trwnh@mastodon.socialT This user is from outside of this forum
                      trwnh@mastodon.social
                      wrote on last edited by
                      #11

                      side note: i wish there was a distinction between html content and a full html document… if you try to render html content in a browser and it isn’t a full html document, weird things might happen

                      trwnh@mastodon.socialT 1 Reply Last reply
                      0
                      • trwnh@mastodon.socialT trwnh@mastodon.social

                        side note: i wish there was a distinction between html content and a full html document… if you try to render html content in a browser and it isn’t a full html document, weird things might happen

                        trwnh@mastodon.socialT This user is from outside of this forum
                        trwnh@mastodon.socialT This user is from outside of this forum
                        trwnh@mastodon.social
                        wrote last edited by
                        #12

                        revisiting: i discovered the iana media type multipart/mixed which could basically be this, just with a little modification https://www.iana.org/assignments/media-types/#multipart

                        the thing is the "boundary" parameter in multipart media types expects to be concatenated to a -- so you can't express the typical --- or +++ without problems (a markdown horizontal rule --- would get parsed as a multipart boundary)

                        still there's probably some inspiration to be had there, you could define an application/subtype that does similar

                        trwnh@mastodon.socialT oblomov@sociale.networkO 2 Replies Last reply
                        0
                        • trwnh@mastodon.socialT trwnh@mastodon.social

                          revisiting: i discovered the iana media type multipart/mixed which could basically be this, just with a little modification https://www.iana.org/assignments/media-types/#multipart

                          the thing is the "boundary" parameter in multipart media types expects to be concatenated to a -- so you can't express the typical --- or +++ without problems (a markdown horizontal rule --- would get parsed as a multipart boundary)

                          still there's probably some inspiration to be had there, you could define an application/subtype that does similar

                          trwnh@mastodon.socialT This user is from outside of this forum
                          trwnh@mastodon.socialT This user is from outside of this forum
                          trwnh@mastodon.social
                          wrote last edited by
                          #13

                          it would probably be more correct to define application/mdx or whatever (since the typical intent is to be processed by something like an MDX processor), but i haven't really looked into the particulars of doing this properly and making it modular (instead of hardcoding semantics of "toml frontmatter, --- separator, markdown body")

                          1 Reply Last reply
                          0
                          • trwnh@mastodon.socialT trwnh@mastodon.social

                            revisiting: i discovered the iana media type multipart/mixed which could basically be this, just with a little modification https://www.iana.org/assignments/media-types/#multipart

                            the thing is the "boundary" parameter in multipart media types expects to be concatenated to a -- so you can't express the typical --- or +++ without problems (a markdown horizontal rule --- would get parsed as a multipart boundary)

                            still there's probably some inspiration to be had there, you could define an application/subtype that does similar

                            oblomov@sociale.networkO This user is from outside of this forum
                            oblomov@sociale.networkO This user is from outside of this forum
                            oblomov@sociale.network
                            wrote last edited by
                            #14

                            @trwnh multipart/* types have to specify the boundary signature, so why would the classic markdown ---- be identified as one if not indicated as the boundary separator?

                            trwnh@mastodon.socialT 1 Reply Last reply
                            0
                            • oblomov@sociale.networkO oblomov@sociale.network

                              @trwnh multipart/* types have to specify the boundary signature, so why would the classic markdown ---- be identified as one if not indicated as the boundary separator?

                              trwnh@mastodon.socialT This user is from outside of this forum
                              trwnh@mastodon.socialT This user is from outside of this forum
                              trwnh@mastodon.social
                              wrote last edited by
                              #15

                              @oblomov i mean if you say boundary="-" then the separator becomes --- but your markdown content might include --- as what gets rendered into an <hr> element

                              something like

                              ```
                              ---
                              foo: bar
                              ---

                              stuff.

                              ---

                              more stuff.
                              ```

                              could get parsed as 3 parts instead of 2

                              1 Reply Last reply
                              0
                              Reply
                              • Reply as topic
                              Log in to reply
                              • Oldest to Newest
                              • Newest to Oldest
                              • Most Votes


                              • Login

                              • Login or register to search.
                              Powered by NodeBB Contributors
                              • First post
                                Last post
                              0
                              • Categories
                              • Recent
                              • Popular