WelcomeWelcome | FAQFAQ | DownloadsDownloads | WikiWiki

Author Topic: A grammar for HTML5  (Read 447 times)

Offline mocore

  • Hero Member
  • *****
  • Posts: 633
  • ~.~
A grammar for HTML5
« on: September 27, 2024, 05:20:32 AM »


 http://trevorjim.com/a-grammar-for-html5/

Quote
Fortunately, I have spent a great deal of time over the past ten years learning about parsing and its security aspects, and I believe I can give here a very succinct grammar for HTML5.

A few preliminaries. I will be using a variant of Backus-Naur Form (BNF) grammars, in which “.” will denote any single input character, and postfix “*” will denote zero or more repetitions of the preceding construct (Kleene closure). I will use capitalized identifiers for the nonterminals of the grammar.

Here then is the grammar of HTML 5:

    HTML5 = .*



 ;D

Offline mocore

  • Hero Member
  • *****
  • Posts: 633
  • ~.~
the vodka is good but the meat is rotten
« Reply #1 on: September 27, 2024, 05:36:31 AM »

this succinct rule set (.*) reminds me of a quote "the vodka is good but the meat is rotten" ( via https://en.wikipedia.org/wiki/Literal_translation#Translator's_humor )

&
"repology-updater/issues/784# Tiny core linux" https://forum.tinycorelinux.net/index.php/topic,26273.0.html

also wrt parsing it appears the above grammar could also work for repo's .tcz.info files   ;)
the current info files are intended for human consumption as plain text
not for programmatic / machine readability & processing

the latter obviously implies "trying to parse the info"
....
seams more likely to bring machine readability to info fles / the reop
Yes, adopting a standard would increase machine readability.