Tiny Core Linux
Off-Topic => Off-Topic - The Funnies => Topic started by: mocore on September 27, 2024, 05:20:32 AM
-
http://trevorjim.com/a-grammar-for-html5/
Fortunately, I have spent a great deal of time over the past ten years learning about parsing and its security aspects, and I believe I can give here a very succinct grammar for HTML5.
A few preliminaries. I will be using a variant of Backus-Naur Form (BNF) grammars, in which “.” will denote any single input character, and postfix “*” will denote zero or more repetitions of the preceding construct (Kleene closure). I will use capitalized identifiers for the nonterminals of the grammar.
Here then is the grammar of HTML 5:
HTML5 = .*
;D
-
this succinct rule set (.*) reminds me of a quote "the vodka is good but the meat is rotten" ( via https://en.wikipedia.org/wiki/Literal_translation#Translator's_humor )
&
"repology-updater/issues/784# Tiny core linux" https://forum.tinycorelinux.net/index.php/topic,26273.0.html
also wrt parsing it appears the above grammar could also work for repo's .tcz.info files ;)
the current info files are intended for human consumption as plain text
not for programmatic / machine readability & processing
the latter obviously implies "trying to parse the info"
....
seams more likely to bring machine readability to info fles / the reop
Yes, adopting a standard would increase machine readability.