Author Topic: A grammar for HTML5 (Read 2007 times)

mocore · « **on:** September 27, 2024, 05:20:32 AM »

http://trevorjim.com/a-grammar-for-html5/

Quote

Fortunately, I have spent a great deal of time over the past ten years learning about parsing and its security aspects, and I believe I can give here a very succinct grammar for HTML5.

A few preliminaries. I will be using a variant of Backus-Naur Form (BNF) grammars, in which “.” will denote any single input character, and postfix “*” will denote zero or more repetitions of the preceding construct (Kleene closure). I will use capitalized identifiers for the nonterminals of the grammar.

Here then is the grammar of HTML 5:

HTML5 = .*

mocore · « **Reply #1 on:** September 27, 2024, 05:36:31 AM »

this succinct rule set (.*) reminds me of a quote "the vodka is good but the meat is rotten" ( via https://en.wikipedia.org/wiki/Literal_translation#Translator's_humor )

&
"repology-updater/issues/784# Tiny core linux" https://forum.tinycorelinux.net/index.php/topic,26273.0.html

also wrt parsing it appears the above grammar could also work for repo's .tcz.info files

Quote from: CardealRusso on December 30, 2023, 07:24:16 AM

Quote from: mocore on December 29, 2023, 10:05:11 PM
the current info files are intended for human consumption as plain text
not for programmatic / machine readability & processing

the latter obviously implies "trying to parse the info"
....
seams more likely to bring machine readability to info fles / the reop
Yes, adopting a standard would increase machine readability.

Tiny Core Linux

News:

Author Topic: A grammar for HTML5 (Read 2007 times)

mocore

A grammar for HTML5

mocore

the vodka is good but the meat is rotten