Tiny Core Linux

Off-Topic => Off-Topic - The Funnies => Topic started by: mocore on September 27, 2024, 05:20:32 AM

Title: A grammar for HTML5
Post by: mocore on September 27, 2024, 05:20:32 AM: http://trevorjim.com/a-grammar-for-html5/

Quote
Fortunately, I have spent a great deal of time over the past ten years learning about parsing and its security aspects, and I believe I can give here a very succinct grammar for HTML5.

A few preliminaries. I will be using a variant of Backus-Naur Form (BNF) grammars, in which “.” will denote any single input character, and postfix “*” will denote zero or more repetitions of the preceding construct (Kleene closure). I will use capitalized identifiers for the nonterminals of the grammar.

Here then is the grammar of HTML 5:

HTML5 = .*

;D
Title: the vodka is good but the meat is rotten
Post by: mocore on September 27, 2024, 05:36:31 AM: this succinct rule set (.*) reminds me of a quote "the vodka is good but the meat is rotten" ( via https://en.wikipedia.org/wiki/Literal_translation#Translator's_humor )

&
"repology-updater/issues/784# Tiny core linux" https://forum.tinycorelinux.net/index.php/topic,26273.0.html

also wrt parsing it appears the above grammar could also work for repo's .tcz.info files ;)
Quote from: CardealRusso on December 30, 2023, 07:24:16 AM
Quote from: mocore on December 29, 2023, 10:05:11 PM
the current info files are intended for human consumption as plain text
not for programmatic / machine readability & processing

the latter obviously implies "trying to parse the info"
....
seams more likely to bring machine readability to info fles / the reop
Yes, adopting a standard would increase machine readability.