27-06-2025, 09:26 PM
I am trying to figure out the paragraph breaks in the Starred Parags (aka Recipes) section.
I will use these terms:
That saiid, a paragraph should ideally be a bunch of consecutive lines with all of the following properties:
The following table gives some relevant statistics per page, with a tentative assignment of starlets:
As it can be seen, on page You are not allowed to view links. Register or Login to view. the counts of stars, short lines, and puffed lines match and the whole texts consists of perfect parags. On other pages there are lines which cannot be placed in perfect parags. I will have to compromise on one or more of the criteria above. Stay tuned...
I will use these terms:
- parag: short for paragraph.
- head of a parag: its first line.
- tail of a parag: its last line,
- puff: a one-legged gallows, either {p} or {f}, with or without the platform slash.
- margin: the mostly text-free space between an edge of the page and the text.
- left rail: the ideal mostly vertical and straight line that runs just to the left of the majority of lines of a page, separating the left margin from the text.
- right rail: the ideal mostly vertical and possibly wavy but fairly smooth line that runs just to the right of the ends of most lines of a page, separating the text from the right margin.
- long line: a text line that starts at the left rail and ends at or beyond the right rail.
- short line: a text line that starts at the left rail but ends well before the right rail.
- baseline: the ideal usually smooth curved line that runs just below the glyphs of a text line, excluding the tails of {y}, {m}, {l}. etc..
- linegap: the vertical distance between baselines of successive lines; which often varies over the width of the text.
- wider linegap: a line gap that is wider than normal, at least in some part of the lines (e.g. left side, right side, or middle).
- topline: an ideal line parallet to the baseline, such that the distance between the two is the height of an EVA {o} in the line's handwriting.
- midline: an ideal line parallel to the baseline and the topline, equdistant from the two.
- starlet: a star in the margin that has been assigned to a unique line, like a bullet in an item list.
That saiid, a paragraph should ideally be a bunch of consecutive lines with all of the following properties:
- P1. The first of these lines follows a short line (or is the first line in the SPS, or follows a "title");
- P2. The last of these lines is short (or is the last line of the SPS, or precedes a "title").
- P3. All lines other than the last one are long lines.
- P4. There are no puffs in any of these lines except possibly in the first of them.
- P5. The first of those lines has an assigned starlet.
- P6. None of these lines, except the first one, has an assigned starlet.
The following table gives some relevant statistics per page, with a tentative assignment of starlets:
- Stars: Number of stars in the page.
- ShLns: Number of short lines in the page
- Puffd: Number of lines that contain puffs (one-leg gallows).
- PerfP: Number of perfect parags in the page.
Code:
page ! Stars ! ShLns ! Puffd ! PerfP
------+-------+-------+-------+-------
f103r | 19 | 18 | 14 | 15
f103v | 14 | 12 | 14 | 9
f104r | 13 | 13 | 13 | 13
f104v | 13 | 13 | 8 | 11
f105r | 10 | 11 | 15 | 6
f105v | 10 | 14 | 20 | 3
f106r | 16 | 15 | 17 | 13
f106v | 14 | 16 | 16 | 14
f107r | 15 | 15 | 13 | 10
f107v | 15 | 15 | 13 | 14
f108r | 16 | 17 | 13 | 8
f108v | 16 | 5 | 8 | 1
f111r | 17 | 10 | 7 | 4
f111v | 19 | 8 | 11 | 6
f112r | 12 | 11 | 13 | 8
f112v | 13 | 15 | 14 | 12
f113r | 16 | 16 | 17 | 12
f113v | 15 | 15 | 16 | 15
f114r | 13 | 11 | 13 | 11
f114v | 12 | 11 | 12 | 9
f115r | 13 | 13 | 12 | 12
f115v | 13 | 13 | 12 | 12
f116r | 10 | 8 | 10 | 5
------+-------+-------+-------+-------
TOTAL | 324 | 295 | 301 | 223
As it can be seen, on page You are not allowed to view links. Register or Login to view. the counts of stars, short lines, and puffed lines match and the whole texts consists of perfect parags. On other pages there are lines which cannot be placed in perfect parags. I will have to compromise on one or more of the criteria above. Stay tuned...