S&H-Online Development Notes
This presentation of Science and Health is based on the text from Project Gutenberg. I first wrote a program to take the PG text and convert it to XML. I was able to write this filter program because the PG text is formatted in a generally consistent manner. There were a few formatting changes I had to make for my filter. Also, in the course of working closely with this text I found and corrected a few typos from the PG text. For comparison and verification I used several different S&H books: (1) a 1910 edition published in 1911 (printed before any changes were made to Fruitage); (2) a 1910 edition published by The Mary Baker Eddy Foundation; (3) a current printing of S&H. I also used the S&H-Online at spirituality.com.
The following are my rough notes of all the changes I made to the PG text.
Italics (using underscore markup) were spanning lines. I changed to not span lines and used html entities at the following refs:
17:2-3 17:10-11 17:14-15 21:15-16 37:30-31 73:3-4 114:29-30 361:30-31 485:2-3 492:25-26 517:3-4 525:8-9 (also: had an extra underscore) 525:9-10 564:18-19 25430 - 2nd paragraph, 3rd-4th lines of HEREDITARY DISEASE OF THE LUNGS CURED 26825 - 1st paragraph, 1st-2nd lines of LIGHT OUT OF DARKNESS
Preface starts with wrong page num.
The quote at start of Preface, "Let us get up early to the vineyards;", had a ':' but should be ';'.
Linebreaks in Fruitage don't correspond to print versions. I re-wrapped to be closer to other chapters' line length but it is still not exactly as print versions.
There is an '&' at 438:31 which should not be there.
In Glossary, 2 linebreaks were missing, separating terms from each other.
An extra period after sense:
563:1 Human sense may well marvel at discord, while, to a
diviner sense., harmony is the real and discord the unreal.
An extra linebreak before 202:27.
Line 19 should begin with "ness":
28:18 world measure aright. Even his righteous-
less and purity did not hinder men from saying: He
I hard-coded 2 <q>s in sh_ch7.txt, 190:23 and 190:28.
I hard-coded pages 498 - 500 at the end of sh_ch14.txt. Note that the <p>s may cause extra <p>s when filtered. ??? (no, I think I have that ok now)
The marginal heading for para at 135:21 is not centered, so my filter did not see it as a MH. I centered it.
Label 66:1 missing, but needed by my filter. I added it.
Converting all quotes to hard-coded <q>s.
Remove extra line break after 113:15.
Add line break before 118:6.
Add line break before 123:24.
Add line break before 313:7.
Add line break before 360:24.
Add line break before 566:15.
Add line break before 566:21.
Add line break before 66:3.
The MH for paragraph 252:31 should instead belong to paragraph 252:32.
410:22 was centered and my filter didn't know what to do with it - I left-justified it.
My filter thought the MH for 371:20 (sh_ch12.txt) was a line of text because it is so long.
The comma should be a period:
532:28 matter, In the allegory the body had been
p='92' l='9' should start a new paragraph.
p='434' l='24', at the end of the line, "test-" should be "testi-".
"Principle" was not capitalized: p='270' l='13', p='91' l='6', p='132' l='12'