home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
HTML - Publishing on the Internet
/
html_cdrom.iso
/
tools
/
html
/
windows
/
check
/
announce.csm
next >
Wrap
Internet Message Format
|
1995-02-22
|
5KB
From: churchh@uts.cc.utexas.edu (Henry Churchyard)
Subject: htmlchek - HTML Error Checker and Utilities, v4.1
Newsgroups: comp.sources.misc
Organization: The University of Texas at Austin; Austin, Texas
Keywords: WWW HTML 2.0 HTML 3.0 Arena Netscape SGML awk perl error checker search-and-replace v.4.1
Submitted-by: churchh@uts.cc.utexas.edu (Henry Churchyard)
Archive-name: htmlchek/patch02
Environment: AWK || Perl
Patch-To: htmlchek: Volume 47, Issue 48-54
This is a bugfix and update to my htmlchek HTML error checker program,
adding several minor features for greater convenience of use, and
bringing the version number to 4.1. The program checks for quite a
number of possible defects in the HTML (Hyper-Text Mark-up Language)
version 2.0 SGML files used on the World-Wide Web; it makes no claim
to understand all of SGML, but is easy and relatively simple to use,
gives lots of information (including about many stylistically bad
practices), can do local cross-reference checking and generate
rudimentary reference-dependency maps, and can be run on any platform
for which an awk or perl language interpreter is available.
The htmlchek distribution also includes a number of supplemental
utilities, including the htmlsrpl.pl HTML-aware search-and-replace
program, which uses either literal strings or regular expressions;
acts either only outside HTML/SGML tags, or only within tags; can be
restricted to operate only within and/or only outside specified
elements; and can also upper-case tag names. Other utilities are:
makemenu -- Makes simple menu for HTML files, based on each file's <TITLE>;
can also make a simple table of contents based on <H1>-<H6>
headings.
xtraclnk.pl -- Extracts links/anchors from HTML files; isolates text
contained in <A> and <TITLE> elements.
dehtml -- Removes all HTML markup, preliminary to spell check.
entify -- Replaces high Latin-1 alphabetic characters with ampersand
entities for safe 7-bit transport.
metachar -- Trivial program to protect HTML/SGML metacharacters "&<>" in
plain text that is to be included in an HTML file.
The patched 4.1 version of htmlchek is (or soon will be) available
for anonymous FTP, in convenient .tar.Z and .zip formats, at the
following URL:
ftp://ftp.cs.buffalo.edu/pub/htmlchek/
The documentation to htmlchek can be browsed over the Web, at this
location:
http://uts.cc.utexas.edu/~churchh/htmlchek.html
Changes in this release include:
Don't warn about null <TEXTAREA></TEXTAREA> element; only check for
inappropriate whitespace within elements commonly rendered as
underlined (<A> and <U>); check ordering of head tags before body tags
even in absence of explicit <head>...</head>; allow comments between
list items; only output non-numeric unquoted option values in each
file; corrected processing of HTML3 <LH>; updated HTML 3 language
definition to January 19 1995 draft; tinkered with Netscape extensions
language-definition yet again; added inline=1 command-line parameter;
added listfile=/lf= command-line parameter (especially for greater
MS-DOS convenience); allow cf= as abbreviation of configfile=;
ampersands followed by non-alphabetics generate warnings rather than
errors (so corresponding erromessage was removed from entify); added
"changed"/"unchanged" STDERR messages to htmlsrpl.pl output; added
.gif's to documentation; added awk-perl.html to documentation; added
index.html menu to documentation.
New files in this release are:
README.41 Update notes
index.html HTML version of README.40, README.41, and menu
awk-perl.html Where to obtain Awk and Perl
geterr.sh Trivial script to extract only ERROR! messages
from htmlchek output
geterwrn.sh Trivial script to extract only ERROR!/Warning!
messages from htmlchek output
___
awk.gif | .gif files used
camel.gif | in htmlchek HTML
ftp.gif | documentation
htmlchek.gif | (uuencoded as .uue
htmlchks.gif | files in the
valdhtml.gif | comp.sources.misc
warning.gif ___| Usenet distribution)
To apply the patch, cd into the htmlchek source directory, unshar
this file and type:
patch < htmlc41a.ptc
patch < htmlc41b.ptc
patch < htmlc41c.ptc
Use a uudecode program to convert the .uue files to .gif's.