CP/M

home *** CD-ROM | disk | FTP | other *** search

/ CP/M / CPM_CDROM.iso / enterprs / c128 / util / lrr.arc / lrr.doc < prev next >

Wrap

Text File | 1992-11-11 | 67KB | 1,809 lines

============================================================================= LITTLE RED READER: MS-DOS file reader for the 128 and 1571/81 drives. by Craig Bruce <csbruce@neumann.uwaterloo.ca> 1. INTRODUCTION This article presents a program that reads MS-DOS files and the root directory of MS-DOS disks. The program copies only from drive to drive without buffering file data internally. This is simpler and imposes no limits on the size of the files transferred, although it requires the use of two disk drives (or a logical drive). The user-interface code is written in BASIC and presents a full-screen file selection menu. The grunt-work code is written in assembly language and operates at maximum velosity. The Burst Command Instruction Set of the 1571/81 is used to read the MS-DOS disk blocks and the standard kernel routines are used for outputting the data. (I am an operating systems specialist, so I call it a kernEl!) Thus, the MS-DOS files must be read from a 1571 or 1581 disk drive, but the output device may be any disk drive type, the screen or a printer, or a virtual drive type such as RAMLink, RAMDrive, or RAMDOS (for the REU). It is interesting to note that the data can be read in from an MS-DOS disk faster than it can be written out to a 1571, 1581, or even a RAMDOS file. A RAMLink can swallow the data only slightly faster than it can be read. Little Red Reader (LRR) supports double density 3.5" disks formatted with 80 tracks, 9 sectors per track, and 2 sides with a 1581 and 5.25" double density disks formatted with 40 tracks, 9 sectors per track, and 2 sides with a 1571. A limit of 128 directory entries and 3 File Allocation Table (FAT) sectors is imposed. There must be 2 copies of the FAT and the cluster size may be 1 or 2 sectors. The sector size must be 512 bytes. Oh, about the name. It is a play on the name of another MS-DOS file copier available for the C-128. "Little" means that it is smaller in scope than the other program, and "Red" is a different primary color to avoid any legal complications. It is also the non-white color of the flag of the country of origin of this program (no, I am not Japanese). Also, this program is Public Domain Software, as is all software I develop for 8-bit Commodore Computers. Feel free to E-mail me if you have questions or comments about this article. 2. USER GUIDE LOAD and RUN the "lrr.128" BASIC program file. When the program is first run, it will display an "initializing" message and will load in the binary machine language package from the "current" Commodore DOS drive (the current drive is obtained from PEEK(186) - the last device accessed). The binary package is loaded only on the first run and is not reloaded on subsequent runs if the package ID field is in place. 2.1. MAIN SCREEN The main screen of the program is then displayed. The main screen of the program will look something like this: MS-DEV=9 MS-TYPE=1581 CBM-DEV=8 NUM S TRN TYP FILENAME EXT LENGTH --- - --- --- -------- --- ------ 1 * ASC SEQ HACK4 TXT 120732 2 BIN PRG RAMDOS SFX 34923 D=DIRECTORY M=MS-DEV F=CBM-DEV Q=QUIT T=TOGGLE-COLUMN, C=COPY-FILES, +/- PAGE except that immediately after starting up, "<no files>" will be displayed rather than filenames. The "MS-DEV" and "MS-TYPE" fields give the device number and type of the drive containing the MS-DOS disk to copy from, and the "CBM-DEV" gives the device number of the drive/virtual drive/character device to copy file data to. Information about all MS-DOS files in the root directory of the MS-DOS disk is displayed in columns below the drive information. "NUM" gives the number of the MS-DOS file in the directory listing, and "S" indicates whether the file is "selected" or not. If the file is selected, an asterisk (*) is displayed; otherwise, a blank is displayed. When you later enter Copy Mode, only the files that have been "selected" are copied. The "TRN" field indicates the character translation scheme to be used when the file is copied. A value of "BIN" (binary) means no translation and a value of "ASC" (ascii) means the file characters are to be translated from MS-DOS ASCII (or "ASCII-CrLf") to PETSCII. The "TYP" field indicates the type of Commodore-DOS file to create for writing the MS-DOS file contents into. The possible values are "SEQ" (sequential) and "PRG" (program). The values of the TRN and TYP fileds are set independently, so you can copy binary data to SEQ files and ascii data to PRG files if you wish. The "FILENAME" and "EXT" fields give the filename and extension type of the MS-DOS files and "LENGTH" gives the exact length of the files in bytes. Note that if you perform "ASC" translation on a file, its PETSCII version will have a shorter length. 2.2. USER COMMANDS The bottom of the screen gives the command summary. After starting the program, you will want to setup the MS-DOS and CBM-DOS drives with the "M" and "F" commands. Simply press the (letter) key corresponding to the command name to activate the command. Pressing M will prompt you for the MS-DOS Drive Number and the MS-DOS Drive Type. In both cases, type the number and press RETURN. (Sorry for insulting all non-novices out there, but I want to be complete). The MS-DOS drive number cannot be the same as the CBM-DOS drive number (since the program copies from drive-to-drive without internal buffering). For the drive type, enter an "8", "81", or "1581" for a 1581 drive or anything else for a 1571 drive. Pressing F will prompt you for the CBM-DOS device number. You may enter a number from 0 to 30, except that it must not be the MS-DOS drive number. Enter a "1" for Cassette Drive (God forbid!), a "3" for the screen, a "4" for the printer (with an automatic secondary address of 7 (lowercase)), any number above 7 for a Commodore disk drive or special virtual drive, or a value of "0" for the special "null" drive. A CBM-DEV value of 0 will case the program to read MS-DOS files and do nothing with the output. You can use this feature to check out the raw reading speed of the program. After setting up the drives, press D to read in the root directory off the MS-DOS disk. The data will come blazing in from the disk but BASIC will take its good ole time sifting through it. Filenames are displayed on the screen as they are scanned in. The program will (eventually) return to the main screen and display the formatted file information. One note: the process of logging in a 1581 MS-DOS disk takes about 12 seconds (on my 1581, anyway), so be patient. An MS-DOS disk will have to be "logged in" every time you change MS-DOS disks. (Disks are logged in automatically). A couple of notes about accessing MS-DOS disks: don't try to access a device that is not present because the machine language routines cannot handle this error for some reason and will lock up, requiring a STOP+RESTORE. Also, make sure that an actual MS-DOS disk is loaded into the drive. If you accidentally place Commodore-DOS disk into the MS-DOS drive, the 1581 will report an invalid boot parameters error (#60), but a 1571 will lock up (since I don't check the sector size and my burst routines are expecting 512 bytes to come out of a sector whereas Commodore disks have only 256 bytes per sector). Now you are ready to pick what files you want copied and how you want them copied. You will notice that a "cursor" appears in the "S" column of the first file. You may move the cursor around with the cursor keys: UP, DOWN, LEFT, RIGHT, HOME, and CLR. CLR (SHIFT-HOME) will move the cursor back to the first file on the first screen. You can move the cursor among the select, translation, and file-type columns of all the files. Pressing a SPACE or a RETURN will toggle the value of the field that the cursor is on. To toggle all of the values of the "cursor" column (including files on all other screens), press T. You will notice that moving the cursor around and toggling fields is a bit sluggish, especially if you are in Slow mode on the 40-column screen. Did I mention that this program will run on either the 40 or 80-column screen? Toggling an entire column can take a couple of seconds. If there are more than 18 MS-DOS files, you can press the "+" and "-" keys to move among all of the screens of files. The cursor movement keys will wrap around on the current screen. "+" is page forward, and "-" is page backward. The screens wrap around too. After you have selected all of the files you want to copy and their translation and file-type fields have been set, press the C key to go into Copy Mode (next section). After copying, you are returned to the main screen with all of the field settings still intact. To exit from the program, press Q. 2.3. COPY MODE When you enter copy mode, the screen will clear and the name of each selected file is displayed as it is being copied. If an error is encountered on either the MS-DOS or CBM-DOS drive during copying, an error message will be displayed and copying will continue (after you press a key for MS-DOS errors). To generate a CBM-DOS filename from an MS-DOS filename, the eight filename characters are taken (including spaces) and a dot (.) and the three characters of the extension are appended. Then, all spaces are removed, and if the name ends with a dot (.) character, then that dot character is removed as well. I think this is fairly reasonable. If there already is a file with the same filename on the CBM-DOS disk, then you will be prompted if you want to overwrite the file or not. Entering an "n" will abort the copying of that file and go on to the next file, and entering a "y" (or anything else) will cause the CBM-DOS file to be "scratched" and then re-written. The physical copying of the file is done completely in machine language and nothing is displayed on the screen while this is happening, but you can follow things by looking at das blinkin lichtes and listening for clicks and grinds. You will probably be surprised by the MS-DOS file reading speed (I mean in a good way). The disk data is read in whole tracks and cached in memory and the directory information and the FAT are retained in memory as well. The result is that minimal time is spent reading disk data, and no costly seeks are required for opening a new MS-DOS file. A result is that small files are copied one after another very quickly. You will have to wait, however, on the relatively slow standard kernel/Commodore-DOS file writing. A few changes had to be made to the program to accomodate the RAMDOS program. RAMDOS uses memory from $2300 to $3FFF of RAM0, which is not really a good place for a device driver, and it uses some of the zero-page locations that I wanted to use. But, difficulties were overcome. The importance of RAMDOS compatibility is that if you only have one disk drive but you have an REU, you can use RAMDOS to store the MS-DOS files temporarily. If you only have one disk drive and no REU, you are SOL (Out of Luck) unless you can get a RamDisk-type program for an unexpanded 128. The RAMDOS program is available from FTP site "ccosun.caltech.edu" in file "/pub/rknop/util128/ramdosii.sfx". One note I found out about RAMDOS: you cannot use a DOPEN#1,(CF$),U(CD),W with it like you are supposed to be able to; you have to use a DOPEN#1,(CF$+",W"),U(CD) Here is a table of copying speeds for copying from 1571s and 1581s with ASC and BIN translation modes. All figures are in bytes/second. These results were obtained from copying a 127,280 byte text file (the text of C= Hacking Issue #3). FROM \ TO: "null" RAMLink RAMDOS JD1581 JD1571 -------+ ------ ------- ------ ------ ------ 81-bin | 5772 3441 2146 n/a 644 81-asc | 5772 3434 2164 n/a 661 71-bin | 4323 2991 1949 1821 n/a 71-asc | 4323 2982 1962 1847 n/a The "null" device is that "0" CBM-DOS device number, and a couple of entries are "n/a" since I only have one 1571 and one 1581. Note that my 71 and 81 are JiffyDOS-ified, so the performance of a stock 71/81 will be poorer. JiffyDOS gives about a 2x performance improvement for the standard file accessing calls (open, close, chrin, chrout). RAMDOS doesn't seem to be as snappy as you might think. The "null" figures are quite impressive, but the raw sector reading speed without the overhead of mucking around with file organization is 6700 bytes/sec for a 1581 and 4600 B/s for a 71. The reason that the 1571 operates so quickly is that I use a sector interleave of 4 (which is optimal) for reading the tracks. I think that other MS-DOS file copier program uses an interleave of 1 (which is not optimal). I lose some of the raw performance because I copy the file data internally once before outputting it (to simplify some of the code). In a couple of places you will notice that ASC translation gives slightly better or slightly worse performance than BIN. This is because although slightly more work is required to translate the characters, slightly fewer characters will have to be written to the CBM-DOS file, since PETSCII uses only CR where MS-DOS ASCII uses CR and LF to represent end-of-line. Translation is done by using a table (that you can change if you wish). Many entries in this table contain a value of zero, which means that no character will be output on translation. Most of the control characters and all of the characters of value 128 (0x80) or greater are thrown away on being translated. The table is set up so that CR characters are thrown away and the LF character is translated to a CBM-DOS CR character. Thus, both MS-DOS ASCII files and UNIX ASCII files can be translated correctly. 2. BURST COMMANDS Three burst commands from the 1571/81 disk drive Burst Command Instruction Set are required to allow this program to read the MS-DOS disks: Query Disk Format, Sector Interleave, and Read. The grungy details about issuing burst commands and burst mode handshaking are covered in C= Hacking Issue #3. The Query Disk Format command is used to "log in" the MS-DOS disk. The Inquire Disk burst command cannot be used with an MS-DOS disk on the 1581 for some unknown reason. I found this out the hard way. The Query Disk Format command has the following format: BYTE \ bit: 7 6 5 4 3 2 1 0 | Value -------+--------+-----+-----+-----+-----+-----+-----+-----+------- 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | "U" 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | "0" 2 | F | X | X | S | 1 | 0 | 1 | N | 10 -------+--------------------------------------------------+------- where the F, S, and N bits have a value of 0 for our purposes. A response of a burst status byte and six other throw-away bytes is given from the drive. This command takes quite a long time to execute on my 1581 but works quite quickly on my 1571. You only have to log in a disk whenever you change disks. The Sector Interleave command is used to set a soft interleave for the Read command. I use an interleave of 1 for the 1581 and an interleave of 4 for the 1571. This means that the MS-DOS sectors will come from 1571 to the computer in the following order: 1, 5, 9, 4, 8, 3, 7, 2, 6 (there are 9 sectors per track on an MS-DOS disk (both 3.5" and 5.25"), numbered from 1 to 9). LRR handles the data coming in in this order, and in straight order from the 1581. The Sector Interleave command has the following format, where the W and N bits are 0 for us: BYTE \ bit: 7 6 5 4 3 2 1 0 | Value -------+--------+-----+-----+-----+-----+-----+-----+-----+------- 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | "U" 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | "0" 2 | W | X | X | 0 | 1 | 0 | 0 | N | 8 3 | <interleave> | 1 or 4 -------+--------------------------------------------------+------- The Read command is used to transfer the nine sectors of a track to the computer in the order specified by the interleave. The format is: BYTE \ bit: 7 6 5 4 3 2 1 0 | Value -------+--------+-----+-----+-----+-----+-----+-----+-----+------- 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | "U" 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | "0" 2 | T/L | E | B/X | S | 0 | 0 | 0 | N | 0 or 16 3 | <track> | ??? 4 | <sector> | 1 5 | <number of sectors> | 9 -------+--------------------------------------------------+------- There are a couple of differences between the 1571 and 1581 versions of this command. Most important, the S bit (Side of disk to use) has the opposite meaning on the two drives. There's no good reason that I know of for this inconsistency. This is the reason that LRR needs to know what type of MS-DOS drive it is dealing with (plus interleaving). The read command returns the following data using burst mode handshaking: +-------------------+ 0 | Burst Status Byte | +-------------------+ 1 | | ... + 512 Data Bytes | 512 | | +-------------------+ for each sector transferred. If the Burst Status Byte indicates an error, then the data is not transferred and none of the following sectors are either. If the status byte gives a "Disk Changed" error, then you have to log in the disk with the Query Disk Format command before read will work properly. This is actually a good feature since it lets you know about a disk change so you can update any data structures you may have. LRR simply re-logs in the disk without updating any data structures and re-tries the failed read operation. 3. MS-DOS DISK FORMAT An MS-DOS disk is separated into 4 different parts: the Boot Sector, the FAT(s), the Root Directory, and the File Data Sectors. The logical sectors (blocks) of a disk are numbered from 0 to some maximum number (1439 for a 3.5", 719 for a 5.25" DD disk). The physical layout and the logical sector numbers typically used by a 3.5" disk are shown here: +-------------------+ 0 | Boot Sector | +-------------------+ 1..3 | FAT copy #1 | +-------------------+ 4..6 | FAT copy #2 | +-------------------+ 7..14 | Root Directory | +-------------------+ 15 | | ... | File Data Sectors | 1439 | | +-------------------+ 3.1. THE BOOT SECTOR The Boot Sector is always at logical sector number 0. It contains some important information about the format of the disk and it also contains code to boot an MS-DOS machine from. We aren't concerned with the bootstrapping code, but the important values we need to obtain from the boot sector are: ABBR OFFSET 1571 1581 DESCRIPTION ---- ------ ---- ---- ----------- CS 13 2 2 Cluster size in sectors NB 14 1 1 Number of boot sectors NF 16 2 2 Number of FATs FL 23 2 3 FAT size in sectors DE 17 112 112 Number of root directory entries TS 19,20 720 1440 Total Number of sectors NS 24 9 9 Number of sectors per track NH 26 2 2 Number of sides The 1571 and 1581 columns give the typical values of these parameters for 5.25" and 3.5" disks. The OFFSET is the address of the parameter within the boot sector. The total number of sectors is given in low-byte, high-byte order (since the 80x86 family is little-endian like the 6502 family). From the above parameters, we can derive the following important parameters: ABBR FORMULA 1571 1581 DESCRIPTION ---- ------- ---- ---- ----------- F1 NB+NF*FL 5 7 First root directory sector FS NB+NF*FL+DE 12 14 First file data sector number NC (TS-FS)/CS 354 713 Total number of file clusters LRR imposes a number of limits on these parameters and will error-out if you try to use a disk that is outside of LRR's limits. 3.2. CHEWING THE FAT MS-DOS disks use a data structure called a File Allocation Table (FAT) to record which clusters belong to which file in what order and which blocks are free. A cluster is a set of contiguous sectors which are allocated to files as a group. LRR handles cluster sizes of 1 and 2 sectors, giving a logical file block size of 512 or 1024 bytes. Typically, a cluster size of 2 sectors is used. The FAT is an array of 12-bit numbers, with an entry corresponding to each cluster that can be allocated to files. FAT entries 0 and 1 are reserved. If a FAT entry contains a value of $000, then the corresponding cluster is free and can be allocated to a file; otherwise, the cluster is allocated and the FAT entry contains the number of the NEXT FAT entry that belongs to the file. Thus, MS-DOS files are stored in a singly-linked list of clusters like Commodore-DOS files are, except that the links are not in the data sectors but rather are in the FAT. The pointer to the first FAT entry for a file is given in the file's directory entry. A special NULL/NIL pointer value of $FFF is used to indicate the end of the chain of clusters allocated to a file. This value is stored in the FAT entry of the last cluster allocated to a file (of course). Consider the following example FAT: ENTRY VALUE ----- ----- $000 $FFF $001 $FFF $002 |----$003 <------Directory Entry $003 +--> $005----+ $004 $000 | $005 $FFF <--+ Entries 0 and 1 are insignificant since they are reserved. Say that a file starts at FAT entry #2. Then, it consists of the following chain of clusters: 2, 3, and 5. Cluster #4 is free. Clusters can be allocated to a file in random order, but if they are allocated contiguously in forward order, then the file will be able to be read faster. The FAT is such an important data structure that typically two copies are kept on the disk incase one of them should become corrupted. The MS-DOS designers were a little sneaky in storing the 12-bit FAT entries - they used only 12 real bits per entry. Ie., they store two FAT entries in three bytes, where the two entries share the two nybbles of the middle byte. The following diagram shows how the nybbles 1 (high), 2 (mid), and 3 (low) are stored into FAT entries A and B: BYTE: 0 1 2 +---+---+ +---+---+ +---+---+ ENTRY: | A | A | | B | A | | B | B | NYBBLE: | 2 | 3 | | 3 | 1 | | 1 | 2 | +---+---+ +---+---+ +---+---+ Anyway, let's just say it's a bit tricky to extract the 12-bit values from this compressed data structure. On top of that, I don't think there is any saving in disk space resulting from compressing this structure; they might as well have just used a 16-bit FAT (like they do nowadays on larger disks). 3.3. THE ROOT DIRECTORY The root directory has a fixed size, although I don't think that subdirectories do. LRR cannot access subdirectories. Each 512-byte sector of the root directory contains sixteen 32-byte directory entries. One directory entry is required for each file stored on the disk. A directory entry has the following structure: OFFSET LEN DESCRIPTION ------ --- ----------- 0..7 8 Filename 8..10 3 Extension 11 1 <unused?> 12 1 Attributes: $10=Directory, $08=VolumeId 13..21 9 <unused> 22..25 4 Date 26..27 2 Starting FAT entry number 28..31 4 File length in bytes The filename and extension are stored with trailing padding spaces. If a directory entry is unused or deleted, then the first character of the filename is either a $00 or a $E5 (229). This is why you have to provide the first character of a filename if you are undeleting a file on an MS-DOS machine. Note that there is enough unused space that Microsoft or IBM could have ditched the annoying 8+3 character filename format. The attributes bits tell whether the directory entry is for a regular file, a subdirectory, a disk volume name (in which case there is no file data), and a couple of other things I can't remember. I'm not sure about the exact position or format of the date, but LRR doesn't use it anyway. The starting FAT entry number and the file length are stored in lower-byte-first order. 3.4. THE FILE DATA SPACE The ramainder of the disk space is used for storing file data in clusters of 1 or 2 sectors each. Given a cluster number (which is also the FAT entry number), the following formula is used to calculate the starting logical sector number of the cluster: (ClusterNumber - 2) * ClusterSizeInBlocks + FirstFileDataLogicalSectorNumber where "FirstFileDataLogicalSectorNumber" is the "FS" parameter derived earlier. The following consecutive logical sector numbers up to the number of sectors per cluster form the rest of the cluster. Note that a single cluster can span sectors from one side of the disk to another or from one track to another. We perform the "(ClusterNumber - 2)" portion of the calculation since the first two FAT entries are reserved. Since the Read burst command of the 1571/81 wants the side, track, and sector number of a sector rather than its logical number, we also need formulae for these conversions: Track = LogicalSectorNumber / 18 Sector = LogicalSectorNumber % 9 + 1 Side = (LogicalSectorNumber / 9) % 2 These formulae are more problematic than the previous one since they require division by 9 and 18. LRR uses the method of repeated subtraction to perfrom the necessary division (only one division is necessary). The above formulae imply that sequential logical sectors are stored on the top of the disk first and then the bottom of the disk of the same track, and then on the top of the next track, etc. This is a good sector numbering scheme (unlike the CBM-DOS scheme for 1571 sectors) since it is faster to switch sides of the disk than it is to switch tracks, so you can read the disk faster. Oh yeah, the way that you know how many file data bytes are in the last cluster of a file chain (the cluster with the NULL FAT entry) is to take the file length from the directory entry and "mod" (the C language % operator) it with the cluster size. One special case is if this calculation results in a zero, then the last cluster is completely full (rather than completely empty as the calculation would suggest). This calculation is easily done in machine language with an AND operation since the cluster size is always a power of two. 4. FILE COPYING PACKAGE This section discusses the interface to and implementation of the MS-DOS file copying package. It is written in assembly language and is loaded into memory at address $8000 on bank 0 and requires about 13K of memory. The package is loaded at this high address to be out of the way of the main BASIC program, even if RAMDOS is installed. 4.1. INTERFACE The subroutine call and parameter passing interface to the file copying package is summarized as follows: ADDRESS DESCRIPTION ------- ----------- PK InitPackage subroutine PK+3 LoadDirectory subroutine PK+6 CopyFile subroutine PK+9 two-byte package identification number PK+15 errno : error code returned PK+16 MS-DOS device number (8 to 30) PK+17 MS-DOS device type ($00=1571, $FF=1581) PK+18 two-byte starting cluster number for file copying PK+20 low and mid bytes of file length for copying where "PK" is the load address of the package. Additional subroutine parameters are passed in the processor registers. The "InitPackage" subroutine should be called when the package is first installed, whenever the MS-DOS device number is changed, and whenever a new disk is mounted to invalidate the internal track cache. It requires no parameters. The "LoadDirectory" subroutine will load the directory, FAT, and the Boot Sector parameters into the internal memory of the package from the current MS-DOS device number. No (other) input parameters are needed and the subroutine returns a pointer to the directory space in the .AY registers and the number of directory entries in the .X register. If an error occurs, then the subroutine returns with the Carry flag set and the error code is available in the "errno" interface variable. The directory entry data is in the directory space as it was read in raw from the directory sectors on the MS-DOS disk. The "CopyFile" subroutine will copy a single file from the MS-DOS disk to a specified CBM-Kernal logical file number (the CBM file must already be opened). If the CBM logical file number is zero, then the file data is simply discarded after it is read from the MS-DOS file. The starting cluster number of the file to copy and the low and mid bytes of the file length are passed in the PK+18 and PK+20 interface words. The translation mode to use is passed in the .A register ($00=binary, $FF=ascii) and the CBM logical file number to output to is passed in the .X register. If an error occurs, the routine returns with the Carry flag set and the error code in the "errno" interface variable. There are no other output parameters. Note that since the starting cluster number and low-file length of the file to be copied are required rather than the filename, it is the responsibility of the front-end application program to dig through the raw directory sector data to get this information. The application must also open the Commodore-DOS file of whatever filetype on whatever device is required; the package does not need to know the Commodore-DOS device number. The MS-DOS device number and device type interface variables allow you to set the MS-DOS drive and the package identification number allows the application program to check if the package is already loaded into memory so that it only has to load the package the first time the application is run and not on re-runs. The identification sequence is a value of $CB followed by a value of 131. 4.2. IMPLEMENTATION This section presents the code that implements the MS-DOS file reading package. It is here in a special form; each code line is preceded by a few special characters and the line number. The special characters are there to allow you to easily extract the assembler code from the rest of this magazine (and all of my ugly comments). On a Unix system, all you have to do is execute the following command line (substitute filenames as appropriate): grep '^\.%...\!' Hack4 | sed 's/^.%...\!..//' | sed 's/.%...\!//' >lrr.s You'll notice that the initial comment lines here were an afterthought. .%000! ;Little Red Reader MS-DOS file copier program .%000! ;written 92/10/03 by Craig Bruce for C= Hacking Net Magazine .%000! The code is written for the Buddy assembler and here are a couple setup directives. Note that my comments come before the section of code. .%001! .org $8000 .%002! .obj "@:lrr.bin" .%003! .%004! ;====jump table and parameters interface ==== .%005! .%006! jmp initPackage .%007! jmp loadDirectory .%008! jmp copyFile .%009! .%010! .byte $cb,131 ;identification .%011! .byte 0,0,0,0 .%012! These variables are included in the package program space to minimize unwanted interaction with other programs loaded at the same time, such as the RAMDOS device driver. .%013! errno .buf 1 ;(location pk+15) .%014! sourceDevice .buf 1 .%015! sourceType .buf 1 ;$00=1571, $ff=1581 .%016! startCluster .buf 2 .%017! lenML .buf 2 ;length medium and low bytes .%018! .%019! ;====global declaraions==== .%020! .%021! kernelListen = $ffb1 .%022! kernelSecond = $ff93 .%023! kernelUnlsn = $ffae .%024! kernelAcptr = $ffa2 .%025! kernelCiout = $ffa8 .%026! kernelSpinp = $ff47 .%027! kernelChkout = $ffc9 .%028! kernelClrchn = $ffcc .%029! kernelChrout = $ffd2 .%030! .%031! st = $d0 .%032! ciaClock = $dd00 .%033! ciaFlags = $dc0d .%034! ciaData = $dc0c .%035! These are the parameters and derived parameters from the boot sector. They are kept in the program space to avoid interactions. .%036! clusterBlockCount .buf 1 ;1 or 2 .%037! fatBlocks .buf 1 ;up to 3 .%038! rootDirBlocks .buf 1 ;up to 8 .%039! rootDirEntries .buf 1 ;up to 128 .%040! totalSectors .buf 2 ;up to 1440 .%041! firstFileBlock .buf 1 .%042! firstRootDirBlock .buf 1 .%043! fileClusterCount .buf 2 .%044! The cylinder (track) and side that is currently stored in the trach cache. .%045! bufCylinder .buf 1 .%046! bufSide .buf 1 .%047! formatParms .buf 6 .%048! This package is split into a number of levels. This level interfaces with the Kernal serial bus routines and the burst command protocol of the disk drives. .%049! ;====hardware level==== .%050! Connect to the MS-DOS device and send the "U0" burst command prefix and the burst command byte. .%051! sendU0 = * ;( .A=burstCommandCode ) : .CS=err .%052! pha .%053! lda #0 .%054! sta st .%055! lda sourceDevice .%056! jsr kernelListen .%057! lda #$6f .%058! jsr kernelSecond .%059! lda #"u" .%060! jsr kernelCiout .%061! bit st .%062! bmi sendU0Error .%063! lda #"0" .%064! jsr kernelCiout .%065! pla .%066! jsr kernelCiout .%067! bit st .%068! bmi sendU0Error .%069! clc .%070! rts .%071! .%072! sendU0Error = * .%073! lda #5 .%074! sta errno .%075! sec .%076! rts .%077! Toggle the "Data Accepted / Ready For More" clock signal for the burst transfer protocol. .%078! toggleClock = * .%079! lda ciaClock .%080! eor #$10 .%081! sta ciaClock .%082! rts .%083! Wait for a burst byte to arrive in the serial data register of CIA#1 from the fast serial bus. .%084! serialWait = * .%085! lda #$08 .%086! - bit ciaFlags .%087! beq - .%088! rts .%089! Wait for and get a burst byte from the fast serial bus, and send the "Data Accepted" signal. .%090! getBurstByte = * .%091! jsr serialWait .%092! ldx ciaData .%093! jsr toggleClock .%094! txa .%095! rts .%096! Send the burst commands to "log in" the MS-DOS disk and set the Read sector interleave factor. .%097! mountDisk = * ;() : .CS=err .%098! lda #%00011010 .%099! jsr sendU0 .%100! bcc + .%101! rts .%102! + jsr kernelUnlsn .%103! bit st .%104! bmi sendU0Error .%105! clc .%106! jsr kernelSpinp .%107! bit ciaFlags .%108! jsr toggleClock .%109! jsr getBurstByte .%110! sta errno .%111! and #$0f .%112! cmp #2 .%113! bcs mountExit Grab the throw-away parameters from the mount operation. .%114! ldy #0 .%115! - jsr getBurstByte .%116! sta formatParms,y .%117! iny .%118! cpy #6 .%119! bcc - .%120! clc Set the sector interleave to 1 for a 1581 or 4 for a 1571. .%121! ;** set interleave .%122! lda #%00001000 .%123! jsr sendU0 .%124! bcc + .%125! rts .%126! + lda #1 ;interleave of 1 for 1581 .%127! bit sourceType .%128! bmi + .%129! lda #4 ;interleave of 4 for 1571 .%130! + jsr kernelCiout .%131! jsr kernelUnlsn .%132! mountExit = * .%133! rts .%134! Read all of the sectors of a given track into the track cache. .%135! bufptr = 2 .%136! secnum = 4 .%137! .%138! readTrack = * ;( .A=cylinder, .X=side ) : trackbuf, .CS=err .%139! pha .%140! txa Get the side and put it into the command byte. Remember that we have to flip the side bit for a 1581. .%141! and #$01 .%142! asl .%143! asl .%144! asl .%145! asl .%146! bit sourceType .%147! bpl + .%148! eor #$10 .%149! + jsr sendU0 .%150! bcc + .%151! rts .%152! + pla ;cylinder number .%153! jsr kernelCiout .%154! lda #1 ;start sector number .%155! jsr kernelCiout .%156! lda #9 ;sector count .%157! jsr kernelCiout .%158! jsr kernelUnlsn Prepare to receive the track data. .%159! sei .%160! clc .%161! jsr kernelSpinp .%162! bit ciaFlags .%163! jsr toggleClock .%164! lda #<trackbuf .%165! ldy #>trackbuf .%166! sta bufptr .%167! sty bufptr+1 Get the sector data for each of the 9 sectors of the track. .%168! lda #0 .%169! sta secnum .%170! - bit sourceType .%171! bmi + If we are dealing with a 1571, we have to set the buffer pointer for the next sector, taking into account the soft interleave of 4. .%172! jsr get1571BufPtr .%173! + jsr readSector .%174! bcs trackExit .%175! inc secnum .%176! lda secnum .%177! cmp #9 .%178! bcc - .%179! clc .%180! trackExit = * .%181! cli .%182! rts .%183! Get the buffer pointer for the next 1571 sector. .%184! get1571BufPtr = * .%185! lda #<trackbuf .%186! sta bufptr .%187! ldx secnum .%188! clc .%189! lda #>trackbuf .%190! adc bufptr1571,x .%191! sta bufptr+1 .%192! rts .%193! .%194! bufptr1571 = * .%195! .byte 0,8,16,6,14,4,12,2,10 .%196! Read an individual sector into memory at the specified address. .%197! readSector = * ;( bufptr ) : .CS=err Get and check the burst status byte for errors. .%198! jsr getBurstByte .%199! sta errno .%200! and #$0f .%201! cmp #2 .%202! bcc + .%203! rts .%204! + ldx #2 .%205! ldy #0 .%206! Receive the 512 sector data bytes into memory. .%207! readByte = * .%208! lda #$08 .%209! - bit ciaFlags .%210! beq - .%211! lda ciaClock .%212! eor #$10 .%213! sta ciaClock .%214! lda ciaData .%215! sta (bufptr),y .%216! iny .%217! bne readByte .%218! inc bufptr+1 .%219! dex .%220! bne readByte .%221! rts .%222! This next level of routines deals with logical sectors and the track cache rather than with hardware. .%223! ;====logical sector level==== .%224! Invalidate the track cache if the MS-DOS drive number is changed or if a new disk is inserted. This routine has to establish a RAM configuration of $0E since it will be called from RAM0. Configuration $0E gives RAM0 from $0000 to $BFFF, Kernal ROM from $C000 to $FFFF, and the I/O space over the Kernal from $D000 to $DFFF. This configuration is set by all application interface subroutines. .%225! initPackage = * .%226! lda #$0e .%227! sta $ff00 .%228! lda #$ff .%229! sta bufCylinder .%230! sta bufSide .%231! clc .%232! rts .%233! Locate a sector (block) in the track cache, or read the corresponding physical track into the track cache if necessary. This routine accepts the cylinder, side, and sector numbers of the block. .%234! sectorSave = 5 .%235! .%236! readBlock = * ;( .A=cylinder,.X=side,.Y=sector ) : .AY=blkPtr,.CS=err Check if the correct track is in the track cache. .%237! cmp bufCylinder .%238! bne readBlockPhysical .%239! cpx bufSide .%240! bne readBlockPhysical If so, then locate the sector's address and return that. .%241! dey .%242! tya .%243! asl .%244! clc .%245! adc #>trackbuf .%246! tay .%247! lda #<trackbuf .%248! clc .%249! rts .%250! Here, we have to read the physical track into the track cache. We save the input parameters and call the hardware-level track-reading routine. .%251! readBlockPhysical = * .%252! sta bufCylinder .%253! stx bufSide .%254! sty sectorSave .%255! jsr readTrack Check for errors. .%256! bcc readBlockPhysicalOk .%257! lda errno .%258! and #$0f .%259! cmp #11 ;disk change .%260! beq + .%261! sec .%262! rts If the error that happened is a "Disk Change" error, then mount the disk and try to read the physical track again. .%263! + jsr mountDisk .%264! lda bufCylinder .%265! ldx bufSide .%266! ldy sectorSave .%267! bcc readBlockPhysical .%268! rts .%269! Here, the physical track has been read into the track cache ok, so we recover the original input parameters and try the top of the routine again. .%270! readBlockPhysicalOk = * .%271! lda bufCylinder .%272! ldx bufSide .%273! ldy sectorSave .%274! jmp readBlock .%275! Divide the given number by 18. This is needed for the calculations to convert a logical sector number to the corresponding physical cylinder, side, and sector numbers that the lower-level routines require. The method of repeated subtraction is used. This routine would probably work faster if we tried to repeatedly subtract 360 (18*20) at the top, but I didn't bother. .%276! divideBy18 = * ;( .AY=number ) : .A=quotient, .Y=remainder .%277! ;** could repeatedly subtract 360 here .%278! ldx #$ff .%279! - inx .%280! sec .%281! sbc #18 .%282! bcs - .%283! dey .%284! bpl - .%285! clc .%286! adc #18 .%287! iny .%288! tay .%289! txa .%290! rts .%291! Convert the given logical block number to the corresponding physical cylinder, side, and sector numbers. This routine follows the formulae given earlier with a few simplifying tricks. .%292! convertLogicalBlockNum = * ;( .AY=blockNum ) : .A=cyl, .X=side, .Y=sec .%293! jsr divideBy18 .%294! ldx #0 .%295! cpy #9 .%296! bcc + .%297! pha .%298! tya .%299! sbc #9 .%300! tay .%301! pla .%302! ldx #1 .%303! + iny .%304! rts .%305! Copy a sequential group of logical sectors into memory. This routine is used by the directory loading routine to load the FAT and Root Directory, and is used by the cluster reading routine to retrieve all of the blocks of a cluster. After the given starting logical sector number is converted into its physical cylinder, side, and sector equivalent, the physical values are incremented to get the address of successive sectors of the group. This avoids the overhead of the logical to physical conversion. Quite a number of temporaries are needed. .%306! destPtr = 6 .%307! curCylinder = 8 .%308! curSide = 9 .%309! curSector = 10 .%310! blockCountdown = 11 .%311! sourcePtr = 12 .%312! .%313! copyBlocks = * ;( .AY=startBlock, .X=blockCount, ($6)=dest ) : .CS=err .%314! stx blockCountdown .%315! jsr convertLogicalBlockNum .%316! sta curCylinder .%317! stx curSide .%318! sty curSector .%319! .%320! copyBlockLoop = * .%321! lda curCylinder .%322! ldx curSide .%323! ldy curSector .%324! jsr readBlock .%325! bcc + .%326! rts .%327! + sta sourcePtr .%328! sty sourcePtr+1 .%329! ldx #2 .%330! ldy #0 Here I unroll the copying loop a little bit to cut the overhead of the branch instruction in half. (A cycle saved... you know). .%331! - lda (sourcePtr),y .%332! sta (destPtr),y .%333! iny .%334! lda (sourcePtr),y .%335! sta (destPtr),y .%336! iny .%337! bne - .%338! inc sourcePtr+1 .%339! inc destPtr+1 .%340! dex .%341! bne - Increment the cylinder, side, sector values. .%342! inc curSector .%343! lda curSector .%344! cmp #10 .%345! bcc + .%346! lda #1 .%347! sta curSector .%348! inc curSide .%349! lda curSide .%350! cmp #2 .%351! bcc + .%352! lda #0 .%353! sta curSide .%354! inc curCylinder .%355! + dec blockCountdown .%356! bne copyBlockLoop .%357! clc .%358! rts .%359! Read a cluster into the Cluster Buffer, given the cluster number. The cluster number is converted to a logical sector number and then the sector copying routine is called. The formula given earlier is used for the conversion. .%360! readCluster = * ;( .AY=clusterNumber ) : clusterBuf, .CS=err .%361! ;** convert cluster number to logical block number .%362! sec .%363! sbc #2 .%364! bcs + .%365! dey .%366! + ldx clusterBlockCount .%367! cpx #1 .%368! beq + .%369! asl .%370! sty 7 .%371! rol 7 .%372! ldy 7 .%373! + clc .%374! adc firstFileBlock .%375! bcc + .%376! iny .%377! .%378! ;** read logical blocks comprising cluster .%379! + ldx #<clusterBuf .%380! stx 6 .%381! ldx #>clusterBuf .%382! stx 7 .%383! ldx clusterBlockCount .%384! jmp copyBlocks .%385! This next level of routines deal with the data structures of the MS-DOS disk format. .%386! ;====MS-DOS format level==== .%387! .%388! bootBlock = 2 .%389! Read the disk format parameters, directory, and FAT into memory. .%390! loadDirectory = * ;( ) : .AY=dirbuf, .X=dirEntries, .CS=err .%391! lda #$0e .%392! sta $ff00 .%393! Read the boot sector and extract the parameters. .%394! ;** get parameters from boot sector .%395! lda #0 .%396! ldy #0 .%397! jsr convertLogicalBlockNum .%398! jsr readBlock .%399! bcc + .%400! rts .%401! + sta bootBlock .%402! sty bootBlock+1 .%403! ldy #13 ;get cluster size .%404! lda (bootBlock),y .%405! sta clusterBlockCount .%406! cmp #3 .%407! bcc + .%408! If a disk parameter is found to exceed the limits of LRR, error code #60 is returned. .%409! invalidParms = * .%410! lda #60 .%411! sta errno .%412! sec .%413! rts .%414! .%415! + ldy #16 ;check FAT replication count, must be 2 .%416! lda (bootBlock),y .%417! cmp #2 .%418! bne invalidParms .%419! ldy #22 ;get FAT size in sectors .%420! lda (bootBlock),y .%421! sta fatBlocks .%422! cmp #4 .%423! bcs invalidParms .%424! ldy #17 ;get directory size .%425! lda (bootBlock),y .%426! sta rootDirEntries .%427! cmp #129 .%428! bcs invalidParms .%429! lsr .%430! lsr .%431! lsr .%432! lsr .%433! sta rootDirBlocks .%434! ldy #19 ;get total sector count .%435! lda (bootBlock),y .%436! sta totalSectors .%437! iny .%438! lda (bootBlock),y .%439! sta totalSectors+1 .%440! ldy #24 ;check sectors per track, must be 9 .%441! lda (bootBlock),y .%442! cmp #9 .%443! bne invalidParms .%444! ldy #26 .%445! lda (bootBlock),y .%446! cmp #2 ;check number of sides, must be 2 .%447! bne invalidParms .%448! ldy #14 ;check number of boot sectors, must be 1 .%449! lda (bootBlock),y .%450! cmp #1 .%451! bne invalidParms .%452! Calculate the derived parameters. .%453! ;** get derived parameters .%454! lda fatBlocks ;first root directory sector .%455! asl .%456! clc .%457! adc #1 .%458! sta firstRootDirBlock .%459! clc ;first file sector .%460! adc rootDirBlocks .%461! sta firstFileBlock .%462! lda totalSectors ;number of file clusters .%463! ldy totalSectors+1 .%464! sec .%465! sbc firstFileBlock .%466! bcs + .%467! dey .%468! + sta fileClusterCount .%469! sty fileClusterCount+1 .%470! lda clusterBlockCount .%471! cmp #2 .%472! bne + .%473! lsr fileClusterCount+1 .%474! ror fileClusterCount .%475! Gee, I have more comments embedded in the code than I did last issue. .%476! ;** load FAT .%477! + lda #<fatbuf .%478! ldy #>fatbuf .%479! sta 6 .%480! sty 7 .%481! lda #1 .%482! ldy #0 .%483! ldx fatBlocks .%484! jsr copyBlocks .%485! bcc + .%486! rts .%487! .%488! ;** load actual directory .%489! + lda #<dirbuf .%490! ldy #>dirbuf .%491! sta 6 .%492! sty 7 .%493! lda firstRootDirBlock .%494! ldy #0 .%495! ldx rootDirBlocks .%496! jsr copyBlocks .%497! bcc + .%498! rts .%499! + lda #<dirbuf .%500! ldy #>dirbuf .%501! ldx rootDirEntries .%502! clc .%503! rts .%504! This routine locates the given FAT table entry number and returns the value stored in it. Some work is needed to deal with the 12-bit compressed data structure. .%505! entryAddr = 2 .%506! entryWork = 4 .%507! entryBits = 5 .%508! entryData0 = 6 .%509! entryData1 = 7 .%510! entryData2 = 8 .%511! .%512! getFatEntry = * ;( .AY=fatEntryNumber ) : .AY=fatEntryValue .%513! sta entryBits Divide the FAT entry number by two and multiply by three because two FAT entries are stored in three bytes. Then add the FAT base address and we have the address of the three bytes that contain the FAT entry we are interested in. I retrieve the three bytes into zero-page memory for easy manipulation. .%514! ;** divide by two .%515! sty entryAddr+1 .%516! lsr entryAddr+1 .%517! ror .%518! .%519! ;** times three .%520! sta entryWork .%521! ldx entryAddr+1 .%522! asl .%523! rol entryAddr+1 .%524! clc .%525! adc entryWork .%526! sta entryAddr .%527! txa .%528! adc entryAddr+1 .%529! sta entryAddr+1 .%530! .%531! ;** add base, get data .%532! clc .%533! lda entryAddr .%534! adc #<fatbuf .%535! sta entryAddr .%536! lda entryAddr+1 .%537! adc #>fatbuf .%538! sta entryAddr+1 .%539! ldy #2 .%540! - lda (entryAddr),y .%541! sta entryData0,y .%542! dey .%543! bpl - .%544! lda entryBits .%545! and #1 .%546! bne + .%547! If the original given FAT entry number is even, then we want the first 12-bit compressed field. The nybbles are extracted according to the diagram shown earlier. .%548! ;** case 1: first 12-bit cluster .%549! lda entryData1 .%550! and #$0f .%551! tay .%552! lda entryData0 .%553! rts .%554! Otherwise, we want the second 12-bit field. .%555! ;** case 2: second 12-bit cluster .%556! + lda entryData1 .%557! ldx #4 .%558! - lsr entryData2 .%559! ror .%560! dex .%561! bne - .%562! ldy entryData2 .%563! rts .%564! Finally, this is the file copying level. It deals with reading the clusters of MS-DOS files and copying the data they contain to the already-open CBM Kernal file, possibly with ASCII-to-PETSCII translation. .%565! ;====file copy level==== .%566! .%567! transMode = 14 .%568! lfn = 15 .%569! cbmDataPtr = $60 .%570! cbmDataLen = $62 .%571! cluster = $64 .%572! Copy the given cluster to the CBM output file. This routine fetches the next cluster of the file for the next time this routine is called, and if it hits the NULL pointer of the last cluster of a file, it adjusts the number of valid file data bytes the current cluster contains to FileLength % ClusterLength (see note below). .%573! copyFileCluster = * ;( cluster, lfn, transMode ) : .CS=err Read the cluster and setup to copy the whole cluster to the CBM file. .%574! lda cluster .%575! ldy cluster+1 .%576! jsr readCluster .%577! bcc + .%578! rts .%579! + lda #<clusterBuf .%580! ldy #>clusterBuf .%581! sta cbmDataPtr .%582! sty cbmDataPtr+1 .%583! lda #0 .%584! sta cbmDataLen .%585! lda clusterBlockCount .%586! asl .%587! sta cbmDataLen+1 .%588! Fetch the next cluster number of the file, and adjust the cluster data length for the last cluster of the file. .%589! ;**get next cluster .%590! lda cluster .%591! ldy cluster+1 .%592! jsr getFatEntry .%593! sta cluster .%594! sty cluster+1 .%595! cmp #$ff .%596! bne copyFileClusterData .%597! cpy #$0f .%598! bne copyFileClusterData .%599! lda lenML .%600! sta cbmDataLen .%601! lda #$01 .%602! ldx clusterBlockCount .%603! cpx #1 .%604! beq + .%605! lda #$03 .%606! + and lenML+1 The following three lines were added in a last minute panic after realizing that if FileLength % ClusterSize == 0, then the last cluster of the file contains ClusterSize bytes, not zero bytes. .%000! bne + .%000! ldx lenML .%000! beq copyFileClusterData .%607! + sta cbmDataLen+1 .%608! .%609! copyFileClusterData = * .%610! jsr commieOut .%611! rts .%612! Copy the file data in the MS-DOS cluster buffer to the CBM output file. .%613! cbmDataLimit = $66 .%614! .%615! commieOut = * ;( cbmDataPtr, cbmDataLen ) : .CS=err If the the logical file number to copy to is 0 ("null device"), then don't bother copying anything. .%616! ldx lfn .%617! bne + .%618! clc .%619! rts Otherwise, prepare the logical file number for output. .%620! + jsr kernelChkout .%621! bcc commieOutMore .%622! sta errno .%623! rts .%624! .%625! commieOutMore = * Process the cluster data in chunks of up to 255 bytes or the number of data bytes remaining in the cluster. .%626! lda #255 .%627! ldx cbmDataLen+1 .%628! bne + .%629! lda cbmDataLen .%630! + sta cbmDataLimit .%631! ldy #0 .%632! - lda (cbmDataPtr),y .%633! bit transMode .%634! bpl + If we have to translate the current ASCII character, look up the PETSCII value in the translation table and output that value. If the translation table entry value is $00, then don't output a character (filter out invalid character codes). .%635! tax .%636! lda transBuf,x .%637! beq commieNext .%638! + jsr kernelChrout .%639! commieNext = * .%640! iny .%641! cpy cbmDataLimit .%642! bne - .%643! Increment the cluster buffer pointer and decrement the cluster buffer character count according to the number of bytes just processed, and repeat the above if more file data remains in the current cluster. .%644! clc .%645! lda cbmDataPtr .%646! adc cbmDataLimit .%647! sta cbmDataPtr .%648! bcc + .%649! inc cbmDataPtr+1 .%650! + sec .%651! lda cbmDataLen .%652! sbc cbmDataLimit .%653! sta cbmDataLen .%654! bcs + .%655! dec cbmDataLen+1 .%656! + lda cbmDataLen .%657! ora cbmDataLen+1 .%658! bne commieOutMore If we are finished with the cluster, then clear the CBM Kernal output channel. .%659! jsr kernelClrchn .%660! clc .%661! rts .%662! The file copying main routine. Set up for the starting cluster, and call the cluster copying routine until end-of-file is reached. Checks for a NULL cluster pointer in the directory entry to handle zero-length files. .%663! copyFile = * ;( startCluster, lenML, .A=transMode, .X=lfn ) : .CS=err .%664! ldy #$0e .%665! sty $ff00 .%666! sta transMode .%667! stx lfn .%668! lda startCluster .%669! ldy startCluster+1 .%670! sta cluster .%671! sty cluster+1 .%672! jmp + .%673! - jsr copyFileCluster .%674! bcc + .%675! rts .%676! + lda cluster .%677! cmp #$ff .%678! bne - .%679! lda cluster+1 .%680! cmp #$0f .%681! bne - .%682! clc .%683! rts .%684! This is the translation table used to convert from ASCII to PETSCII. You can modify it to suit your needs if you wish. If you cannot reassemble this file, then you can sift through the binary file and locate the tabel and change it there. An entry of $00 means the corresponding ASCII character will not be translated. You'll notice that I have set up translations for the following ASCII control characters into PETSCII: Backspace, Tab, Linefeed (CR), and Formfeed. I also translate the non-PETSCII characters such as {, |, ~, and _ according to what they probably would have been if Commodore wasn't so concerned with the graphics characters. .%685! transBuf = * .%686! ;0 1 2 3 4 5 6 7 8 9 a b c d e f .%687! .byte $00,$00,$00,$00,$00,$00,$00,$00,$14,$09,$0d,$00,$93,$00,$00,$00 ;0 .%688! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;1 .%689! .byte $20,$21,$22,$23,$24,$25,$26,$27,$28,$29,$2a,$2b,$2c,$2d,$2e,$2f ;2 .%690! .byte $30,$31,$32,$33,$34,$35,$36,$37,$38,$39,$3a,$3b,$3c,$3d,$3e,$3f ;3 .%691! .byte $40,$c1,$c2,$c3,$c4,$c5,$c6,$c7,$c8,$c9,$ca,$cb,$cc,$cd,$ce,$cf ;4 .%692! .byte $d0,$d1,$d2,$d3,$d4,$d5,$d6,$d7,$d8,$d9,$da,$5b,$5c,$5d,$5e,$5f ;5 .%693! .byte $c0,$41,$42,$43,$44,$45,$46,$47,$48,$49,$4a,$4b,$4c,$4d,$4e,$4f ;6 .%694! .byte $50,$51,$52,$53,$54,$55,$56,$57,$58,$59,$5a,$db,$dc,$dd,$de,$df ;7 .%695! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;8 .%696! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;9 .%697! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;a .%698! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;b .%699! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;c .%700! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;d .%701! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;e .%702! .byte $00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00,$00 ;f .%703! This is where the track cache, etc. are stored. This section requires 11K of storage space but does not increase the length of the binary program file since these storage areas are DEFINED rather than allocated with ".buf" directives. The Unix terminology for this type of uninitialized data is "bss". .%704! ;====bss storage==== .%705! .%706! bss = * .%707! trackbuf = bss .%708! clusterBuf = trackbuf+4608 .%709! fatbuf = clusterBuf+1024 .%710! dirbuf = fatbuf+1536 .%711! end = dirbuf+4096 5. USER-INTERFACE PROGRAM This section presents the listing of the user-interface BASIC program. You should be aware that you can easily change some of the defaults to your own preferences if you wish. This program is not listed in the ".%nnn!" format that the assembler listing is since you can recover this listing from the uuencoded binary program file. This program should be a little easier to follow than the assembler listing since BASIC is a self-commenting language. :-) 10 rem little red reader, by craig bruce, 30-sep-92, for c= hacking netmag 11 : These lines set up the default CBM-DOS and MS-DOS device numbers, taking care to disallow them to be the same device. You can change this to your own drive configuration. 20 cd=peek(186) : rem ** default cbm-dos drive ** 25 dv=9:dt=0 : rem ** ms-dos drive, type (0=1571,255=1581) 26 if dv=cd then dv=8:dt=0 : rem ** alternate ms-dos drive 27 : 30 print chr$(147);"initializing..." : print 40 bank0 : pk=dec("8000") 50 if peek(pk+9)=dec("cb") and peek(pk+10)=131 then 60 55 print"loading machine language routines..." : bload"lrr.bin",u(cd) 60 poke pk+16,dv : poke pk+17,dt : sys pk I "dim" the following variables before the arrays to avoid the overhead of pushing the arrays around when creating new scalar variables. 70 dim t,r,b,i,a$,c,dt$,fl$,il$,x,x$ 80 dim di$(128),cl(128),sz(128) 90 if dt=255 then dt$="1581" :else dt$="1571" 100 fl$=chr$(19)+chr$(17)+chr$(17)+chr$(17)+chr$(17) 110 il$=fl$:fori=1to19:il$=il$+chr$(17):next 120 goto 500 130 : 131 rem ** load ms-dos directory ** 140 print"loading directory..." : print 150 sys pk : sys pk+3 160 dl=0 The "rreg" instruction returns the return values of the .A, .X, .Y, and .S registers from the last "sys" call. I check the 1-bit of the .S register (the Carry flag) for error returns. 170 rreg bl,dc,bh,s : e=peek(pk+15) 180 if (s and 1) then gosub 380 : return 190 print"scanning directory..." : print 200 db=bl+256*bh 210 if dc=0 then 360 220 for dp=db to db+32*(dc-1) step 32 230 if peek(dp)=0 or peek(dp)=229 then 350 240 if peek(dp+12) and 24 then 350 250 dl=dl+1 This next line is where I set the default selection status, translation type, and CBM file type for the MS-DOS files. You can change these defaults simply by overtyping the string in ( | ||| ||| ) the "V" locations. V VVV VVV 260 d$=right$(" "+str$(dl),3)+" asc seq " : rem ** default sel/tr/ft ** 270 a$="" : fori=0to10 : a$=a$+chr$(peek(dp+i)) : next 280 a$=left$(a$,8)+" "+right$(a$,3) 290 print dl; a$ 300 d$=d$+a$+" " 310 cl(dl)=peek(dp+26)+256*peek(dp+27) 320 sz=peek(dp+28)+256*peek(dp+29)+65536*peek(dp+30) 330 di$(dl)=d$+right$(" "+str$(sz),6) 340 sz(dl)=sz 350 next dp 360 return 370 : 371 rem ** report ms-dos disk error ** 380 print chr$(18);"ms-dos disk error #";mid$(str$(e),2); 390 print " ($";mid$(hex$(e),3);"), press key.";chr$(146) 400 getkey a$ : return 410 : 411 rem ** screen heading ** 420 printchr$(147);"ms-dev=";mid$(str$(dv),2);" ms-type=";dt$; 430 print" cbm-dev=";mid$(str$(cd),2):print 440 return 450 : 451 rem ** screen footing ** 460 print il$;"d=directory m=ms-dev f=cbm-dev q=quit" 470 print"t=toggle-column, c=copy-files, +/- page"; 480 return 490 : 491 rem ** main routine ** 500 t=1 : c=0 510 r=0 520 gosub 420 530 print "num s trn typ filename ext length" 540 print "--- - --- --- -------- --- ------" 550 gosub 460 560 b=t+17 : if b>dl then b=dl 570 print fl$;: if t>dl then 590 580 for i=t to b : print di$(i) : next 590 if dl=0 then print chr$(18);"<no files>";chr$(146) 600 if dl=0 then 660 610 print left$(il$,r+5);chr$(18); 620 on c+1 goto 630,640,650 630 print spc(4);mid$(di$(t+r),5,3) : goto 660 640 print spc(7);mid$(di$(t+r),8,5) : goto 660 650 print spc(12);mid$(di$(t+r),13,5) : goto 660 660 getkey a$ Oh shi^Hoot. I screwed up the following line in the string after the "+chr$(13)+" part. You'll notice that I have avoided putting cursor control characters into the strings everywhere else, but I forgot to do that here. The "{stuff}" should be CursorUp, CursorDown, CursorLeft, CursorRight, CursorHome, and CursorCLR control characters, respectively. These characters give the index for the "on" statement below. 670 i=instr("dmftc+-q "+chr$(13)+"{stuff}",a$) 680 print left$(il$,r+5);di$(t+r) 690 if i=0 then 600 700 onigoto760,1050,1110,950,1150,1000,1020,730,860,860,770,790,810,830,850,500 710 stop 720 : 721 rem ** various menu options ** 730 print chr$(147);"have an awesome day." 740 end 760 gosub 420 : gosub 140 : goto 500 770 r=r-1 : if r<0 then r=b-t 780 goto 600 790 r=r+1 : if t+r>b then r=0 800 goto 600 810 c=c-1 : if c<0 then c=2 820 goto 600 830 c=c+1 : if c>2 then c=0 840 goto 600 850 r=0 : c=0 : goto 600 860 if dl=0 then 600 870 x=t+r : on c+1 gosub 890,910,930 880 print left$(il$,r+5);di$(x) : goto 600 890 if mid$(di$(x),6,1)=" " then x$="*" :else x$=" " 900 mid$(di$(x),6,1)=x$ : return 910 if mid$(di$(x),9,1)="a" then x$="bin" :else x$="asc" 920 mid$(di$(x),9,3)=x$ : return 930 if mid$(di$(x),14,1)="s" then x$="prg" :else x$="seq" 940 mid$(di$(x),14,3)=x$ : return 950 if dl=0 then 600 960 for x=1 to dl 970 on c+1 gosub 890,910,930 980 next x 990 goto 520 1000 if b=dl then t=1 : goto 510 1010 t=t+18 : goto 510 1020 if t=1 then t=dl-(dl-int(dl/18)*18)+1 : goto 510 1030 t=t-18 : if t<1 then t=1 1040 goto 510 1050 print il$;chr$(27);"@"; 1060 input"ms-dos device number (8-30)";dv 1061 if cd=dv then print"ms-dos and cbm-dos devices must be different!":goto1060 1070 input"ms-dos device type (71/81)";x 1080 if x=8 or x=81 or x=1581 then dt=255:dt$="1581" :else dt=0:dt$="1571" 1090 poke pk+16,dv : poke pk+17,dt : sys pk 1100 goto 520 1110 print il$;chr$(27);"@"; 1120 input "cbm-dos device number (0-30)";cd 1130 if cd=dv then print"ms-dos and cbm-dos devices must be different!":goto1120 1140 goto 520 1141 : 1142 rem ** copy files ** 1150 print chr$(147);"copy files":print:print 1160 if dl=0 then fc=0 : goto 1190 1170 fc=0 : for f=1 to dl : if mid$(di$(f),6,1)="*" then gosub 1200 1180 next f 1190 print : print"files copied =";fc;" - press key" 1191 getkey a$ : goto 520 1200 fc=fc+1 1210 x$=mid$(di$(f),19,8)+"."+mid$(di$(f),29,3) 1220 cf$="":fori=1tolen(x$):if mid$(x$,i,1)<>" " then cf$=cf$+mid$(x$,i,1) 1230 next 1231 if right$(cf$,1)="." then cf$=left$(cf$,len(cf$)-1) 1232 cf$=cf$+","+mid$(di$(f),14,1) 1240 print str$(fc);". ";chr$(34);cf$;chr$(34);tab(20);sz(f)"bytes"; 1245 print tab(35);mid$(di$(f),9,3) 1250 cl=cl(f) : lb=sz(f) - int(sz(f)/65536)*65536 I had to use a DOPEN statement here for disk files because the regular OPEN statment does not redirect the DS and DS$ pseudo-variables. You'll notice that the non-disk OPEN statment below has a secondary address of 7. This is to put the printer into lowercase mode if you are outputting directly to it. You can replace this with a 5 (or whatever) if you have a special interface to an IBM-compatible printer and you want to print directly in ASCII. In this case, you would select the "BIN" translation mode for the file you are routing directly to the printer. 1260 if cd>=8 then dopen#1,(cf$+",w"),u(cd) :else if cd<>0 then open 1,cd,7 1265 if cd<8 then 1288 1270 if ds<>63 then 1288 1275 x$="y" : print "file exists; overwrite (y/n)"; 1280 close 1 : input x$ : if x$="n" then fc=fc-1 : return 1285 scratch(cf$),u(cd) 1286 dopen#1,(cf$+",w"),u(cd) 1288 if cd<8 then 1320 1300 if ds<20 then 1320 1310 print chr$(18)+"cbm disk error: "+ds$ : fc=fc-1 : close1 : return 1320 poke pk+19,cl/256 : poke pk+18,cl-peek(pk+19)*256 1330 poke pk+21,lb/256 : poke pk+20,lb-peek(pk+21)*256 1340 tr=0 : if mid$(di$(f),9,1)="a" then tr=255 1346 x=1 : if cd=0 then x=0 1350 sys pk+6,tr,x 1355 rreg x,x,x,s : e=peek(pk+15) 1356 if (s and 1) then gosub 380 : fc=fc-1 1360 if cd<>0 and cd<8 then close1 1370 if cd>=8 then dclose#1 : if ds>=20 then 1310 1380 return 6. UUENCODED FILES Here are the binary executables in uuencoded form. The CRC32s of the two files are as follows: "lrr.128" 1106058594 "lrr.bin" 460671650 The "lrr.128" file is the main BASIC program and the "lrr.bin" file contains the machine lanugage disk-accessing routines. [LRR.128 and LRR.BIN are included in this archive. -RW] 7. BIBLIOGRAPHY The following works were consulted in creating this article: [1] Jim Butterfield, "Jim Butterfield's Complete C128 Memory Map", _The_Transactor_, Volume 7, Issue 01, July 1986 (A Must!). [2] Commodore Business Machines, _Commodore_1571_Disk_Drive_User's_Guide_, CBM, 1985. [3] Some program called "msdos-to-128" included with "cs-dos" by M. G-something. Originally published in COMPUTE!'s Gazzette, I think. [4] Commodore Business Machines, _Commodore_128_Programmer's_Reference_Guide_, Bantam Books, 1986. [5] _The_Transactor_, Volume 4, Issue 05 ("The Reference Issue"), May 1983. =============================================================================