home *** CD-ROM | disk | FTP | other *** search
- Copyright (C) 1987 Free Software Foundation, Inc.
- Contributed by Michael Tiemann (tiemann@mcc.com)
-
- This file is part of GNU CC.
-
- GNU CC is free software; you can redistribute it and/or modify
- it under the terms of the GNU General Public License as published by
- the Free Software Foundation; either version 1, or (at your option)
- any later version.
-
- GNU CC is distributed in the hope that it will be useful,
- but WITHOUT ANY WARRANTY; without even the implied warranty of
- MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
- GNU General Public License for more details.
-
- You should have received a copy of the GNU General Public License
- along with GNU CC; see the file COPYING. If not, write to
- the Free Software Foundation, 675 Mass Ave, Cambridge, MA 02139, USA.
-
-
- This file describes the implementation notes of the GNU C Compiler for
- the National Semiconductor 32032 chip (and 32000 family).
-
- The 32032 machine description and configuration file for this compiler
- is, for NS32000 family machine, primarily machine independent.
- However, since this release still depends on vendor-supplied
- assemblers and linkers, the compiler must obey the existing
- conventions of the actual machine to which this compiler is targeted.
- In this case, the actual machine which this compiler was targeted to
- is a Sequent Balance 8000, running DYNIX 2.1.
-
- The assembler for DYNIX 2.1 (and DYNIX 3.0, alas) does not cope with
- the full generality of the addressing mode REGISTER RELATIVE.
- Specifically, it generates incorrect code for operands of the
- following form:
-
- sym(rn)
-
- Where `rn' is one of the general registers. Correct code is generated
- for operands of the form
-
- sym(pn)
-
- where `pn' is one of the special processor registers (sb, fp, or sp).
-
- An equivalent operand can be generated by the form
-
- sym[rn:b]
-
- although this addressing mode is about twice as slow on the 32032.
-
- The more efficient addressing mode is controlled by defining the
- constant SEQUENT_ADDRESS_BUG to 0. It is currently defined to be 1.
-
- Another bug in the assembler makes it impossible to compute with
- explicit addresses. In order to compute with a symbolic address, it
- is necessary to load that address into a register using the "addr"
- instruction. For example, it is not possible to say
-
- cmpd _p,@_x
-
- Rather one must say
-
- addr _x,rn
- cmpd _p,rn
-
-
- The ns32032 chip has a number of known bugs. Any attempt to make the
- compiler unaware of these deficiencies will surely bring disaster.
- The current list of know bugs are as follows (list provided by Richard
- Stallman):
-
- 1) instructions with two overlapping operands in memory
- (unlikely in C code, perhaps impossible).
-
- 2) floating point conversion instructions with constant
- operands (these may never happen, but I'm not certain).
-
- 3) operands crossing a page boundary. These can be prevented
- by setting the flag in tm.h that requires strict alignment.
-
- 4) Scaled indexing in an insn following an insn that has a read-write
- operand in memory. This can be prevented by placing a no-op in
- between. I, Michael Tiemann, do not understand what exactly is meant
- by `read-write operand in memory'. If this is referring to the special
- TOS mode, for example "addd 5,tos" then one need not fear, since this
- will never be generated. However, is this includes "addd 5,-4(fp)"
- then there is room for disaster. The Sequent compiler does not insert
- a no-op for code involving the latter, and I have been informed that
- Sequent is aware of this list of bugs, so I must assume that it is not
- a problem.
-
- 5) The 32032 cannot shift by 32 bits. It shifts modulo the word size
- of the operand. Therefore, for 32-bit operations, 32-bit shifts are
- interpreted as zero bit shifts. 32-bit shifts have been removed from
- the compiler, but future hackers must be careful not to reintroduce
- them.
-
- 6) The ns32032 is a very slow chip; however, some instructions are
- still very much slower than one might expect. For example, it is
- almost always faster to double a quantity by adding it to itself than
- by shifting it by one, even if that quantity is deep in memory. The
- MOVM instruction has a 20-cycle setup time, after which it moves data
- at about the speed that normal moves would. It is also faster to use
- address generation instructions than shift instructions for left
- shifts less than 4. I do not claim that I generate optimal code for all
- given patterns, but where I did escape from National's "clean
- architecture", I did so because the timing specification from the data
- book says that I will win if I do. I suppose this is called the
- "performance gap".
-
-
- Signed bitfield extraction has not been implemented. It is not
- provided by the NS32032, and while it is most certainly possible to do
- better than the standard shift-left/shift-right sequence, it is also
- quite hairy. Also, since signed bitfields do not yet exist in C, this
- omission seems relatively harmless.
-
-
- Zero extractions could be better implemented if it were possible in
- GCC to provide sized zero extractions: i.e. a byte zero extraction
- would be allowed to yield a byte result. The current implementation
- of GCC manifests 68000-ist thinking, where bitfields are extracted
- into a register, and automatically sign/zero extended to fill the
- register. See comments in ns32k.md around the "extzv" insn for more
- details.
-
-
- It should be noted that while the NS32000 family was designed to
- provide odd-aligned addressing capability for multi-byte data (also
- provided by the 68020, but not by the 68000 or 68010), many machines
- do not opt to take advantage of this. For example, on the sequent,
- although there is no advantage to long-word aligning word data, shorts
- must be int-aligned in structs. This is an example of another
- machine-specific machine dependency.
-
-
- Because the ns32032 is has a coherent byte-order/bit-order
- architecture, many instructions which would be different for
- 68000-style machines, fold into the same instruction for the 32032.
- The classic case is push effective address, where it does not matter
- whether one is pushing a long, word, or byte address. They all will
- push the same address.
-
-
- The macro FUNCTION_VALUE_REGNO_P is probably not sufficient, what is
- needed is FUNCTION_VALUE_P, which also takes a MODE parameter. In
- this way it will be possible to determine more exactly whether a
- register is really a function value register, or just one that happens
- to look right.
-