RESTRICTION: The HP DS disk is not debugged. DO NOT enable this feature for normal operations. WARNING: Massive changes in the PDP-11 make all previous SAVEd file obsolete. Do not attempt to use a PDP-11 SAVE file from a prior release with V3.3! 1. New Features in 3.3 1.1 SCP - Added -p (powerup) qualifier to RESET - Changed SET <unit> ONLINE/OFFLINE to SET <unit> ENABLED/DISABLED - Moved SET DEBUG under SET CONSOLE hierarchy - Added optional parameter value to SHOW command - Added output file option to SHOW command 1.2 PDP-11 - Separated RH Massbus adapter from RP controller - Added TU tape support - Added model emulation framework - Added model details 1.3 VAX - Separated out CVAX-specific features from core instruction simulator - Implemented capability for CIS, octaword, compatibility mode instructions - Added instruction display and parse for compatibility mode - Changed SET CPU VIRTUAL=n to SHOW CPU VIRTUAL=n - Added =n optional parameter to SHOW CPU HISTORY 1.4 Unibus/Qbus simulators (PDP-11, VAX, PDP-10) - Simplified DMA API's - Modified DMA peripherals to use simplified API's 1.5 HP2100 (all changes from Dave Bryan) CPU - moved MP into its own device; added MP option jumpers - modified DMA to allow disabling - modified SET CPU 2100/2116 to truncate memory > 32K - added -F switch to SET CPU to force memory truncation - modified WRU to be REG_HRO - added BRK and DEL to save console settings DR - provided protected tracks and "Writing Enabled" status bit - added "parity error" status return on writes for 12606 - added track origin test for 12606 - added SCP test for 12606 - added "Sector Flag" status bit - added "Read Inhibit" status bit for 12606 - added TRACKPROT modifier LPS - added SET OFFLINE/ONLINE, POWEROFF/POWERON - added fast/realistic timing - added debug printouts LPT - added SET OFFLINE/ONLINE, POWEROFF/POWERON PTR - added paper tape loop mode, DIAG/READER modifiers to PTR - added PV_LEFT to PTR TRLLIM register CLK - modified CLK to permit disable 1.6 IBM 1401, IBM 1620, Interdata 16b, SDS 940, PDP-10 - Added instruction history 1.7 H316, PDP-15, PDP-8 - Added =n optional value to SHOW CPU HISTORY 2. Bugs Fixed in 3.3 2.1 SCP - Fixed comma-separated SET options (from Dave Bryan) - Fixed duplicate HELP displays with user-specified commands 2.2 PDP-10 - Replicated RP register state per drive - Fixed TU to set FCE on short record - Fixed TU to return bit<15> in drive type - Fixed TU format specification, 1:0 are don't cares - Fixed TU handling of TMK status - Fixed TU handling of DONE, ATA at end of operation - Implemented TU write check 2.3 PDP-11 - Replicated RP register state per drive - Fixed RQ, TQ to report correct controller type and stage 1 configuration flags on a Unibus system - Fixed HK CS2<output_ready> flag 2.4 VAX - Fixed parsing of indirect displacement modes in instruction input 2.5 HP2100 (all fixes from Dave Bryan) CPU - fixed S-register behavior on 2116 - fixed LIx/MIx behavior for DMA on 2116 and 2100 - fixed LIx/MIx behavior for empty I/O card slots DP - fixed enable/disable from either device - fixed ANY ERROR status for 12557A interface - fixed unattached drive status for 12557A interface - status cmd without prior STC DC now completes (12557A) - OTA/OTB CC on 13210A interface also does CLC CC - fixed RAR model - fixed seek check on 13210 if sector out of range DQ - fixed enable/disable from either device - shortened xtime from 5 to 3 (drive avg 156KW/second) - fixed not ready/any error status - fixed RAR model DR - fixed enable/disable from either device - fixed sector return in status word - fixed DMA last word write, incomplete sector fill value - fixed 12610 SFC operation - fixed current-sector determination IPL - fixed enable/disable from either device LPS - fixed status returns for error conditions - fixed handling of non-printing characters - fixed handling of characters after column 80 - improved timing model accuracy for RTE LPT - fixed status returns for error conditions - fixed TOF handling so form remains on line 0 SYS - fixed display of CCA/CCB/CCE instructions 2.5 PDP-15 FPP - fixed URFST to mask low 9b of fraction - fixed exception PC setting
1378 lines
42 KiB
C
1378 lines
42 KiB
C
/* vax_fpa.c - VAX f_, d_, g_floating instructions
|
||
|
||
Copyright (c) 1998-2004, Robert M Supnik
|
||
|
||
Permission is hereby granted, free of charge, to any person obtaining a
|
||
copy of this software and associated documentation files (the "Software"),
|
||
to deal in the Software without restriction, including without limitation
|
||
the rights to use, copy, modify, merge, publish, distribute, sublicense,
|
||
and/or sell copies of the Software, and to permit persons to whom the
|
||
Software is furnished to do so, subject to the following conditions:
|
||
|
||
The above copyright notice and this permission notice shall be included in
|
||
all copies or substantial portions of the Software.
|
||
|
||
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
||
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
||
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL
|
||
ROBERT M SUPNIK BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER
|
||
IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
|
||
CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
|
||
|
||
Except as contained in this notice, the name of Robert M Supnik shall not
|
||
be used in advertising or otherwise to promote the sale, use or other dealings
|
||
in this Software without prior written authorization from Robert M Supnik.
|
||
|
||
30-Sep-04 RMS Comment and formating changes based on vax_octa.c
|
||
18-Apr-04 RMS Moved format definitions to vax_defs.h
|
||
19-Jun-03 RMS Simplified add algorithm
|
||
16-May-03 RMS Fixed bug in floating to integer convert overflow
|
||
Fixed multiple bugs in EMODx
|
||
Integrated 32b only code
|
||
05-Jul-02 RMS Changed internal routine names for C library conflict
|
||
17-Apr-02 RMS Fixed bug in EDIV zero quotient
|
||
|
||
This module contains the instruction simulators for
|
||
|
||
- 64 bit arithmetic (ASHQ, EMUL, EDIV)
|
||
- single precision floating point
|
||
- double precision floating point, D and G format
|
||
*/
|
||
|
||
#include "vax_defs.h"
|
||
#include <setjmp.h>
|
||
|
||
extern int32 R[16];
|
||
extern int32 PSL;
|
||
extern int32 p1;
|
||
extern jmp_buf save_env;
|
||
|
||
extern int32 Read (uint32 va, int32 size, int32 acc);
|
||
|
||
#if defined (USE_INT64)
|
||
|
||
#define M64 0xFFFFFFFFFFFFFFFF /* 64b */
|
||
#define FD_FRACW (0xFFFF & ~(FD_EXP | FPSIGN))
|
||
#define FD_FRACL (FD_FRACW | 0xFFFF0000) /* f/d fraction */
|
||
#define G_FRACW (0xFFFF & ~(G_EXP | FPSIGN))
|
||
#define G_FRACL (G_FRACW | 0xFFFF0000) /* g fraction */
|
||
#define UNSCRAM(h,l) (((((t_uint64) (h)) << 48) & 0xFFFF000000000000) | \
|
||
((((t_uint64) (h)) << 16) & 0x0000FFFF00000000) | \
|
||
((((t_uint64) (l)) << 16) & 0x00000000FFFF0000) | \
|
||
((((t_uint64) (l)) >> 16) & 0x000000000000FFFF))
|
||
#define CONCAT(h,l) ((((t_uint64) (h)) << 32) | ((uint32) (l)))
|
||
|
||
struct ufp {
|
||
int32 sign;
|
||
int32 exp;
|
||
t_uint64 frac; };
|
||
|
||
typedef struct ufp UFP;
|
||
|
||
#define UF_NM 0x8000000000000000 /* normalized */
|
||
#define UF_FRND 0x0000008000000000 /* F round */
|
||
#define UF_DRND 0x0000000000000080 /* D round */
|
||
#define UF_GRND 0x0000000000000400 /* G round */
|
||
#define UF_V_NM 63
|
||
#define UF_V_FDHI 40
|
||
#define UF_V_FDLO (UF_V_FDHI - 32)
|
||
#define UF_V_GHI 43
|
||
#define UF_V_GLO (UF_V_GHI - 32)
|
||
#define UF_GETFDHI(x) (int32) ((((x) >> (16 + UF_V_FDHI)) & FD_FRACW) | \
|
||
(((x) >> (UF_V_FDHI - 16)) & ~0xFFFF))
|
||
#define UF_GETFDLO(x) (int32) ((((x) >> (16 + UF_V_FDLO)) & 0xFFFF) | \
|
||
(((x) << (16 - UF_V_FDLO)) & ~0xFFFF))
|
||
#define UF_GETGHI(x) (int32) ((((x) >> (16 + UF_V_GHI)) & G_FRACW) | \
|
||
(((x) >> (UF_V_GHI - 16)) & ~0xFFFF))
|
||
#define UF_GETGLO(x) (int32) ((((x) >> (16 + UF_V_GLO)) & 0xFFFF) | \
|
||
(((x) << (16 - UF_V_GLO)) & ~0xFFFF))
|
||
|
||
void unpackf (int32 hi, UFP *a);
|
||
void unpackd (int32 hi, int32 lo, UFP *a);
|
||
void unpackg (int32 hi, int32 lo, UFP *a);
|
||
void norm (UFP *a);
|
||
int32 rpackfd (UFP *a, int32 *rh);
|
||
int32 rpackg (UFP *a, int32 *rh);
|
||
void vax_fadd (UFP *a, UFP *b, t_int64 mask);
|
||
void vax_fmul (UFP *a, UFP *b, int32 prec, int32 bias, t_int64 mask);
|
||
void vax_fdiv (UFP *b, UFP *a, int32 prec, int32 bias);
|
||
void vax_fmod (UFP *a, int32 bias, int32 *intgr, int32 *flg);
|
||
|
||
/* Quadword arithmetic shift
|
||
|
||
opnd[0] = shift count (cnt.rb)
|
||
opnd[1:2] = source (src.rq)
|
||
opnd[3:4] = destination (dst.wq)
|
||
*/
|
||
|
||
int32 op_ashq (int32 *opnd, int32 *rh, int32 *flg)
|
||
{
|
||
t_int64 src, r;
|
||
int32 sc = opnd[0];
|
||
|
||
src = CONCAT (opnd[2], opnd[1]); /* build src */
|
||
if (sc & BSIGN) { /* right shift? */
|
||
*flg = 0; /* no ovflo */
|
||
sc = 0x100 - sc; /* |shift| */
|
||
if (sc > 63) r = (opnd[2] & LSIGN)? -1: 0; /* sc > 63? */
|
||
else r = src >> sc; }
|
||
else { if (sc > 63) { /* left shift */
|
||
r = 0; /* sc > 63? */
|
||
*flg = (src != 0); } /* ovflo test */
|
||
else {
|
||
r = src << sc; /* do shift */
|
||
*flg = (src != (r >> sc)); } } /* ovflo test */
|
||
*rh = (int32) ((r >> 32) & LMASK); /* hi result */
|
||
return ((int32) (r & LMASK)); /* lo result */
|
||
}
|
||
|
||
/* Extended multiply subroutine */
|
||
|
||
int32 op_emul (int32 mpy, int32 mpc, int32 *rh)
|
||
{
|
||
t_int64 lmpy = mpy;
|
||
t_int64 lmpc = mpc;
|
||
|
||
lmpy = lmpy * lmpc;
|
||
*rh = (int32) ((lmpy >> 32) & LMASK);
|
||
return ((int32) (lmpy & LMASK));
|
||
}
|
||
|
||
/* Extended divide
|
||
|
||
opnd[0] = divisor (non-zero)
|
||
opnd[1:2] = dividend
|
||
*/
|
||
|
||
int32 op_ediv (int32 *opnd, int32 *rh, int32 *flg)
|
||
{
|
||
t_int64 ldvd, ldvr;
|
||
int32 quo, rem;
|
||
|
||
*flg = CC_V; /* assume error */
|
||
*rh = 0;
|
||
ldvr = ((opnd[0] & LSIGN)? -opnd[0]: opnd[0]) & LMASK; /* |divisor| */
|
||
ldvd = CONCAT (opnd[2], opnd[1]); /* 64b dividend */
|
||
if (opnd[2] & LSIGN) ldvd = -ldvd; /* |dividend| */
|
||
if (((ldvd >> 32) & LMASK) >= ldvr) return opnd[1]; /* divide work? */
|
||
quo = (int32) (ldvd / ldvr); /* do divide */
|
||
rem = (int32) (ldvd % ldvr);
|
||
if ((opnd[0] ^ opnd[2]) & LSIGN) { /* result -? */
|
||
quo = -quo; /* negate */
|
||
if (quo && ((quo & LSIGN) == 0)) return opnd[1]; } /* right sign? */
|
||
else if (quo & LSIGN) return opnd[1];
|
||
if (opnd[2] & LSIGN) rem = -rem; /* sign of rem */
|
||
*flg = 0; /* no overflow */
|
||
*rh = rem & LMASK; /* set rem */
|
||
return (quo & LMASK); /* return quo */
|
||
}
|
||
|
||
/* Compare floating */
|
||
|
||
int32 op_cmpfd (int32 h1, int32 l1, int32 h2, int32 l2)
|
||
{
|
||
t_uint64 n1, n2;
|
||
|
||
if ((h1 & FD_EXP) == 0) {
|
||
if (h1 & FPSIGN) RSVD_OPND_FAULT;
|
||
h1 = l1 = 0; }
|
||
if ((h2 & FD_EXP) == 0) {
|
||
if (h2 & FPSIGN) RSVD_OPND_FAULT;
|
||
h2 = l2 = 0; }
|
||
if ((h1 ^ h2) & FPSIGN) return ((h1 & FPSIGN)? CC_N: 0);
|
||
n1 = UNSCRAM (h1, l1);
|
||
n2 = UNSCRAM (h2, l2);
|
||
if (n1 == n2) return CC_Z;
|
||
return (((n1 < n2) ^ ((h1 & FPSIGN) != 0))? CC_N: 0);
|
||
}
|
||
|
||
int32 op_cmpg (int32 h1, int32 l1, int32 h2, int32 l2)
|
||
{
|
||
t_uint64 n1, n2;
|
||
|
||
if ((h1 & G_EXP) == 0) {
|
||
if (h1 & FPSIGN) RSVD_OPND_FAULT;
|
||
h1 = l1 = 0; }
|
||
if ((h2 & G_EXP) == 0) {
|
||
if (h2 & FPSIGN) RSVD_OPND_FAULT;
|
||
h2 = l2 = 0; }
|
||
if ((h1 ^ h2) & FPSIGN) return ((h1 & FPSIGN)? CC_N: 0);
|
||
n1 = UNSCRAM (h1, l1);
|
||
n2 = UNSCRAM (h2, l2);
|
||
if (n1 == n2) return CC_Z;
|
||
return (((n1 < n2) ^ ((h1 & FPSIGN) != 0))? CC_N: 0);
|
||
}
|
||
|
||
/* Integer to floating convert */
|
||
|
||
int32 op_cvtifdg (int32 val, int32 *rh, int32 opc)
|
||
{
|
||
UFP a;
|
||
|
||
if (val == 0) {
|
||
if (rh) *rh = 0;
|
||
return 0; }
|
||
if (val < 0) {
|
||
a.sign = FPSIGN;
|
||
val = - val; }
|
||
else a.sign = 0;
|
||
a.exp = 32 + ((opc & 0x100)? G_BIAS: FD_BIAS);
|
||
a.frac = ((t_uint64) val) << (UF_V_NM - 31);
|
||
norm (&a);
|
||
if (opc & 0x100) return rpackg (&a, rh);
|
||
return rpackfd (&a, rh);
|
||
}
|
||
|
||
/* Floating to integer convert */
|
||
|
||
int32 op_cvtfdgi (int32 *opnd, int32 *flg, int32 opc)
|
||
{
|
||
UFP a;
|
||
int32 lnt = opc & 03;
|
||
int32 ubexp;
|
||
static t_uint64 maxv[4] = { 0x7F, 0x7FFF, 0x7FFFFFFF, 0x7FFFFFFF };
|
||
|
||
*flg = 0;
|
||
if (opc & 0x100) {
|
||
unpackg (opnd[0], opnd[1], &a);
|
||
ubexp = a.exp - G_BIAS; }
|
||
else { if (opc & 0x20) unpackd (opnd[0], opnd[1], &a);
|
||
else unpackf (opnd[0], &a);
|
||
ubexp = a.exp - FD_BIAS; }
|
||
if ((a.exp == 0) || (ubexp < 0)) return 0;
|
||
if (ubexp <= UF_V_NM) {
|
||
a.frac = a.frac >> (UF_V_NM - ubexp); /* leave rnd bit */
|
||
if ((opc & 03) == 03) a.frac = a.frac + 1; /* if CVTR, round */
|
||
a.frac = a.frac >> 1; /* now justified */
|
||
if (a.frac > (maxv[lnt] + (a.sign? 1: 0))) *flg = CC_V; }
|
||
else { *flg = CC_V; /* set overflow */
|
||
if (ubexp > (UF_V_NM + 32)) return 0;
|
||
a.frac = a.frac << (ubexp - UF_V_NM - 1); } /* no rnd bit */
|
||
return ((int32) ((a.sign? (a.frac ^ LMASK) + 1: a.frac) & LMASK));
|
||
}
|
||
|
||
/* Extended modularize
|
||
|
||
One of three floating point instructions dropped from the architecture,
|
||
EMOD presents two sets of complications. First, it requires an extended
|
||
fraction multiply, with precise (and unusual) truncation conditions.
|
||
Second, it has two write operands, a dubious distinction it shares
|
||
with EDIV.
|
||
*/
|
||
|
||
int32 op_emodf (int32 *opnd, int32 *intgr, int32 *flg)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackf (opnd[0], &a); /* unpack operands */
|
||
unpackf (opnd[2], &b);
|
||
a.frac = a.frac | (((t_uint64) opnd[1]) << 32); /* extend src1 */
|
||
vax_fmul (&a, &b, 32, FD_BIAS, LMASK); /* multiply */
|
||
vax_fmod (&a, FD_BIAS, intgr, flg); /* sep int & frac */
|
||
return rpackfd (&a, NULL); /* return frac */
|
||
}
|
||
|
||
int32 op_emodd (int32 *opnd, int32 *flo, int32 *intgr, int32 *flg)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackd (opnd[0], opnd[1], &a); /* unpack operands */
|
||
unpackd (opnd[3], opnd[4], &b);
|
||
a.frac = a.frac | opnd[2]; /* extend src1 */
|
||
vax_fmul (&a, &b, 64, FD_BIAS, 0); /* multiply */
|
||
vax_fmod (&a, FD_BIAS, intgr, flg); /* sep int & frac */
|
||
return rpackfd (&a, flo); /* return frac */
|
||
}
|
||
|
||
int32 op_emodg (int32 *opnd, int32 *flo, int32 *intgr, int32 *flg)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackg (opnd[0], opnd[1], &a); /* unpack operands */
|
||
unpackg (opnd[3], opnd[4], &b);
|
||
a.frac = a.frac | (opnd[2] >> 5); /* extend src1 */
|
||
vax_fmul (&a, &b, 64, G_BIAS, 0); /* multiply */
|
||
vax_fmod (&a, G_BIAS, intgr, flg); /* sep int & frac */
|
||
return rpackg (&a, flo); /* return frac */
|
||
}
|
||
|
||
/* Unpacked floating point routines */
|
||
|
||
void vax_fadd (UFP *a, UFP *b, t_int64 mask)
|
||
{
|
||
int32 ediff;
|
||
UFP t;
|
||
|
||
if (a->exp == 0) { /* s1 = 0? */
|
||
*a = *b;
|
||
return; }
|
||
if (b->exp == 0) return; /* s2 = 0? */
|
||
if ((a->exp < b->exp) || /* |s1| < |s2|? swap */
|
||
((a->exp == b->exp) && (a->frac < b->frac))) {
|
||
t = *a;
|
||
*a = *b;
|
||
*b = t; }
|
||
ediff = a->exp - b->exp; /* exp diff */
|
||
if (a->sign ^ b->sign) { /* eff sub? */
|
||
if (ediff) { /* exp diff? */
|
||
if (ediff > 63) b->frac = M64; /* retain sticky */
|
||
else b->frac = ((-((t_int64) b->frac) >> ediff) | /* denormalize */
|
||
(M64 << (64 - ediff))); /* preserve sign */
|
||
a->frac = a->frac + b->frac; } /* add frac */
|
||
else a->frac = a->frac - b->frac; /* sub frac */
|
||
a->frac = a->frac & ~mask;
|
||
norm (a); } /* normalize */
|
||
else { if (ediff > 63) b->frac = 0; /* add */
|
||
else if (ediff) b->frac = b->frac >> ediff; /* denormalize */
|
||
a->frac = a->frac + b->frac; /* add frac */
|
||
if (a->frac < b->frac) { /* chk for carry */
|
||
a->frac = UF_NM | (a->frac >> 1); /* shift in carry */
|
||
a->exp = a->exp + 1; } /* skip norm */
|
||
a->frac = a->frac & ~mask; }
|
||
return;
|
||
}
|
||
|
||
/* Floating multiply - 64b * 64b with cross products */
|
||
|
||
void vax_fmul (UFP *a, UFP *b, int32 prec, int32 bias, t_int64 mask)
|
||
{
|
||
t_uint64 ah, bh, al, bl, rhi, rlo, rmid1, rmid2;
|
||
|
||
if ((a->exp == 0) || (b->exp == 0)) { /* zero argument? */
|
||
a->frac = a->sign = a->exp = 0; /* result is zero */
|
||
return; }
|
||
a->sign = a->sign ^ b->sign; /* sign of result */
|
||
a->exp = a->exp + b->exp - bias; /* add exponents */
|
||
ah = (a->frac >> 32) & LMASK; /* split operands */
|
||
bh = (b->frac >> 32) & LMASK; /* into 32b chunks */
|
||
rhi = ah * bh; /* high result */
|
||
if (prec > 32) { /* 64b needed? */
|
||
al = a->frac & LMASK;
|
||
bl = b->frac & LMASK;
|
||
rmid1 = ah * bl;
|
||
rmid2 = al * bh;
|
||
rlo = al * bl;
|
||
rhi = rhi + ((rmid1 >> 32) & LMASK) + ((rmid2 >> 32) & LMASK);
|
||
rmid1 = rlo + (rmid1 << 32); /* add mid1 to lo */
|
||
if (rmid1 < rlo) rhi = rhi + 1; /* carry? incr hi */
|
||
rmid2 = rmid1 + (rmid2 << 32); /* add mid2 to to */
|
||
if (rmid2 < rmid1) rhi = rhi + 1; } /* carry? incr hi */
|
||
a->frac = rhi & ~mask; /* mask out */
|
||
norm (a); /* normalize */
|
||
return;
|
||
}
|
||
|
||
/* Floating modulus - there are three cases
|
||
|
||
exp <= bias - integer is 0, fraction is input,
|
||
no overflow
|
||
bias < exp <= bias+64 - separate integer and fraction,
|
||
integer overflow may occur
|
||
bias+64 < exp - result is integer, fraction is 0
|
||
integer overflow
|
||
*/
|
||
|
||
void vax_fmod (UFP *a, int32 bias, int32 *intgr, int32 *flg)
|
||
{
|
||
if (a->exp <= bias) *intgr = *flg = 0; /* 0 or <1? int = 0 */
|
||
else if (a->exp <= (bias + 64)) { /* in range? */
|
||
*intgr = (int32) (a->frac >> (64 - (a->exp - bias)));
|
||
if ((a->exp > (bias + 32)) || /* test ovflo */
|
||
((a->exp == (bias + 32)) &&
|
||
(((uint32) *intgr) > (a->sign? 0x80000000: 0x7FFFFFFF))))
|
||
*flg = CC_V;
|
||
else *flg = 0;
|
||
if (a->sign) *intgr = -*intgr; /* -? comp int */
|
||
a->frac = a->frac << (a->exp - bias);
|
||
a->exp = bias; }
|
||
else { *intgr = 0; /* out of range */
|
||
a->frac = a->sign = a->exp = 0; /* result 0 */
|
||
*flg = CC_V; } /* overflow */
|
||
norm (a); /* normalize */
|
||
return;
|
||
}
|
||
|
||
/* Floating divide
|
||
Needs to develop at least one rounding bit. Since the first
|
||
divide step can fail, caller should specify 2 more bits than
|
||
the precision of the fraction.
|
||
*/
|
||
|
||
void vax_fdiv (UFP *a, UFP *b, int32 prec, int32 bias)
|
||
{
|
||
int32 i;
|
||
t_uint64 quo = 0;
|
||
|
||
if (a->exp == 0) FLT_DZRO_FAULT; /* divr = 0? */
|
||
if (b->exp == 0) return; /* divd = 0? */
|
||
b->sign = b->sign ^ a->sign; /* result sign */
|
||
b->exp = b->exp - a->exp + bias + 1; /* unbiased exp */
|
||
a->frac = a->frac >> 1; /* allow 1 bit left */
|
||
b->frac = b->frac >> 1;
|
||
for (i = 0; (i < prec) && b->frac; i++) { /* divide loop */
|
||
quo = quo << 1; /* shift quo */
|
||
if (b->frac >= a->frac) { /* div step ok? */
|
||
b->frac = b->frac - a->frac; /* subtract */
|
||
quo = quo + 1; } /* quo bit = 1 */
|
||
b->frac = b->frac << 1; } /* shift divd */
|
||
b->frac = quo << (UF_V_NM - i + 1); /* shift quo */
|
||
norm (b); /* normalize */
|
||
return;
|
||
}
|
||
|
||
/* Support routines */
|
||
|
||
void unpackf (int32 hi, UFP *r)
|
||
{
|
||
r->sign = hi & FPSIGN; /* get sign */
|
||
r->exp = FD_GETEXP (hi); /* get exponent */
|
||
if (r->exp == 0) { /* exp = 0? */
|
||
if (r->sign) RSVD_OPND_FAULT; /* if -, rsvd op */
|
||
r->frac = 0; /* else 0 */
|
||
return; }
|
||
hi = (((hi & FD_FRACW) | FD_HB) << 16) | ((hi >> 16) & 0xFFFF);
|
||
r->frac = ((t_uint64) hi) << (32 + UF_V_FDLO);
|
||
return;
|
||
}
|
||
|
||
void unpackd (int32 hi, int32 lo, UFP *r)
|
||
{
|
||
r->sign = hi & FPSIGN; /* get sign */
|
||
r->exp = FD_GETEXP (hi); /* get exponent */
|
||
if (r->exp == 0) { /* exp = 0? */
|
||
if (r->sign) RSVD_OPND_FAULT; /* if -, rsvd op */
|
||
r->frac = 0; /* else 0 */
|
||
return; }
|
||
hi = (hi & FD_FRACL) | FD_HB; /* canonical form */
|
||
r->frac = UNSCRAM (hi, lo) << UF_V_FDLO; /* guard bits */
|
||
return;
|
||
}
|
||
|
||
void unpackg (int32 hi, int32 lo, UFP *r)
|
||
{
|
||
r->sign = hi & FPSIGN; /* get sign */
|
||
r->exp = G_GETEXP (hi); /* get exponent */
|
||
if (r->exp == 0) { /* exp = 0? */
|
||
if (r->sign) RSVD_OPND_FAULT; /* if -, rsvd op */
|
||
r->frac = 0; /* else 0 */
|
||
return; }
|
||
hi = (hi & G_FRACL) | G_HB; /* canonical form */
|
||
r->frac = UNSCRAM (hi, lo) << UF_V_GLO; /* guard bits */
|
||
return;
|
||
}
|
||
|
||
void norm (UFP *r)
|
||
{
|
||
int32 i;
|
||
static t_uint64 normmask[5] = {
|
||
0xc000000000000000, 0xf000000000000000, 0xff00000000000000,
|
||
0xffff000000000000, 0xffffffff00000000 };
|
||
static int32 normtab[6] = { 1, 2, 4, 8, 16, 32};
|
||
|
||
if (r->frac == 0) { /* if fraction = 0 */
|
||
r->sign = r->exp = 0; /* result is 0 */
|
||
return; }
|
||
while ((r->frac & UF_NM) == 0) { /* normalized? */
|
||
for (i = 0; i < 5; i++) { /* find first 1 */
|
||
if (r->frac & normmask[i]) break; }
|
||
r->frac = r->frac << normtab[i]; /* shift frac */
|
||
r->exp = r->exp - normtab[i]; } /* decr exp */
|
||
return;
|
||
}
|
||
|
||
int32 rpackfd (UFP *r, int32 *rh)
|
||
{
|
||
if (rh) *rh = 0; /* assume 0 */
|
||
if (r->frac == 0) return 0; /* result 0? */
|
||
r->frac = r->frac + (rh? UF_DRND: UF_FRND); /* round */
|
||
if ((r->frac & UF_NM) == 0) { /* carry out? */
|
||
r->frac = r->frac >> 1; /* renormalize */
|
||
r->exp = r->exp + 1; }
|
||
if (r->exp > (int32) FD_M_EXP) FLT_OVFL_FAULT; /* ovflo? fault */
|
||
if (r->exp <= 0) { /* underflow? */
|
||
if (PSL & PSW_FU) FLT_UNFL_FAULT; /* fault if fu */
|
||
return 0; } /* else 0 */
|
||
if (rh) *rh = UF_GETFDLO (r->frac); /* get low */
|
||
return r->sign | (r->exp << FD_V_EXP) | UF_GETFDHI (r->frac);
|
||
}
|
||
|
||
int32 rpackg (UFP *r, int32 *rh)
|
||
{
|
||
*rh = 0; /* assume 0 */
|
||
if (r->frac == 0) return 0; /* result 0? */
|
||
r->frac = r->frac + UF_GRND; /* round */
|
||
if ((r->frac & UF_NM) == 0) { /* carry out? */
|
||
r->frac = r->frac >> 1; /* renormalize */
|
||
r->exp = r->exp + 1; }
|
||
if (r->exp > (int32) G_M_EXP) FLT_OVFL_FAULT; /* ovflo? fault */
|
||
if (r->exp <= 0) { /* underflow? */
|
||
if (PSL & PSW_FU) FLT_UNFL_FAULT; /* fault if fu */
|
||
return 0; } /* else 0 */
|
||
if (rh) *rh = UF_GETGLO (r->frac); /* get low */
|
||
return r->sign | (r->exp << G_V_EXP) | UF_GETGHI (r->frac);
|
||
}
|
||
|
||
#else /* 32b code */
|
||
|
||
#define WORDSWAP(x) ((((x) & WMASK) << 16) | (((x) >> 16) & WMASK))
|
||
|
||
struct udp {
|
||
uint32 lo;
|
||
uint32 hi;
|
||
};
|
||
|
||
typedef struct udp UDP;
|
||
|
||
struct ufp {
|
||
int32 sign;
|
||
int32 exp;
|
||
struct udp frac; };
|
||
|
||
typedef struct ufp UFP;
|
||
|
||
#define UF_NM_H 0x80000000 /* normalized */
|
||
#define UF_FRND_H 0x00000080 /* F round */
|
||
#define UF_FRND_L 0x00000000
|
||
#define UF_DRND_H 0x00000000 /* D round */
|
||
#define UF_DRND_L 0x00000080
|
||
#define UF_GRND_H 0x00000000 /* G round */
|
||
#define UF_GRND_L 0x00000400
|
||
#define UF_V_NM 63
|
||
|
||
void unpackf (uint32 hi, UFP *a);
|
||
void unpackd (uint32 hi, uint32 lo, UFP *a);
|
||
void unpackg (uint32 hi, uint32 lo, UFP *a);
|
||
void norm (UFP *a);
|
||
int32 rpackfd (UFP *a, int32 *rh);
|
||
int32 rpackg (UFP *a, int32 *rh);
|
||
void vax_fadd (UFP *a, UFP *b, uint32 mask);
|
||
void vax_fmul (UFP *a, UFP *b, int32 prec, int32 bias, uint32 mask);
|
||
void vax_fmod (UFP *a, int32 bias, int32 *intgr, int32 *flg);
|
||
void vax_fdiv (UFP *b, UFP *a, int32 prec, int32 bias);
|
||
int32 vax_fcmp (UFP *a, UFP *b);
|
||
void dp_add (UDP *a, UDP *b);
|
||
void dp_inc (UDP *a);
|
||
void dp_sub (UDP *a, UDP *b);
|
||
void dp_imul (uint32 a, uint32 b, UDP *r);
|
||
void dp_lsh (UDP *a, uint32 sc);
|
||
void dp_rsh (UDP *a, uint32 sc);
|
||
void dp_rsh_s (UDP *a, uint32 sc, uint32 neg);
|
||
void dp_neg (UDP *a);
|
||
int32 dp_cmp (UDP *a, UDP *b);
|
||
|
||
/* Quadword arithmetic shift
|
||
|
||
opnd[0] = shift count (cnt.rb)
|
||
opnd[1:2] = source (src.rq)
|
||
opnd[3:4] = destination (dst.wq)
|
||
*/
|
||
|
||
int32 op_ashq (int32 *opnd, int32 *rh, int32 *flg)
|
||
{
|
||
UDP r, sr;
|
||
uint32 sc = opnd[0];
|
||
|
||
r.lo = opnd[1]; /* get source */
|
||
r.hi = opnd[2];
|
||
*flg = 0; /* assume no ovflo */
|
||
if (sc & BSIGN) /* right shift? */
|
||
dp_rsh_s (&r, 0x100 - sc, r.hi & LSIGN); /* signed right */
|
||
else { dp_lsh (&r, sc); /* left shift */
|
||
sr = r; /* copy result */
|
||
dp_rsh_s (&sr, sc, sr.hi & LSIGN); /* signed right */
|
||
if ((sr.hi != ((uint32) opnd[2])) || /* reshift != orig? */
|
||
(sr.lo != ((uint32) opnd[1]))) *flg = 1; } /* overflow */
|
||
*rh = r.hi; /* hi result */
|
||
return r.lo; /* lo result */
|
||
}
|
||
|
||
/* Extended multiply subroutine */
|
||
|
||
int32 op_emul (int32 mpy, int32 mpc, int32 *rh)
|
||
{
|
||
UDP r;
|
||
int32 sign = mpy ^ mpc; /* sign of result */
|
||
|
||
if (mpy & LSIGN) mpy = -mpy; /* abs value */
|
||
if (mpc & LSIGN) mpc = -mpc;
|
||
dp_imul (mpy & LMASK, mpc & LMASK, &r); /* 32b * 32b -> 64b */
|
||
if (sign & LSIGN) dp_neg (&r); /* negative result? */
|
||
*rh = r.hi;
|
||
return r.lo;
|
||
}
|
||
|
||
/* Extended divide
|
||
|
||
opnd[0] = divisor (non-zero)
|
||
opnd[1:2] = dividend
|
||
*/
|
||
|
||
int32 op_ediv (int32 *opnd, int32 *rh, int32 *flg)
|
||
{
|
||
UDP dvd;
|
||
uint32 i, dvr, quo;
|
||
|
||
dvr = opnd[0]; /* get divisor */
|
||
dvd.lo = opnd[1]; /* get dividend */
|
||
dvd.hi = opnd[2];
|
||
*flg = CC_V; /* assume error */
|
||
*rh = 0;
|
||
if (dvd.hi & LSIGN) dp_neg (&dvd); /* |dividend| */
|
||
if (dvr & LSIGN) dvr = NEG (dvr); /* |divisor| */
|
||
if (dvd.hi >= dvr) return opnd[1]; /* divide work? */
|
||
for (i = quo = 0; i < 32; i++) { /* 32 iterations */
|
||
quo = quo << 1; /* shift quotient */
|
||
dp_lsh (&dvd, 1); /* shift dividend */
|
||
if (dvd.hi >= dvr) { /* step work? */
|
||
dvd.hi = (dvd.hi - dvr) & LMASK; /* subtract dvr */
|
||
quo = quo + 1; } }
|
||
if ((opnd[0] ^ opnd[2]) & LSIGN) { /* result -? */
|
||
quo = NEG (quo); /* negate */
|
||
if (quo && ((quo & LSIGN) == 0)) return opnd[1]; } /* right sign? */
|
||
else if (quo & LSIGN) return opnd[1];
|
||
if (opnd[2] & LSIGN) *rh = NEG (dvd.hi); /* sign of rem */
|
||
else *rh = dvd.hi;
|
||
*flg = 0; /* no overflow */
|
||
return quo; /* return quo */
|
||
}
|
||
|
||
/* Compare floating */
|
||
|
||
int32 op_cmpfd (int32 h1, int32 l1, int32 h2, int32 l2)
|
||
{
|
||
UFP a, b;
|
||
int32 r;
|
||
|
||
unpackd (h1, l1, &a);
|
||
unpackd (h2, l2, &b);
|
||
if (a.sign != b.sign) return (a.sign? CC_N: 0);
|
||
r = a.exp - b.exp;
|
||
if (r == 0) r = dp_cmp (&a.frac, &b.frac);
|
||
if (r < 0) return (a.sign? 0: CC_N);
|
||
if (r > 0) return (a.sign? CC_N: 0);
|
||
return CC_Z;
|
||
}
|
||
|
||
int32 op_cmpg (int32 h1, int32 l1, int32 h2, int32 l2)
|
||
{
|
||
UFP a, b;
|
||
int32 r;
|
||
|
||
unpackg (h1, l1, &a);
|
||
unpackg (h2, l2, &b);
|
||
if (a.sign != b.sign) return (a.sign? CC_N: 0);
|
||
r = a.exp - b.exp;
|
||
if (r == 0) r = dp_cmp (&a.frac, &b.frac);
|
||
if (r < 0) return (a.sign? 0: CC_N);
|
||
if (r > 0) return (a.sign? CC_N: 0);
|
||
return CC_Z;
|
||
}
|
||
|
||
/* Integer to floating convert */
|
||
|
||
int32 op_cvtifdg (int32 val, int32 *rh, int32 opc)
|
||
{
|
||
UFP a;
|
||
|
||
if (val == 0) { /* zero? */
|
||
if (rh) *rh = 0; /* return true 0 */
|
||
return 0; }
|
||
if (val < 0) { /* negative? */
|
||
a.sign = FPSIGN; /* sign = - */
|
||
val = -val; }
|
||
else a.sign = 0; /* else sign = + */
|
||
a.exp = 32 + ((opc & 0x100)? G_BIAS: FD_BIAS); /* initial exp */
|
||
a.frac.hi = val & LMASK; /* fraction */
|
||
a.frac.lo = 0;
|
||
norm (&a); /* normalize */
|
||
if (opc & 0x100) return rpackg (&a, rh); /* pack and return */
|
||
return rpackfd (&a, rh);
|
||
}
|
||
|
||
/* Floating to integer convert */
|
||
|
||
int32 op_cvtfdgi (int32 *opnd, int32 *flg, int32 opc)
|
||
{
|
||
UFP a;
|
||
int32 lnt = opc & 03;
|
||
int32 ubexp;
|
||
static uint32 maxv[4] = { 0x7F, 0x7FFF, 0x7FFFFFFF, 0x7FFFFFFF };
|
||
|
||
*flg = 0;
|
||
if (opc & 0x100) { /* G? */
|
||
unpackg (opnd[0], opnd[1], &a); /* unpack */
|
||
ubexp = a.exp - G_BIAS; } /* unbiased exp */
|
||
else { if (opc & 0x20) unpackd (opnd[0], opnd[1], &a); /* F or D */
|
||
else unpackf (opnd[0], &a); /* unpack */
|
||
ubexp = a.exp - FD_BIAS; } /* unbiased exp */
|
||
if ((a.exp == 0) || (ubexp < 0)) return 0; /* true zero or frac? */
|
||
if (ubexp <= UF_V_NM) { /* exp in range? */
|
||
dp_rsh (&a.frac, UF_V_NM - ubexp); /* leave rnd bit */
|
||
if (lnt == 03) dp_inc (&a.frac); /* if CVTR, round */
|
||
dp_rsh (&a.frac, 1); /* now justified */
|
||
if ((a.frac.hi != 0) ||
|
||
(a.frac.lo > (maxv[lnt] + (a.sign? 1: 0)))) *flg = CC_V; }
|
||
else { *flg = CC_V; /* always ovflo */
|
||
if (ubexp > (UF_V_NM + 32)) return 0; /* in ext range? */
|
||
dp_lsh (&a.frac, ubexp - UF_V_NM - 1); } /* no rnd bit */
|
||
return (a.sign? NEG (a.frac.lo): a.frac.lo); /* return lo frac */
|
||
}
|
||
|
||
/* Extended modularize
|
||
|
||
One of three floating point instructions dropped from the architecture,
|
||
EMOD presents two sets of complications. First, it requires an extended
|
||
fraction multiply, with precise (and unusual) truncation conditions.
|
||
Second, it has two write operands, a dubious distinction it shares
|
||
with EDIV.
|
||
*/
|
||
|
||
int32 op_emodf (int32 *opnd, int32 *intgr, int32 *flg)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackf (opnd[0], &a); /* unpack operands */
|
||
unpackf (opnd[2], &b);
|
||
a.frac.hi = a.frac.hi | opnd[1]; /* extend src1 */
|
||
vax_fmul (&a, &b, 32, FD_BIAS, LMASK); /* multiply */
|
||
vax_fmod (&a, FD_BIAS, intgr, flg); /* sep int & frac */
|
||
return rpackfd (&a, NULL); /* return frac */
|
||
}
|
||
|
||
int32 op_emodd (int32 *opnd, int32 *flo, int32 *intgr, int32 *flg)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackd (opnd[0], opnd[1], &a); /* unpack operands */
|
||
unpackd (opnd[3], opnd[4], &b);
|
||
a.frac.lo = a.frac.lo | opnd[2]; /* extend src1 */
|
||
vax_fmul (&a, &b, 64, FD_BIAS, 0); /* multiply */
|
||
vax_fmod (&a, FD_BIAS, intgr, flg); /* sep int & frac */
|
||
return rpackfd (&a, flo); /* return frac */
|
||
}
|
||
|
||
int32 op_emodg (int32 *opnd, int32 *flo, int32 *intgr, int32 *flg)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackg (opnd[0], opnd[1], &a); /* unpack operands */
|
||
unpackg (opnd[3], opnd[4], &b);
|
||
a.frac.lo = a.frac.lo | (opnd[2] >> 5); /* extend src1 */
|
||
vax_fmul (&a, &b, 64, G_BIAS, 0); /* multiply */
|
||
vax_fmod (&a, G_BIAS, intgr, flg); /* sep int & frac */
|
||
return rpackg (&a, flo); /* return frac */
|
||
}
|
||
|
||
/* Unpacked floating point routines */
|
||
|
||
/* Floating add */
|
||
|
||
void vax_fadd (UFP *a, UFP *b, uint32 mask)
|
||
{
|
||
int32 ediff;
|
||
UFP t;
|
||
|
||
if (a->exp == 0) { /* s1 = 0? */
|
||
*a = *b;
|
||
return; }
|
||
if (b->exp == 0) return; /* s2 = 0? */
|
||
if ((a->exp < b->exp) || /* |s1| < |s2|? swap */
|
||
((a->exp == b->exp) && (dp_cmp (&a->frac, &b->frac) < 0))) {
|
||
t = *a;
|
||
*a = *b;
|
||
*b = t; }
|
||
ediff = a->exp - b->exp; /* exp diff */
|
||
if (a->sign ^ b->sign) { /* eff sub? */
|
||
if (ediff) { /* exp diff? */
|
||
dp_neg (&b->frac); /* negate fraction */
|
||
dp_rsh_s (&b->frac, ediff, 1); /* signed right */
|
||
dp_add (&a->frac, &b->frac); } /* "add" frac */
|
||
else dp_sub (&a->frac, &b->frac); /* a >= b */
|
||
a->frac.lo = a->frac.lo & ~mask; /* trim low result */
|
||
norm (a); } /* normalize */
|
||
else { if (ediff) dp_rsh (&b->frac, ediff); /* add, denormalize */
|
||
dp_add (&a->frac, &b->frac); /* add frac */
|
||
if (dp_cmp (&a->frac, &b->frac) < 0) { /* chk for carry */
|
||
dp_rsh (&a->frac, 1); /* renormalize */
|
||
a->frac.hi = a->frac.hi | UF_NM_H; /* add norm bit */
|
||
a->exp = a->exp + 1; } /* skip norm */
|
||
a->frac.lo = a->frac.lo & ~mask; } /* trim low result */
|
||
return;
|
||
}
|
||
|
||
/* Floating compare */
|
||
|
||
int32 vax_fcmp (UFP *a, UFP *b)
|
||
{
|
||
int32 r;
|
||
|
||
if (a->sign != b->sign) return ((a->sign)? CC_N: 0);
|
||
r = a->exp - b->exp;
|
||
if (r == 0) r = dp_cmp (&a->frac, &b->frac);
|
||
if (r == 0) return CC_Z;
|
||
if (r < 0) return (a->sign? 0: CC_N);
|
||
else return (a->sign? CC_N: 0);
|
||
}
|
||
|
||
/* Floating multiply - 64b * 64b with cross products */
|
||
|
||
void vax_fmul (UFP *a, UFP *b, int32 prec, int32 bias, uint32 mask)
|
||
{
|
||
UDP rhi, rlo, rmid1, rmid2;
|
||
|
||
if ((a->exp == 0) || (b->exp == 0)) { /* zero argument? */
|
||
a->frac.hi = a->frac.lo = 0; /* result is zero */
|
||
a->sign = a->exp = 0;
|
||
return; }
|
||
a->sign = a->sign ^ b->sign; /* sign of result */
|
||
a->exp = a->exp + b->exp - bias; /* add exponents */
|
||
dp_imul (a->frac.hi, b->frac.hi, &rhi); /* high result */
|
||
if (prec > 32) { /* 64b needed? */
|
||
dp_imul (a->frac.hi, b->frac.lo, &rmid1); /* cross products */
|
||
dp_imul (a->frac.lo, b->frac.hi, &rmid2);
|
||
dp_imul (a->frac.lo, b->frac.lo, &rlo); /* low result */
|
||
rhi.lo = (rhi.lo + rmid1.hi) & LMASK; /* add hi cross */
|
||
if (rhi.lo < rmid1.hi) /* to low high res */
|
||
rhi.hi = (rhi.hi + 1) & LMASK;
|
||
rhi.lo = (rhi.lo + rmid2.hi) & LMASK;
|
||
if (rhi.lo < rmid2.hi)
|
||
rhi.hi = (rhi.hi + 1) & LMASK;
|
||
rlo.hi = (rlo.hi + rmid1.lo) & LMASK; /* add mid1 to low res */
|
||
if (rlo.hi < rmid1.lo) dp_inc (&rhi); /* carry? incr high res */
|
||
rlo.hi = (rlo.hi + rmid2.lo) & LMASK; /* add mid2 to low res */
|
||
if (rlo.hi < rmid1.hi) dp_inc (&rhi); } /* carry? incr high res */
|
||
a->frac.hi = rhi.hi; /* mask low fraction */
|
||
a->frac.lo = rhi.lo & ~mask;
|
||
norm (a); /* normalize */
|
||
return;
|
||
}
|
||
|
||
/* Floating modulus - there are three cases
|
||
|
||
exp <= bias - integer is 0, fraction is input,
|
||
no overflow
|
||
bias < exp <= bias+64 - separate integer and fraction,
|
||
integer overflow may occur
|
||
bias+64 < exp - result is integer, fraction is 0
|
||
integer overflow
|
||
*/
|
||
|
||
void vax_fmod (UFP *a, int32 bias, int32 *intgr, int32 *flg)
|
||
{
|
||
UDP ifr;
|
||
|
||
if (a->exp <= bias) *intgr = *flg = 0; /* 0 or <1? int = 0 */
|
||
else if (a->exp <= (bias + 64)) { /* in range? */
|
||
ifr = a->frac;
|
||
dp_rsh (&ifr, 64 - (a->exp - bias)); /* separate integer */
|
||
if ((a->exp > (bias + 32)) || /* test ovflo */
|
||
((a->exp == (bias + 32)) &&
|
||
(ifr.lo > (a->sign? 0x80000000: 0x7FFFFFFF))))
|
||
*flg = CC_V;
|
||
else *flg = 0;
|
||
*intgr = ifr.lo;
|
||
if (a->sign) *intgr = -*intgr; /* -? comp int */
|
||
dp_lsh (&a->frac, a->exp - bias); /* excise integer */
|
||
a->exp = bias; }
|
||
else { *intgr = 0; /* out of range */
|
||
a->frac.hi = a->frac.lo = a->sign = a->exp = 0; /* result 0 */
|
||
*flg = CC_V; } /* overflow */
|
||
norm (a); /* normalize */
|
||
return;
|
||
}
|
||
|
||
/* Floating divide
|
||
Needs to develop at least one rounding bit. Since the first
|
||
divide step can fail, caller should specify 2 more bits than
|
||
the precision of the fraction.
|
||
*/
|
||
|
||
void vax_fdiv (UFP *a, UFP *b, int32 prec, int32 bias)
|
||
{
|
||
int32 i;
|
||
UDP quo = { 0, 0 };
|
||
|
||
if (a->exp == 0) FLT_DZRO_FAULT; /* divr = 0? */
|
||
if (b->exp == 0) return; /* divd = 0? */
|
||
b->sign = b->sign ^ a->sign; /* result sign */
|
||
b->exp = b->exp - a->exp + bias + 1; /* unbiased exp */
|
||
dp_rsh (&a->frac, 1); /* allow 1 bit left */
|
||
dp_rsh (&b->frac, 1);
|
||
for (i = 0; i < prec; i++) { /* divide loop */
|
||
dp_lsh (&quo, 1); /* shift quo */
|
||
if (dp_cmp (&b->frac, &a->frac) >= 0) { /* div step ok? */
|
||
dp_sub (&b->frac, &a->frac); /* subtract */
|
||
quo.lo = quo.lo + 1; } /* quo bit = 1 */
|
||
dp_lsh (&b->frac, 1); } /* shift divd */
|
||
dp_lsh (&quo, UF_V_NM - prec + 1); /* put in position */
|
||
b->frac = quo;
|
||
norm (b); /* normalize */
|
||
return;
|
||
}
|
||
|
||
/* Double precision integer routines */
|
||
|
||
int32 dp_cmp (UDP *a, UDP *b)
|
||
{
|
||
if (a->hi < b->hi) return -1; /* compare hi */
|
||
if (a->hi > b->hi) return +1;
|
||
if (a->lo < b->lo) return -1; /* hi =, compare lo */
|
||
if (a->lo > b->lo) return +1;
|
||
return 0; /* hi, lo equal */
|
||
}
|
||
|
||
void dp_add (UDP *a, UDP *b)
|
||
{
|
||
a->lo = (a->lo + b->lo) & LMASK; /* add lo */
|
||
if (a->lo < b->lo) a->hi = a->hi + 1; /* carry? */
|
||
a->hi = (a->hi + b->hi) & LMASK; /* add hi */
|
||
return;
|
||
}
|
||
|
||
void dp_inc (UDP *a)
|
||
{
|
||
a->lo = (a->lo + 1) & LMASK; /* inc lo */
|
||
if (a->lo == 0) a->hi = (a->hi + 1) & LMASK; /* carry? inc hi */
|
||
return;
|
||
}
|
||
|
||
void dp_sub (UDP *a, UDP *b)
|
||
{
|
||
if (a->lo < b->lo) a->hi = a->hi - 1; /* borrow? decr hi */
|
||
a->lo = (a->lo - b->lo) & LMASK; /* sub lo */
|
||
a->hi = (a->hi - b->hi) & LMASK; /* sub hi */
|
||
return;
|
||
}
|
||
|
||
void dp_lsh (UDP *r, uint32 sc)
|
||
{
|
||
if (sc > 63) r->hi = r->lo = 0; /* > 63? result 0 */
|
||
else if (sc > 31) { /* [32,63]? */
|
||
r->hi = (r->lo << (sc - 32)) & LMASK;
|
||
r->lo = 0; }
|
||
else if (sc != 0) {
|
||
r->hi = ((r->hi << sc) | (r->lo >> (32 - sc))) & LMASK;
|
||
r->lo = (r->lo << sc) & LMASK; }
|
||
return;
|
||
}
|
||
|
||
void dp_rsh (UDP *r, uint32 sc)
|
||
{
|
||
if (sc > 63) r->hi = r->lo = 0; /* > 63? result 0 */
|
||
else if (sc > 31) { /* [32,63]? */
|
||
r->lo = (r->hi >> (sc - 32)) & LMASK;
|
||
r->hi = 0; }
|
||
else if (sc != 0) {
|
||
r->lo = ((r->lo >> sc) | (r->hi << (32 - sc))) & LMASK;
|
||
r->hi = (r->hi >> sc) & LMASK; }
|
||
return;
|
||
}
|
||
|
||
void dp_rsh_s (UDP *r, uint32 sc, uint32 neg)
|
||
{
|
||
dp_rsh (r, sc); /* do unsigned right */
|
||
if (neg && sc) { /* negative? */
|
||
if (sc > 63) r->hi = r->lo = LMASK; /* > 63? result -1 */
|
||
else {
|
||
UDP ones = { LMASK, LMASK };
|
||
dp_lsh (&ones, 64 - sc); /* shift ones */
|
||
r->hi = r->hi | ones.hi; /* or into result */
|
||
r->lo = r->lo | ones.lo; } }
|
||
return;
|
||
}
|
||
|
||
void dp_imul (uint32 a, uint32 b, UDP *r)
|
||
{
|
||
uint32 ah, bh, al, bl, rhi, rlo, rmid1, rmid2;
|
||
|
||
if ((a == 0) || (b == 0)) { /* zero argument? */
|
||
r->hi = r->lo = 0; /* result is zero */
|
||
return; }
|
||
ah = (a >> 16) & WMASK; /* split operands */
|
||
bh = (b >> 16) & WMASK; /* into 16b chunks */
|
||
al = a & WMASK;
|
||
bl = b & WMASK;
|
||
rhi = ah * bh; /* high result */
|
||
rmid1 = ah * bl;
|
||
rmid2 = al * bh;
|
||
rlo = al * bl;
|
||
rhi = rhi + ((rmid1 >> 16) & WMASK) + ((rmid2 >> 16) & WMASK);
|
||
rmid1 = (rlo + (rmid1 << 16)) & LMASK; /* add mid1 to lo */
|
||
if (rmid1 < rlo) rhi = rhi + 1; /* carry? incr hi */
|
||
rmid2 = (rmid1 + (rmid2 << 16)) & LMASK; /* add mid2 to to */
|
||
if (rmid2 < rmid1) rhi = rhi + 1; /* carry? incr hi */
|
||
r->hi = rhi & LMASK; /* mask result */
|
||
r->lo = rmid2;
|
||
return;
|
||
}
|
||
|
||
void dp_neg (UDP *r)
|
||
{
|
||
r->lo = NEG (r->lo);
|
||
r->hi = (~r->hi + (r->lo == 0)) & LMASK;
|
||
return;
|
||
}
|
||
|
||
/* Support routines */
|
||
|
||
void unpackf (uint32 hi, UFP *r)
|
||
{
|
||
r->sign = hi & FPSIGN; /* get sign */
|
||
r->exp = FD_GETEXP (hi); /* get exponent */
|
||
if (r->exp == 0) { /* exp = 0? */
|
||
if (r->sign) RSVD_OPND_FAULT; /* if -, rsvd op */
|
||
r->frac.hi = r->frac.lo = 0; /* else 0 */
|
||
return; }
|
||
r->frac.hi = WORDSWAP ((hi & ~(FPSIGN | FD_EXP)) | FD_HB);
|
||
r->frac.lo = 0;
|
||
dp_lsh (&r->frac, FD_GUARD);
|
||
return;
|
||
}
|
||
|
||
void unpackd (uint32 hi, uint32 lo, UFP *r)
|
||
{
|
||
r->sign = hi & FPSIGN; /* get sign */
|
||
r->exp = FD_GETEXP (hi); /* get exponent */
|
||
if (r->exp == 0) { /* exp = 0? */
|
||
if (r->sign) RSVD_OPND_FAULT; /* if -, rsvd op */
|
||
r->frac.hi = r->frac.lo = 0; /* else 0 */
|
||
return; }
|
||
r->frac.hi = WORDSWAP ((hi & ~(FPSIGN | FD_EXP)) | FD_HB);
|
||
r->frac.lo = WORDSWAP (lo);
|
||
dp_lsh (&r->frac, FD_GUARD);
|
||
return;
|
||
}
|
||
|
||
void unpackg (uint32 hi, uint32 lo, UFP *r)
|
||
{
|
||
r->sign = hi & FPSIGN; /* get sign */
|
||
r->exp = G_GETEXP (hi); /* get exponent */
|
||
if (r->exp == 0) { /* exp = 0? */
|
||
if (r->sign) RSVD_OPND_FAULT; /* if -, rsvd op */
|
||
r->frac.hi = r->frac.lo = 0; /* else 0 */
|
||
return; }
|
||
r->frac.hi = WORDSWAP ((hi & ~(FPSIGN | G_EXP)) | G_HB);
|
||
r->frac.lo = WORDSWAP (lo);
|
||
dp_lsh (&r->frac, G_GUARD);
|
||
return;
|
||
}
|
||
|
||
void norm (UFP *r)
|
||
{
|
||
int32 i;
|
||
static uint32 normmask[5] = {
|
||
0xc0000000, 0xf0000000, 0xff000000, 0xffff0000, 0xffffffff };
|
||
static int32 normtab[6] = { 1, 2, 4, 8, 16, 32};
|
||
|
||
if ((r->frac.hi == 0) && (r->frac.lo == 0)) { /* if fraction = 0 */
|
||
r->sign = r->exp = 0; /* result is 0 */
|
||
return; }
|
||
while ((r->frac.hi & UF_NM_H) == 0) { /* normalized? */
|
||
for (i = 0; i < 5; i++) { /* find first 1 */
|
||
if (r->frac.hi & normmask[i]) break; }
|
||
dp_lsh (&r->frac, normtab[i]); /* shift frac */
|
||
r->exp = r->exp - normtab[i]; } /* decr exp */
|
||
return;
|
||
}
|
||
|
||
int32 rpackfd (UFP *r, int32 *rh)
|
||
{
|
||
static UDP f_round = { UF_FRND_L, UF_FRND_H };
|
||
static UDP d_round = { UF_DRND_L, UF_DRND_H };
|
||
|
||
if (rh) *rh = 0; /* assume 0 */
|
||
if ((r->frac.hi == 0) && (r->frac.lo == 0)) return 0; /* result 0? */
|
||
if (rh) dp_add (&r->frac, &d_round); /* round */
|
||
else dp_add (&r->frac, &f_round);
|
||
if ((r->frac.hi & UF_NM_H) == 0) { /* carry out? */
|
||
dp_rsh (&r->frac, 1); /* renormalize */
|
||
r->exp = r->exp + 1; }
|
||
if (r->exp > (int32) FD_M_EXP) FLT_OVFL_FAULT; /* ovflo? fault */
|
||
if (r->exp <= 0) { /* underflow? */
|
||
if (PSL & PSW_FU) FLT_UNFL_FAULT; /* fault if fu */
|
||
return 0; } /* else 0 */
|
||
dp_rsh (&r->frac, FD_GUARD); /* remove guard */
|
||
if (rh) *rh = WORDSWAP (r->frac.lo); /* get low */
|
||
return r->sign | (r->exp << FD_V_EXP) |
|
||
(WORDSWAP (r->frac.hi) & ~(FD_HB | FPSIGN | FD_EXP));
|
||
}
|
||
|
||
int32 rpackg (UFP *r, int32 *rh)
|
||
{
|
||
static UDP g_round = { UF_GRND_L, UF_GRND_H };
|
||
|
||
*rh = 0; /* assume 0 */
|
||
if ((r->frac.hi == 0) && (r->frac.lo == 0)) return 0; /* result 0? */
|
||
dp_add (&r->frac, &g_round); /* round */
|
||
if ((r->frac.hi & UF_NM_H) == 0) { /* carry out? */
|
||
dp_rsh (&r->frac, 1); /* renormalize */
|
||
r->exp = r->exp + 1; }
|
||
if (r->exp > (int32) G_M_EXP) FLT_OVFL_FAULT; /* ovflo? fault */
|
||
if (r->exp <= 0) { /* underflow? */
|
||
if (PSL & PSW_FU) FLT_UNFL_FAULT; /* fault if fu */
|
||
return 0; } /* else 0 */
|
||
dp_rsh (&r->frac, G_GUARD); /* remove guard */
|
||
*rh = WORDSWAP (r->frac.lo); /* get low */
|
||
return r->sign | (r->exp << G_V_EXP) |
|
||
(WORDSWAP (r->frac.hi) & ~(G_HB | FPSIGN | G_EXP));
|
||
}
|
||
|
||
#endif
|
||
|
||
/* Floating point instructions */
|
||
|
||
/* Move/test/move negated floating
|
||
|
||
Note that only the high 32b is processed.
|
||
If the high 32b is not zero, it is unchanged.
|
||
*/
|
||
|
||
int32 op_movfd (int32 val)
|
||
{
|
||
if (val & FD_EXP) return val;
|
||
if (val & FPSIGN) RSVD_OPND_FAULT;
|
||
return 0;
|
||
}
|
||
|
||
int32 op_mnegfd (int32 val)
|
||
{
|
||
if (val & FD_EXP) return (val ^ FPSIGN);
|
||
if (val & FPSIGN) RSVD_OPND_FAULT;
|
||
return 0;
|
||
}
|
||
|
||
int32 op_movg (int32 val)
|
||
{
|
||
if (val & G_EXP) return val;
|
||
if (val & FPSIGN) RSVD_OPND_FAULT;
|
||
return 0;
|
||
}
|
||
|
||
int32 op_mnegg (int32 val)
|
||
{
|
||
if (val & G_EXP) return (val ^ FPSIGN);
|
||
if (val & FPSIGN) RSVD_OPND_FAULT;
|
||
return 0;
|
||
}
|
||
|
||
/* Floating to floating convert - F to D is essentially done with MOVFD */
|
||
|
||
int32 op_cvtdf (int32 *opnd)
|
||
{
|
||
UFP a;
|
||
|
||
unpackd (opnd[0], opnd[1], &a);
|
||
return rpackfd (&a, NULL);
|
||
}
|
||
|
||
int32 op_cvtfg (int32 *opnd, int32 *rh)
|
||
{
|
||
UFP a;
|
||
|
||
unpackf (opnd[0], &a);
|
||
a.exp = a.exp - FD_BIAS + G_BIAS;
|
||
return rpackg (&a, rh);
|
||
}
|
||
|
||
int32 op_cvtgf (int32 *opnd)
|
||
{
|
||
UFP a;
|
||
|
||
unpackg (opnd[0], opnd[1], &a);
|
||
a.exp = a.exp - G_BIAS + FD_BIAS;
|
||
return rpackfd (&a, NULL);
|
||
}
|
||
|
||
/* Floating add and subtract */
|
||
|
||
int32 op_addf (int32 *opnd, t_bool sub)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackf (opnd[0], &a); /* F format */
|
||
unpackf (opnd[1], &b);
|
||
if (sub) a.sign = a.sign ^ FPSIGN; /* sub? -s1 */
|
||
vax_fadd (&a, &b, 0); /* add fractions */
|
||
return rpackfd (&a, NULL);
|
||
}
|
||
|
||
int32 op_addd (int32 *opnd, int32 *rh, t_bool sub)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackd (opnd[0], opnd[1], &a);
|
||
unpackd (opnd[2], opnd[3], &b);
|
||
if (sub) a.sign = a.sign ^ FPSIGN; /* sub? -s1 */
|
||
vax_fadd (&a, &b, 0); /* add fractions */
|
||
return rpackfd (&a, rh);
|
||
}
|
||
|
||
int32 op_addg (int32 *opnd, int32 *rh, t_bool sub)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackg (opnd[0], opnd[1], &a);
|
||
unpackg (opnd[2], opnd[3], &b);
|
||
if (sub) a.sign = a.sign ^ FPSIGN; /* sub? -s1 */
|
||
vax_fadd (&a, &b, 0); /* add fractions */
|
||
return rpackg (&a, rh); /* round and pack */
|
||
}
|
||
|
||
/* Floating multiply */
|
||
|
||
int32 op_mulf (int32 *opnd)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackf (opnd[0], &a); /* F format */
|
||
unpackf (opnd[1], &b);
|
||
vax_fmul (&a, &b, 24, FD_BIAS, 0); /* do multiply */
|
||
return rpackfd (&a, NULL); /* round and pack */
|
||
}
|
||
|
||
int32 op_muld (int32 *opnd, int32 *rh)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackd (opnd[0], opnd[1], &a);
|
||
unpackd (opnd[2], opnd[3], &b);
|
||
vax_fmul (&a, &b, 56, FD_BIAS, 0); /* do multiply */
|
||
return rpackfd (&a, rh); /* round and pack */
|
||
}
|
||
|
||
int32 op_mulg (int32 *opnd, int32 *rh)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackg (opnd[0], opnd[1], &a); /* G format */
|
||
unpackg (opnd[2], opnd[3], &b);
|
||
vax_fmul (&a, &b, 53, G_BIAS, 0); /* do multiply */
|
||
return rpackg (&a, rh); /* round and pack */
|
||
}
|
||
|
||
/* Floating divide */
|
||
|
||
int32 op_divf (int32 *opnd)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackf (opnd[0], &a); /* F format */
|
||
unpackf (opnd[1], &b);
|
||
vax_fdiv (&a, &b, 26, FD_BIAS); /* do divide */
|
||
return rpackfd (&b, NULL); /* round and pack */
|
||
}
|
||
|
||
int32 op_divd (int32 *opnd, int32 *rh)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackd (opnd[0], opnd[1], &a);
|
||
unpackd (opnd[2], opnd[3], &b);
|
||
vax_fdiv (&a, &b, 58, FD_BIAS); /* do divide */
|
||
return rpackfd (&b, rh); /* round and pack */
|
||
}
|
||
|
||
int32 op_divg (int32 *opnd, int32 *rh)
|
||
{
|
||
UFP a, b;
|
||
|
||
unpackg (opnd[0], opnd[1], &a); /* G format */
|
||
unpackg (opnd[2], opnd[3], &b);
|
||
vax_fdiv (&a, &b, 55, G_BIAS); /* do divide */
|
||
return rpackg (&b, rh); /* round and pack */
|
||
}
|
||
|
||
/* Polynomial evaluation
|
||
The most mis-implemented instruction in the VAX (probably here too).
|
||
POLY requires a precise combination of masking versus normalizing
|
||
to achieve the desired answer. In particular, both the multiply
|
||
and add steps are masked prior to normalization. In addition,
|
||
negative small fractions must not be treated as 0 during denorm.
|
||
*/
|
||
|
||
void op_polyf (int32 *opnd, int32 acc)
|
||
{
|
||
UFP r, a, c;
|
||
int32 deg = opnd[1];
|
||
int32 ptr = opnd[2];
|
||
int32 i, wd, res;
|
||
|
||
if (deg > 31) RSVD_OPND_FAULT; /* degree > 31? fault */
|
||
unpackf (opnd[0], &a); /* unpack arg */
|
||
wd = Read (ptr, L_LONG, RD); /* get C0 */
|
||
ptr = ptr + 4;
|
||
unpackf (wd, &r); /* unpack C0 */
|
||
res = rpackfd (&r, NULL); /* first result */
|
||
for (i = 0; (i < deg) && a.exp; i++) { /* loop */
|
||
unpackf (res, &r); /* unpack result */
|
||
vax_fmul (&r, &a, 32, FD_BIAS, LMASK); /* r = r * arg */
|
||
wd = Read (ptr, L_LONG, RD); /* get Cnext */
|
||
ptr = ptr + 4;
|
||
unpackf (wd, &c); /* unpack Cnext */
|
||
vax_fadd (&r, &c, LMASK); /* r = r + Cnext */
|
||
res = rpackfd (&r, NULL); } /* round and pack */
|
||
R[0] = res;
|
||
R[1] = R[2] = 0;
|
||
R[3] = opnd[2] + 4 + (opnd[1] << 2);
|
||
return;
|
||
}
|
||
|
||
void op_polyd (int32 *opnd, int32 acc)
|
||
{
|
||
UFP r, a, c;
|
||
int32 deg = opnd[2];
|
||
int32 ptr = opnd[3];
|
||
int32 i, wd, wd1, res, resh;
|
||
|
||
if (deg > 31) RSVD_OPND_FAULT; /* degree > 31? fault */
|
||
unpackd (opnd[0], opnd[1], &a); /* unpack arg */
|
||
wd = Read (ptr, L_LONG, RD); /* get C0 */
|
||
wd1 = Read (ptr + 4, L_LONG, RD);
|
||
ptr = ptr + 8;
|
||
unpackd (wd, wd1, &r); /* unpack C0 */
|
||
res = rpackfd (&r, &resh); /* first result */
|
||
for (i = 0; (i < deg) && a.exp; i++) { /* loop */
|
||
unpackd (res, resh, &r); /* unpack result */
|
||
vax_fmul (&r, &a, 32, FD_BIAS, 0); /* r = r * arg */
|
||
wd = Read (ptr, L_LONG, RD); /* get Cnext */
|
||
wd1 = Read (ptr + 4, L_LONG, RD);
|
||
ptr = ptr + 8;
|
||
unpackd (wd, wd1, &c); /* unpack Cnext */
|
||
vax_fadd (&r, &c, 0); /* r = r + Cnext */
|
||
res = rpackfd (&r, &resh); } /* round and pack */
|
||
R[0] = res;
|
||
R[1] = resh;
|
||
R[2] = 0;
|
||
R[3] = opnd[3] + 4 + (opnd[2] << 2);
|
||
return;
|
||
}
|
||
|
||
void op_polyg (int32 *opnd, int32 acc)
|
||
{
|
||
UFP r, a, c;
|
||
int32 deg = opnd[2];
|
||
int32 ptr = opnd[3];
|
||
int32 i, wd, wd1, res, resh;
|
||
|
||
if (deg > 31) RSVD_OPND_FAULT; /* degree > 31? fault */
|
||
unpackg (opnd[0], opnd[1], &a); /* unpack arg */
|
||
wd = Read (ptr, L_LONG, RD); /* get C0 */
|
||
wd1 = Read (ptr + 4, L_LONG, RD);
|
||
ptr = ptr + 8;
|
||
unpackg (wd, wd1, &r); /* unpack C0 */
|
||
res = rpackg (&r, &resh); /* first result */
|
||
for (i = 0; (i < deg) && a.exp; i++) { /* loop */
|
||
unpackg (res, resh, &r); /* unpack result */
|
||
vax_fmul (&r, &a, 32, G_BIAS, 0); /* r = r * arg */
|
||
wd = Read (ptr, L_LONG, RD); /* get Cnext */
|
||
wd1 = Read (ptr + 4, L_LONG, RD);
|
||
ptr = ptr + 8;
|
||
unpackg (wd, wd1, &c); /* unpack Cnext */
|
||
vax_fadd (&r, &c, 0); /* r = r + Cnext */
|
||
res = rpackg (&r, &resh); } /* round and pack */
|
||
R[0] = res;
|
||
R[1] = resh;
|
||
R[2] = 0;
|
||
R[3] = opnd[3] + 4 + (opnd[2] << 2);
|
||
return;
|
||
}
|