I’ve always wanted to have the option to have simulated devices behave more naturally with respect to I/O operations. By more naturally I mean that the current simulator model I/O is either polled (for asynchronous things link Muxes and Network), or it is performed in the middle of some instruction execution taking possibly many milliseconds (disk and/or tapes). The existing model creates quite deterministic behavior which helps to debug and understand issues, but it trades off potential instruction execution while performing these I/O operations in between instruction execution. To address this concept (while still retaining the potential advantages of the original model), I’ve designed an Asynch I/O model extension for simh. In order to flesh-out and debug this design, I’ve also refactored several devices to utilize this capability. Please read the attached 0readmeAsynchIO.txt file for concept details about the approach. In order to make disk devices easy to implement (within or without the AsynchIO framework), I’ve created a sim_disk.c library which is modeled on the sim_tape.c library to generalize disk I/O like tape I/O is generalized in sim_tape.c. This sim_disk.c library now provides that natural place to implement support for various disk implementation formats (just like sim_tape support several formats, and one day will be the place to add direct physical tape access). The current sim_disk library provides the framework for direct support of 3 different disk formats: 1) standard simh disk format 2) platform specific physical disk access and 3) platform independent Virtual Disk format. The Virtual Disk format is an implementation of the format described in the ”Microsoft Virtual Hard Disk (VHD) Image Format Specification”. The VHD specification is available for anyone to implement under the "Microsoft Open Specification Promise" described at http://www.microsoft.com/interop/osp/default.mspx. The VHD implementation includes support for: 1) Fixed sized disks 2) Dynamically expanding disks and 3) Differencing Disks. Dynamically expanding disks don’t change their “Virtual Size”, but they don’t consume disk space on the containing storage until the virtual sectors in the disk are actually written to (i.e. an RA81 or RA92 VHD with a VMS installed on it may initially only contain 30+ MB of files, and the resulting VHD will be 30+ MB). The VHD format contains meta data which describes the virtual device. Amongst this meta data is the simh device type which the VHD was originally created as. This metadata is therefore available whenever that VHD is attached to an emulated disk device in the future so the device type & size can be automatically be configured. Sim_disk_attach is used by device emulations to attach a simh/vhd/raw device to a simulated device. The following simh command switches are used by the sim_disk_attach API: -R Attach Read Only. -E Must Exist (if not specified an attempt to create the indicated virtual disk will be attempted). -F Open the indicated disk container in a specific format (default is to autodetect VHD defaulting to simh if the indicated container is not a VHD). -X When creating a VHD, create a fixed sized VHD (vs a Dynamically expanding one). -C Create a VHD and copy its contents from another disk (simh, VHD, or RAW format). -D Create a Differencing VHD (relative to an already existing VHD disk) Examples: sim> show rq RQ, address=20001468-2000146B*, no vector, 4 units RQ0, 159MB, not attached, write enabled, RD54, autosize, SIMH format RQ1, 159MB, not attached, write enabled, RD54, autosize, SIMH format RQ2, 159MB, not attached, write enabled, RD54, autosize, SIMH format RQ3, 409KB, not attached, write enabled, RX50, autosize, SIMH format sim> atta rq0 RA81.vhd sim> show rq0 RQ0, 456MB, attached to RA81.vhd, write enabled, RA81, autosize, VHD format sim> set rq2 ra92 sim> att rq2 -f vhd RA92.vhd RQ2: creating new file sim> sho rq2 RQ2, 1505MB, attached to RA92.vhd, write enabled, RA92, autosize, VHD format sim> ! dir RA92.vhd Volume in drive H is New Volume Volume Serial Number is F8DE-510C Directory of H:\Data 04/14/2011 12:57 PM 5,120 RA92.vhd 1 File(s) 5,120 bytes 0 Dir(s) 3,074,412,544 bytes free sim> atta rq3 -c RA92-1.vhd RA92.vhd sim> atta rq3 -c RA92-1.vhd RA92.vhd RQ3: creating new virtual disk 'RA92-1.vhd' RQ3: Copied 1505MB. 99% complete. RQ3: Copied 1505MB. Done. sim> sh rq3 RQ3, 1505MB, attached to RA92-1.vhd, write enabled, RA92, autosize, VHD format sim> ! dir RA92* Volume in drive H is New Volume Volume Serial Number is F8DE-510C Directory of H:\Data 04/14/2011 01:12 PM 5,120 RA92-1.vhd 04/14/2011 12:58 PM 5,120 RA92.vhd 2 File(s) 10,240 bytes 0 Dir(s) 3,074,404,352 bytes free sim> sho rq2 RQ2, 1505MB, not attached, write enabled, RA92, autosize, VHD format sim> set rq2 ra81 sim> set rq2 noauto sim> sho rq2 RQ2, 456MB, not attached, write enabled, RA81, noautosize, VHD format sim> set rq2 format=simh sim> sho rq2 RQ2, 456MB, not attached, write enabled, RA81, noautosize, SIMH format sim> atta rq2 -c RA81-Copy.vhd VMS055.dsk RQ2: creating new virtual disk 'RA81-Copy.vhd' RQ2: Copied 456MB. 99% complete. RQ2: Copied 456MB. Done. sim> sho rq2 RQ2, 456MB, attached to RA81-Copy.vhd, write enabled, RA81, noautosize, VHD format sim> det rq2 sim> ! dir RA81-Copy.vhd Volume in drive H is New Volume Volume Serial Number is F8DE-510C Directory of H:\Data 04/14/2011 01:22 PM 178,304,512 RA81-Copy.vhd 1 File(s) 178,304,512 bytes 0 Dir(s) 2,896,097,280 bytes free sim> ! dir VMS055.dsk Volume in drive H is New Volume Volume Serial Number is F8DE-510C Directory of H:\Data 03/08/2011 01:42 PM 403,663,872 VMS055.dsk 1 File(s) 403,663,872 bytes 0 Dir(s) 2,896,097,280 bytes free sim>
161 lines
8.4 KiB
Text
161 lines
8.4 KiB
Text
SIM_ASYNCH_IO
|
|
|
|
Theory of operation.
|
|
|
|
Features.
|
|
- Optional Use. Build with or without SIM_ASYNCH_IO defined and
|
|
simulators will still build and perform correctly when run.
|
|
- Consistent Save/Restore state. The state of a simulator saved
|
|
on a simulator with (or without) Asynch support can be restored
|
|
on any simulator of the same version with or without Asynch
|
|
support.
|
|
- Optimal behavior/performance with simulator running with or
|
|
without CPU idling enabled.
|
|
- Consistent minimum instruction scheduling delays when operating
|
|
with or without SIM_ASYNCH_IO. When SIM_ASYNCH_IO is emabled,
|
|
any operation which would have been scheduled to occurr in 'n'
|
|
instructions will still occur (from the simulated computer's
|
|
point of view) at least 'n' instructions after it was initiated.
|
|
|
|
Benefits.
|
|
Allows a simulator to execute simulated instructions concurrently
|
|
with I/O operations which may take numerous milliseconds to perform.
|
|
Allows a simulated device to potentially avoid polling for the arrival
|
|
of data. Polling consumes host processor CPU cycles which may better
|
|
be spent executing simulated instructions or letting other host
|
|
processes run. Measurements made of available instruction execution
|
|
easily demonstrate the benefits of parallel instruction and I/O
|
|
activities. A VAX simulator with a process running a disk intensive
|
|
application in one process was able to process 11 X the number of
|
|
Dhrystone operations with Asynch I/O enabled.
|
|
|
|
Asynch I/O is provided through a callback model.
|
|
SimH Libraries which provide Asynch I/O support:
|
|
sim_disk
|
|
sim_tape
|
|
sim_ether
|
|
|
|
Requirements to use:
|
|
The Simulator's instruction loop needs to be modified to include a single
|
|
line which checks for asynchronouzly arrived events. The vax_cpu.c
|
|
module added the following line indicated by >>>:
|
|
|
|
/* Main instruction loop */
|
|
|
|
for ( ;; ) {
|
|
|
|
[...]
|
|
>>> AIO_CHECK_EVENT;
|
|
if (sim_interval <= 0) { /* chk clock queue */
|
|
temp = sim_process_event ();
|
|
if (temp)
|
|
ABORT (temp);
|
|
SET_IRQL; /* update interrupts */
|
|
}
|
|
|
|
A global variable (sim_asynch_latency) is used to indicate the "interrupt
|
|
dispatch latency". This variable is the number of nanoseconds between checks
|
|
for completed asynchronous I/O. The default value is 4000 (4 usec) which
|
|
corresponds reasonably with simulated hardware. This variable controls
|
|
the computation of sim_asynch_inst_latency which is the number of simulated
|
|
instructions in the sim_asynch_latency interval. We are trying to avoid
|
|
checking for completed asynchronous I/O after every instruction since the
|
|
actual checking every instruction can slow down execution. Periodic checks
|
|
provide a balance which allows response similar to real hardware while also
|
|
providing minimal impact on actual instruction execution. Meanwhile, if
|
|
maximal response is desired, then the value of sim_asynch_latency can be
|
|
set sufficiently low to assure that sim_asynch_inst_latency computes to 1.
|
|
The sim_asynch_inst_latency is dynamically updated once per second in the
|
|
sim_rtcn_calb routine where clock to instruction execution is dynamically
|
|
determined. A simulator would usually add register definitions
|
|
to enable viewing and setting of these variables via scp:
|
|
|
|
#if defined (SIM_ASYNCH_IO)
|
|
{ DRDATA (LATENCY, sim_asynch_latency, 32), PV_LEFT },
|
|
{ DRDATA (INST_LATENCY, sim_asynch_inst_latency, 32), PV_LEFT },
|
|
#endif
|
|
|
|
|
|
Naming conventions:
|
|
All of the routines implemented in sim_disk and sim_tape have been kept
|
|
in place. All routines which perform I/O have a variant routine available
|
|
with a "_a" appended to the the routine name with the addition of a single
|
|
parameter which indicates the asynch completion callback routine. For
|
|
example there now exists the routines:
|
|
t_stat sim_tape_rdrecf (UNIT *uptr, uint8 *buf, t_mtrlnt *bc, t_mtrlnt max);
|
|
t_stat sim_tape_rdrecf_a (UNIT *uptr, uint8 *buf, t_mtrlnt *bc, t_mtrlnt max, TAPE_PCALLBACK callback);
|
|
|
|
The Purpose of the callback function is to record the I/O completion status
|
|
and then to schedule the activation of the unit.
|
|
|
|
Considerations:
|
|
Avoiding multiple concurrent users of the unit structure. While asynch
|
|
I/O is pending on a Unit, the unit should not otherwise be on the event
|
|
queue. The I/O completion will cause the Unit to be scheduled to run
|
|
immediately to actually dispatch control flow to the callback routine.
|
|
The callback routine is always called in the same thread which is
|
|
executing instructions. Since all simulator device data structures are
|
|
only referenced from this thread there are no host multi-processor cache
|
|
coherency issues to be concerned about.
|
|
|
|
Arguments to the callback routine:
|
|
UNIT *, and IO Status
|
|
Requirements of the Callback routine.
|
|
The callback routine must save the I/O completion status in a place
|
|
which the next invocation of the unit service routine will reference
|
|
and act on it. This allows device code to return error conditions
|
|
back to scp in a consistent way without regard to how the callback
|
|
routine (and the actual I/O) may have been executed.
|
|
|
|
Required change in device coding.
|
|
Devices which wish to leverage the benefits of asynch I/O must rearrange
|
|
the code which implements the unit service routine. This rearrangement
|
|
usually entails breaking the activities into two phases. The first phase
|
|
(I'll call the top half) involves performing whatever is needed to
|
|
initiate a call to perform an I/O operation with a callback argument.
|
|
Control is then immediately returned to the scp event dispatcher.
|
|
The callback routine needs to be coded to stash away the io completion
|
|
status and some indicator that an I/O has completed.
|
|
The top/bottom half separation of the unit service routine would be
|
|
coded to examine the I/O completion indicator and invoke the bottom half
|
|
code upon completion. The bottom half code should clear the I/O
|
|
completion indicator and then perform any activities which normally
|
|
need to occur after the I/O completes. Care should be taken while
|
|
performing these top/bottom half activities to return to the scp event
|
|
dispatcher with either SCPE_OK or an appropriate error code when needed.
|
|
The need to return error indications to the scp event dispatcher is why
|
|
the bottom half activities can't simply be performed in the
|
|
callback routine (the callback routine does not return a status).
|
|
Care should also be taken to realize that local variables in the
|
|
unit service routine will not directly survive between the separate
|
|
top and bottom half calls to the unit service routine. If any such
|
|
information must be referenced in both the top and bottom half code paths
|
|
then it must either be recomputed prior to the top/bottom half check
|
|
or not stored in local variables of the unit service routine.
|
|
|
|
Run time requirements to use SIM_ASYNCH_IO.
|
|
The Posix threads API (pthreads) is required for asynchronous execution.
|
|
Most *nix platforms have these APIs available and on these platforms
|
|
simh is typically built with these available since on these platforms,
|
|
pthreads is required for simh networking support. Windows can also
|
|
utilize the pthreads APIs if the compile and run time support for the
|
|
win32Pthreads package has been installed on the build system and the
|
|
run time dll is available in the execution environment.
|
|
|
|
Sample Asynch I/O device implementations.
|
|
The pdp11_rq.c module has been refactored to leverage the asynch I/O
|
|
features of the sim_disk library. The impact to this code to adopt the
|
|
asynch I/O paradigm was quite minimal.
|
|
The pdp11_rp.c module has also been refactored to leverage the asynch I/O
|
|
features of the sim_disk library.
|
|
The pdp11_tq.c module has been refactored to leverage the asynch I/O
|
|
features of the sim_tape library. The impact to this code to adopt the
|
|
asynch I/O paradigm was very significant. This was due to the two facts:
|
|
1) there are many different operations which can be requested of tape
|
|
devices and 2) some of the tmscp operations required many separate
|
|
operations on the physical device layer to perform a single tmscp request.
|
|
This issue was addressed by adding additional routines to the physical
|
|
device layer (in sim_tape.c) which combined these multiple operations.
|
|
This approach will dovetail well with a potential future addition of
|
|
operations on physical tapes as yet another supported tape format.
|
|
|