Count key data

From The Right Wiki
(Redirected from Count Key Data)
Jump to navigationJump to search

Count key data (CKD) is a direct-access storage device (DASD)[lower-alpha 1] data recording format introduced in 1964, by IBM with its IBM System/360 and still being emulated on IBM mainframes. It is a self-defining format with each data record represented by a Count Area that identifies the record and provides the number of bytes in an optional Key Area and an optional Data Area. This is in contrast to devices using fixed sector size or a separate format track. Count key data (CKD) also refers to the set of channel commands (collectively Channel Command Words, CCWs) that are generated by an IBM mainframe for execution by a DASD subsystem employing the CKD recording format.[1] The initial set of CKD CCWs, introduced in 1964, was substantially enhanced and improved into the 1990s.

CKD track format

File:CKD Track Format.png
Block diagram of count key data track format used on IBM mainframe computers beginning with S/360 shipment in 1965

The reason for CKD track format is to allow data field lengths to vary, each recorded block of data on a DASD track, called a record has an associated count field which identifies the record and indicates the size of the key, if used (user-defined up to 255 bytes), and the size of the data area, if used.[2][lower-alpha 2] The count field has the identification of the record[lower-alpha 3] in cylinder-head-record format, the length of the key, and the length of the data. The key may be omitted or consist of a string of characters. "The beginning of a track is signalled when the index marker (index point) is detected. ... The marker is automatically recognized by a special sensing device."[3]: 5  Following the index marker is the home address, which indicates the location of this track on the disk, and contains other control information internal to the control unit. A fixed-length gap follows the home address. Next, each track contains a Record 0 (R0), the track descriptor record, which is "designed to enable the entire content of a track to be moved to alternate tracks if a portion of the primary track becomes defective."[3]: 7  Following R0 are the data records, separated by gaps.[3]: 9  Because of the gaps and other information, the recorded space is larger than that required for just the count data, key data, or user data. IBM provides a "reference card" for each device, which can be used to compute the number of records per track for various key and data field sizes, and to optimize the capacity of the device.[4] Later, programs were written to do these calculations. Because records are normally not split between tracks, specification of an incorrect record size create problems. Most often, the key is omitted and the record is located sequentially or by direct cylinder-head-record addressing. If it is present, the key is any data used to find the record, usually using the Search Key Equal or Search Key High or Equal CCW. The key (and hence the record) is locatable via hardware commands.[5][page needed] Since the introduction of IBM's System/360 in 1964, nearly all IBM large and intermediate system DASDs have used the count key data record format.[6][page needed] The advantages of count key data record format are:

  • The record size can be exactly matched to the application data block sizes
  • CPU and memory requirements can be reduced by exploiting search-key commands.
  • IBM CKD subsystems initially operated synchronously with the system channel and can process information in the gaps between the various fields, thereby achieving higher performance by avoiding the redundant transfer of information to the host.[6] Both synchronous and asynchronous operations are supported on later subsystems.[6]

Reduced CPU and memory prices and higher device and interface speeds have somewhat nullified the advantages of CKD, and it is retained only because IBM's flagship operating system z/OS does not support sector-oriented interfaces. Originally CKD records had a one-to-one correspondence to a physical track of a DASD device; however over time the records have become more and more virtualized such that in modern IBM mainframes there is no longer a direct correspondence between a CKD record ID and the physical layout of a track.

IBM's CKD DASD subsystems

Packaging

File:IBM CKD Arch.png
IBM S/360 & S/370 Input/Output operations for CKD DASD showing channel, storage control unit and DASD device

Initially there was a high degree of correspondence between the logical view of DASD accesses and the actual hardware, as shown in the illustration. Three digit labels were typically affixed[lower-alpha 4] to identify the address of channel, control unit and device. On low end systems the Channel and the Control Unit were frequently physically integrated but remained logically separate. IBM's New Attachment Strategy[7] beginning with the 3830 Model 2 in 1972 physically separated the SCU into two physical entities, a director and a controller while keeping them logically the same. The controller handles the CKD track formatting and is packaged with the first drive or drives in a string of drives and having a model number with the letter "A" as a prefix, an "A-Unit" (or "A-Box") as in 3350 Model A2 containing a controller and two DASDs. DASD without a controller, that is B-Units, have a "B" prefix in their model number. CKD subsystems[lower-alpha 5] and directors were offered by IBM and plug compatible competitors until at least 1996 (2301 to 3390 Model 9);[8] in total 22 unique[lower-alpha 6] DASD offered by IBM configured in at least 35 different subsystem configurations. Plug-compatible offered many of the same DASD including 4 CKD subsystems featuring unique DASD.[lower-alpha 7]

Programming

Access to specific classes of I/O devices by an IBM mainframe is under the control of Channel Command Words (CCWs), some of which are generic (e.g. No Operation) but many of which are specific to the type of I/O device (e.g. Read Backwards for a tape drive). The group of CCWs defined by IBM for DASD fall into five broad categories:

  • Control – control of the DASD including the path thereto
  • Sense – sense status of the DASD including the path thereto; some sense commands affect the status of the controller and DASD in a fashion more in keeping with a control command, e.g., RESERVE, RELEASE
  • Write – write information to the controller or DASD (which may be buffered or cached in the path)
  • Search[lower-alpha 8] – compare information from the CPU with information stored in the DASD; the Channel operates in the Write mode while the storage unit operates in the Read mode.
  • Read – read information from the DASD (which may be buffered or cached in the path)

CKD CCWs are the specific set of CCWs used to access CKD DASD subsystems. This is in contrast to fixed block architecture (FBA) CCWs which are used to access FBA DASD subsystems. CKD DASD are addressed like other Input/Output devices; for System/360 and System/370 DASD are addressed directly, through channels and the associated control units[9][10][11] (SCU or Storage Control Unit), initially using three hexadecimal digits, one for channel and two for control unit and device, providing addressing for up to 16 channels, for up to 256 DASD access mechanisms/channel and 4,096 DASD addresses total. Modern IBM mainframes use four hexadecimal digits as an arbitrary subchannel number within a channel subsystem subset, whose definition includes the actual channels, control units and device, providing addressing for up to 65,536 DASD per channel subsystem subset. In practice, physical and design constraints of the channel and of the controllers limited the maximum number of attached DASD attachable to a system to a smaller amount than the number that could be addressed.

Initial CKD feature set

The initial feature set provided by IBM with its 1964 introduction of the CKD track format and associated CCWs included: .

  • Defective/Alternative Track – enables an alternate track to replace a defective track transparent to the access method in use.
  • Record overflow – AKA track overflow  – The Write Special Count, Key and Data CCW can write a record segment that is logically part of a record segment on the next track. While an individual record segment can not exceed the maximum track length of a DASD[12] track, a sequence of record segments is not so limited.
  • Multitrack operations – specific CCWs can continue onto the next sequential head[12]
  • Command chaining – CCWs could be chained together to construct complex channel programs. The gaps in a CKD track format provided sufficient time between the commands so that all channel and SCU activity necessary to complete a command can be performed in the a gap between appropriate fields.[13] Such programs can search a large amount of information stored on a DASD, upon successful completion returning only the desired data and thereby freeing CPU resources for other activity.[14] This mode of operating synchronous to the gap was later enhanced by additional CCWs enabling a nonsynchronous mode of operation.
  • Channel switching – an SCU can be shared between channels – initially two channel switching was provided and it was expanded to up to eight channels in later SCUs. The channels can be on the same or different CPUS.

A Scan feature set was also provided but not continued into future CKD subsystems beyond the 2314. Forty one CCWs implemented the feature set:

IBM S/360 DASD Channel Commands[15]
Command Class Command‡ 2301[16] 2302[12] 2303[12]
7320[17][lower-alpha 9]
2311[12] 2321[12] 2314
2319[18]
MT
Off
MT
On †
Count Length
Control No Op S S S S S S 03
Seek S S S S S S 07 6
Seek Cylinder S S S S S S 0B 6
Seek Head S S S S S S 1B 6
Set File Mask S S S S S S 1F 1
Space Count S S S S S S 0F 3
Recalibrate S S 13 Not zero
Restore S 17 Not zero
Sense Sense I/O S S S S S S 04 6
Release Device O O O O O O 94 6
Reserve Device O O O O O O B4 6
Search Home Address EQ S S S S S S 39 B9 4 (usually)
Identifier EQ S S S S S S 31 B1 5 (usually)
Identifier HI S S S S S S 51 D1 5 (usually)
Identifier EQ or HI S S S S S S 71 FI 5 (usually)
Key EQ S S S S S S 29 A9 1 to 255
Key HI S S S S S S 49 C9 1 to 255
Key EQ or HI S S S S S S 69 E9 1 to 255
Key & Data EQ O O O S 2D AD See Note 2
Key & Data HI O O O S 4D CD See Note 2
Key & Data EQ or HI O O O S 6D ED See Note 2
Continue Scan
(see Note 1)  
Search EQ O O O S 25 A5 See Note 2
Search HI O O O S 45 C5 See Note 2
Search HI or EQ O O O S 65 E5 See Note 2
Set Compare O O O S 35 B5 See Note 2
Set Compare O O O S 75 F5 See Note 2
No Compare O O O S 55 D5 See Note 2
Read Home Address S S S S S S 1A 9A 5
Count S S S S S S 12 92 8
Record 0 S S S S S S 16 96 Number of bytes transferred
Data S S S S S S 06 86
Key & Data S S S S S S 0E 8E
Count. Key & Data S S S S S S 1E 9E
IPL S S S S S S 02
Write Home Address S S S S S S 19 5 (usually)
Record 0 S S S S S S 15 8*KL*DL of RO
Count, Key & Data S S S S S S 1D 8+KL+DL
Special Count, Key & Data S S S S S S 01 8+KL+DL
Data S S S S S S 05 DL
Key & Data S S S S S S 0D KL*DL
Erase S S S S S S 11 8*KL*DL
Total CCWs 41 30 39 30 40 40 40

Notes:

O = optional feature
S = standard feature
MT = multitrack: when supported CCW will continue to operate on next heads in sequence to end of cylinder
‡ = TIC (Transfer In Channel) and other standard commands not shown.
† = code same as MT Off except as listed
1. File Scan Feature (9 CCWs) only available on 2841 for 2302, 2311 and 2321; they were not available on subsequent DASD controllers for DASD later than 2314.
2. Count is number of bytes in search argument, including mask bytes

The CCWs were initially executed by two types of SCU attached to the system's high speed Selector Channels. The 2820 SCU[16] controlled the 2301 Drum while the 2841 SCU[12] controlled combinations of the 2302 Disk Storage, 2311 Disk Drive, 2321 Data Cell and/or 7320 Drum Storage. IBM quickly replaced the 7320 with the faster and larger 2303. Subsequently, the feature set was implemented on the 2314 family of storage controls and an integrated attachment of the System 370 Model 25. The following example of a channel program[18] reads a disk record identified by a Key field. The track containing the record and the desired value of the key is known. The SCU will search the track to find the requested record. In this example <> indicate that the channel program contains the storage address of the specified field.

SEEK             <cylinder/head number>
SEARCH KEY EQUAL <key value>
TIC              *-8 Back to search if not equal
READ DATA        <buffer>
The TIC (transfer in channel) will cause the channel program to branch to the SEARCH command until a record with a matching key (or the end of the track) is encountered. When a record with a matching key is found the SCU will include Status Modifier in the channel status, causing the channel to skip the TIC CCW; thus the channel program will not branch and the channel will execute the READ command.

Block multiplexer channel enhancements

The block multiplexor channel was introduced beginning in 1971 on some high end System/360 systems along with the 2835 Control Unit and associated 2305 DASD,[19] This channel was then standard on IBM System/370 and subsequent mainframes; when contrasted to the prior Selector channel it offered performance improvements for high speed devices such as DASD, including:

Multiple Requesting

Allowed multiple channel programs,[lower-alpha 10] to be simultaneously active in the facility[19] as opposed to only one with a Selector channel. The actual number of subchannels provided depends upon the system model and its configuration.[20] Sometimes described as disconnected command chaining, the control unit could disconnect at various times during a chained set of CCWs, for example, disconnection for a Seek CCW, freeing the channel for another subchannel.

Command Retry

The channel and storage control under certain conditions can inter-operate to cause a CCW to be retried without an I/O interruption.[19] This procedure is initiated by the storage control and used to recover from correctable errors.

Rotational Position Sensing

Rotational position sensing (RPS) was implemented with two new CCWs, SET SECTOR and READ SECTOR enabled the channel to delay command chaining until the disk rotated to a specified angular track position. RPS permits channel disconnection during most of the rotational delay period and thus contributes to increased channel utilization. The control unit implements RPS by dividing each track into equal angular segments.[19]

Example Channel Program

The following example channel program will format a track with an R0 and three CKD records.[19]

SEEK             <cylinder/head number>
SET FILE MASK    <allow write operations>
SET SECTOR       <sector number=0>
WRITE R0         <cylinder/head/R0, key length=0, data length=6>
WRITE CKD        <cylinder/head/R1, key length, data length>
WRITE CKD        <cylinder/head/R2, key length, data length>
WRITE CKD        <cylinder/head/R3, key length, data length>

In this example the Record 0 conforms to IBM programming standards. With a block multiplexer channel the channel is free during the time the DASD is seeking and again while the disk rotates to beginning of the track. A selector channel would be busy for the entire duration of this sample program.

Defect skipping

Defect skipping allows data to be written before and after one of more surface defects[lower-alpha 11] allowing all of a track to be used except for that portion that has the defect. This also eliminates the time that was formerly required to seek to an alternate track.[21] Only a limited number of defects could be skipped so alternate tracks remained supported for those tracks with excess defects. Defect skipping was introduced in 1974 with the 3340 attached via the 3830 Model 2 Storage Control Unit[21] or integrated attachments on small systems. Defect skipping was essentially a factory only feature until 1981 when CCWs for management along with associated utilities were released.[22]

Dynamic paths

First introduced with the 3380 DASD on the 3880 Storage Control Unit[23] in 1981 the feature was included with the later CKD DASD subsystems. The dynamic path selection function controls operation of the two controllers, including simultaneous data transfer over the two paths. When supported by the operating system, each controller can serve as an alternate path in the event the other controller is unavailable.[24] Three additional commands, Set Path Group ID, Sense Path Group ID, and Suspend Multipath Reconnection, are used to support attachment of the 3380 Models having two controllers at the head of a string.[23] The Set Path Group ID command, with the dynamic path selection (DPS) function, provides greater flexibility in operations on reserved devices. Once a path group for a device has been established, it may be accessed over any path which is a member of the group to which it is reserved. In addition, on 370-XA systems which set the multipath mode bit in the function control byte (byte 0) to a 1, block multiplex reconnections will occur on the first available path which is a member of the group over which the channel program was initiated (regardless of the reservation state of the device).[23] If the controller designated in the I/O address is busy or disabled, the dynamic path selection allows an alternate path to the device to be established via another storage director and the other controller in the model AA.[24]

Nonsynchronous operation

Prior to the 1981 introduction of the 3880 director, CKD records were synchronously accessed, all activities required that one CCW be ended and the next initiated in the gaps between the CKD fields.[13] The gap size placed limitations on cable length but did provide for very high performance since complex chains of CCWs could be performed by the subsystem in real time without use of CPU memory or cycles. Nonsynchronous operation provided by the Extended CKD ("ECKD") set of CCWs removed the gap timing constraint.[13] The five additional ECKD CCWs are Define Extent, Locate Record, Write Update Data, Write Update Key and Data, and Write CKD Next Track.[23] In nonsynchronous operation, the transfer of data between the channel and the storage control is not synchronized with the transfer of data between the storage control and the device. Channel programs can be executed such that channel and storage control activities required to end execution of one command and advance to the next do not have to occur during the inter-record gap between two adjacent fields. An intermediate buffer in the storage control allows independent operations between the channel and the device. A major advantage of ECKDs is far longer cables; depending upon application it may improve performance.[13] ECKD CCWs are supported on all subsequent CKD subsystems. This example nonsynchronous channel program reads records R1 and R2 from track X'0E' in cylinder X'007F'. Both records have a key length of 8 and a data length of X'64' (10010) bytes.[23]

Define Extent       <extent= X'007F 0000' through track X'0081 000E'>
Locate Record       <cylinder = X'007F', head = X'000E'
Read Key and Data   <key record = X'001038'>
Read Data           <record = X'001108'>

Caching

Caching first introduced in DASD CKD subsystems by Memorex[25] (1978) and StorageTek[lower-alpha 12] (1981) was subsequently introduced in late 1981 by IBM on the 3880 Model 13 for models of the 3380 with dynamic pathing.[lower-alpha 13] The cache is dynamically managed by an algorithm; high activity data is accessed from the high-performance cache and low activity data is accessed from less-expensive DASD storage. A large memory in the Director, the cache, is divided into track slots that store data from the 3380 tracks. A smaller area is a directory that contains entries that allow data to be located in the cache.[26] Caches were also provided on subsequently introduced storage controls.

Other extensions

Over time a number of path control, diagnostic and/or error recovery CCWs were implemented on one or more storage controls. For example:

  • Unconditional Reserve allowed the releasing a device reserved to another channel and reserving the device to the channel issuing the command.
  • Read Multiple Count Key Data could more efficiently read full tracks allowing for more efficient backups.

Beyond System/370

Reduced CPU and memory prices and higher device and interface speeds have somewhat nullified the advantages of CKD, and support continues by IBM to this date because its flagship operating system z/OS continues to use CKD CCWs for many functions. Originally CKD records had a one-to-one correspondence to a physical track of a DASD device; however over time the records have become more and more virtualized such that in a modern IBM mainframe there is no longer a direct correspondence between the a CKD record ID and a physical layout of a track. An IBM mainframe constructs CKD track images in memory and executes the ECKD and CKD channel programs against the image. To bridge between the native fixed block sized disks and the variable length ECKD/CKD record format, the CKD track images in memory are mapped onto a series of fixed blocks suitable for transfer to and from an FBA disk subsystem.[27] Of the 83 CKD CCWs implemented for System/360 and System/370 channels 56 are emulated on System/390 and later systems.[27]

See also

Notes

  1. Most commonly a hard disk drive.
  2. data length 0 indicates EOF
  3. Because records are variable length and because the record number on the track need not be unique, the record number does not correspond to an angular displacement.
  4. In some early DASD the label was on a plug which allowed the address to be moved between DASD
  5. That is the combination of an SCU and one or more DASDs or an A-Unit with any attached B-Units.
  6. A unique combination of number of tracks and maximum track length. With this definition a double density DASD counts as a unique DASD.
  7. double density 2314, triple density 3330, double density 3350 and solid state disk
  8. Technically writes
  9. The 2302 replaced the 7230. Datamation, March 1966, p. 81
  10. In the case of the 2305, up to 8 channel programs for the same disk drive and 16 on the same SCU
  11. Number of skippable defects varies by DASD model
  12. STK 8890 CyberCache for STK 3350 compatibles
  13. Simultaneously announced was the 3880-11 using its cache in a paging mode 3350s as paging devices

References

  1. IBM 3990 Storage Control Introduction – 6th Ed. IBM. February 1994. GA32-0098-05.
  2. "Count key data". IBM Knowledge Center. IBM. Retrieved 6 August 2014.
  3. 3.0 3.1 3.2 IBM Corporation (September 1969). IBM System/360 Component Descriptions 2314 Direct Access Storage Facility and 2844 Auxiliary Storage Control (PDF). Archived from the original (PDF) on March 30, 2020. Retrieved Dec 5, 2019.
  4. IBM Corporation (November 1973). 3330 Series Disk Storage 3333 Models 1 and 11 3330 Models 1, 2, and 11 Reference Summary (PDF). Retrieved Dec 5, 2019.
  5. Houtekamer, Gilbert E.; Artis, H. Pat (1993). MVS I/O Subsystems: Configuration Management and Performance Analysis. New York: McGraw-Hill. ISBN 978-0-07-002553-0. OCLC 26096983.
  6. 6.0 6.1 6.2 "Synchronous DASD Operations". Introduction to Nonsynchronous Direct Access Storage Subsystems. International Business Machines Corporation. January 1990. GC46–4519–0.
  7. "Historical Narrative of the 1970s, US v IBM, Exhibit 14971". July 1980. p. 1051.
  8. "Direct Access Storage * 22.7GB, 12 actuators". Archived from the original on December 22, 2015.
  9. IBM System/360 Principles of Operation (PDF). IBM. Input/Output Operations. GA22-6821-7.
  10. IBM System/370 Principles of Operation (PDF). IBM. Input/Output Operations. GA22-7000-0.
  11. "Chapter 13. I/O Overview". IBM Enterprise Systems Architecture/370 Principles of Operation (PDF). IBM. SA22-7000-0.
  12. 12.0 12.1 12.2 12.3 12.4 12.5 12.6 IBM System/360 Component Descriptions - 2841 and Associated DASD (PDF). Eighth Edition. IBM. December 1969. GA26-5988-7. Archived (PDF) from the original on 2011-10-14. Retrieved 2015-12-07.
  13. 13.0 13.1 13.2 13.3 Introduction to Nonsynchronous Direct Access Storage Subsystems. IBM. January 1990. GC26-4519-0.
  14. J. Buzen (June 1975). "I/O Subsystem Architecture". Proceedings of the IEEE. 63 (6): 871. doi:10.1109/PROC.1975.9852. S2CID 68000.
  15. Derived from IBM System/360 Reference Data (Green) Card, GX20-1703-9
  16. 16.0 16.1 IBM System/360 Component Descriptions-2820 Storage Control And 2301 Drum Storage (PDF). Archived (PDF) from the original on 2016-03-04. Retrieved 2015-08-28.
  17. IBM System/360 Component Descriptions - 2841 and Associated DASD (PDF). First Edition. IBM. A26-5988-0.
  18. 18.0 18.1 IBM System/360 Component Descriptions 2314 Direct Access Storage Facility and 2844 Auxiliary Storage Control (PDF) (Seventh ed.), November 1971, GA26-3599-6
  19. 19.0 19.1 19.2 19.3 19.4 Reference Manual for IBM 2835 Storage Control and IBM 2305 Fixed Head Storage Module (PDF). October 1983. GA26-1589-5. Archived (PDF) from the original on 2016-03-04. Retrieved 2015-12-21.
  20. J. Kettner (November 2007). "Input/Output - A White Paper" (PDF). IBM. Archived from the original (PDF) on March 4, 2016.
  21. 21.0 21.1 Reference Manual for 3830 Model 1. March 1974.
  22. "Device Support Facilities, User's Guide and Reference. Release 4.0" (PDF). May 1981. pp. vi, 46, 61, 87.
  23. 23.0 23.1 23.2 23.3 23.4 IBM 3880 Storage Control Models 1, 2, 3, and 4 Description Manual. IBM. September 1987. Section 4. GA26-1661-9.
  24. 24.0 24.1 IBM 3380 Direct Access Storage Description and User's Guide (PDF). IBM. December 1981. GA26-1664-1.
  25. "Now Memorex fills the gap in your system's performance" (PDF). Datamation. August 1978. pp. 85–86.
  26. Introduction to IBM 3880 Storage Control Model 13 (PDF). IBM. September 1981. GA32-0062-0.
  27. 27.0 27.1 IBM S/390 Multiprise 3000 Enterprise Server, Internal Disk Subsystem: Reference Guide. IBM. November 1999. Archived from the original on March 4, 2016.

Further reading