next up previous
Next: Cycle Time Up: Total Access and Cycle Previous: Total Access and Cycle

Access Time

There are two potential critical paths in a cache read access. If the time to read the tag array, perform the comparison, and drive the multiplexor select signals is larger than the time to read the data array, then the tag side is the critical path, while if it takes longer to read the data array, then the data side is the critical path. In many cache implementations, the designer would try to margin the cache design such that the tag path is slightly faster than the data path so that the multiplexor select signals are valid by the time the data is ready. Often, however, this is not possible. Therefore, either side could determine the access time, meaning both sides must be modeled in detail.

In a direct-mapped cache, the access time is the larger of the two paths:

displaymath1398

where tex2html_wrap_inline1402 is the delay of the decoder, wordline, bitline, and sense amplifier for the data array, tex2html_wrap_inline1404 is the delay of the decoder, wordline, bitline, sense amplifier, and comparator for the tag array, tex2html_wrap_inline1406 is the delay of the cache data output driver, and tex2html_wrap_inline1408 is the delay of the valid signal driver.

In a set-associative cache, the tag array must be read before the data signals can be driven. Thus, the access time is:

displaymath1399

where tex2html_wrap_inline1410 is the same as tex2html_wrap_inline1404 , except that it includes the time to drive the select lines of the output multiplexors.

   figure411
Figure 13: Direct mapped: tex2html_wrap_inline1414

   figure417
Figure 14: Direct mapped: tex2html_wrap_inline1404

Figures 13 to 16 show analytical and Hspice estimations of the data and tag sides for direct-mapped and 4-way set-associative caches. A tex2html_wrap_inline1418 CMOS process was assumed [6]. To gather these results, the model was first used to find the array organization parameters which resulted in the lowest access time via exhaustive search for each cache size. These optimum parameters are shown in the figures (the six numbers associated with each point correspond to tex2html_wrap_inline1140 , tex2html_wrap_inline1142 , tex2html_wrap_inline1100 , tex2html_wrap_inline1154 , tex2html_wrap_inline1156 , and tex2html_wrap_inline1158 in that order). The parameters were then used in the Hspice model. As the graphs show, the difference between the analytical and Hspice results is less than 6% in every case.

   figure431
Figure: 4-way set associative: tex2html_wrap_inline1414

   figure437
Figure: 4-way set associative: tex2html_wrap_inline1112


next up previous
Next: Cycle Time Up: Total Access and Cycle Previous: Total Access and Cycle

Steve Wilton
Tue Jul 30 15:38:35 EDT 1996