l航空航天計算機學(xué)院

上傳人：我*** IP屬地：北京上傳時間：2022-11-05 格式：PPTX 頁數(shù)：43 大小：1.43MB 積分：14 舉報 版權(quán)申訴

已閱讀5頁，還剩38頁未讀，繼續(xù)免費閱讀

版權(quán)說明：本文檔由用戶提供并上傳，收益歸屬內(nèi)容提供方，若內(nèi)容存在侵權(quán)，請進行舉報或認(rèn)領(lǐng)

文檔簡介

2014/11/13計算機學(xué)院專業(yè)基礎(chǔ)課計算機組成Cache航空航天大學(xué)計算機學(xué)院航空航天大學(xué)計算機學(xué)院提綱內(nèi)容主要取材：CS61C的11講和12講層次概述直接

Cache直接

Cache舉例Cache讀和寫Cache性能多級Cache組相連Cache改進Cache性能多級Cache性能實戰(zhàn)當(dāng)代Cache舉例212014/11/13Great

Idea

#3:

Principle

Locality/Memory

Hierarchymemory

hierarchy/層次Storage

ComputerProcessorHolds

data

files

100

bytes)Registers

accessed

sub-nanosecond

timescaleMemory

(“main

memory”)More

capacity

than

registers

Gbytes)Access

time~

50-100

nsHundreds

clock

cycles

per

memory

access?!7/05/2012Summer

2012--

Lecture

#11422014/11/13101100001000100PerformanceYear“Moore’s

Law”Processor-MemoryPerformance

Gap(grows

50%/year)Processor-Memory

Gap7/05/201251989

In CPU

with

cache

chip1998Pentium

III

has

two

cache

levels

chipμProc55%/year(2X/1.5yr)DRAM7%/year(2X/10yrs)Summer

2012--

Lecture

#11Principle

Locality

(1/3)Principle

Locality:

Programs

access

only

asmall

portion

the

full

address

space

anyinstant

timeRecall:

Address

space

holds

both

code

and

dataLoops

and

sequential

instructionexecution

meangenerally

localized

code

accessStack

and

Heap

try

keep

your

data

togetherArrays

and

structs

naturally

group

data

you

wouldaccess

together7

2012Summer

2012--

Lecture

#116locality/局部性32014/11/13Principle

Locality

(2/3)Temporal

Locality

(locality

time)

back

the

same

book

desk

multiple

timesIf

memory

location

referenced

then

will

tendto

referenced

againsoonSpatial

Locality

(locality

space)

When

book

shelf,

grab

many

books

J.D.Salinger

since

library

stores

books

togetherIf

memory

location

referenced,

the

locationswith

nearby

addresses

will

tend

referencedsoon7

2012

S 2012

#11

7temporal

locality/時間局部性spatial

locality/空間局部性Principle

Locality

(3/3)We

exploit

the

principle

locality

inhardware

via

memory

hierarchy

where:–

Levels

closer

processor

are

faster(and

expensive

per

bit

smaller)–

Levels

farther

from

processor

are

larger(and

less

expensive

per

bit

slower)Goal:

Create

the

illusion

memory

beingalmost

fast

fastest

memory

and

almostas

large

biggest

memory

the

hierarchy7/05/2012Summer

2012--

Lecture

#11842014/11/137/05/2012Summer

2012--

Lecture

#119Smaller,Faster,More

expensiveBigger,Slower,Che

rMemory

Hierarchy

SchematicProcessor

Level

2Level

nLevel

3..

.Cache

ConceptIntroduce

intermediate

hierarchy

level:

memorycache,

which

holds

copy

subset

mainmemory–

apun,

often

use

$(“cash”)

abbreviate

cache(e.g.

D ache,

L1$

Level

Cache)Modern

processors

have

separate

caches

forinstructions

and

data,

well

several

levels

ofcaches

implemented

different

sizesImplemented

with

same

processing

technologyasC nd

integrated

on-chip

–

faster

but

moreexpensive

than

main

memory7/05/2012 Summer

2012--

Lecture

#11

1052014/11/13Memory

Hierarchy

TechnologiesCaches

use

static

RAM

(SRAM)+

Fast

(typical

access

timesof

0.5

2.5

ns)–

Low

density

(6transistor

cells),

higherpower,

expensive($2000

$4000

per

2011)Static:

content

will

last

long

power

onMain

memory

uses

dynamic

RAM

(DRAM)+

High

density

transistor

cells),

lowerpower,

che

r($20

$40

per

2011)–

Slower

(typical

access

timesof50

70ns)Dynamic:

needs

“refreshed”

regularly

every

8ms)7/05/2012 Summer

2012--

Lecture

#1111Inclusive:

data

inL1$?

data

L2$?

data

MM?data

SMMemory

Transfer

the

HierarchyProcessorL1$L2$Main

MemorySecondary

MemoryBlock:

Unit

oftransfer

betweenmemory

and

cache7/05/20126Summer

2012--

Lecture

#11122014/11/13cache

main

memory–

the

cache

controller

hardwareManaging

theHierarchyregisters

memory–

compiler

(or

assemblylevel

programmer)main

memory

disks

(secondary

storage)By

the

(virtual

memory,

which

later

topic)Virtual

physical

address

map assisted

bythe

hardware

(TLB)By

theprogrammer

(files)7/05/2012 Summer

2012--

Lecture

#1113We

are

hereTypical

Memory

HierarchySecond

Level

Cache

(SRAM)On-Chip

ComponentsControlDatapathSecondary

Memory

(Diskor

Flash)RegFileMain

Memory

(DRAM)Instr

DataCache

Cache?’s100’shighest1’s10K’s10’sM’s100’sG’s1,000,000’sT’slowestSpeed:(cycles)Size:(bytes)Cost/bit:7/05/2012Summer

2012--

Lecture

#111472014/11/13航空航天大學(xué)計算機學(xué)院提綱內(nèi)容主要取材CS61C的11講和12講層次概述直接

Cache直接

Cache舉例Cache讀和寫Cache性能多級Cache組相連Cache改進Cache性能多級Cache性能實戰(zhàn)當(dāng)代Cache舉例Cache

ManagementLibrary

ogy:

anization

isnecessary!What

the

overall

anization

blocks

weimpose

ourcache?Where

do ut

block

data

from

memory?How

know

block

already

cache?How

quickly

find

block

when

need

it?When

replace

something

the

cache?7/05/2012Summer

2012--

Lecture

#111682014/11/137/05/2012Summer

2012--

Lecture

#1117General

Notes

CachesRecall:

Memory

byte-addressedWe

haven’t

specified

the

size

our

“blocks,”but

will

multiple

word

size

(32-bits)How

access

individual

words

byteswithin

block?

OFFSETCache

smaller

than

memoryCan’t

fit

all

blocks

once,

multiple

blocks

inmemory

map

the

same

cache

slot

(row)

INDEXNeed

some

way

identifying

whi

oryblock

currently

the

row

TAGDirect-Mapped

Caches(1/3)Ea emory

block

mapped

exactly

onerow

the

cache

(direct-mapped)Use

simple

hash

functionEffect

block

size:Spatial

locality

dictates

our

blocks

consist

ofadjacent

bytes,

which

differ

address

by1Offset

field:

Lowest

bits

memory

addresscanbe

used

index

specific

bytes

within

blockBlock

size

needs

power

oftwo

(in

bytes)7/05/2012Summer

2012--

Lecture

#111892014/11/13Direct-Mapped

Caches(2/3)Effect

cache

size:

(total

stored

data)Determines

number

blocks

cache

holds

could

hold

all

memory,

would

use

remaining

bits

(minus

offset

bits)

select

appropriate

row

cacheIndex

field:

Apply

hash

function

remainingbitsto

determine

which

row

the

block

goes

in(block

address)

modulo

blocks

the

cache)Tag

field:

Leftover

upperbitsof

memory

addressdetermine

which

portion

memory

the

blockcame

from

(identifier)7/05/2012 Summer

2012--

Lecture

#11

19TIO

Address

BreakdownMemory

address

fields:31

0Tag

Index

OffsetT

bits

bitsMeaning

the

field

sizes:O

bits

?2O

bytes/block

2O-2

words/blockI

bits

?2I

rows

cache

size

block

sizeT

bits=

–

whereA

address

bits(A

here)7/05/201210Summer

2012--

Lecture

#11202014/11/13Direct-Mapped

Caches

(3/3)What’s

actually

the

cache?Each

row

contains

theactual

data

block

store(B

bits

bits)In

addition,

must

save

Tag

field

address

asidentifier

bits)Valid

bit:

Indicates

whether

the

block

that

rowis

validor

notTotal

bits

cache

rows

1)=

(8×2O

T+

bits7/05/2012Summer

2012--

Lecture

#1121Cache

Example

(1/2)Cache

parameters:–

Address

space

64B,

block

size

word,cache

size

wordsTIO

Breakdown:–

word

bytes,

log2(4)

2Cache

size

block

size

log2(4)

log2(64)

bits,

–

2Bits

cache

(8×22

+1)

=140

bits7/05/2012 Summer

2012--

Lecture

#1122Memory

Addresses:Block

address112014/11/13Cache

Example

(2/2)7/05/2012Summer

2012--

Lecture

#1123Main

Memory:On

memory

request:2)Check

Valid

bit

istrue

that

row

cache3)

valid,

then

check

ifTag

matches00011011TagDataCache:Index

ValidWhich

blocks

map

toeach

rowof

the

cache?(see

colors)Main

Memory

shownin

blocks,

offsetbits

not

shown

(x’s)0000000001010101101010101111111100011011000110110001101100011011XXXX

XXXXXX

XX001011(let’ssay

two

)1)

Take

Index

field

10Cache

rows

exactlymatch

the

Index

fieldDirect-Mapped

Cache

Internals4words/block,

cache

size

words7/05/2012Summer

2012--

Lecture

#1124

Data

(words)

ByteoffsetIndex

Valid

Tag012...25325425531

.. 13

...

020

8IndexTagHitData32Block

offset20and122014/11/13Caching

Terminology

(1/2)When

reading

memory,

things

can

happen:Cache

hit:Cache

block

valid

and

contains

the

properaddress,

read

the

desired

wordCache

miss:Nothing

that

row

the

cache

(not

valid),so

fetch

from

memoryCache

miss

with

block

replacement:Wrong

block

the

row,

discard

and

fetchdesired

data

from

memory257

2012 Summer

2012--

Lecture

#11terminology/術(shù)語miss/缺失Caching

Terminology

(2/2)How

effective

your

cache?Want

max

cache

hitsand

mincache

missesHit

rate

(HR):

Percentage

memory

accesses

aprogram

set

instructions

that

result

cache

hitMiss

rate

(MR):

hit

rate,

but

for

cachemisses

–

HRHow

fast

your

cache?Hit

time

(HT):

Time

access

cache

(including

Tagcomparison)Miss

penalty

(MP):

Time

replace

block

inthecache

from

lower

level

the

memory

hierarchy7

2012 Summer

2012--

Lecture

#11

26penalty/代價132014/11/13Sources

Cache

Misses:

The

3CsCompulsory:

(cold

start

process

migration,1st

reference)access

block

impossible

avoid;Effect

small

for

long

running

programsCapacity:Cache

cannot

contain

all

blocksaccessed

theprogram:

(collision)Multiple

memory

locations

mapped

the

samecache

location7

2012Summer

2012--

Lecture

#1127compulsory/強制航空航天大學(xué)計算機學(xué)院提綱內(nèi)容主要取材CS61C的11講和12講層次概述直接

Cache直接

Cache舉例Cache讀和寫Cache性能多級Cache組相連Cache改進Cache性能多級Cache性能實戰(zhàn)當(dāng)代Cache舉例28142014/11/132900Mem(0)4

miss15

miss3

hit00Mem(0)00Mem(1)00Mem(2)00Mem(3)01Mem(4)00Mem(1)00Mem(2)00Mem(3)01Mem(4)00Mem(1)00Mem(2)00Mem(3)01Mem(4)00Mem(1)00Mem(2)00Mem(3)014111500Mem(0)00Mem(1)00Mem(0)00Mem(1)00Mem(2)00Mem(0)00Mem(1)00Mem(2)00Mem(3)8

requests,

misses

(HR

0.25,

=0.75)Time地址空間：16B，block

size：1B，cache

size：4BTIO=2-2-000000001001000110100001101001111Direct-Mapped

Cache

Example(modified

GXP)Consider

the

sequence

memory

address

accessesStart

with

empty

cache

all

blocks

15initially

marked

not

valid0

miss

missTaking

Advantage

Spatial

LocalityLet

cache

block

hold

than

one

byte3000Mem(1)Mem(0)0

miss00Mem(1)Mem(0)00Mem(1)Mem(0)00Mem(3)Mem(2)3

hit00Mem(1)Mem(0)00Mem(3)Mem(2)4

miss0100Mem(1)5Mem(0)00Mem(3)Mem(2)43hit01Mem(5)Mem(4)00Mem(3)Mem(2)4

hit01Mem(5)Mem(4)00Mem(3)Mem(2)01Mem(5)Mem(4)00Mem(3)Mem(2)15miss11Start

with

empty

cache

all

blocks

initially

marked

not

valid8

requests,

misses

(HR

0.5,15

14MR

0.5)地址空間：16B，block

size：2B，cache

size：4BTIO=2-1-10000

0001

0010

0011

0100

0011

0100

11111

hit

miss0

15152014/11/130516

32Miss

rate

(%)64

128

256Block

size

(bytes)4

KB16

KB64

KB256

KBEffect

Block

and

Cache

Sizeson

Miss

Rate10Cache

SizeMiss

rate

goes

upifthe

block

size es

significant

fractionof

the

cache

size

because

the

numberof

blocksthat

can

heldin

the

same

size

cache

smaller

(increasing

capacity

misses)7/05/2012 Summer

2012--

Lecture

#11

31航空航天大學(xué)計算機學(xué)院提綱內(nèi)容主要取材CS61C的11講和12講層次概述直接

Cache直接

Cache舉例Cache讀和寫Cache性能多級Cache組相連Cache改進Cache性能多級Cache性能實戰(zhàn)當(dāng)代Cache舉例32162014/11/13Cache一致性問題時間事件Cache內(nèi)容位置X的主存內(nèi)存011A讀X112A將0寫入X0133Cache

Reads

and

WritesWant

handle

reads

and

writes

quickly

whilemaintaining

consistency

between

cache

andmemory

(i.e.bot ow

about

all

updates)Policies

for

cache

hits

and

misses

are

independentHere

assume

the

use

separateinstruction

and

d aches

(I$

and

D$)Read

from

bothWrite

only

(assume

self-modifying

code)7

2012Summer

2012--

Lecture

#1134consistency/一致性172014/11/13Handling

Cache

HitsRead

hits

(I$

and

D$)–

Fastest

possible

scenario,

want

theseWrite

hits

(D$)Write-Through

Policy:

Always

write

data

tocache

and

memory

(through

cache)Forces

cache

and

memory

always

consistentSlow!

(every

memory

access

long)Include

Write

Buffer

that

updates

memory

parallelwith

processorAssume

present

all

schemeswhen

writing

tomemory7/05/2012 Summer

2012--

Lecture

#11

35Handling

Cache

HitsRead

hits

(I$

and

D$)–

Fastest

possible

scenario,

want

theseWrite

hits

(D$)Write-Back

Policy:

Write

dataonly

cache,

thenupdate

memory

when

block

removedAllows

cache

and

memory

inconsistentMultiple

writes

collected

cache;

single

write

tomemory

per

blockDirty

bit:

Extra

bit

per

cacherow

that

set

block

waswritten

(is

“dirty”)

and

needs

written

back7/05/2012 Summer

2012--

Lecture

#11

36182014/11/13Handling

Cache

MissesMiss

penalty

grows

block

size

doesRead

misses

(I$

and

D$)–

Stall

execution,

fetch

block

from

memory,

put

incache,

send

requested

word

processor,

resumeWrite

misses

(D$)Write

allocate:

Fetch

block

from

memory,

put

incache,

execute

write

hitWorks

with

either

write-through

write-backEnsures

cache

up-to-date

after

write

miss7/05/2012Summer

2012--

Lecture

#1137Handling

Cache

MissesMiss

penalty

grows

block

size

doesRead

misses

(I$

and

D$)–

Stall

execution,

fetch

block

from

memory,

put

incache,

send

requested

word

processor,

resumeWrite

misses

(D$)No-write

allocate:

Skip

cache

altogether

andwrite

directly

memoryCache

never

up-to-date

after

write

missEnsures

memory

always

up-to-date197/05/2012Summer

2012--

Lecture

#11382014/11/13SummaryMemory

hierarchy

exploits

principle

localityto

deliver

lots

memory

fast

speedsDirect-Mapped

Cache:

Each

blockin

memorymaps

exactly

one

row

the

cacheIndex

determine

which

rowOffset

determine

which

byte

within

blockTag

identify

it’s

the

block

you

wantCache

read

and

write

policies:Write-back

and

write-through

for

hitsWrite

allocate

and

no-write

allocate

for

misses7/05/2012 Summer

2012--

Lecture

#11

40提綱內(nèi)容主要取材CS61C的11講和12講層次概述直接

Cache直接

Cache舉例Cache讀和寫Cache性能多級Cache組相連Cache改進Cache性能多級Cache性能實戰(zhàn)當(dāng)代Cache舉例航空航天大學(xué)計算機學(xué)院202014/11/13Great

Idea

#3:

Principle

Locality/Memory

HierarchyCache

PerformanceTwo

things

hurt

the

performance

acache:–

Miss

rate

and

miss

penaltyAverage

Memory

Access

Time

(AMAT):

average

timeto

access

memory

considering

both

hits

and

missesAMAT

Hit

time

Miss

rate

Miss

penalty(abbreviated

AMAT

MP)217/09/2012Summer

2012--

Lecture

#12432014/11/137/09/2012Summer

2012--

Lecture

#1244AMAT

Example

UsageProcessor

specs:

200ps

clock,

clockcycles,

0.02

misses/instruction,

and

HTof

clock

cycleAMAT

0.02

clock

cycles

400

psWhich

improvement

would

best?190

clock

380

psMP

clock

cycles

360

psMR

0.015

misses/instruction

350

psCache

Parameter

ExampleWhat

the

potential

impact

much

largercache

AMAT?

(same

block

size)Increase

HRLonger

HT:

smaller

faster–

some

point,increase

hit

time

for

largercache

may

the

improvement

hitrate,yielding

decrease

performanceEffect

TIO?

Bits

cache?

Cost?227/09/2012Summer

2012--

Lecture

#12452014/11/13Effect

Cache

Performance

CPIRecall:

CPU

PerformanceCPU

Time

Instructions

CPI

Clock

Cycle

Time(IC)

(CC)Include

memory

accesses

CPI:CPIstall

CPIbase

Average

Memory-stallCyclesCPU

Time=

CPIstall

CCSimplified

model

for

memory-stall

cycles:Memory-stall

cycles–

will

discuss

complicated

models

soon7/09/2012 Summer

2012--

Lecture

#1246CPI

ExampleProcessor

specs:

CPIbase

100

cycle

MP,36%

load/store

instructions,

and

4%D$

MRsHow

many

times

per

instruction

access

theI$?

The

D$?MP

assumed

the

same

for

both

andD$Memory-stall

cycles

will

sum

stall

cycles

forboth

and

D$237/09/2012Summer

2012--

Lecture

#12472014/11/13CPI

ExampleProcessor

specs:

CPIbase

100

cycle

MP,36%

load/store

instructions,

and

4%D$

MRsMemory-stall

cycles=

(100%×

36%

4%)

100

3.44I$

D$CPIstall

3.44

4.44

(more

than

CPIbase!)What

the

CPIbase

reduced

1?What

the

miss

rate

went

by1%?7/09/2012 Summer

2012--

Lecture

#12

487/09/2012Summer

2012--

Lecture

#1250The

3Cs

Revisited:

Design

SolutionsCompulsory:Increase

block

size

(increases

MP;

too

large

blockscould

increase

MR)Capacity:Increase

cache

size

(may

increase

HT):Increase

cache

sizeIncrease

associativity

(mayincrease

HT)242014/11/13航空航天大學(xué)計算機學(xué)院提綱內(nèi)容主要取材CS61C的11講和12講層次概述直接

Cache直接

Cache舉例Cache讀和寫Cache性能多級Cache組相連Cache改進Cache性能多級Cache性能實戰(zhàn)當(dāng)代Cache舉例Multiple

Cache

LevelsWith

advancing

technology,have

moreroomondie

for

bigger

cachesand

for

(andinsome

cases

even

L3)

cache–

Normally

lower-level

caches

are

unified(i.e.

holds

both

instructions

and

data)Multilevel

caching

way

reduce

misspenaltySo

what

does

this

look

like?257/09/2012Summer

2012--

Lecture

#12522014/11/13ReturnMultilevel

Cache

Diagram7/09/2012Summer

2012--

Lecture

#1253L2$Main

Memory...L1$CPU

MemoryAccessMissMissHitHitLegend:Request

for

dataReturn

ofdataStoreStorePath

data

back

CPUMultilevel

Cache

AMATAMAT

MPNow

depends

other

cache

levelsL1

L2HT

L2MPIf

levels,

then

continue

thischain(i.e.

MPi

HTi+1

MRi+1

×MPi+1)Final

main

memory

access

timeFor

two

levels:AMAT

(L2

MP)267/09/2012Summer

2012--

Lecture

#12542014/11/13Multilevel

Cache

AMAT

ExampleProcessor

specs:

cycle

HT,

L1MR,5

cycle

HT,

MR,

100

cycle

mainmemoryHT–

Here

assuming

unified

L1$Without

L2$:AMAT1

0.02

100

3With

L2$:AMAT2

0.02

0.05

100)

1.27/09/2012Summer

2012--

Lecture

#1255Local

vs.

Global

Miss

RatesLocal

miss

rate:

Fraction

references

toonelevel

cache

that

misse.g.

L2$

local

L2$

misses/L1$

missesSpecific

level

caching

(as

used

AMAT)Global

miss

rate:

Fraction

all

referencesthat

miss

all

levels

multilevel

cacheProperty

the

overall

memory

hierarchyGlobal

the

product

all

local

MRsStart

Global

misses/L1

accessesand

expandSo

definition,

global

≤

any

local

MR277/09/2012Summer

2012--

Lecture

#12562014/11/13Memory

Hierarchy

withTwo

Cache

Levels57CPUL1$L2$MM1000mem

refs40

mem

refs20

mem

refs1

cycle 10

cycles 100

cyclesFor

every

1000

CPU

memory

references–40

will

miss

L1$;

what

the

local

MR?

0.04–20

will

miss

L2$;

what

the

local

MR?

0.5–Global

miss

rate?

0.027/09/2012 Summer

2012--

Lecture

#12Design

ConsiderationsL1$

focuses

low

hit

time

(fast

access)minimize

achieve

shorterclock

cycleL1

significantly

reduced

presence

L2$,

socan

smaller/faster

even

with

higher

MRe.g.

smaller

(fewer

rows)L2$,

L3$

focus

low

miss

rateAs

much

possible

avoid

reaching

mainmemory

(heavy

penalty)e.g.

larger

with

larger

block

sizes

(same

rows)287/09/2012Summer

2012--

Lecture

#12592014/11/13航空航天大學(xué)計算機學(xué)院提綱內(nèi)容主要取材CS61C的11講和12講層次概述直接

Cache直接

Cache舉例Cache讀和寫Cache性能多級Cache組相連Cache改進Cache性能多級Cache性能實戰(zhàn)當(dāng)代Cache舉例Reducing

Cache

MissesAllow

flexible

block

placement

cache:Direct-mapped:

Memory

block

maps

exactlyone

cache

blockFully

associative:

Memory

block

can

inanyslotN-way

set-associative:

Divide

into

sets,

each

ofwhich

consists

slots

place

memory

blockMemory

block

maps

set

determined

byIndexfield

andis

placed

any

the

slotsof

that

setHash

function:

(block

address)

modulo

sets

thecache)7/09/2012 Summer

2012--

Lecture

#12

61292014/11/13Block

Placement

SchemesPlace

memory

block12

cache

that

holds

8blocksDirect-mapped:

Can

only

row

(12

mod8)

4Fully

associative:

Can

any

the

slots(1

set/row)2-way

set

associative:

Can

ineither

slot

set(12

mod

07/09/2012 Summer

2012--Lecture

#1262Effect

Associativity

TIO(1/2)Here

assume

cache

fixed

size(C)Offset:

bytes

block

(same

before)Index:

Instead

pointing

row,

nowpoints

set,

C/B/associativity?

Fullyassociative

set):

Index

bits!?

Direct-mapped

(associativity

1):

max

Indexbits?

Set

associative:

somewhere

in-betweenTag:

Remaining

identifier

bits

–

O)307/09/2012Summer

2012--

Lecture

#12632014/11/13Summer

2012--

Lecture

#1264Fully

associative(only

one

set)Decreasing

associativityDirect

mapped(only

one

way)7/09/2012Effect

Associativity

TIO

(2/2)For

fixed-size

cache,

each

increase

factor

oftwo

associativity

doubles

the

number

blocksper

set

(i.e.

the

number

slots)

and

halves

thenumber

sets

–decreasing

the

size

the

Indexby

bit

and

increases

the

size

the

Tag

bitUsed

for

tagcomparison Selects

the

set Selects

the

word

theblockTag

Index

Block

offset

Byte

offsetIncreasing

associativity65Set

Associative

Example

(1/2)Cache

parameters:6-bit

addresses,

block

size

word,cache

size

words,

2-way

set

associativeHow

many

sets?C/B/associativ

人人文庫> 全部分類> 教育資料 > 課件下載

溫馨提示

1. 本站所有資源如無特殊說明，都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
2. 本站的文檔不包含任何第三方提供的附件圖紙等，如果需要附件，請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
3. 本站RAR壓縮包中若帶圖紙，網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽，若沒有圖紙預(yù)覽就沒有圖紙。
4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
5. 人人文庫網(wǎng)僅提供信息存儲空間，僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理，對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯，并不能對任何下載內(nèi)容負(fù)責(zé)。
6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容，請與我們聯(lián)系，我們立即糾正。
7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

l航空航天計算機學(xué)院

文檔簡介

溫馨提示

最新文檔

評論

l航空航天計算機學(xué)院

文檔簡介

溫馨提示

最新文檔

評論

相關(guān)文檔