联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-21:00
  • 微信:codinghelp

您当前位置:首页 >> Algorithm 算法作业Algorithm 算法作业

日期:2023-07-13 09:41

Initials:

1. Assembly and ISA! [25 points]

During the semester, we learn how x86 can be assembled and deassembled into assembly code and

binaries in assignment 4 task 2. In this question, one of your TAs would like to be cool and design

his own ISAs. Consider the following 16-bit MUIC-IS-COOL-ISA with the following features.

• The ISA is byte addressable and there are 8 16-bit registers from R0 to R7.

• The machine stop its execution whenever the decoder observes the instruction JMP 0, in which

case it finish all remaining instructions in the pipeline.

• There are 3 status bits: negative, zero, overflow. The negative bit is set to true if the

destination register yield a negative result, in which case value of the destination register is the

leftmost 16 bits. The zero bit is set to true if the destination register yield zero. The overflow

bit is set to true if the destination register overflows (i.e., result in a number higher than 16

bits, in which case the destination register stores the leftmost 16 bits value).

Instruction Type Opcode Op1 Op2 Op3 Unused

Bits location Higher bits <----------> Lower bits

Compute R Rs1, Rs2, Rd 4 bits 3 bits 3 bits 3 bits 3 bits

Compute I Rs1, Rd, IMM 4 bits 3 bits 3 bits 6 bits None

Memory Type 1 Rd, Rs, IMM 4 bits 3 bits 3 bits 6 bits None

Memory Type 2 Rd, IMM 4 bits 3 bits 9 bits None

Cond. Type 1 IMM 4 bits 12 bits None

Cond. Type 2 Rd 4 bits 3 bits 6 Bits

Cond. Type 3 Rs1, Rs2, Rd 4 bits 3 bits 3 bits 3 bits 3 Bits

Table 1: Bit pattern for each instruction types. The most significant bit is on the leftmost side and

the least significant bit is on the rightmost side.

2/15

Initials:

Instruction Opcode Op1 Op2 Op3 Description

ADD 0000 Rs1 Rs2 Rd Rd <= Rs1 + Rs2

ADDI 0001 Rs1 Rd IMM Rd <= Rs1 + IMM

SUB 0010 Rs1 Rs2 Rd Rd <= Rs1 − Rs2

SUBI 0011 Rs1 Rd IMM Rd <= Rs1 − IMM

AND 0100 Rs1 Rs2 Rd Rd <= Rs1andRs2

OR 0101 Rs1 Rs2 Rd Rd <= Rs1orRs2

XOR 0110 Rs1 Rs2 Rd Rd <= Rs1xorRs2

LD 0111 Rd Rs IMM Rd <= mem[Rs + IMM]

LDI 1000 Rd IMM Rd <= IMM

ST 1001 Rd Rs IMM Rs => mem[Rd + IMM]

JMP 1010 IMM P C <= IMM

JMPR 1011 Rd P C <= Rd

BLT 1100 Rs1 Rs2 Rd If Rs1 < Rs2 then P C <= Rd else

P C <= P C + 2

BGT 1101 Rs1 Rs2 Rd If Rs1 > Rs2 then P C <= Rd else

P C <= P C + 2

BNE 1110 Rs1 Rs2 Rd If Rs1 = Rs2 then P C <= Rd else

P C <= P C + 2

LDPC 1111 Rd Rd <= P C

Table 2: All instructions in the MUIC-IS-COOL-ISA. Rs1 is the input source 1, Rs2 is the input

source 2, Rd is the destination (output), and IMM is the immediate (constant value). Register bits

are denoted based on their register ID. For example, if Rs1 is R3, it will have the value equal to 3 in

the appropriate register field in the binary instruction.

(a) Now that we have establish the ISA specification. (15 points)

Assume PC starts at 0x30. What is the code (in MUIC-IS-COOL assembly) from the memory

snapshot below. Note that for this memory snapshot, the bits within the data word in

the table below is sorted using [highest bit – lowest bit] format (i.e., if the data word

is 0x1234, then the word is 0b’0001 0010 0011 0100).

Address Values (in hex) [Lowest address – Highest address]

0x00 00 01 02 03 04 05 06 07 08 09 0a 0b 0c 0d 0e 0f

0x10 00 01 02 03 04 05 06 07 08 09 0a 0b 0c 0d 0e 0f

0x20 00 01 02 03 04 05 06 07 08 09 0a 0b 0c 0d 0e 0f

0x30 80 00 8a 00 8c 14 fe f2 72 14 0a 6b 10 04 c1 ba

0x40 91 40 a0 00 e2 ff 01 0f ff 2e be ef 24 31 a0 00

0x50 19 15 12 0a 6b 3a 4b 12 91 ac ff fe 3c 3d 3e 4f

0x60 12 50 62 8a 5e 5f df ea 99 ac 74 6b 91 44 33 ef

0x70 70 71 72 73 74 75 76 77 78 79 7a 7b 7c 7d 7e 7f

0x80 80 81 82 83 84 85 86 87 88 89 8a 8b 8c 8d 8e 8f

0x90 90 91 92 93 94 95 96 97 98 99 9a 9b 9c 9d 9e 9f

0xa0 80 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 02 e1 ba

0xb0 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

0xc0 80 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 01 e1 ba

0xd0 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

0xe0 70 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 03 e1 ba

0xf0 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

3/15

Initials:

Instruction Bits Pattern

(b) What are the values of all the registers inside the register files after the program finishes? You

can put in XX for an unknown value. (10 points)

Address Values (in Decimal)

R0

R1

R2

R3

R4

R5

R6

R7

4/15

Initials:

2. Jump Table [20 points]

In this question, consider the following assembly codes below. Fill in the rest of the C code for each

of the switch cases. Write ”NOTHING HERE” if the space should be left blank or if that line of code

should not exist (i.e., the program does not support to modify result at that line).

Assume that a is in %rdi and b is in %rsi

quiz:

pushl %ebp

movl %esp, %ebp

movl %rdi, %edx

movl %rsi, %eax

cmpl $8, %edx

ja .L2

jmp *.L9(,%edx,4)

.section .rodata

.align 4

.align 4

.L9:

.long .L2

.long .L4

.long .L2

.long .L5

.long .L4

.long .L2

.long .L6

.long .L2

.long .L2

.text

.L4:

movl %edx, %eax

jmp .L6

.L5:

decl %eax

jmp .L3

.L6:

incl %eax

jmp .L3

.L2:

movl $1, %eax

.L3:

popl %ebp

ret

5/15

Initials:

In the space below, fill in the blank to reflect the assembly code above.

int quiz(int a, int b)

{

int result = ________;

int temp = ________; // Feel free to use this if you need to store

// any temp variable. Leave blank if not needed.

switch(________)

{

case ________:

case ________:

result = _________;

break;

// Feels free to use as many of these as you want,

// but the corrent answer does not use all of these cases.

case ________:

result = _________;

break;

case ________:

result = _________;

break;

case ________:

result = _________;

break;

case ________:

result = _________;

break;

case ________:

result = _________;

break;

default:

result = _________;

}

return result;

}

6/15

Initials:

3. Caching [20 points]

In this question, let’s assume that we have a 16-bit system with a single level 6-way set associative

cache with 8 sets, and a cache block size of 64 bytes.

How many bits are needed for the setID and the tags? Draw the breakdown of the tag/index/bytein-block bits.

What is the total size of this cache?

For the following program, assume that an integer is 4 bytes.

int i; // Assume these variables are stored in the registers.

int a[65536]; // Assume that a = 0x1000

int b[65536]; // Assume that b = 0x8000000

for(i=0;i<1024;i++)

a[i * 8 ] = i;

for(i=0;i<1024;i++)

b[i * 8 ] = a[i * 8 ]++;

Let’s asssume that the array is initialized to zero and that the variable i is already stored on the

register. What is my cache hit rate? Show your work.

7/15

Initials:

4. Cache Mystery [30 points]

A byte-addressable system with 16-bit addresses ships with a three-way set associative, write-back

cache (i.e., each block needs a dirty bit). The cache implements a true LRU replacement policy using

the minimum number of replacement policy bits necessary to implement it, which means it requires 3

bits per set. The tag store requires a total of 264 bits of storage. What is the block size of the cache?

(Hint: 264 = 28 + 23 and please also do not forget that aside from the tag itself, each block needs 1

valid bit, 1 dirty bit).

Answer:

Show all your work.

8/15

Initials:

5. Virtual Memory [30 points]

Let’s create a simple BIG endian machine that utilize two-level page table with a 4KB page size

(similar to what we learn in class), 4KB page table, and this processor also uses 32-bit address.

Assuming the following:

• Data in the memory and the page table root is at 0x10.

• The status bits in the PTE for both levels are 12 bits, and the page table entries is 32-bit long,

where the n most significant bits after the page offset are either used as the ID of the next page

(for the first level) or the physical page number (for the second level).

• To get the address of the first entry in the second level of the page table, our machine will take

the ID of the next page. Then, it appends this ID with m additional zero bits, where m is the

number of bits required for the page table size. For example, if your page table size is 64 bytes

and the ID is 5, m is 6. So, the next level page for this ID is at address 0x140.

Address Values (in hexadecimal) [Lowest byte – Highest byte]

0x00 00 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0

0x10 00 10 00 c0 14 15 16 17 18 19 1a 1b 08 00 00 00

0x20 19 15 12 0a 6b 3a 60 70 19 15 12 dd 6b d0 e0 f0

0x30 30 31 ee 33 34 35 36 37 00 10 0e aa 3c 3d 3e 3f

0x40 00 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0

0x50 80 00 00 0a 6b 3a 4b 12 91 ac ff fe 3c 3d 3e 4f

0x60 12 50 62 8a 5e 5f df ea 99 ac 74 6b 91 44 33 ef

0x70 70 71 72 73 74 75 76 77 78 79 7a 7b 7c 7d 7e 7f

0x80 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

0x90 90 91 92 93 94 95 96 97 98 99 9a 9b 9c 9d 9e 9f

0xa0 80 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 02 e1 ba

0xb0 30 31 32 33 34 35 36 37 38 39 3a 3b 3c 3d 3e 3f

0xc0 80 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 01 e1 ba

0xd0 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

0xe0 70 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 03 e1 ba

0xf0 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

0x100000 00 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0

0x100010 10 11 12 13 14 15 16 17 18 19 1a 1b 1c 1d 1e 1f

0x100020 00 10 20 30 40 50 60 70 08 00 00 bc c0 d0 e0 f0

0x100030 30 31 32 33 34 35 36 37 38 39 3a 3b 3c 3d 3e 3f

0x100040 00 10 20 30 40 50 60 70 80 90 a0 b0 c0 d0 e0 f0

0x100050 19 15 12 0a 6b 3a 4b 12 91 ac ff fe 3c 3d 3e 4f

0x100060 12 50 62 8a 5e 5f df ea 99 ac 74 6b 91 44 33 ef

0x100070 70 71 72 73 74 75 76 77 78 79 7a 7b 7c 7d 7e 7f

0x8000000 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

0x8000010 90 91 92 93 94 95 96 97 98 99 9a 9b 9c 9d 9e 9f

0x8000020 80 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 02 e1 ba

0x8000030 30 31 32 33 34 35 36 37 38 39 3a 3b 3c 3d 3e 3f

0x8000040 80 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 01 e1 ba

0x8000050 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

0x8000060 70 00 8a 00 8c 14 fe ff 72 14 0a 6b 10 03 e1 ba

0x8000070 91 40 8a 00 8c 14 fe ff 74 13 02 ba 6b 12 4b 31

9/15

Initials:

(a) What is the status bits for both page table levels for a virtual address 0x0000a000? Put in

Not enough information if the table does not provide enough information to get the physical

address.

(b) What is the physical address for a virtual address 0x04002fff? Put in Not enough information if the table does not provide enough information to get the physical address.

10/15

Initials:

(c) What is the physical address for a virtual address 0x0000beef if this system were to use a single

level 64KB page instead? Put in Not enough information if the table does not provide enough

information to get the physical address.

(d) Assuming that the memory access takes 100 cycles to access DRAM, the system has 4-level page

table (i.e., a page walk have to access the memory 4 times before it can access its data), an TLB

access takes 1 cycle, and a L1 cache access to the set takes 1 cycle and the tag comparison in the

L1 cache takes another 1 cycle. How long does it takes to load a data that has a TLB miss and a

L1 cache hit? Feels free to explain your answer.

11/15

Initials:

(e) Now let’s run this on a virtual machine (VM). Assuming that the memory access takes 100 cycles

to access DRAM, the system has 4-level page table (i.e., a page walk have to access the memory

4 times before it can access its data), assume we have a TLB-miss and a L1 cache hit. With

a virtual machine, each physical address on your VM’s Operating System is actually a virtual

address on the host machine. What is the number of cycles it would take to translate a virtual

address inside a VM on your host machine. Feels free to explain your answer.

12/15

Initials:

Log Table

N log2N

1 0

2 1

4 2

8 3

16 4

32 5

64 6

128 7

256 8

512 9

1024 (1k) 10

2048 (2k) 11

4096 (4k) 12

8192 (8k) 13

16384 (16k) 14

32768 (32k) 15

62236 (64k) 16

131072 (128k) 17

262144 (256k) 18

524288 (512k) 19

1048576 (1M) 20

2097152 (2M) 21

4194304 (4M) 22

8388608 (8M) 23

16777216 (16M) 24

13/15

Initials:

Stratchpad

14/15

Initials:

Stratchpad

15/15


相关文章

版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。 站长地图

python代写
微信客服:codinghelp