Pregunta 1
Pregunta
In given two C modules which rule will Unix linker use to resolve multiple symbol definition?
Respuesta
-
Multiple strong symbols are not allowed
-
None of these
-
Given multiple weak symbols, chose any of the weak symbols
-
Given a strong symbol and multiple weak symbols, choose the strong symbol.
Pregunta 2
Pregunta
Suppose we attempt to compile and link the following two C modules. Linker will generate an error?
Pregunta 3
Pregunta
Suppose we attempt to compile and link the following two C modules. Linker will generate an error?
Pregunta 4
Pregunta
A branch instruction
Respuesta
-
Sets the program counter to one of two possible values
-
Increases the program counter by a fixed amount
-
Sets the program counter to one of many possible values
-
Unconditionally sets the program counter to its operand
Pregunta 5
Pregunta
In binary, the number 1/3 is 0.01010101 . . .. In the following, assume that x is an int and y is a short. (short)((0x5555 * (int)y)>>16) and y/3 could differ even when neither computation overflows
Pregunta 6
Pregunta
A callee-saved register should be saved before it is written to in a procedure
Pregunta 7
Pregunta
In binary, the number 1/3 is 0.01010101 . . .. In the following, assume that x is an int and y is a short. For each x there exists some integer w (which may be different for different xs) such that abs(x/3 - q) < 10, where q = (((0x55555555>>w)*x)>>(32-w)).
Pregunta 8
Pregunta
In the FDEMW five-stage pipeline we analyzed in class, if we want to stall memory how many pipeline registers should be given the "stall" signal?
Pregunta 9
Pregunta
A CPU register file is a small storage device that
Respuesta
-
Consists of a collection of word size registers
-
Houses a critical variable for the duration of the execution of a program
-
Records the results of periodic CPU diagnostics
-
Is automatically loaded when CPU instruction refers to a word of normal memory
Pregunta 10
Pregunta
In the FDEMW five-stage pipeline we analyzed in class, if we want to stall decode how many pipeline registers should be given the “normal operation” signal?
Pregunta 11
Pregunta
Using a base address [Eb] %edx= 0x1000, and index register [Ei] %ecx=0x02, compute the effective address for:
Respuesta
-
0x1064
-
0x1032
-
0x1016
-
0x1010
Pregunta 12
Pregunta
A memory leak is caused by a
Respuesta
-
Failure to free allocated memory
-
Function that allocates a large amount of memory from the heap
-
None of these
-
Bug in the memory allocator that fails to free memory
Pregunta 13
Pregunta
In the two's complement number representation, a negative number with more leading ones is larger (i.e. less negative) than a negative number with fewer leading ones.
Pregunta 14
Pregunta
When shifting a two’s complement number to the left, an overflow can only occur if the number is negative, since positive numbers have a zero in the leftmost bit
Pregunta 15
Pregunta
A program cannot accidentally use another program's memory space and "crash" a computer that uses (see the img)
1- virtual memory address
2- program relocation
3- separate instruction and data caches
Pregunta 16
Pregunta
In x86 assembly, the eax register can be used to hold integers but not addresses.
Pregunta 17
Pregunta
A program written directly in machine language (binary) would run faster than the same program written in assembly code and then assembled.
Pregunta 18
Pregunta
Initially byte i of memory is i + 0x20 (mod 256), %eax contains the number 3, and %ebx contains the number 5. What is in %ebx after running x86 movl (%eax), %ebx?
Respuesta
-
none of the above
-
3
-
0x25
-
0x23
Pregunta 19
Pregunta
Adding local variables is usually used to reduce what source of inefficiency?
Pregunta 20
Pregunta
Asserting a word line and asserting (to "high" voltage) a bit line writes a 1 to a DRAM cell)
Pregunta 21
Pregunta
Inline substitution (also called inlining) is usually used to reduce what source of inefficiency?
Pregunta 22
Pregunta
Using a base address [Eb] %edx=0x1000, and index register [Ei] %ecx=0x03, compute the effective address for:
Respuesta
-
0x1032
-
0x1014
-
0x1064
-
0x1016
Pregunta 23
Pregunta
Jump in the following Y86-64 program. How many times does the addq instruction execute?
Pregunta 24
Pregunta
Jump tables, such as are used for switch statements, require that the new PC be determined by a memory read. Suppose we added an assembly instruction memjump accepting a single memory address as its parameter, like memjump (%eax). In the five-stage example pipeline (F D E M W), for how many cycles will we need to bubble and/or stall stage F after executing a memjump?
Pregunta 25
Pregunta
Let f32 be the number of distinct numbers that can be represented in 4-byte IEEE-style floating point and u32 be the number of distinct numbers that can be represented as unsigned 4-byte integers. Which of the following is true?
Respuesta
-
f32 < u32−1
-
f32 > u32 +1
-
f32 = u32
-
f32 = u32+1
Pregunta 26
Pregunta
Let s8 be the number of distinct numbers that can be represented as signed (two's complement) 1-byte integers and u8 be the number of distinct numbers that can be represented as unsigned 1-byte integers. Which of the following is true?
Respuesta
-
s8 = u8
-
s8 > u8+1
-
s8 = u8 - 1
-
s8 < u8-1
Pregunta 27
Pregunta
Assume a set-associative and a direct-mapped cache both have the same address size, same data capacity, and same number of index bits. This means
Pregunta 28
Pregunta
Assume x is the most positive signed integer. What is x + 1?
Respuesta
-
none of the above
-
+∞
-
-1
-
-x
Pregunta 29
Pregunta
Logical address is calculated by: segment.base + eff_addr
Respuesta
-
Selector: eff_addr
-
Segment base + lin_addr
-
Selector: offset
-
Segment base + eff_addr
Pregunta 30
Pregunta
Assuming the standard IEEE-style bias, what is the smallest number of bits a float point number could have and represent the number −29/4 exactly?
Pregunta 31
Pregunta
Buses are typically designed to transfer fixed-sized chunks of bytes known as
Respuesta
-
words
-
frames
-
records
-
pages
Pregunta 32
Pregunta
Capacity misses are typically discussed in connection with?
Respuesta
-
fully-associative cache
-
direct-mapped caches
-
set-associative caches
-
all of the above
Pregunta 33
Pregunta
Check the following (listed in alphabetical order) in order of importance when optimizing code that operates on very large arrays: B = Body of loop optimization (function inlining, efficient math), C = Cache locality, O = big-O of algorithm used
Pregunta 34
Pregunta
Compare the size of int and int*
Pregunta 35
Pregunta
Loop blocking is usually used to reduce what source of inefficiency?
Respuesta
-
poor cache locality
-
data dependencies
-
poor branch prediction
-
procedure call overhead
Pregunta 36
Pregunta
Loop unrolling is usually used to remove what source of inefficiency?
Respuesta
-
poor cache locality
-
data dependencies
-
poor branch prediction
-
procedure call overhead
Pregunta 37
Pregunta
Programs with good ______ ______, tend to access a referenced memory location multiple times in the near future.
Respuesta
-
Temporal locality
-
Spatial locality
-
Repetitive locality
-
Associative locality
Pregunta 38
Pregunta
Putting \#include "foo.h" in your C file is the equivalent of typing the contents of foo.h into your file
Pregunta 39
Pregunta
Consider the following code fragment
Respuesta
-
The value of &y is closer to the value of &x than to the value &a
-
The value of *y is closer to the value of *x than to the value *a
-
The value of *a and *b are closer to each other than the values of *x and *y
-
The value of &a and &b are closer to each other than the values of &x and &y
Pregunta 40
Pregunta
Compared to dynamic RAM (SRAM), dynamic RAM (DRAM) is
Respuesta
-
a) DRAM is cheaper than SRAM
-
b) DRAM is faster than SRAM
-
c) DRAM is more likely to be used for main memory than for on-chip caches
-
d) SRAM is more likely to be used for main memory than for on-chip cache
-
a) DRAM uses more transistors per bit than SRAM
-
b) DRAM uses a capacitor to store a value
-
c) Typically a computer has more DRAM than SRA
Pregunta 41
Pregunta
Compiling C into x86 assembly comprises translating each C statement into a single assembly instruction.
Pregunta 42
Pregunta
Conflict misses are typically discussed in connection with?
Respuesta
-
direct-mapped caches
-
set-associative caches
-
fully-associative caches
-
all of the above
Pregunta 43
Pregunta
Rank the following in order of significance when writing code that deals with large arrays:
- Algorithmic efficiency (i.e., big-O),
- Computational efficiency (e.g., loop unrolling, inlining, etc),
- L cache Locality.
Pregunta 44
Pregunta
CPE (cycles per element, also called cycles per execution or cycles per instruction in other sources). If we have code with 20 CPE and run it on a problem where my algorithm executes on 100 elements; I should expect the runtime to be
Respuesta
-
50-220 cycles
-
0-50 cycles
-
220-1100 cycles
-
1100-2200 cycles
Pregunta 45
Pregunta
Reassociation of operators is usually used to reduce what source of inefficiency?
Pregunta 46
Pregunta
Register %edx is one of the "callee-save" registers. This means that if procedure baz calls procedure xyxxy
Respuesta
-
baz may assume xyxxy did not change %edx
-
xyxxy may communicate with baz through register %edx
-
baz must assume xyxxy could have changed %edx
-
baz may communicate with xyxxy through register %edx
Pregunta 47
Pregunta
Register %esi is one of the "caller-save" registers. This means that if procedure baz calls procedure xyxxy
Respuesta
-
baz must assume xyxxy could have changed %esi
-
xyxxy may communicate with baz through register %esi
-
baz may assume xyxxy did not change %esi
-
baz may communicate with xyxxy through register %esi
Pregunta 48
Pregunta
Dirty bits are associated with (pick the best answer)
Respuesta
-
write-back caches
-
write-through caches
-
set-associative caches
-
direct-mapped caches
Pregunta 49
Pregunta
Edsger Dijkstra wrote "Go To Statement Considered Harmful" in 1968 in which he argued (translated into current language) that code that uses goto becomes needlessly hard to read and maintain compared to only using if and while. In assembly, we use the goto-like jump instructions extensively; the main reason for this is?
Respuesta
-
goto is only bad when mixed with higher-level constructs
-
Dijkstra was wrong
-
no one is reading assembly anyway
-
there is no other alternative in assembly
Pregunta 50
Pregunta
Every procedure in an x86 assembly program must be declared using .globl
Pregunta 51
Pregunta
Which of the following is a good reason (are good reasons) to equip the CPU with small amounts of fast memory?
I. To make the design of the compiler simpler
II. To make some CPU instructions smaller
III. To make some CPU instructions faster
Respuesta
-
III only
-
II and III only
-
I, II, and III
-
II only
Pregunta 52
Pregunta
Given 6-bit numbers where 010001 + 001010 = 010110, which of the following describes the number representation used?
Respuesta
-
floating-point with 2 exponent bits
-
floating-point with 3 exponent bits
-
integer
-
floating-point with 4 exponent bits
Pregunta 53
Pregunta
Given 6-bit numbers where 010101 + 000001 = 010110, which of the following describes the number representation used?
Respuesta
-
integer
-
floating-point with 3 exponent bits
-
floating-point with 1 exponent bits
-
floating-point with 2 exponent bits
Pregunta 54
Pregunta
Good software design includes writing procedures for code you might otherwise repeat in-line. Pulling code into procedures can help some branch predictors; how else can it improve your program’s performance and/or your compiler’s ability to optimize your code?
Respuesta
-
more opportunities for loop unrolling
-
less chance of compiler having to worry about aliasing and side effects
-
more opportunities for pipeline-level parallelism
-
better instruction cache hit rate
Pregunta 55
Pregunta
Return from function issues a bubble in pipeline.
Pregunta 56
Pregunta
Good software design includes writing procedures for code you might otherwise repeat in-line. Pulling code into procedures involves call/return overhead; how else can it hurt your program's performance and/or your compiler's ability to optimize your code?
Respuesta
-
more chance of compiler having to worry about aliasing and side effects
-
fewer opportunities for pipeline-level parallelism
-
worse instruction cache hit rate
-
fewer opportunities for loop unrolling
Pregunta 57
Pregunta
SRAM is "static" in the sense that if the power is turned off, SRAM will continue to store data (e.g. as in flash memory in MP3 players or USB thumb drives).
Pregunta 58
Pregunta
Suppose addresses A and B have different tags but the same index (for those caches that have tags and indexes, that is). Assume the cache is empty prior to the accesses listed in the questions. In the access pattern "read A, then read B, then read A, then read B" the second read of A will be a miss for?
Pregunta 59
Pregunta
How does knowing you have a pipelined processor change the code you write?
Respuesta
-
it suggests optimizations like multiple accumulators
-
it suggests optimizations like adding local variables
-
it suggests optimizations like loop unrolling
-
it suggests optimizations like loop reordering
Pregunta 60
Pregunta
Suppose addresses A and B have different tags but the same index (for those caches that have tags and indexes, that is). Assume the cache is empty prior to the accesses listed in the questions. In the access pattern "read A, then read B" the read of B will be a miss for
Pregunta 61
Pregunta
How is "-1" (decimal) represented in an 8-bit 2's complement binary format?
Pregunta 62
Pregunta
Suppose page table entries are 8 bytes long and virtual addresses are 32 bytes long. If each page table is to fit on a single page, what is the smallest pages can be if we have a 2-level page table hierarchy?
Pregunta 63
Pregunta
How is 46 (decimal) represented in an 8-bit 2's complement binary format?
Respuesta
-
00101100
-
00101110
-
00011110
-
01000110
Pregunta 64
Pregunta
How is -10 (decimal) represented in an 8-bit 2's complement binary format?
Respuesta
-
11110110
-
10001010
-
11110101
-
11111010
Pregunta 65
Pregunta
Which of the following move operations is the following instruction an example of movl (%edx), (%eax)?
Pregunta 66
Pregunta
In the access pattern “write A, then write B, then write A”, to minimize
writes to the higher-level cache we should pick which of the following?
Respuesta
-
a write-through fully-associative cache
-
a write-back fully-associative cache
-
a write-through direct-mapped cache
-
a write-back direct-mapped cache
-
two or more of the above are equivalently the best option
Pregunta 67
Pregunta
How many hex digits are needed to represent the decimal number 67?
Pregunta 68
Pregunta
How many ints fit inside x if x is declared as int *x = malloc(8); ? Word size is 4.
Pregunta 69
Pregunta
Which of the following move operations is the following instruction an example of movl (%edx), %eax?
Pregunta 70
Pregunta
How many rules do Unix linkers use for dealing with multiply defined symbol
Pregunta 71
Pregunta
Suppose an array of char (8-bit values) {0x12, 0x34, 0x56, 0x78} is stored at address 0x24 of a little-endian computer. What byte is stored at address 0x26?
Pregunta 72
Pregunta
If a computer has 8GB memory, a single 1MB cache, and 4KB of registers, the total amount of data it can store is?
Respuesta
-
more than 8GB but less than 8GB + 1MB
-
more than 8GB + 1MB but less than 8GB + 1MB + 1KB
-
more than 8GB + 1MB + 4KB
-
8GB + 1MB + 4KB
Pregunta 73
Pregunta
Suppose the 32-bit value 0x12345678 is stored at address 0x24 of a big-endian computer. What byte is stored at address 0x23?
Pregunta 74
Pregunta
When the following code fragment is executed on a computer with 32-bit integers and a fully-associative cache with 16-byte cache lines, how many bytes of the array A[] will be fetched into the cache from main memory
Respuesta
-
At most 96
-
Exactly 17
-
Exactly 32
-
At most 80
Pregunta 75
Pregunta
If a wire carries a logical value of 0, its voltage level will be 0 Volts.
Pregunta 76
Pregunta
If we fail to find data in a cache because we have never accessed the data before, we call that a
Respuesta
-
cold miss
-
capacity miss
-
conflict miss
-
forced miss
Pregunta 77
Pregunta
If we fail to find data in a cache because we've read too much data since we last accessed that same data, we call that a
Respuesta
-
capacity miss
-
cold miss
-
conflict miss
-
forced miss
Pregunta 78
Pregunta
Suppose the 32-bit value 0x12345678 is stored at address 0x24 of a big-endian computer. What byte is stored at address 0x25?
Pregunta 79
Pregunta
If we were running a non-pipelined processor (like SEQ) with a real memory system, which of the following optimization strategies would no longer provide a speed benefit?
Pregunta 80
Pregunta
The .data section of an x86 assembly program is used to specify the parameters for each procedure.
Pregunta 81
Pregunta
In C, a pointer variable is used for storing an address
Pregunta 82
Pregunta
If x is the most negative signed integer, what is -x?
Pregunta 83
Pregunta
In a DRAM, all cells have to be periodically refreshed, not just the ones that have been read.
Pregunta 84
Pregunta
In a program with good ____________, if a memory location is referenced once, then the program is likely to access nearby memory locations in the near future
Respuesta
-
Spatial locality
-
Temporal locality
-
Repetitive locality
-
Associative locality
Pregunta 85
Pregunta
The ASCII code for the character 'B' is 0x42. What is the ASCII hex code for the character 'D'?
Pregunta 86
Pregunta
If we were running a non-pipelined processor (like SEQ) with a real memory system, which of the following optimization strategies would not longer provide a speed benefit?
Pregunta 87
Pregunta
In C, local variables allocated inside functions are allocated:
Respuesta
-
On the stack
-
In static storage
-
In virtual memory
-
In the heap
Pregunta 88
Pregunta
In C, using default floating point settings, a special value of "infinity" is computed when
Respuesta
-
A floating-point computation results in an overflow
-
Program execution is halted
-
An erroneous value is computed and execution continues
-
An exception is raised
Pregunta 89
Pregunta
The ASCII code for the character 'B' is 0x42. What is the ASCII hex code for the character ‘E’?
Pregunta 90
Pregunta
In given two C modules which rule will Unix linker use to resolve multiple symbol definition?
Respuesta
-
Given a strong symbol and multiple weak symbols, choose the strong symbol.
-
Given multiple weak symbols, chose any of the weak symbols
-
Multiple strong symbols are not allowed.
-
None of these
Pregunta 91
Pregunta
The assembly instruction int x used to make system calls only accepts a 1-byte (256-value) argument x, but Linux uses it to support over 300 system calls. How does it do that?
Respuesta
-
SRAM would lose its data, DRAM would keep its data
-
SRAM would keep its data, DRAM would keep its data
-
SRAM would keep its data, DRAM would lose its data
-
SRAM would lose its data, DRAM would lose its data
Pregunta 92
Pregunta
The contents of how many program registers are modified by an assembly push operation?
Pregunta 93
Pregunta
The difference between the SAR (arithmetic shift right) and SHR (logical shift right) operations is that SAR puts a one into the leftmost bit after a shift and SHR puts a zero.
Pregunta 94
Pregunta
The EBP is a callee-saved register.
Pregunta 95
Pregunta
The EFLAGS register contains the zero flag and the sign flag?
Pregunta 96
Pregunta
What is the value of the following C expression? x=0x1234 and y=0x3 ((x >> y) << y)
Respuesta
-
0x1230
-
0x1234
-
0x0246
-
0x1200
Pregunta 97
Pregunta
The hardware is aware of the program stack
Pregunta 98
Pregunta
The instruction addq %rax, %rbx changes what?
Respuesta
-
The contents of the rbx register
-
The contents of memory at an address found in the rax register
-
The contents of memory at an address found in the rbx register
-
The contents of the rax register
Pregunta 99
Pregunta
What is the value of the following C expression? x=0x00 and y=0xA5 x && ~y
Respuesta
-
0xFF50
-
0x0000
-
0x0001
-
0x11A5
Pregunta 100
Pregunta
The instruction cmp eax,ebx (or cmp %ebx,%eax in AT&T syntax) computes the value of EAX - EBX.
Pregunta 101
Pregunta
_______ - program transformation that reduces the number of iterations for a loop by increasing the number of elements computed on each iteration
Respuesta
-
Loop unrolling
-
Memory aliasing
-
Side effect
-
Code motion
Pregunta 102
Pregunta
The valid bit is associated with
Respuesta
-
fully-associative caches
-
write-through caches
-
set-associative caches
-
direct-mapped caches
Pregunta 103
Pregunta
It is possible to explicitly specify the number of threads to be created in the team, using the _____ attribute:
Respuesta
-
pragma omp
-
parallel
-
private
-
Num_threads
Pregunta 104
Pregunta
CPU optimized for throughput
Pregunta 105
Pregunta
The working set of a program is formed by
Respuesta
-
The pages that the program is actively using
-
The libraries required by the program
-
The memory of the program that is currently in physical memory
-
All the files the program needs to execute
Pregunta 106
Pregunta
This problem occurs when a larger type is converted into a smaller type or with the signed-unsigned operations?
Pregunta 107
Pregunta
Using multiple accumulators is usually used to reduce what source of inefficiency?
Pregunta 108
Pregunta
169. In C a single statement that performs the equivalent of y=x*100; but without using the multiply operator (*);
Respuesta
-
y = (x>>6)+(x<<5)+(x<<2)
-
y = (x<<6)+(x<<3)+(x>>2)
-
y = (x<<5)+(x<<5)+(x<<7)
-
y = (x<<6)+(x<<5)+(x<<2)
Pregunta 109
Pregunta
We can encode the C command a ? b : c, where all three variables are of type int,
Respuesta
-
((-!!a) & b) | ((-!a) & c)
-
((!!a) & b) | ((!a) & c)
-
Any of the above
-
(a & b) | ((!a) & c)
Pregunta 110
Pregunta
We say that pipelining generally increases latency. That statement means (select all that are part of the definition of "increases latency")
Respuesta
-
each instruction takes more time to complete
-
each instruction takes less time to complete
-
entire programs take more time to complete
-
entire programs take less time to complete
Pregunta 111
Pregunta
Two computers A and B with a cache in the CPU chip differ only in that A has an L2 cache and B does not. Which of the following are possible?
Respuesta
-
1 & 2 only
-
2 & 3 only
-
1 only
-
2
Pregunta 112
Pregunta
At which of the following times is a stack frame created?
Respuesta
-
II and III only
-
II only
-
III only
-
I and II only
Pregunta 113
Pregunta
The bitwise and operation (& in C) can be used to flip the bits of a word
Pregunta 114
Pregunta
Linking can be performed at
Respuesta
-
run time
-
compile time
-
all of the above
-
load time
Pregunta 115
Pregunta
Parallel processing mechanisms to achieve parallelism in uniprocessor system are:
Pregunta 116
Pregunta
This is an example of?
Pregunta 117
Pregunta
This is an example of?
Respuesta
-
None
-
Loop unrolling
-
Loop fusion
-
Loop fission
Pregunta 118
Pregunta
This is an example of?
Pregunta 119
Pregunta
This is an example of?
Pregunta 120
Pregunta
This is an example of?
Pregunta 121
Pregunta
This is an example of?
Pregunta 122
Pregunta
The _______ directive causes threads encountering the barrier to wait until all the other threads in the same team have encountered the barrier.
Respuesta
-
single
-
barrier
-
nowait
-
private
Pregunta 123
Pregunta
How is 148 (decimal) represented in a 16-bit 2’s complement binary format?
Respuesta
-
0000 0000 1101 0111
-
0000 0000 0010 1001
-
1111 1111 0010 1000
-
1111 1111 0110 1100
Pregunta 124
Pregunta
In a Compare the runtime of y = x * 5 and y = (x<<2) + x, measured by the time it takes for the resulting value of y to be available somewhere in the pipeline.
Pregunta 125
Pregunta
What is the difference between 15 and $15?
Respuesta
-
15 is an address, $15 is a value
-
15 is a value, $15 is an address
-
15 is legal in assembly, $15 is not
-
$15 is legal in assembly, 15 is not
Pregunta 126
Pregunta
What is the result of the following assembly code instruction?
Respuesta
-
Fetch the data from register %ecx and store in register %eax
-
Fetch the data from the register %ecx and store in the memory address indicated by (%eax)
-
Fetch the data from the register %eax and store in register %ecx
-
Fetch the data from the memory address indicated by (%eax) and store in the register %ecx
Pregunta 127
Pregunta
What is the value of the following C expression? x=0xA0 and y=0x0F ~x||~y
Respuesta
-
0х0001
-
0х6000
-
0xFF50
-
0xFFFF
Pregunta 128
Pregunta
What value ends up in EAX afer the following code is executed?
Respuesta
-
48 (decimal) or 00110000 (binary) or 0x30 (hex)
-
50 (decimal) or 00110010 (binary) or 0x32 (hex)
-
46 (decimal) or 00101110 (binary) or 0x2E (hex)
-
52 (decimal) or 00110100 (binary) or 0x34 (hex)
Pregunta 129
Pregunta
When shifting a two's complement number to the left, an overflow can only occur if the number is negative, since positive numbers have a zero in the leftmost bit.
Pregunta 130
Pregunta
Which of the five phases determines how many bytes long an instruction is?
Respuesta
-
Fetch
-
Decode
-
Writeback
-
Execute
Pregunta 131
Pregunta
Which of the five phases determines what the next PC is?
Respuesta
-
Fetch
-
Memory
-
Execute
-
Decode
Pregunta 132
Pregunta
Which of the five phases determines the destination register for instructions that write to registers?
Respuesta
-
Decode
-
Writeback
-
Fetch
-
Execute
Pregunta 133
Pregunta
Which of the following computations may be performed by exactly one CPU instruction? (img)
a = 5;
a = b + c * 5;
for (i = 0; i < 10; i += a[i++]);
Respuesta
-
I, II, and III
-
II only
-
I and II only
-
I only
Pregunta 134
Pregunta
Which of the following is true?
Respuesta
-
A direct-mapped cache is a set-associative cache with set size 1
-
A direct-mapped cache is a fully-associative cache with only one set
-
A direct-mapped cache is a fully-associative cache with set size 1
-
A direct-mapped cache is a set-associative cache with only one set
Pregunta 135
Pregunta
Which of the following manages the transfer of data between the cache and main memory
Respuesta
-
Operating system
-
Compiler
-
Hardware
-
Registry
Pregunta 136
Pregunta
Which of the following manages the transfer of data between the CPU registers and the cache
Respuesta
-
Operating system
-
Compiler
-
Hardware
-
Registry
Pregunta 137
Pregunta
Which of the following registers is the lower order 16 bit register of register %ecx?
Pregunta 138
Pregunta
Which of the following statements about alignment within C struct's is true?
Respuesta
-
A constant offset associated with the member is added to the address
-
The struct consists of an array of pointers to the elements of the struct
-
The element name is looked up in a symbol table
-
A linear search is made from the base address of the struct
Pregunta 139
Pregunta
Which of the following statements about floating-point numbers in C is true?
Respuesta
-
II only
-
III only
-
I
-
I and II
Pregunta 140
Pregunta
Which one of the following IA32 registers is typically used for stack management?
Pregunta 141
Pregunta
Which one of the following optimizations does not provide any speed benefit for a non-pipelined processor, such as SEQ?
Pregunta 142
Pregunta
Which of the following levels of a typical memory hierarchy transfers data in chunks of smallest size?
Pregunta 143
Pregunta
In C, a pointer variable is used for storing an address
Pregunta 144
Pregunta
If we cut power for one millisecond, which of the following is true?
Respuesta
-
SRAM would keep its data, DRAM would lose its data
-
SRAM would lose its data, DRAM would keep its data
-
SRAM would lose its data, DRAM would lose its data
-
SRAM would keep its data, DRAM would keep its data
Pregunta 145
Pregunta
If we cut power for one minute, which of the following is true?
Respuesta
-
SRAM would lose its data, DRAM would lose its data
-
SRAM would lose its data, DRAM would keep its data
-
SRAM would keep its data, DRAM would keep its data
-
SRAM would keep its data, DRAM would lose its data
Pregunta 146
Pregunta
Select the most correct statement from the following as applied over the last two decades:
Respuesta
-
Processors get faster but memory does not
-
Processors and memory both get faster, but memory gets faster faster that processors do
-
Processors and memory both get faster, but processors get faster than memory does
-
Memory gets faster but processor does not
Pregunta 147
Pregunta
Which one of the following optimizations does not provide any speed benefit for a
non-pipelined processor, such as SEQ?
Respuesta
-
Using more efficient algorithms
-
Invocation inlining
-
Reassociation
-
Storing reused, computed values in variables instead of recomputing them
Pregunta 148
Pregunta
Consider the following code. Assume the compiler is performing no optimization. Which of the following strategies would improve the speed of this code the most in the case where it returns true?
Respuesta
-
Unroll the loop
-
Pull the (x % i)==0 into a function isDivisibleBy
-
Declare int sx = (int)sqrt(x) and change i < sqrt(x) to i < sx
-
Move the check for (x % 2) != 0 to before the loop
Pregunta 149
Pregunta
Consider the following code. This code would most likely benefit from which of the following optimization strategies
Pregunta 150
Pregunta
What does the following C code print?
Respuesta
-
P = 103, c = 101
-
P = 104, c = 101
-
P = 105, c = 103
-
P = 104, c = 102
Pregunta 151
Pregunta
The bitwise and operation (& in C) can be used to flip the bits of a word
Pregunta 152
Pregunta
The bitwise xor operation (^ in C) can be used to flip the bits of a word
Pregunta 153
Pregunta
Consider a direct-mapped cache with 256 sets and 16 byte blocks. In this cache the address 0x12345 maps to the same set as which of the following addresses?
Respuesta
-
0x02345
-
0x22244
-
0x12040
-
0x12354
Pregunta 154
Pregunta
In C, on a 34-bit x86 machine, the expression (1 << 31) results in a negative integer
Pregunta 155
Pregunta
Consider the following code. Which of the following changes would make the largest positive impact on its cache locality?
Respuesta
-
Put the for-i loop inside of the for-j loop instead of the other way around
-
Store the result of (i*101)%n outside the inner loop
-
Block the accesses by nesting three or four for loops instead of just two
-
Change int** to int*
Pregunta 156
Pregunta
Consider the following code. Assume the compiler is performing no optimizations and that isPrime is the exact code presented in another question on this exam. Which of the following strategies would improve the speed of this code the most?
Respuesta
-
Inline the isPrime call
-
Add a second ans accumulator
-
Swap the order of the if statements
-
Change the nested if statements to a single if with a &&
Pregunta 157
Pregunta
Consider the following two versions of a C function: (Assume len is larger than 10)
Respuesta
-
sum1
-
None of them
-
sum2
-
Both of them
Pregunta 158
Pregunta
Which of the following is necessary true regarding the following code fragment?
Respuesta
-
It exhibits locality of reference but only when a == b
-
It exhibits locality of reference because the variables are allocated near each other
-
It exhibit no locality of reference
-
It exhibits locality of reference no matter where the variables are allocated
Pregunta 159
Pregunta
A & B is the bitwise version of A && B; A | B is the bitwise version of A || B; which of the following is the bitwise version of A == B?
Respuesta
-
~(a = b)
-
a = b
-
~ (a ^ b)
-
a ^ b
Pregunta 160
Pregunta
The unary & operator returns (in C language)
Respuesta
-
The logical “and” of two integer operands
-
The logical “or” of two integer operands
-
The value of a data object
-
The address of a data object
Pregunta 161
Pregunta
In C a single statement that performs the equivalent of y = x *100, but without using the multiply operator (*)
Respuesta
-
y = (x<<6) + (x<<5) + (x<<2)
-
y = (x>>6) + (x<<5) + (x<<2)
-
y = (x<<6) + (x<<3) + (x>>2)
-
y = (x<<5) + (x<<5) + (x<<7)
Pregunta 162
Pregunta
Adding local variable is usually used to reduce what source inefficiency?
Pregunta 163
Pregunta
If, in C, an array is declared by “int a[10];”, then accessing a[11] during execution will generate an error message.
Pregunta 164
Pregunta
In C, if x is an integer variable, the expression “x << 3” computes x * 8 but does not change the value of x.
Pregunta 165
Pregunta
Which of the following code snippets is fastest? Assume n is very large (more than ten thousand).
Respuesta
-
for(i=0;i<n;i+=1) for(j=0;j<n;j+=1) a[i][j] = b[j][i];
-
for(j=0;j<n;j+=1) for(i=0;i<n;i+=1) a[i][j] = b[j][i];
-
-
two or more of the above are equivalently the fastest
Pregunta 166
Pregunta
Which of the following code snippets is fastest? Assume n is very large (more than ten thousand).
Respuesta
-
a) for(k=0; k<n; k+=16) for(l=0; l<n; l+=16)
for(j=0;j<16;j+=1) for(i=0;i<16;i+=1)
a[i+l][j+k] = b[i+l][j+k];
-
for(i=0;i<n;i+=1) for(j=0;j<n;j+=1) a[i][j] = b[i][j];
-
for(j=0;j<n;j+=1) for(i=0;i<n;i+=1) a[i][j] = b[i][j];
-
two or more of the above are equivalently the fastest
Pregunta 167
Pregunta
In binary, the number 1/3 is 0.01010101 . . .. In the following, assume that x is an int and y is a short. (short)((0x5555 * (int)y)>>16) and y/3 could differ because of overflow
Pregunta 168
Pregunta
How is 148 (decimal) represented in a 16-bit 2’s complement binary format?
Respuesta
-
0000 0000 1101 0111
-
0000 0000 0010 1001
-
1111 1111 0010 1000
-
1111 1111 0110 1100
Pregunta 169
Pregunta
Which of the following code snippets is fastest? Assume n is very large (more than ten thousand).
Pregunta 170
Pregunta
In a Compare the runtime of y = x * 5 and y = (x<<2) + x, measured by the time it takes for the resulting value of y to be available somewhere in the pipeline.
Pregunta 171
Pregunta
The program counter contains:
Respuesta
-
The address of the CPU instruction that is about to be executed
-
The number of programs currently running
-
The number of CPU instructions a program has executed so far
-
The number of processes currently running
Pregunta 172
Pregunta
A memory leak is caused by a:
Respuesta
-
Bug in the memory allocator that fails to free memory
-
Failure to free allocated memory
-
None of these
-
Function that allocates a large amount of memory from the heap
Pregunta 173
Pregunta
Registers and cache memory are the same thing
Pregunta 174
Pregunta
In a computer in which both addresses and integers are 32 bits wide, how many bytes of memory will the compiler allocate for the following code fragment?
Pregunta 175
Pregunta
In C, local variables allocated inside functions are allocated:
Respuesta
-
On the stack
-
In the heap
-
In virtual memory
-
In static storage
Pregunta 176
Pregunta
What is the value of the following C expression? x=0xA0 and y=0x0F ~x || ~y
Respuesta
-
0x0000
-
0xFF50
-
0xFFFF
-
0x0001
Pregunta 177
Pregunta
What is an example of?
Respuesta
-
Loop unrolling
-
Loop fission
-
none
-
Loop fusion
Pregunta 178
Pregunta
How is 215 (decimal) represented in a 16-bit 2's complement binary format?
Respuesta
-
1111 1111 0010 1001
-
0000 0000 1101 0111
-
0000 0000 0010 1001
-
1111 1111 0010 1000
Pregunta 179
Pregunta
This code indicates that Work2 and Work3 must be run in _______ :
Respuesta
-
none
-
parallel
-
sequence
-
both
Pregunta 180
Pregunta
____ involve identifying a computation that is performed multiple times (e.g., within a loop), but such that the result of the computation will not change.
Respuesta
-
Code motion
-
Side effect
-
Loop unrolling
-
Memory aliasing
Pregunta 181
Pregunta
In C, for the expression (x | THE_MASK), where THE_MASK has at least one bit that is not zero, the result will be zero if all the bits of x are zero.
Pregunta 182
Pregunta
What is the value of the following C expression? x = 0x128 and y = 0x2 ~x << y (in hex)
Respuesta
-
0x000004A0
-
0x000FFB5C
-
0xFFFFFB5C
-
0x00FF04A0
Pregunta 183
Pregunta
In C, using default floating point settings, a special value of "infinity" is computed when:
Respuesta
-
A floating-point computation results in an overflow
-
Program execution is halted
-
An exception is raised
-
An erroneous value is computed and execution continues
Pregunta 184
Pregunta
GPU optimized for latency.
Pregunta 185
Pregunta
If you don’t explicitly specify the number of threads (OpenMP) how many thread will be executed?
Pregunta 186
Pregunta
CPU optimized for latency
Pregunta 187
Pregunta
Program take as input a collection of relocatable object files and command-line arguments and generate as output a fully linked executable object file that can be loaded and run:
Respuesta
-
Static linker
-
Dynamic linker
-
Both
-
none
Pregunta 188
Pregunta
If we find the data we want in a cache, we call that a
Respuesta
-
Cache find
-
Cache hit
-
Cache miss
-
Cache success
Pregunta 189
Pregunta
The___construct forms a region that contains one or more structured blocks. Each block of a_____ directive executed once by one of the threads (any one) in the team?
Respuesta
-
parallel
-
master
-
section
-
schedule
Pregunta 190
Pregunta
Pick a value for A and for B which will cause the two procedures below to return different results.
Respuesta
-
a=7; b=1;
-
a=1; b=2;
-
a=1; b=5;
-
a=1; b=1;
Pregunta 191
Pregunta
Pick a value for A and for B which will cause the two procedures below to return the same results.
Pregunta 192
Pregunta
A is the 4-bit value 0011 and B is the 4-bit value 0110. What is A^B?
Pregunta 193
Pregunta
Consider the following code. This code would most likely benefit from which of the following optimization strategies.
Respuesta
-
improving locality
-
loop unrolling
-
using multiple
-
adding local variables
Pregunta 194
Pregunta
Conflict misses are typically discussed in connection with?
Respuesta
-
all of the above
-
fully-associative caches
-
set-associative caches
-
direct-mapped caches
Pregunta 195
Pregunta
What is -5 as a six bit two’s complement number? Answer in binary
Respuesta
-
101111
-
111011
-
000001
-
011011
Pregunta 196
Pregunta
By default, OpenMp _____ assigns loop iterations to threads. When the parallel for block is entered, it assigns each thread the set of loop iterations it is to execute?
Respuesta
-
static
-
runtime
-
dynamic
-
auto
Pregunta 197
Pregunta
In general, which of the following is slowest?
Pregunta 198
Pregunta
How is 28 (decimal) represented in an 8-bit 2’s complement binary format?
Respuesta
-
0001 1100
-
1110 0100
-
1100 1011
-
1110 0011
Pregunta 199
Pregunta
The ASCII code for the character ‘b’ is 98. How can the character ‘b’ be represented in binary using ASCII codes?
Respuesta
-
1011 1111
-
1010 1111
-
0110 0010
-
1000 0001
Pregunta 200
Pregunta
This code indicates that any of the tasks Work1, Work2 + Work3 and Work4 may run in _____________:
Respuesta
-
parallel
-
sequence
-
both
-
none