Question 1
Question
Which of the following are sequence elements that algorithms can exploit to search for genes in a prokaryotic genome?
Answer
-
TFIIB recognition element
-
TATA box at -10
-
ATG start codon
-
STOP codon
-
downstream core promoter element at +30
-
initiator element around transcription start site
Question 2
Question
[blank_start]Sanger sequencing[blank_end] is an example of a first generation sequencing technology
Question 3
Question
Sanger sequencing has been automated by fluorescent labelling
Question 4
Question
Which of the following are advantages of sanger sequencing?
Answer
-
high accuracy
-
good for short sequences
-
high throughput
-
cheap
-
long read length
Question 5
Question
select the technologies that are second generation sequencing methods
Answer
-
Sanger
-
454 pyrosequencing
-
Ilumina
-
Ion torrent
-
nanopore
-
PacBio
Question 6
Question
Which of the following are limitations of 454 pyrosequencing?
Question 7
Question
A homopolymer error is a problem with base calling which there are multiple bases in a row as the signal does not increase with linearity
Question 8
Question
454 pyrosequencing and ion torrent use solid-phase bridge PCR
Question 9
Question
Ion torrent detects the incorporation of a base based on [blank_start]light[blank_end] whereas 454 pyrosequencing detects the incorporation of a base based on [blank_start]pH[blank_end]
Question 10
Question
What are the advantages of third generation sequencing technologies?
Question 11
Question
[blank_start]human[blank_end] [blank_start]genome[blank_end] [blank_start]project[blank_end] [blank_start]encode[blank_end] and [blank_start]1000[blank_end] [blank_start]genomes[blank_end] [blank_start]project[blank_end] are all examples of large scale genome sequencing projects
Answer
-
human
-
genome
-
project
-
encode
-
1000
-
genomes
-
project
Question 12
Question
[blank_start]shotgun[blank_end] [blank_start]sequencing[blank_end] is the most common sequencing approach for whole genomes
Question 13
Question
a [blank_start]contig[blank_end] is a set of overlapping DNA fragments that together represent a consensus region of DNA
Answer
-
contig
-
scaffold
-
read
-
coverage
Question 14
Question
the de bruijn graph method is a greedy method of assembly
Question 15
Question
[blank_start]k[blank_end] is the parameter used in the de bruijn graph assembly algorithm
Question 16
Question
sequence assembly can be...
Answer
-
ab initio
-
de novo
-
read mapping
Question 17
Question
Which of the following are de bruijn graph sequence assemblers?
Answer
-
Celera
-
GigAssembler
-
Velvet
-
SPAdes
Question 18
Question
Genomes always need to be finished
Question 19
Question
hybrid sequencing is an effective way of closing gaps in genome assembly as different technologies are biased in sequencing in different ways
Question 20
Question
in the equation N = (a x g) / L
N is the [blank_start]reads[blank_end] a is the [blank_start]coverage[blank_end] g is the genome length and L is the read length
Answer
-
reads
-
coverage
-
genome length
-
read length
-
coverage
-
reads
-
genome length
-
read length
Question 21
Question
Which of the following are examples of challenges faced during sequence assembly?
Question 22
Question
Why can't BLAST be used for short read mapping to assemble our reads using a reference genome?
Question 23
Question
when might short-read mapping be beneficial to use?
Answer
-
for RNA-sequencing experiments
-
for chipping experiments
-
to assemble a whole genome
-
to find open reading frames
Question 24
Question
[blank_start]Burrows[blank_end]-[blank_start]wheeler[blank_end] is the name of the algorithm which is used by mapping alignment packages such as Bowtie in order to convert the genome into a different format so matches can be easily found
Question 25
Question
We always need to assemble the genome in metagenomics experiments
Question 26
Question
raw sequencing data from sequencing experiments are saved in the sequence read archive
Question 27
Question
annotated sequence data from sequencing experiments are saved in GenBank and EMBL
Question 28
Question
Which of the following are legitimate methods of assessing a sequence assembly?
Question 29
Question
the N50 statistic is the length of the smallest contig in the set that contains the fewest contigs whose combined length represents 50% of the assembly
Question 30
Question
sequence annotation involves identifying...
Answer
-
read lengths
-
coverage
-
CDSs
-
promoters
-
ribosome binding sites
-
introns
-
exons
Question 31
Question
gene prediction involves finding UTRs and alternative splice isoforms
Question 32
Question
what are the 2 major approaches for gene finding?
Answer
-
ab initio
-
comparative proteomics
-
comparative genomics
-
de novo
Question 33
Question
ab initio gene finding approaches are more accurate for eukaryotes than prokaryotes
Question 34
Question
the gene finding tools Glimmer and GeneScan use [blank_start]hidden[blank_end] [blank_start]markov[blank_end] models
Question 35
Question
which of the following make eukaryotic gene finding more difficult than prokaryotic gene finding?
Answer
-
high number of repeats
-
introns
-
exons
-
highly compact
-
alternative splicing
Question 36
Question
What measures can be used to assess gene prediction?
Answer
-
sensitivity
-
specificity
-
accuracy
-
N50 statistic
Question 37
Question
There is a trade-off when it comes to the specificity and sensitivity of gene prediction tools
Question 38
Question
[blank_start]prokka[blank_end] is a genome annotation pipeline good for prokaryotes and small eukaryotes
Answer
-
prokka
-
genescan
-
glimmer
-
genie
Question 39
Question
order the types of mutation in terms of relative frequency:
1. [blank_start]point[blank_end]
2. [blank_start]deletion[blank_end]
3. [blank_start]duplication[blank_end]
4. [blank_start]inversion[blank_end]
5. [blank_start]insertion[blank_end]
6. [blank_start]translocation[blank_end]
Answer
-
point
-
deletion
-
inversion
-
insertion
-
translocation
-
duplication
-
deletion
-
point
-
insertion
-
inversion
-
duplication
-
translocation
-
duplication
-
point
-
deletion
-
inversion
-
insertion
-
translocation
-
inversion
-
insertion
-
point
-
deletion
-
translocation
-
duplication
-
insertion
-
inversion
-
translocation
-
duplication
-
point
-
deletion
-
translocation
-
inversion
-
insertion
-
duplication
-
point
-
deletion
Question 40
Question
silent, missense and nonsense are all types of [blank_start]point[blank_end] mutation
Question 41
Question
nonsense mutations can be conservative or non-conservative (similar AA or not)
Question 42
Question
introns, intergenic regions and pseudogenes are highly conserved and intolerant to change
Question 43
Question
Gene duplicates experience relaxed evolutionary constraints
Question 44
Question
when does gene duplication occur in bacteria?
Answer
-
in response to favourable conditions
-
in response to stress
-
in response to an internal stimulus
-
linearly over evolutionary time
Question 45
Question
[blank_start]duplication[blank_end] is an essential mutation for evolutionary change to occur in eukaryotes
Answer
-
duplication
-
point mutation
-
inversion
-
insertion
-
deletion
Question 46
Question
gene duplication can lead to [blank_start]nonfunctionalisation[blank_end] [blank_start]neofunctionalisation[blank_end] or [blank_start]subfunctionalisation[blank_end]
Answer
-
nonfunctionalisation
-
neofunctionalisation
-
subfunctionalisation
Question 47
Question
which of the following are sources of variation in prokaryotes?
Answer
-
lateral gene transfer
-
endosymbiosis
-
mutations
Question 48
Question
genes that share a common ancestor are said to be what?
Answer
-
homologs
-
paralogs
-
orthologs
-
xenologs
Question 49
Question
genes that have diverged as a result of speciation are said to be what?
Answer
-
homologs
-
orthologs
-
paralogs
-
xenologs
Question 50
Question
genes within the same genome created as a result of gene duplication are said to be what?
Answer
-
homologs
-
orthologs
-
paralogs
-
xenologs
Question 51
Question
homology is a measure of similarity
Question 52
Question
which of the following are simplistic measure of similarity when it comes to measuring sequence similarity?
Answer
-
hamming distance
-
sequence identity
-
levenshtein distance
-
PAM250
-
BLOSUM62
Question 53
Question
what kind of mutations are more common?
Answer
-
point mutations
-
translocation mutations
-
amino acid substitutions tend to be conservative
-
single nucleotide or amino acid deletions
-
successive deletions of bases or amino acids
-
transversion mutations
-
transition mutations
Question 54
Question
PAM and BLOSUM are example of [blank_start]substitution[blank_end] [blank_start]matrices[blank_end]
Question 55
Question
1 PAM is 1% similarity
Question 56
Question
PAM is better for [blank_start]global[blank_end] alignments whilst BLOSUM is better for [blank_start]local[blank_end] alignments
Question 57
Question
BLOSUM matrices are derived from the [blank_start]BLOCKS[blank_end] database
Question 58
Question
A higher PAM matrix will find weaker, longer alignments and a BLOSUM matrix with a higher number are better for similar sequences
Question 59
Question
A local alignment tries to align all the residues in a sequence
Question 60
Question
Dynamic programming is used for [blank_start]exact[blank_end] alignment methods
Question 61
Question
Needleman-Wunsch is a [blank_start]global[blank_end] alignment algorithm
Question 62
Question
Smith-waterman is a local alignment algorithm
Question 63
Question
The trajectory refers to the traceback arrows in a trajectory table
Question 64
Question
BLAST and FASTA are examples of [blank_start]heuristic[blank_end] alignment methods
Question 65
Question
Exact alignment methods are not guaranteed to find an optimal solution
Question 66
Question
K-tuple alignment methods are a family of approximate alignment methods, and BLAST is part of the family
Question 67
Question
a [blank_start]heuristic[blank_end] approach is taken with multiple sequence alignment because an exact approach has complexity O(L^N)
Question 68
Question
progressive, iterative and statistical are all approaches used for [blank_start]MSA[blank_end]
Question 69
Question
Which of the following are examples of progressive alignment algorithms?
Answer
-
T-coffee
-
Clustal omega
-
Clustal W
-
Muscle
Question 70
Question
Which of the following algorithms takes a hybrid approach for multiple sequence alignment?
Answer
-
T-coffee
-
Muscle
-
Clustal omega
-
Clustal W
Question 71
Question
A [blank_start]motif[blank_end] is part of a protein sequence associated with a particular biological function
Question 72
Question
A [blank_start]pattern[blank_end] is a qualitative description of a motif
A [blank_start]profile[blank_end] is a quantitative description of a motif
Answer
-
profile
-
pattern
-
pattern
-
profile
Question 73
Question
Which of the following databases describe motifs in terms of pattern and profile?
Answer
-
Pfam
-
PROSITE
-
InterPro
-
GeneBank
-
EMBL
-
BLOCKS
Question 74
Question
PSI-BLAST is more powerful than BLAST for picking up distant relationships between sequences
Question 75
Question
in phylogenetics, masking an alignment involved looking for regions or conservation and removing data that does not appear homologous
Question 76
Question
Which of the following are examples of distance-based tree building methods?
Answer
-
Maximum likelihood
-
Maximum parsimony
-
UPGMA
-
WPGMA
-
Bayesian inference
Question 77
Question
[blank_start]Bootstrap[blank_end] [blank_start]values[blank_end] can be added to branches in phylogenetic trees to summarise the degree of certainty for a given branching
Question 78
Question
[blank_start]WPGMA[blank_end] uses a flat average whilst UPGMA uses a weighted average that takes into account the number of taxa in a group
Question 79
Question
microarrays and RNA-sequencing are examples of what kind of experiments?
Answer
-
genomics
-
transcriptomics
-
proteomics
-
phylogenetics
Question 80
Question
[blank_start]normalisation[blank_end] aims to remove technical variation existing in microarray experiments
Question 81
Question
Which of the following are methods for quality control to remove outliers from microarray experiments?
Question 82
Question
following a microarray experiment, probeset QC removes noise and uninformative data points (i.e close to the background level of detection)
Question 83
Question
[blank_start]Benjamin[blank_end]-[blank_start]Hochberg[blank_end] [blank_start]FDR[blank_end] is the most common multiple testing correction used in microarray, RNA-seq and proteomics experiments
Question 84
Question
Benjamin-Hochberg FDR modifies [blank_start]P[blank_end]-values
Question 85
Question
Which of the following are not advantages for RNA-seq experiments over microarrays?
Answer
-
can search for unknown genes
-
can detect very scarce transcripts
-
lower technical variation
-
lower background noise
-
can sequence whole proteome
Question 86
Question
[blank_start]Poly[blank_end]-[blank_start]A[blank_end] [blank_start]selection[blank_end] gets rid of uninteresting, abundant RNA such as rRNA and haemoglobin RNA in blood samples in preparation for RNA-seq experiment
Question 87
Question
RNA-sequencing relies on reverse transcription
Question 88
Question
RNA-sequencing experiments are quantifiable - the sequencing reads in the library are proportional to the abundance of RNA
Question 89
Question
RPKM and FPKM are examples of [blank_start]normalisation[blank_end] tools used following an RNA-sequencing experiment
Question 90
Question
T-tests can be used to analyse microarray and RNA-seq data as both are continuous
Question 91
Question
microarrays can be used to discover novel transcripts
Question 92
Question
transcriptomics is used instead of proteomics as the transcript level always correlates to the protein abundance
Question 93
Question
the two main approaches in expression proteomics experiments are [blank_start]bottom[blank_end] up and [blank_start]top[blank_end] down experiments
Question 94
Question
Which of the following are experimental strategies used in proteomics?
Question 95
Question
Which of the following are disadvantages of 2DGE?
Answer
-
expensive
-
time-consuming
-
limited sensitivity
-
limited resolution
-
low reproducibility
Question 96
Question
[blank_start]DIGE[blank_end] is a variation of 2DGE whereby multiple samples are ran on one gel but are differentially labelled to eliminate running difference between gels
Question 97
Question
Technical variation is higher in microarrays and RNA-seq than 2DGE and liquid chromatography tandem MS
Question 98
Question
in 2DGE, proteins are separated based first on [blank_start]charge[blank_end] then on [blank_start]size[blank_end]
Question 99
Question
progenesis is a software used in [blank_start]2DGE[blank_end] experiments
Answer
-
2DGE
-
microarray
-
RNA-seq
-
HPLC
Question 100
Question
[blank_start]peptide[blank_end]-[blank_start]mass[blank_end] [blank_start]fingerprinting[blank_end] is used to identify which proteins are contained within spots on a gel from a 2DGE experiment
Answer
-
peptide
-
mass
-
fingerprinting
Question 101
Question
2DGE can be used to identify membrane proteins
Question 102
Question
2DGE cannot be used to show post-translational modifications
Question 103
Question
in a proteomics experiment, proteins are first isolated then digested using an enzyme such as [blank_start]trypsin[blank_end] as it cuts in a predictable ways
Question 104
Question
in a peptide-mass fingerprinting experiment, resulting peak-lists can be the same for very similar proteins
Question 105
Question
in tandem MS, when fragments are introduced they are broken up by argon gas, which preferentially breaks peptide bonds
Question 106
Question
Which of the following databases of hypothetical spectra is used to identify peptides from an MS experiment?
Answer
-
Genescan
-
InterPro
-
MASCOT
-
BLOCKS
-
PRINTS
-
iTRAQ
Question 107
Question
the intensity of peaks in MS can be used to quantify proteins
Question 108
Question
[blank_start]hydrophobicity[blank_end] is the main driving force of protein folding process
Question 109
Question
secondary structure refers to global interactions within a protein
Question 110
Question
[blank_start]alpha[blank_end] helix, [blank_start]beta[blank_end] sheet and [blank_start]coil[blank_end] are the 3 secondary structure states
Question 111
Question
protein [blank_start]domains[blank_end] are subunits within a protein with quasi-independent folding stability
Question 112
Question
the [blank_start]quaternary[blank_end] structure refers to proteins formed from several subunits or monomers
Question 113
Question
protein structures solved by NMR or crystallography are saved as [blank_start]PBD[blank_end] files
Question 114
Question
a [blank_start]Ramachandran[blank_end] [blank_start]plot[blank_end] visualises and clusters residues of an amino acid sequence based on psi and phi angles of the residue backbone
Question 115
Question
CATH, SCOP and FSSP/DDD are all examples of what?
Answer
-
tertiary structure classification methods
-
protein structure prediction assessment
-
databases containing sequence information
-
protein data banks
Question 116
Question
the levels of hierarchy in the CATH system to catalogue proteins are ordered from bottom to top as follows:
1. [blank_start]class[blank_end]
2. [blank_start]architecture[blank_end]
3. [blank_start]fold[blank_end]
4. [blank_start]superfamily[blank_end]
5. [blank_start]domain[blank_end]
Answer
-
class
-
domain
-
architecture
-
superfamily
-
fold
-
architecture
-
class
-
domain
-
fold
-
superfamily
-
fold
-
domain
-
class
-
architecture
-
superfamily
-
superfamily
-
architecture
-
domain
-
class
-
fold
-
domain
-
class
-
architecture
-
fold
-
superfamily
Question 117
Question
mainly alpha and mainly beta are examples of CATH folds
Question 118
Question
3D protein structure prediction is treated as a machine learning problem
Question 119
Question
machine learning in the context of protein structure prediction aims to minimise the energy function
Question 120
Question
Dynamic programming is an optimisation method
Question 121
Question
Which of the following are types of machine learning?
Answer
-
Hidden markov models
-
artificial neural networks
-
rule learning
-
position specific scoring
-
multiple testing correction
Question 122
Question
a [blank_start]PSSM[blank_end] is similar to a substitution matrix but specifically tailored to the sequence being aligned
Question 123
Question
[blank_start]PSIPRED[blank_end] is the most popular secondary structure prediction software
Question 124
Question
PSIPRED uses hidden markov models
Question 125
Question
[blank_start]contact[blank_end] [blank_start]number[blank_end] is the number of connections a residue in a protein has
Question 126
Question
[blank_start]solvent[blank_end] [blank_start]accessibility[blank_end] is the amount of surface exposed of each residue
Question 127
Question
which of the following are the broad approaches for 3D PSP?
Answer
-
De novo
-
Ab initio
-
template-based
-
machine learning
Question 128
Question
which 3 ways can a template by identified for 3D PSP?
Answer
-
homology modelling
-
profile-base methods
-
machine learning
-
threading
-
ab initio modelling
Question 129
Question
Fold recognition is used to identify a template with high structural similarity but low sequence identity with the target protein, when homology modelling is not an option
Question 130
Question
in 3D PSP, profile-based methods make profiles for residues in a sequence based on...
Answer
-
secondary structure
-
hydrophobicity
-
acidity
-
solvent accessibility
-
tertiary structure
Question 131
Question
in 3D PSP, fragment assembly combines [blank_start]homology[blank_end] [blank_start]modelling[blank_end] with [blank_start]ab[blank_end] [blank_start]initio[blank_end] methods
Answer
-
homology
-
modelling
-
ab
-
initio
Question 132
Question
in fragment assembly, [blank_start]decoys[blank_end] are candidate structure generated from all the possible combinations of fragments. They energy minimisation process is applied to them and they are clustered. The final models are selected from the centre of this cluster,
Question 133
Question
I-Tasser is a [blank_start]pipeline[blank_end] used for protein structure prediction
Question 134
Question
a network is a graph consisting of a series of [blank_start]nodes[blank_end] connect by [blank_start]edges[blank_end]
Question 135
Question
in a biological network, genes, proteins and cell types can be depicted as [blank_start]nodes[blank_end]
Question 136
Question
in a network, sink nodes have high in degree and sources have a high out degree
Question 137
Question
Which of the following is not a type of degree distribution in a network?
Answer
-
constant
-
scale-free
-
random
-
betweenness
Question 138
Question
In a network, the distance can be defined by Pajek or Watts
Question 139
Question
The longest shortest path between all pairs of nodes is...
Answer
-
Pajeks diameter
-
Pajeks distance
-
Watts diameter
-
Watts distance
Question 140
Question
the [blank_start]density[blank_end] is defined by the number of edges as a fraction of the number of possible edges
Question 141
Question
Which of the following are measures of centrality of a network?
Answer
-
degree
-
betweenness
-
closeness
-
distance
-
diameter
Question 142
Question
The betweenness centrality is a fraction of the shortest paths of the network for which a certain node is a member of
Question 143
Question
[blank_start]closeness[blank_end] [blank_start]centrality[blank_end] rewards nodes from which within a few edges, any node can be accessed
Question 144
Question
a random Boolean network is undirected
Question 145
Question
A random Boolean network can be used to study dynamic processes such as gene expression
Question 146
Question
an [blank_start]integrated[blank_end] network uses data from high-quality databases such as BioGrid as well as our own experimental data
Question 147
Question
Gene co-expression networks are built using [blank_start]guilt[blank_end] by [blank_start]association[blank_end]
Question 148
Question
In gene co-expression networks, similarity in expression across samples is usually computed by
Question 149
Question
A gene co-prediction network relies on a set of rules and an edge connects genes that co-predict with high frequency
Question 150
Question
PathExpand and TopoGSA are examples of network [blank_start]refinement[blank_end] packages
Question 151
Question
force, arc, circular and hive are all examples of network [blank_start]layout[blank_end]
Question 152
Question
An Arc network is more scalable than a Hive network
Question 153
Question
community detection is also known as [blank_start]clustering[blank_end]
Question 154
Question
[blank_start]clustering[blank_end] identifies sub-parts of a network with many connections and often reflect meaningful modules within the network organisation i.e cellular machinery or biological processes
Question 155
Question
[blank_start]ontologies[blank_end] represent relationships in a computationally amenable way by providing controlled vocabulary of terms
Question 156
Question
Which of the following are ontologies used by GO to describe the associations of gene products
Answer
-
biological processes
-
cellular components
-
3D structure
-
interaction partners
-
molecular functions
Question 157
Question
there are [blank_start]20[blank_end] amino acids used in biological systems
Question 158
Question
Which of the following is not commonly used to assess sequencing methods?
Answer
-
read length
-
throughput
-
cost per base
-
cost of the machine
-
sample size
Question 159
Question
Which of the following is not a database combined in the INSDC major collection point for sequencing data?
Answer
-
EMBL-EBI
-
NCBI
-
NIG
-
GenBank
Question 160
Question
Sanger, 454, ion torrent and ilumina sequencing all sequence by [blank_start]synthesis[blank_end]
Question 161
Question
Third generation sequencing involves a PCR step
Question 162
Question
the current gold-standard for shotgun sequencing assembly is a [blank_start]100[blank_end]-fold coverage
Question 163
Question
Which of the following is not a reason for making sequence assembly difficult?
Question 164
Question
coverage assumes that DNA is randomly fragmented and all DNA is able to be sequenced.
Question 165
Question
the coverage equation often underestimates the number of reads necessary
Question 166
Question
silent mutations usually occur in the [blank_start]3rd[blank_end] base of a [blank_start]codon[blank_end]
Question 167
Question
[blank_start]xenologous[blank_end] genes are those which are homologous and have been gained via horizontal gene transfer
Question 168
Question
in sequence alignments a [blank_start]:[blank_end] represents a perfect match, a [blank_start].[blank_end] represents a similar AA and a blank space represents a larger AA change
Question 169
Question
Heuristic alignment methods are better when computational power is not a problem or for a small number of sequences
Question 170
Question
in a BLAST search, the number of hits one can expect to see by chance when searching a database of a particular size is defined by the [blank_start]E[blank_end]-[blank_start]value[blank_end]
Question 171
Question
in a MSA, the alignment table can be summarised in a single line, a pseudo sequence called the [blank_start]consensus[blank_end]
Question 172
Question
A MSA algorithm which starts with a complete MSA, makes changes, computes score, keeps the MSA if the score is better then repeats is known as an [blank_start]iterative[blank_end] method
Question 173
Question
In a progression MSA, the original mapping can be changed
Question 174
Question
progressive multiple sequence alignment strategies use pairwise alignments
Question 175
Question
the muscle MSA alignment method uses the [blank_start]kimura[blank_end] [blank_start]distance[blank_end] matrix to make a global alignment during the improved progressive alignment
Question 176
Question
muscle uses WPGMA to make alignments
Question 177
Question
a [blank_start]profile[blank_end] can be incorporated into MSA and PSP algorithms to give better results
Question 178
Question
PSI-BLAST uses a position-specific scoring matrix
Question 179
Question
UPGMA can be fitted with an evolutionary model
Question 180
Question
microarrays assay gene expression by quantification of mRNA using hybridisation
Question 181
Question
[blank_start]quantile[blank_end] normalisation is a method of normalisation which ranks data, then takes the median value for each rank and replace the original values with the ranked averages
Question 182
Question
principle component analysis reduces multi-dimensional data down to [blank_start]2[blank_end] dimensions
Question 183
Question
when analysing microarray data, multiple testing correction controls for the error rate due to false positives being produced by multiple T-tests
Question 184
Question
which of the following does not encompass the same methods between microarrays and RNA-seq?
Answer
-
normalisation
-
quality control
-
statistical analysis
Question 185
Question
when analysing data from an RNA-seq experiment, DE-seq assumes a [blank_start]negative[blank_end] [blank_start]binomial[blank_end] distribution
Question 186
Question
organisms have 1 genome and 1 proteome
Question 187
Question
in 2DGE, there is a pH gradient running left to right. Where a protein is positioned depends on its [blank_start]isoelectric[blank_end] [blank_start]point[blank_end]
Question 188
Question
in 2DGE, it is valid to compare spots between gels if the spot is absent on one of the gels
Question 189
Question
Sensitivity is good in 2DGE as the dye is linearly incorporated
Question 190
Question
LC-MS can be multidimensional, separating proteins based on more than 2 physiochemical properties
Question 191
Question
iTRAQ is used to label samples in order to quantify them. Tags are made up of an [blank_start]ester[blank_end] group to tag to the protein, a [blank_start]reporter[blank_end] of varying sizes and a [blank_start]balancer[blank_end] to balance the mass
Question 192
Question
when using iTRAQ to quantify proteins during LC-MS, the balancer moiety is measured - when there is a more balancer moiety, there is a higher peak and therefore more peptide.
Question 193
Question
iTRAQ is a relative quantification method in LC-MS
Question 194
Question
data from LC-MS experiments have been locked in [blank_start]proprietary[blank_end] [blank_start]boxes[blank_end] up until recently, meaning that specialist software was required to view and analyse data depending on the technology used.
Question 195
Question
spot profiles for LC-MS data can be clustered [blank_start]statistically[blank_end] based on how similar their expression profiles are or [blank_start]functionally[blank_end] based on how similar their function are
Answer
-
statistically
-
functionally
Question 196
Question
the function of a protein depends on its [blank_start]structure[blank_end]
Question 197
Question
a beta hairpin is an example of a [blank_start]supersecondary[blank_end] structure
Answer
-
supersecondary
-
secondary
-
tertiary
-
CATH
-
primary
-
domain
Question 198
Question
which of the following is not an example of a structural property of an individual residue that can be predicted
Question 199
Question
predicting structural aspects of protein residues is generally treated as an optimisation problem
Question 200
Question
[blank_start]scaffolding[blank_end] is a technique used to link together non-contiguous series of genomic DNA