Bioinformatics

Pregunta 1

Pregunta

Which of the following are sequence elements that algorithms can exploit to search for genes in a prokaryotic genome?

Respuesta

TFIIB recognition element
TATA box at -10
ATG start codon
STOP codon
downstream core promoter element at +30
initiator element around transcription start site

Pregunta 2

Pregunta

[blank_start]Sanger sequencing[blank_end] is an example of a first generation sequencing technology

Respuesta

Sanger sequencing

Pregunta 3

Pregunta

Sanger sequencing has been automated by fluorescent labelling

Respuesta

True
False

Pregunta 4

Pregunta

Which of the following are advantages of sanger sequencing?

Respuesta

high accuracy
good for short sequences
high throughput
cheap
long read length

Pregunta 5

Pregunta

select the technologies that are second generation sequencing methods

Respuesta

Sanger
454 pyrosequencing
Ilumina
Ion torrent
nanopore
PacBio

Pregunta 6

Pregunta

Which of the following are limitations of 454 pyrosequencing?

Respuesta

slow sample preparation
lower throughput than sanger
shorter read lengths than sanger
homopolymer errors

Pregunta 7

Pregunta

A homopolymer error is a problem with base calling which there are multiple bases in a row as the signal does not increase with linearity

Respuesta

True
False

Pregunta 8

Pregunta

454 pyrosequencing and ion torrent use solid-phase bridge PCR

Respuesta

True
False

Pregunta 9

Pregunta

Ion torrent detects the incorporation of a base based on [blank_start]light[blank_end] whereas 454 pyrosequencing detects the incorporation of a base based on [blank_start]pH[blank_end]

Respuesta

pH
light

Pregunta 10

Pregunta

What are the advantages of third generation sequencing technologies?

Respuesta

High accuracy
high throughput
longer read length
Low cost
minimal sample preparation

Pregunta 11

Pregunta

[blank_start]human[blank_end] [blank_start]genome[blank_end] [blank_start]project[blank_end] [blank_start]encode[blank_end] and [blank_start]1000[blank_end] [blank_start]genomes[blank_end] [blank_start]project[blank_end] are all examples of large scale genome sequencing projects

Respuesta

human
genome
project
encode
1000
genomes
project

Pregunta 12

Pregunta

[blank_start]shotgun[blank_end] [blank_start]sequencing[blank_end] is the most common sequencing approach for whole genomes

Respuesta

shotgun
sequencing

Pregunta 13

Pregunta

a [blank_start]contig[blank_end] is a set of overlapping DNA fragments that together represent a consensus region of DNA

Respuesta

contig
scaffold
read
coverage

Pregunta 14

Pregunta

the de bruijn graph method is a greedy method of assembly

Respuesta

True
False

Pregunta 15

Pregunta

[blank_start]k[blank_end] is the parameter used in the de bruijn graph assembly algorithm

Respuesta

Pregunta 16

Pregunta

sequence assembly can be...

Respuesta

ab initio
de novo
read mapping

Pregunta 17

Pregunta

Which of the following are de bruijn graph sequence assemblers?

Respuesta

Celera
GigAssembler
Velvet
SPAdes

Pregunta 18

Pregunta

Genomes always need to be finished

Respuesta

True
False

Pregunta 19

Pregunta

hybrid sequencing is an effective way of closing gaps in genome assembly as different technologies are biased in sequencing in different ways

Respuesta

True
False

Pregunta 20

Pregunta

in the equation N = (a x g) / L N is the [blank_start]reads[blank_end] a is the [blank_start]coverage[blank_end] g is the genome length and L is the read length

Respuesta

reads
coverage
genome length
read length
coverage
reads
genome length
read length

Pregunta 21

Pregunta

Which of the following are examples of challenges faced during sequence assembly?

Respuesta

sequencing errors
shotgun fragmenting is not random
repeated regions
computational power
throughput
cost

Pregunta 22

Pregunta

Why can't BLAST be used for short read mapping to assemble our reads using a reference genome?

Respuesta

it costs too much and is highly inaccurate
it is not compatible
it takes too long and is not good at finding close matches

Pregunta 23

Pregunta

when might short-read mapping be beneficial to use?

Respuesta

for RNA-sequencing experiments
for chipping experiments
to assemble a whole genome
to find open reading frames

Pregunta 24

Pregunta

[blank_start]Burrows[blank_end]-[blank_start]wheeler[blank_end] is the name of the algorithm which is used by mapping alignment packages such as Bowtie in order to convert the genome into a different format so matches can be easily found

Respuesta

Burrows
wheeler

Pregunta 25

Pregunta

We always need to assemble the genome in metagenomics experiments

Respuesta

True
False

Pregunta 26

Pregunta

raw sequencing data from sequencing experiments are saved in the sequence read archive

Respuesta

True
False

Pregunta 27

Pregunta

annotated sequence data from sequencing experiments are saved in GenBank and EMBL

Respuesta

True
False

Pregunta 28

Pregunta

Which of the following are legitimate methods of assessing a sequence assembly?

Respuesta

the N50 statistic
principle component analysis
average gap size
average number of gaps per scaffold
coverage
hierarchical clusterin

Pregunta 29

Pregunta

the N50 statistic is the length of the smallest contig in the set that contains the fewest contigs whose combined length represents 50% of the assembly

Respuesta

True
False

Pregunta 30

Pregunta

sequence annotation involves identifying...

Respuesta

read lengths
coverage
CDSs
promoters
ribosome binding sites
introns
exons

Pregunta 31

Pregunta

gene prediction involves finding UTRs and alternative splice isoforms

Respuesta

True
False

Pregunta 32

Pregunta

what are the 2 major approaches for gene finding?

Respuesta

ab initio
comparative proteomics
comparative genomics
de novo

Pregunta 33

Pregunta

ab initio gene finding approaches are more accurate for eukaryotes than prokaryotes

Respuesta

True
False

Pregunta 34

Pregunta

the gene finding tools Glimmer and GeneScan use [blank_start]hidden[blank_end] [blank_start]markov[blank_end] models

Respuesta

hidden
markov

Pregunta 35

Pregunta

which of the following make eukaryotic gene finding more difficult than prokaryotic gene finding?

Respuesta

high number of repeats
introns
exons
highly compact
alternative splicing

Pregunta 36

Pregunta

What measures can be used to assess gene prediction?

Respuesta

sensitivity
specificity
accuracy
N50 statistic

Pregunta 37

Pregunta

There is a trade-off when it comes to the specificity and sensitivity of gene prediction tools

Respuesta

True
False

Pregunta 38

Pregunta

[blank_start]prokka[blank_end] is a genome annotation pipeline good for prokaryotes and small eukaryotes

Respuesta

prokka
genescan
glimmer
genie

Pregunta 39

Pregunta

order the types of mutation in terms of relative frequency: 1. [blank_start]point[blank_end] 2. [blank_start]deletion[blank_end] 3. [blank_start]duplication[blank_end] 4. [blank_start]inversion[blank_end] 5. [blank_start]insertion[blank_end] 6. [blank_start]translocation[blank_end]

Respuesta

point
deletion
inversion
insertion
translocation
duplication
deletion
point
insertion
inversion
duplication
translocation
duplication
point
deletion
inversion
insertion
translocation
inversion
insertion
point
deletion
translocation
duplication
insertion
inversion
translocation
duplication
point
deletion
translocation
inversion
insertion
duplication
point
deletion

Pregunta 40

Pregunta

silent, missense and nonsense are all types of [blank_start]point[blank_end] mutation

Respuesta

point

Pregunta 41

Pregunta

nonsense mutations can be conservative or non-conservative (similar AA or not)

Respuesta

True
False

Pregunta 42

Pregunta

introns, intergenic regions and pseudogenes are highly conserved and intolerant to change

Respuesta

True
False

Pregunta 43

Pregunta

Gene duplicates experience relaxed evolutionary constraints

Respuesta

True
False

Pregunta 44

Pregunta

when does gene duplication occur in bacteria?

Respuesta

in response to favourable conditions
in response to stress
in response to an internal stimulus
linearly over evolutionary time

Pregunta 45

Pregunta

[blank_start]duplication[blank_end] is an essential mutation for evolutionary change to occur in eukaryotes

Respuesta

duplication
point mutation
inversion
insertion
deletion

Pregunta 46

Pregunta

gene duplication can lead to [blank_start]nonfunctionalisation[blank_end] [blank_start]neofunctionalisation[blank_end] or [blank_start]subfunctionalisation[blank_end]

Respuesta

nonfunctionalisation
neofunctionalisation
subfunctionalisation

Pregunta 47

Pregunta

which of the following are sources of variation in prokaryotes?

Respuesta

lateral gene transfer
endosymbiosis
mutations

Pregunta 48

Pregunta

genes that share a common ancestor are said to be what?

Respuesta

homologs
paralogs
orthologs
xenologs

Pregunta 49

Pregunta

genes that have diverged as a result of speciation are said to be what?

Respuesta

homologs
orthologs
paralogs
xenologs

Pregunta 50

Pregunta

genes within the same genome created as a result of gene duplication are said to be what?

Respuesta

homologs
orthologs
paralogs
xenologs

Pregunta 51

Pregunta

homology is a measure of similarity

Respuesta

True
False

Pregunta 52

Pregunta

which of the following are simplistic measure of similarity when it comes to measuring sequence similarity?

Respuesta

hamming distance
sequence identity
levenshtein distance
PAM250
BLOSUM62

Pregunta 53

Pregunta

what kind of mutations are more common?

Respuesta

point mutations
translocation mutations
amino acid substitutions tend to be conservative
single nucleotide or amino acid deletions
successive deletions of bases or amino acids
transversion mutations
transition mutations

Pregunta 54

Pregunta

PAM and BLOSUM are example of [blank_start]substitution[blank_end] [blank_start]matrices[blank_end]

Respuesta

substitution
matrices

Pregunta 55

Pregunta

1 PAM is 1% similarity

Respuesta

True
False

Pregunta 56

Pregunta

PAM is better for [blank_start]global[blank_end] alignments whilst BLOSUM is better for [blank_start]local[blank_end] alignments

Respuesta

global
local

Pregunta 57

Pregunta

BLOSUM matrices are derived from the [blank_start]BLOCKS[blank_end] database

Respuesta

BLOCKS

Pregunta 58

Pregunta

A higher PAM matrix will find weaker, longer alignments and a BLOSUM matrix with a higher number are better for similar sequences

Respuesta

True
False

Pregunta 59

Pregunta

A local alignment tries to align all the residues in a sequence

Respuesta

True
False

Pregunta 60

Pregunta

Dynamic programming is used for [blank_start]exact[blank_end] alignment methods

Respuesta

exact

Pregunta 61

Pregunta

Needleman-Wunsch is a [blank_start]global[blank_end] alignment algorithm

Respuesta

global

Pregunta 62

Pregunta

Smith-waterman is a local alignment algorithm

Respuesta

True
False

Pregunta 63

Pregunta

The trajectory refers to the traceback arrows in a trajectory table

Respuesta

True
False

Pregunta 64

Pregunta

BLAST and FASTA are examples of [blank_start]heuristic[blank_end] alignment methods

Respuesta

heuristic

Pregunta 65

Pregunta

Exact alignment methods are not guaranteed to find an optimal solution

Respuesta

True
False

Pregunta 66

Pregunta

K-tuple alignment methods are a family of approximate alignment methods, and BLAST is part of the family

Respuesta

True
False

Pregunta 67

Pregunta

a [blank_start]heuristic[blank_end] approach is taken with multiple sequence alignment because an exact approach has complexity O(L^N)

Respuesta

heuristic

Pregunta 68

Pregunta

progressive, iterative and statistical are all approaches used for [blank_start]MSA[blank_end]

Respuesta

Pregunta 69

Pregunta

Which of the following are examples of progressive alignment algorithms?

Respuesta

T-coffee
Clustal omega
Clustal W
Muscle

Pregunta 70

Pregunta

Which of the following algorithms takes a hybrid approach for multiple sequence alignment?

Respuesta

T-coffee
Muscle
Clustal omega
Clustal W

Pregunta 71

Pregunta

A [blank_start]motif[blank_end] is part of a protein sequence associated with a particular biological function

Respuesta

motif

Pregunta 72

Pregunta

A [blank_start]pattern[blank_end] is a qualitative description of a motif A [blank_start]profile[blank_end] is a quantitative description of a motif

Respuesta

profile
pattern
pattern
profile

Pregunta 73

Pregunta

Which of the following databases describe motifs in terms of pattern and profile?

Respuesta

Pfam
PROSITE
InterPro
GeneBank
EMBL
BLOCKS

Pregunta 74

Pregunta

PSI-BLAST is more powerful than BLAST for picking up distant relationships between sequences

Respuesta

True
False

Pregunta 75

Pregunta

in phylogenetics, masking an alignment involved looking for regions or conservation and removing data that does not appear homologous

Respuesta

True
False

Pregunta 76

Pregunta

Which of the following are examples of distance-based tree building methods?

Respuesta

Maximum likelihood
Maximum parsimony
UPGMA
WPGMA
Bayesian inference

Pregunta 77

Pregunta

[blank_start]Bootstrap[blank_end] [blank_start]values[blank_end] can be added to branches in phylogenetic trees to summarise the degree of certainty for a given branching

Respuesta

Bootstrap
values

Pregunta 78

Pregunta

[blank_start]WPGMA[blank_end] uses a flat average whilst UPGMA uses a weighted average that takes into account the number of taxa in a group

Respuesta

WPGMA

Pregunta 79

Pregunta

microarrays and RNA-sequencing are examples of what kind of experiments?

Respuesta

genomics
transcriptomics
proteomics
phylogenetics

Pregunta 80

Pregunta

[blank_start]normalisation[blank_end] aims to remove technical variation existing in microarray experiments

Respuesta

normalisation

Pregunta 81

Pregunta

Which of the following are methods for quality control to remove outliers from microarray experiments?

Respuesta

N50 statistic
hierarchical clustering
normalisation
principle component analysis
probeset QC
multiple testing correction

Pregunta 82

Pregunta

following a microarray experiment, probeset QC removes noise and uninformative data points (i.e close to the background level of detection)

Respuesta

True
False

Pregunta 83

Pregunta

[blank_start]Benjamin[blank_end]-[blank_start]Hochberg[blank_end] [blank_start]FDR[blank_end] is the most common multiple testing correction used in microarray, RNA-seq and proteomics experiments

Respuesta

Benjamin
Hochberg
FDR

Pregunta 84

Pregunta

Benjamin-Hochberg FDR modifies [blank_start]P[blank_end]-values

Respuesta

Pregunta 85

Pregunta

Which of the following are not advantages for RNA-seq experiments over microarrays?

Respuesta

can search for unknown genes
can detect very scarce transcripts
lower technical variation
lower background noise
can sequence whole proteome

Pregunta 86

Pregunta

[blank_start]Poly[blank_end]-[blank_start]A[blank_end] [blank_start]selection[blank_end] gets rid of uninteresting, abundant RNA such as rRNA and haemoglobin RNA in blood samples in preparation for RNA-seq experiment

Respuesta

Poly
A
selection

Pregunta 87

Pregunta

RNA-sequencing relies on reverse transcription

Respuesta

True
False

Pregunta 88

Pregunta

RNA-sequencing experiments are quantifiable - the sequencing reads in the library are proportional to the abundance of RNA

Respuesta

True
False

Pregunta 89

Pregunta

RPKM and FPKM are examples of [blank_start]normalisation[blank_end] tools used following an RNA-sequencing experiment

Respuesta

normalisation

Pregunta 90

Pregunta

T-tests can be used to analyse microarray and RNA-seq data as both are continuous

Respuesta

True
False

Pregunta 91

Pregunta

microarrays can be used to discover novel transcripts

Respuesta

True
False

Pregunta 92

Pregunta

transcriptomics is used instead of proteomics as the transcript level always correlates to the protein abundance

Respuesta

True
False

Pregunta 93

Pregunta

the two main approaches in expression proteomics experiments are [blank_start]bottom[blank_end] up and [blank_start]top[blank_end] down experiments

Respuesta

bottom
top

Pregunta 94

Pregunta

Which of the following are experimental strategies used in proteomics?

Respuesta

liquid chromatography tandem MS
2DGE
Microarrays
RNA-sequencing

Pregunta 95

Pregunta

Which of the following are disadvantages of 2DGE?

Respuesta

expensive
time-consuming
limited sensitivity
limited resolution
low reproducibility

Pregunta 96

Pregunta

[blank_start]DIGE[blank_end] is a variation of 2DGE whereby multiple samples are ran on one gel but are differentially labelled to eliminate running difference between gels

Respuesta

DIGE

Pregunta 97

Pregunta

Technical variation is higher in microarrays and RNA-seq than 2DGE and liquid chromatography tandem MS

Respuesta

True
False

Pregunta 98

Pregunta

in 2DGE, proteins are separated based first on [blank_start]charge[blank_end] then on [blank_start]size[blank_end]

Respuesta

charge
size

Pregunta 99

Pregunta

progenesis is a software used in [blank_start]2DGE[blank_end] experiments

Respuesta

2DGE
microarray
RNA-seq
HPLC

Pregunta 100

Pregunta

[blank_start]peptide[blank_end]-[blank_start]mass[blank_end] [blank_start]fingerprinting[blank_end] is used to identify which proteins are contained within spots on a gel from a 2DGE experiment

Respuesta

peptide
mass
fingerprinting

Pregunta 101

Pregunta

2DGE can be used to identify membrane proteins

Respuesta

True
False

Pregunta 102

Pregunta

2DGE cannot be used to show post-translational modifications

Respuesta

True
False

Pregunta 103

Pregunta

in a proteomics experiment, proteins are first isolated then digested using an enzyme such as [blank_start]trypsin[blank_end] as it cuts in a predictable ways

Respuesta

trypsin

Pregunta 104

Pregunta

in a peptide-mass fingerprinting experiment, resulting peak-lists can be the same for very similar proteins

Respuesta

True
False

Pregunta 105

Pregunta

in tandem MS, when fragments are introduced they are broken up by argon gas, which preferentially breaks peptide bonds

Respuesta

True
False

Pregunta 106

Pregunta

Which of the following databases of hypothetical spectra is used to identify peptides from an MS experiment?

Respuesta

Genescan
InterPro
MASCOT
BLOCKS
PRINTS
iTRAQ

Pregunta 107

Pregunta

the intensity of peaks in MS can be used to quantify proteins

Respuesta

True
False

Pregunta 108

Pregunta

[blank_start]hydrophobicity[blank_end] is the main driving force of protein folding process

Respuesta

hydrophobicity

Pregunta 109

Pregunta

secondary structure refers to global interactions within a protein

Respuesta

True
False

Pregunta 110

Pregunta

[blank_start]alpha[blank_end] helix, [blank_start]beta[blank_end] sheet and [blank_start]coil[blank_end] are the 3 secondary structure states

Respuesta

coil
alpha
beta

Pregunta 111

Pregunta

protein [blank_start]domains[blank_end] are subunits within a protein with quasi-independent folding stability

Respuesta

domains

Pregunta 112

Pregunta

the [blank_start]quaternary[blank_end] structure refers to proteins formed from several subunits or monomers

Respuesta

quaternary

Pregunta 113

Pregunta

protein structures solved by NMR or crystallography are saved as [blank_start]PBD[blank_end] files

Respuesta

Pregunta 114

Pregunta

a [blank_start]Ramachandran[blank_end] [blank_start]plot[blank_end] visualises and clusters residues of an amino acid sequence based on psi and phi angles of the residue backbone

Respuesta

Ramachandran
plot

Pregunta 115

Pregunta

CATH, SCOP and FSSP/DDD are all examples of what?

Respuesta

tertiary structure classification methods
protein structure prediction assessment
databases containing sequence information
protein data banks

Pregunta 116

Pregunta

the levels of hierarchy in the CATH system to catalogue proteins are ordered from bottom to top as follows: 1. [blank_start]class[blank_end] 2. [blank_start]architecture[blank_end] 3. [blank_start]fold[blank_end] 4. [blank_start]superfamily[blank_end] 5. [blank_start]domain[blank_end]

Respuesta

class
domain
architecture
superfamily
fold
architecture
class
domain
fold
superfamily
fold
domain
class
architecture
superfamily
superfamily
architecture
domain
class
fold
domain
class
architecture
fold
superfamily

Pregunta 117

Pregunta

mainly alpha and mainly beta are examples of CATH folds

Respuesta

True
False

Pregunta 118

Pregunta

3D protein structure prediction is treated as a machine learning problem

Respuesta

True
False

Pregunta 119

Pregunta

machine learning in the context of protein structure prediction aims to minimise the energy function

Respuesta

True
False

Pregunta 120

Pregunta

Dynamic programming is an optimisation method

Respuesta

True
False

Pregunta 121

Pregunta

Which of the following are types of machine learning?

Respuesta

Hidden markov models
artificial neural networks
rule learning
position specific scoring
multiple testing correction

Pregunta 122

Pregunta

a [blank_start]PSSM[blank_end] is similar to a substitution matrix but specifically tailored to the sequence being aligned

Respuesta

PSSM

Pregunta 123

Pregunta

[blank_start]PSIPRED[blank_end] is the most popular secondary structure prediction software

Respuesta

PSIPRED

Pregunta 124

Pregunta

PSIPRED uses hidden markov models

Respuesta

True
False

Pregunta 125

Pregunta

[blank_start]contact[blank_end] [blank_start]number[blank_end] is the number of connections a residue in a protein has

Respuesta

contact
number

Pregunta 126

Pregunta

[blank_start]solvent[blank_end] [blank_start]accessibility[blank_end] is the amount of surface exposed of each residue

Respuesta

solvent
accessibility

Pregunta 127

Pregunta

which of the following are the broad approaches for 3D PSP?

Respuesta

De novo
Ab initio
template-based
machine learning

Pregunta 128

Pregunta

which 3 ways can a template by identified for 3D PSP?

Respuesta

homology modelling
profile-base methods
machine learning
threading
ab initio modelling

Pregunta 129

Pregunta

Fold recognition is used to identify a template with high structural similarity but low sequence identity with the target protein, when homology modelling is not an option

Respuesta

True
False

Pregunta 130

Pregunta

in 3D PSP, profile-based methods make profiles for residues in a sequence based on...

Respuesta

secondary structure
hydrophobicity
acidity
solvent accessibility
tertiary structure

Pregunta 131

Pregunta

in 3D PSP, fragment assembly combines [blank_start]homology[blank_end] [blank_start]modelling[blank_end] with [blank_start]ab[blank_end] [blank_start]initio[blank_end] methods

Respuesta

homology
modelling
ab
initio

Pregunta 132

Pregunta

in fragment assembly, [blank_start]decoys[blank_end] are candidate structure generated from all the possible combinations of fragments. They energy minimisation process is applied to them and they are clustered. The final models are selected from the centre of this cluster,

Respuesta

decoys

Pregunta 133

Pregunta

I-Tasser is a [blank_start]pipeline[blank_end] used for protein structure prediction

Respuesta

pipeline

Pregunta 134

Pregunta

a network is a graph consisting of a series of [blank_start]nodes[blank_end] connect by [blank_start]edges[blank_end]

Respuesta

nodes
edges

Pregunta 135

Pregunta

in a biological network, genes, proteins and cell types can be depicted as [blank_start]nodes[blank_end]

Respuesta

nodes

Pregunta 136

Pregunta

in a network, sink nodes have high in degree and sources have a high out degree

Respuesta

True
False

Pregunta 137

Pregunta

Which of the following is not a type of degree distribution in a network?

Respuesta

constant
scale-free
random
betweenness

Pregunta 138

Pregunta

In a network, the distance can be defined by Pajek or Watts

Respuesta

True
False

Pregunta 139

Pregunta

The longest shortest path between all pairs of nodes is...

Respuesta

Pajeks diameter
Pajeks distance
Watts diameter
Watts distance

Pregunta 140

Pregunta

the [blank_start]density[blank_end] is defined by the number of edges as a fraction of the number of possible edges

Respuesta

density

Pregunta 141

Pregunta

Which of the following are measures of centrality of a network?

Respuesta

degree
betweenness
closeness
distance
diameter

Pregunta 142

Pregunta

The betweenness centrality is a fraction of the shortest paths of the network for which a certain node is a member of

Respuesta

True
False

Pregunta 143

Pregunta

[blank_start]closeness[blank_end] [blank_start]centrality[blank_end] rewards nodes from which within a few edges, any node can be accessed

Respuesta

closeness
centrality

Pregunta 144

Pregunta

a random Boolean network is undirected

Respuesta

True
False

Pregunta 145

Pregunta

A random Boolean network can be used to study dynamic processes such as gene expression

Respuesta

True
False

Pregunta 146

Pregunta

an [blank_start]integrated[blank_end] network uses data from high-quality databases such as BioGrid as well as our own experimental data

Respuesta

integrated

Pregunta 147

Pregunta

Gene co-expression networks are built using [blank_start]guilt[blank_end] by [blank_start]association[blank_end]

Respuesta

guilt
association

Pregunta 148

Pregunta

In gene co-expression networks, similarity in expression across samples is usually computed by

Respuesta

pearson's correlation
principle component analysis
guilt-by-association
force

Pregunta 149

Pregunta

A gene co-prediction network relies on a set of rules and an edge connects genes that co-predict with high frequency

Respuesta

True
False

Pregunta 150

Pregunta

PathExpand and TopoGSA are examples of network [blank_start]refinement[blank_end] packages

Respuesta

refinement

Pregunta 151

Pregunta

force, arc, circular and hive are all examples of network [blank_start]layout[blank_end]

Respuesta

layout

Pregunta 152

Pregunta

An Arc network is more scalable than a Hive network

Respuesta

True
False

Pregunta 153

Pregunta

community detection is also known as [blank_start]clustering[blank_end]

Respuesta

clustering

Pregunta 154

Pregunta

[blank_start]clustering[blank_end] identifies sub-parts of a network with many connections and often reflect meaningful modules within the network organisation i.e cellular machinery or biological processes

Respuesta

clustering

Pregunta 155

Pregunta

[blank_start]ontologies[blank_end] represent relationships in a computationally amenable way by providing controlled vocabulary of terms

Respuesta

ontologies

Pregunta 156

Pregunta

Which of the following are ontologies used by GO to describe the associations of gene products

Respuesta

biological processes
cellular components
3D structure
interaction partners
molecular functions

Pregunta 157

Pregunta

there are [blank_start]20[blank_end] amino acids used in biological systems

Respuesta

Pregunta 158

Pregunta

Which of the following is not commonly used to assess sequencing methods?

Respuesta

read length
throughput
cost per base
cost of the machine
sample size

Pregunta 159

Pregunta

Which of the following is not a database combined in the INSDC major collection point for sequencing data?

Respuesta

EMBL-EBI
NCBI
NIG
GenBank

Pregunta 160

Pregunta

Sanger, 454, ion torrent and ilumina sequencing all sequence by [blank_start]synthesis[blank_end]

Respuesta

synthesis

Pregunta 161

Pregunta

Third generation sequencing involves a PCR step

Respuesta

True
False

Pregunta 162

Pregunta

the current gold-standard for shotgun sequencing assembly is a [blank_start]100[blank_end]-fold coverage

Respuesta

Pregunta 163

Pregunta

Which of the following is not a reason for making sequence assembly difficult?

Respuesta

biased sequence composition
homopolymers
repeats
long reads

Pregunta 164

Pregunta

coverage assumes that DNA is randomly fragmented and all DNA is able to be sequenced.

Respuesta

True
False

Pregunta 165

Pregunta

the coverage equation often underestimates the number of reads necessary

Respuesta

True
False

Pregunta 166

Pregunta

silent mutations usually occur in the [blank_start]3rd[blank_end] base of a [blank_start]codon[blank_end]

Respuesta

3rd
codon

Pregunta 167

Pregunta

[blank_start]xenologous[blank_end] genes are those which are homologous and have been gained via horizontal gene transfer

Respuesta

xenologous

Pregunta 168

Pregunta

in sequence alignments a [blank_start]:[blank_end] represents a perfect match, a [blank_start].[blank_end] represents a similar AA and a blank space represents a larger AA change

Respuesta

Pregunta 169

Pregunta

Heuristic alignment methods are better when computational power is not a problem or for a small number of sequences

Respuesta

True
False

Pregunta 170

Pregunta

in a BLAST search, the number of hits one can expect to see by chance when searching a database of a particular size is defined by the [blank_start]E[blank_end]-[blank_start]value[blank_end]

Respuesta

E
value

Pregunta 171

Pregunta

in a MSA, the alignment table can be summarised in a single line, a pseudo sequence called the [blank_start]consensus[blank_end]

Respuesta

consensus

Pregunta 172

Pregunta

A MSA algorithm which starts with a complete MSA, makes changes, computes score, keeps the MSA if the score is better then repeats is known as an [blank_start]iterative[blank_end] method

Respuesta

iterative

Pregunta 173

Pregunta

In a progression MSA, the original mapping can be changed

Respuesta

True
False

Pregunta 174

Pregunta

progressive multiple sequence alignment strategies use pairwise alignments

Respuesta

True
False

Pregunta 175

Pregunta

the muscle MSA alignment method uses the [blank_start]kimura[blank_end] [blank_start]distance[blank_end] matrix to make a global alignment during the improved progressive alignment

Respuesta

kimura
distance

Pregunta 176

Pregunta

muscle uses WPGMA to make alignments

Respuesta

True
False

Pregunta 177

Pregunta

a [blank_start]profile[blank_end] can be incorporated into MSA and PSP algorithms to give better results

Respuesta

profile

Pregunta 178

Pregunta

PSI-BLAST uses a position-specific scoring matrix

Respuesta

True
False

Pregunta 179

Pregunta

UPGMA can be fitted with an evolutionary model

Respuesta

True
False

Pregunta 180

Pregunta

microarrays assay gene expression by quantification of mRNA using hybridisation

Respuesta

True
False

Pregunta 181

Pregunta

[blank_start]quantile[blank_end] normalisation is a method of normalisation which ranks data, then takes the median value for each rank and replace the original values with the ranked averages

Respuesta

quantile

Pregunta 182

Pregunta

principle component analysis reduces multi-dimensional data down to [blank_start]2[blank_end] dimensions

Respuesta

Pregunta 183

Pregunta

when analysing microarray data, multiple testing correction controls for the error rate due to false positives being produced by multiple T-tests

Respuesta

True
False

Pregunta 184

Pregunta

which of the following does not encompass the same methods between microarrays and RNA-seq?

Respuesta

normalisation
quality control
statistical analysis

Pregunta 185

Pregunta

when analysing data from an RNA-seq experiment, DE-seq assumes a [blank_start]negative[blank_end] [blank_start]binomial[blank_end] distribution

Respuesta

negative
binomial

Pregunta 186

Pregunta

organisms have 1 genome and 1 proteome

Respuesta

True
False

Pregunta 187

Pregunta

in 2DGE, there is a pH gradient running left to right. Where a protein is positioned depends on its [blank_start]isoelectric[blank_end] [blank_start]point[blank_end]

Respuesta

isoelectric
point

Pregunta 188

Pregunta

in 2DGE, it is valid to compare spots between gels if the spot is absent on one of the gels

Respuesta

True
False

Pregunta 189

Pregunta

Sensitivity is good in 2DGE as the dye is linearly incorporated

Respuesta

True
False

Pregunta 190

Pregunta

LC-MS can be multidimensional, separating proteins based on more than 2 physiochemical properties

Respuesta

True
False

Pregunta 191

Pregunta

iTRAQ is used to label samples in order to quantify them. Tags are made up of an [blank_start]ester[blank_end] group to tag to the protein, a [blank_start]reporter[blank_end] of varying sizes and a [blank_start]balancer[blank_end] to balance the mass

Respuesta

ester
reporter
balancer

Pregunta 192

Pregunta

when using iTRAQ to quantify proteins during LC-MS, the balancer moiety is measured - when there is a more balancer moiety, there is a higher peak and therefore more peptide.

Respuesta

True
False

Pregunta 193

Pregunta

iTRAQ is a relative quantification method in LC-MS

Respuesta

True
False

Pregunta 194

Pregunta

data from LC-MS experiments have been locked in [blank_start]proprietary[blank_end] [blank_start]boxes[blank_end] up until recently, meaning that specialist software was required to view and analyse data depending on the technology used.

Respuesta

proprietary
boxes

Pregunta 195

Pregunta

spot profiles for LC-MS data can be clustered [blank_start]statistically[blank_end] based on how similar their expression profiles are or [blank_start]functionally[blank_end] based on how similar their function are

Respuesta

statistically
functionally

Pregunta 196

Pregunta

the function of a protein depends on its [blank_start]structure[blank_end]

Respuesta

structure

Pregunta 197

Pregunta

a beta hairpin is an example of a [blank_start]supersecondary[blank_end] structure

Respuesta

supersecondary
secondary
tertiary
CATH
primary
domain

Pregunta 198

Pregunta

which of the following is not an example of a structural property of an individual residue that can be predicted

Respuesta

supersecondary structure
secondary structure
solvent accessibility
contact number
whether it is exposed on the surface

Pregunta 199

Pregunta

predicting structural aspects of protein residues is generally treated as an optimisation problem

Respuesta

True
False

Pregunta 200

Pregunta

[blank_start]scaffolding[blank_end] is a technique used to link together non-contiguous series of genomic DNA

Respuesta

scaffolding

	Creado por lauren beck hace casi 5 años

Siguiente

Bioinformatics

Descripción

Resumen del Recurso

Pregunta 1

Pregunta 2

Pregunta 3

Pregunta 4

Pregunta 5

Pregunta 6

Pregunta 7

Pregunta 8

Pregunta 9

Pregunta 10

Pregunta 11

Pregunta 12

Pregunta 13

Pregunta 14

Pregunta 15

Pregunta 16

Pregunta 17

Pregunta 18

Pregunta 19

Pregunta 20

Pregunta 21

Pregunta 22

Pregunta 23

Pregunta 24

Pregunta 25

Pregunta 26

Pregunta 27

Pregunta 28

Pregunta 29

Pregunta 30

Pregunta 31

Pregunta 32

Pregunta 33

Pregunta 34

Pregunta 35

Pregunta 36

Pregunta 37

Pregunta 38

Pregunta 39

Pregunta 40

Pregunta 41

Pregunta 42

Pregunta 43

Pregunta 44

Pregunta 45

Pregunta 46

Pregunta 47

Pregunta 48

Pregunta 49

Pregunta 50

Pregunta 51

Pregunta 52

Pregunta 53

Pregunta 54

Pregunta 55

Pregunta 56

Pregunta 57

Pregunta 58

Pregunta 59

Pregunta 60

Pregunta 61

Pregunta 62

Pregunta 63

Pregunta 64

Pregunta 65

Pregunta 66

Pregunta 67

Pregunta 68

Pregunta 69

Pregunta 70

Pregunta 71

Pregunta 72

Pregunta 73

Pregunta 74

Pregunta 75

Pregunta 76