University of Ghana http://ugspace.ug.edu.gh THE DETECTION AND MOLECULAR CHARACTERISATION OF BACTERIAL SYMBIONTS OF ANOPHELES GAMBIAE S.L. by CHARLES ADDOQUAYE BROWN This thesis is submitted to the University of Ghana in partial fulfilment of the requirements for the award of Ph.D Biochemistry degree. SEPTEMBER, 2003 . University of Ghana http://ugspace.ug.edu.gh DECLARATION This thetis is the ~suh of research work undertaken by CharleJ A. Brown in the Department of Bk>chemiSlry. Uni ... mity of Gtwla. under the supervision of Prof. M.D. Wilson and Prof. F. N. Gyang. (C harlesA.Brown) ~ I (Prof. M.D. Wilson) University of Ghana http://ugspace.ug.edu.gh DEDICATION This thesIS IS dcdic:ared to Joaa, for aU thai she went throusJ1 to enable me complete: II, C110ch and alJ theNii "Atians". University of Ghana http://ugspace.ug.edu.gh ACKNOWLEDGEMENTS 1 am glad 10 IYve this opponuaily of exprcsiin& my deepest gratilude to my supervisors. Prof. M.D. Wilson and Prof. F.N. Gyana. who provided paidance and encouragement tbrou&bout the: period of thi:; work. neir invaluable commcolS greatly helped to improve lb. quo1ily or .... work Spec.a! 1lwIk. go to Dr D.A. Boakye (Noguc:hi), Prof. T. Unnuc::b and Dr Tariq Higazl (both of the Unh'CfSllY of Alabama at Binningham, USA) for tbeir divene contributions. My sincere thanks goes to Nkem Okoyc, Mrs. Anitli Ghansah. Mrs Bridgette-Marian Ogoe, Nancy Duah. Janet Midega. Mrs Benedicta Kuivi. Fred Aboagye--Antwi, S. Dadzie, Enns Glah, Helena Baidoo. Naiki Puplampu, Awo Osafo-Addo, F.e. Mills-Robertson. Hury A>mah. Shelly, Tol., L)'Ii. . Abena, BB IIId MIs S .... Ad. .A mankwab whose: support kept me Fin&- I am equally grateful to Mr Nana M. A. Appawu. Dr K. Boaompan IOd Dr. K. Koram for their encouragement. A word of appreciation goa to all manbcrs of the PansilOIogy Unit (NMIMR) and lecturer.; of the BiochemiJtty Department for their oontinucd interest and encouragement. Finally, but by no means the least, I thank my ... cry good friendl Hasl80 H855m. David Mmab,. Akosua Boafo. 5.B. orea, Hetty and Esi Colecraft fOf their CDCOUI8.JCIl1COI, moral support and prayersdunng timcsoffrustration. God Bias You All. flus work wa, supported in .,.n with a grant from the UNDPlWorld BanklWHO Special Pmgnmmc for Rcscan:h and Training in Tropical Diseases (TOR) University of Ghana http://ugspace.ug.edu.gh ABSTRACT The aim of the prcseot study was to identify and characterise bacterial symbionts in An. g"m/JiM s.l. mosquitoes. and ultimately select one that occurs in all its life stases. Mosquito larvae and pupae samples were collected &om six locations in the GlUIer Accra region of GbaAa and some reared 10 adults in the laboratory. Wild Mlult All, ,iUllbUle mosquitoes from Navronso and Dodowa (Ghana) and Jaribuni (Kilifi District, Kenya) and laboratory colonies of adult An. gambiae mosquitoes from KihmanJaro (Tanzania), Suakoto (Ubcria) and Kisumu (Kenya) and An. lU'abknsu (Wqcningen SU'ain) were alw studied. Each specimen was surface sterilized before DNA extraction was carried out The peR method of Scott el al. (1993) was used on each 1p«lmcn to determine the species oflhe An. gambiae complex. PCRs usin, univawl cubactcria 16S and 2JS rONA oligonucleolide pnmers were first used to detect the presence of the microorganism's DNA sequences in lhe mosquito. then peRs using WOLB 16SF liWOLB16SR I aJXJ /tsZI !ft~Z1 primers were carried out on posItive reactions to detamine wbecber or not they were Wolbachia sp. /" sllico (computational mole-cular biology) analyses of DNA sequences of Esderlclria coli and PallloQ(" agglo",uatu were perfonned WlinS DIGEST software and the results compared with those obtained by restriction analysis of the amplified 16S and 23S rONAl with six enzymes. The peR products were cloned and sequenced using an ABI )77 automated sequencer. Two conJCnsus DNA scqucm:n were generated from Che sequence data by pajrwise sequence similarity and the phylogram methods. Pbyloametic relatIOnships of the consensus sequcnca to homololOus acquenc:cs in (he databases were inferred using nelghbour·joining (NJ). muimum likelihood (Ml), and maximum pvsimony (MP) methods. Of Ihe 4)2 molquilO spa:imcns studied, 37) (94.4°·.) WC1'C idC1'lhfied IS An. gambiae s.s., 20 (5.''''0) U All. lU"dbiellJis. and one (0.2SV.) each of All. ",enu and All. melas_ 295 An. gambiae (74.7".) adults. 29 (7.3%) University of Ghana http://ugspace.ug.edu.gh pupae and 71 (18%) larvae were studied. DNA fragmenls of Ihe predicted sizes were successfully amplified usiPl the 16S rONA md 23S rONA primers in 85.1% (3361395) and 795% (314/395) of the specimens. respeaively. Bactcnal DNA sequences were amplified from aU the sibling species. which consisted of 54 (76.1 %) larvae. 26 (89.70/.) pup« and 256 (86.r...-) adults from both wild and laboratory reared specimens, lIT~vcofthe geographical origin. Out of281 specimens, peR positives for 16S and 23S rONA primers. 94 (33.S%) were positive (or the WOLl6S rONA primers but none for the ft;sZ primers. Failure of PCR with /t3Z primers indicated _ possible absence of Wolbachia infection. The restriction studies revealed that none of the amplified peR products could be either E. col; or P. agglom~raflS. Seven sequences. namely AgAl (AY24716S), AgA2 (AYJ2S810), AsU (AY247160), AgL2 (AY247161), AgLJ (AY247162), AgL4 (AY24716J) and AgLS (AY247164) were obtained, four and two with high homology to each other and one, a stand alone. CONSEN4 and CONSEN2 were hishly homoloSOUS to 16S rONA sequences of Par/JCOCCUJ and Rhodobact~r species. The sequence AgLS was highly homologous to the 16S rONA sequences of the Cylophaga~Flexibacter~Bacteroidcs group (CFB) group. The phylogenetic trees constructed. indicated that CONSEN4 and CONSEN2 probably belong to the a subdivision of the Proccobeclcria whilst AgLS is _ member of the CFB group. Since PIllDCOCC14 and RJtodoboct~r 11'. are Gram-negalive aquuic bacteria and members of the CFB group are _]so aqualic bacteria. it illikcly Ihal they occur in the breeding habitats of DX*Iuitocs. Further research is nccdcd; (I) 10 ddenninc if lhcy are in the breeding habitats and are inaatcd by larvae. (2) 10 cultuR and isolale them. and (3) 10 decermme which part of the adult IlU*plitothcy reside in. University of Ghana http://ugspace.ug.edu.gh TABLE OF CONTENTS DECiARAT ION DEDICATION AKNOWJ.EDGEMENTS ABSTRACT TABLE OF CONTENTS vii LIST OF TABLES x; LIST OF FIGURES ]l;ii ABBREVIA nONS C'IIAPTER ONE 1.0INTRotn'(TION 1.1 Objectives JO CHAPTER TWO LITERATURE REVIEW II 21 MosquitoCl II 2.1.1 GeneraJintroduction 2.1.2 Classificationandidcntification II 2.1.3 Spccicscomplael 16 2.1.4 MethodJ for diltinpishing between cryptic species 17 2.1.-4.1 'Traditional tools' 18 i) Morpbologica1identification 18 Ii) Rcproductiveincompalibility 19 iii) Cytogcnelicsludies 20 iv)CuticuJarhydrocarbonanalysis 21 v) Allozymcfisozytne anaI)"is 22 vi) Hybridizationauays 23 2.1.4.2 Classicalgeoeticmarkers 24 i) Restriction &agment length polymorphisms (RfLPs) 24 ii) MicochondrialDNAanalysil 25 iii) The polymcrue chaia reaction IIpplied to nuclear DNA 26 (nONA) 2.1.4.3 Hishly polymorphiC marker.; 28 i) Miaosatdlite DNAs 28 ;;) RlUIdomly ..p hficd polymorphic DNA (RAPOs) 30 2.I.S Distribulionandccology JI 2.1.6 Mc:dicalimportance 32 2.2 The Genus Anopheles 34 2.2.1 The life cycle of Anopheles mosquitoes 14 2.2.2 ARopIwlrs species of medical importance 40 2.J 1bc """"MIn gaM_ Gila Complex. University of Ghana http://ugspace.ug.edu.gh 2.3.1 DistnbutlOn and importance or members of the Anopltelu 41 gaINbUJe sibling species complex 2.3.1.1 Anopheles lnwnrtbae 47 2.3.1.2 AnopltelesqlladriQlfnlltahlS 49 2.3.1.3 Anopltele3lMlas 50 2.3.1.4 AftOplwks m~1i45 51 2.3.1.5 ArtOpIre/~s orab"~1lSis 52 2.3.1.6 Al'tOf'Iwles gombioe sensu stricto 2.4 Symbiotic Associations 58 2.4.1 Type$ ofsymbiOlic associations 59 2.4.1.1 Phoresis 59 2.4.1.2 Commensalism 59 2.4.1.3 MutuaJism 60 2.4.1.4 Parasitism 60 2.4.2 Modes of bow symbionts get together 60 2.4.3 Batterialsymbionts 62 2.5 Molecular Genetic Analysis 64 2.S.1 Extraction and purifiCilion of genomic DNA 64 2.5.2 Restrictionendonuclcasc:s 66 2.5.3 Restriction n.1l"''''' Imath pol)1Jlorphism (RFLP) 68 2.5.4 AgaroscgeJclcclrophorcsis 70 2.S.S Determination of ONA frlgment sizes 73 2.5.6 Pol)1Jlcras< Choin R_on (PCR) 74 2.6 MolecularPhylopneticAnaiysis 80 2.6.1 Molcculorphylogmymclltods 81 2.6.1.IPhylogenctict:rec:!. 81 2.6.1.2 Searchalgoritluns 83 2.6.1.3 Heuristics 85 2.6.2 Tree eonsttuctioa methods 86 2.6.2.1 Discretccharactcrmctbods 87 i) Muimum ~imony (MP) and weishled par!>imony (WP) 87 ii) Maximwn likclihood (ML) 89 2.6.2.2 Distancematrixmethods 90 i) Transformation of sequence data to distances 90 ii) Unweightcd pUr group method with arithmetic mean 92 (UPGMA) 2.6.2.3 Trwnsfonncddistancemethods 92 i) Neighbourrdalionmethods 93 ii) Fi,ch ond M_liuh (FM) method 94 iii) Distance Wagner method (OW) 94 iv) Minimwn evolution (ME) 95 University of Ghana http://ugspace.ug.edu.gh 2_6_3 AccountIng for superimposed events (nucleotide substitution in 9S a DNA sequence) 1.6.4 AJ,scssing the reliabilityofa tree (confidence in phylogenetic 97 estimate<;) CIIAPTER THREE 99 MATERIALS AND METHODS 99 3.1 Chemicals. Reaacnts, Eql. .. pmenl and Software 99 3.2 Biological Specimeol aod Sample Collection 99 3.3 I..aboraWry Rearing of Mosquitoes 103 3." IdcntificationofAnophelesspecies lOS 3.4.1. Morpholoaica1 identification lOS 3.4.2. Molecular identification of sibling species oftbe AnopIwlu lOS gambiae complex 3.4.2.1 Genomic ONAexlr8Ction 107 3.4.2.2 peR amplification 107 3.4.2.3 AnaiysisofPCRproducts 108 3.S Dctectioo of Bacterial Symbionts Using PCR 109 3_5.1 EWmationoftheooncentrationofPCRproduCIS 112 3.6 Amplified Ribosomal DNA Restriction Analysis (ARDRA) 113 3.6.lln$ilk:o(computItionalmolecularbiology)rcsbictionanalysis 113 3.6.2 Restriction endonuclease digestion and analysis 114 3.7 CIonin& of Amplified Bactaial Sequences 115 3.7.1 dAtoilinaofPCRprodueu 116 3.7.2 Ligation 116 3.7.3 Transfonnationexperiments 117 3.7.4 Plasmid DNA ilOlltion 118 3.8 IdcntificatiOllof Amplified Bactaial Symbiont Sequences and 120 Phy~cAnalysis 3.8.1 Sequencing ofamplificd bacterial DNA sequences 120 3.8.1.1 Scq""""cditing 120 3.8.1.2 VecSerccnandehimcradetcction 122 3.8.1.3 ScquencesimiiariryandDNAdarabasesc811:hcs 123 123 124 128 128 129 SequcnccAa:cssionNumbcrs 130 III Identification III University of Ghana http://ugspace.ug.edu.gh 4.2 PCR Dc:tectioa of Bactcri. DNA5Cqueoces inAn. galllbiM s.J 13S 4.3 Amplified Ribosomal DNA Restriction AnaJysis (ARDRA) 142 4.4-.1Scqucoc:ghbour-joining) oflhe CONSEN4 and selected 174 membcn of the Q_Proleobacteria infared from 16S rONA sequcnceoompansons. Fig",2R PhytoglWll (maxlffiwn pllS1mony) of the CON5EN4 and 175 selected members of the Q-Proteobacteria inferred from 16S rONA scqucncc comparisons. FIs-4.Z9 Phytogram (maximum likelibood) of the CONSEN4 and 176 selected members of the a-Proteobacteria inferred from 165 rONA sequence oompari!OOS. fl3.4.JO Phylo. .... (oeighbout-joining) of the CONSEN2 and selected 179 membcn of the a_Pro/~cteT;a inferred from 16S rONA sequenceoompansons Fie. 4.31 Phytognun (maximum parsimony) of the CONSEN2 and 180 selected members of the a-Proleobacteria inferred from 16S rONA sequence comparisons Fig. 4.32 Phytogram (maximum likelihood) of the CONSEN2 and 181 selected members of the a-Proteobaclerla infelTctI from 16S rONA sequeoce comparisons. F1c.4.33 Phytosram (neighbour-joining) of AgLS and .mainly. selected 184 memben of the CFB group infemd from 165 rONA sequence compc;tonS fl" 4.34 Phylogram (m&Ximwn parsimony) of AgL5 and, mainly, 185 xlccted membcn of the CFB group inferred from 16S rONA scquencccompan5OfU. Fig.4.JS Phylogram (maximum hkelihood) of AgL5 and, mainly, 186 selected members of the CFB group inferred from 165 rONA scquc:noec:ompll13OflS. University of Ghana http://ugspace.ug.edu.gh ABBREVIATIONS bp b.:lScpalt"S dATP dcoxyadenosine triphosphate dCTP deoxycytidinetriphosphate de-ionizeddistilledwlter dGTP dcoxyguanosinetriphosphate DNA dcoxyribonucleicacld dNTP dcoxyribonw::leotidephosphate cbDNA double-strandcdDNA .nTP dc:oxythymidinetripoosphlte EDTA etbylenediaminetetraacetale. 2HlO EtRr ethidiumbrotnide ethanol GPS g1obalpositioningsyslcm. HBI hwnanbloodindex H,O kb kilobasc KOH potISSiumhydroxide l.B Luria Bcrtani broth molar (moles per litre) "I miaolitre I'M micromolar ML Maximumlikc:lihood millilitre Maximum panlmony mRNA messcngcrRNA Mw molccularwClghl N.OH sodium hydroxIde Neighbour-JoIning openllonalwooomic urul \0 University of Ghana http://ugspace.ug.edu.gh PAUP Phylogenetic AnalYSIS USing Parsimony PeR polymerase cham reaction pll hydrogen-ion exponent PHYllP pHYlogeny lnference Package rONA ribosomal DNA RNA ribonucleic ecid nbonuclease revolutionsperminutc ribosomal RNA sddHlO stenle double distilled water 5.1. $~nsulalo ssDNA single-stramiod DNA TAE Tris·ac:eta1eEDTA mchingtemperature 2 -amino-2-(hydroxymethyl)-t,3 propanediol University of Ghana http://ugspace.ug.edu.gh If you would see 01/ of Nature gathered up at one point, in all of her Iotleliness. and her skill, and her deadliness, and her sex, where would you lind a more exqUfSf te symbol than the Mosquito? . HAVE LOC K tiLlS, 1920 L_ ____ _ University of Ghana http://ugspace.ug.edu.gh CHAPTER ONE INTRODUCTION Vector-borne diseases are major causc!> uf death and morbidity worldwide. Not only do they craie publtc:: health problems. they also preSl.'nt serious obstacles for socia-economic developmc:ntin tropicaJ countries. Of the disease-causing H:Clors, mosquitoes transmit some of the world's most life- threatenin& and debilitating parasitic and viral diseases. including malaria. Venezuelan equint' encephalitis. yellow fever. filariasis and dengue/dengue haemorrhagic fever (Miller. 1992; Priest. 1992; Monath. 1994). Of these, malaria is by far the most pn:\'alcnt uopicaIVCClor-bomcdisease. Malaria. the vector of which is the female Anophe/e$ mosquito, is estimated to rep~SC'nt 2.)% of the overall global disease burden and 9% of Africa's; il ranks third after pnaunoc:tXCaiacutere!ipiratory tract infections (3.5o/e) and lubcreulosis(2.8".) (WHO. 1999). An estimated 2.3 billion people. almost oncHhird of the world's population. are at risk of infection with the malaria parasite (WHO, 1999). The annUIII incidence of the dlscase is estimated to be 300- SOO million clinical cases with mortality accounting for between 1·3 million deaths among children under five years. In sub.Saharan Africa it accounlS for more than ~/. of the total global incidence: and mortality (Breman, 2001; W}10fUNICEF.2003). h is a major cause of dealh in pn=gnant women (Lindsay el aI. • 2000) and also an imponant calIX of sti ll births and low binh weights (Menendez, 1995; Stcketee ~, ai. • 2001). These figures are probably undereS1imates a' the statistics used in their colkction may vary by a fxwr of tJuec dependi"K on the method of estimation (WIIO. 1999). The 21 million n:ported cases of malaria in Africa an: believed 10 University of Ghana http://ugspace.ug.edu.gh represent only 5-10% of the total malaria incidence on the continent (Hamoudi & Sachs. 1999). Globally. the nwnbcrs of malaria cases ace increasing and the rate of increase is accelerntina. This pattern is illustrated by multi fold increases in malaria rales since 1979 in Soulh America (Roberts el aI.. 1997) accompanied by a rise in the proportion of populations at high risk to the disease. Malaria is nrpearing in urban areas and in rountries lhal had previously succeeded in eradicating the disease. for example:. in urben arasofthe Amazon Basin (Roberts & laughlin. 1(99), South and North Korea (Feighner el at.. 1998). ,\nnerua. Azerbaijan. and Tajikistan (Curtis, 1999). In Africa. the pn:vaIencc or malaria has been escalating al an alarming rate. within the last decade. Between 1994 and 1999.ma1ariaepidemicsin 14 sub-Saharan Africa countries caused an unacceptable high number of deaths. many in areas previously free of the disease (OAU. 1997). Additionally. the increase in cases and the altered geographic distribution of malaria is underestimated beausc: acCUnltc information on global incidence is difficult to obWn, since trplrts arc gcncnlly fragmentary and in'Cgular (Roberts el 01 •• 2000). Air travel bas brought the threat of the disease to the doorsteps of industrialised countries with an increasing incidence of imported cases and deaths of visitors to endemic areas. IleaJth authorities in many countries are bewming mucasingly concerned about the poICntiaily dc8dIy risks of malaria auricd inaothcirterrito!')' by "jet-setting" mosquitoes that bavcl on international Righls and spread the disease (Gratz et aI., 2000). Between 1969 and t 999. 12 counbies reported a total of 81 cases of m.laria in people livins near • airport. France headcd the Jig, with 26ca'iC5. foliowedby Belgium, 16. and the United Klngdum. 14 cases (WHO. 2(){K)). TIlt occurrmcc of lhe relatively large number of cases of airport malaria in Pwis and Brussels reflects the large number of Rights arriving from University of Ghana http://ugspace.ug.edu.gh Central and WcstAfrica. lbe3e-airportmalaria" CIlSCS, occurring in or near airports. are distinguished from other ca5eS of imported malaria among persons who contJ'Kt the infection during a stay in a malarious area :md ~\lhscqucntJy faU ill (Lusina el aI., 2000; Gratz el aI., 2000). The DlIjority of the cases were caused by P. /o/clptrUM. At least five deaths ha\"C rcsuhcd; all cateS occurred among non-immune individuals, accounting for • relatively hish mortality of 6% (Gratz et aI., 2000). The prevalence of malaria bas an enormous impact on a country's economy incapKitaIinR part of the labour force and thus affecting productivity. The number of DAL Ys (disability adjusted life YCIIS) lost in 2000 to malaria has been estimated to be 42. 280. 000 (WHOIUNICEF. 2003). II is also estimaled thal a bout of malaria. depending OD the severity, typic:aIly entails a loss of four or more ",orking days, followed by additional days with reduced work capacity (Shepard tlal., 1991; Picard &. Mills. 1992; Hempel A Najera, 1996). In Africa, where malaria IK:Counts for up to • third of all bospital admissions. this costs. sum equivalent 10 over 10 working dlys, adding to the continent's economic burden. Malaria attacks are also • major cause of school abJenteeism and appear to rteplive!y impact on long tenn leaming capacity (McLlonald, 1950; Wemsdorfer &: McGregor, 1')8)(; Holding & Snow, 2001). Endemic malaria also reduces the growth potentiaJ for some industries. notably lOw-ism and transportacion. and sharply raises the cost ofinfrutructure projec:ts and Olhcr coll«tive enlcrpriscs(Gallup& s.:hs, 2001; s.m. & Ma1aaey. 2002). Some raeaR:hers have eslimated the economic burden of malaria .. 0.6-1.0% of GOP in Africa. although recent reports indialle tbal the economic impact of the disease on national income is likely to be much hi&her (Branan, 20(H; Sachs & Malaney. 2002). The economic loss due 10 malaria in Africa is in excess of two bilhon llS dollan per year (Okenu. 1999). Malaria thus has ~iaI consequences and isaheavyburdcnoneronomicJe\'dopment University of Ghana http://ugspace.ug.edu.gh o;_~ "" .. __ lOconuoI malari .. ThoCOOlrOI methods include lilt usc of inicctic::idts. chemotBcnpy and rNftIIICIIlCnt of.he environment for more th&n 10 years, ins«tK:ides. most notably dictdorodiphcnyltricbJorocthane (DOn. have been onc of the prim.")' ~ of controlling imect-bome diseases_ However. resistance to imectK:~s has appeared in the major insect vectors from evcr)' genus (Brogdon &: McAllister. 1998). According 10 the World Health Organization approximuely 125 Irtbropod species an: resistant to at least one, and often two or more, insecticides (WHO. 1992). Rcsiscance has also developed to every chaaic:aJ class of ins.«cicide. includina microbiAl dn"gs and iMCCt growth regulators (Brogdon &: McAllister, 1998). In many parts of lhc wortd where insect-borne di:IeUCS cause illness and death. insecticides are 3\"ailable. However. the panial wjlhdnwal of available effective ituecticida due to resiSWlCe in target species, IDd the non-renewal or the registration or SOrM insecticides (l..aecy • Orr. 1994), coupled with environmcnlal safety issues and the rugh cost a.xia&ed willi heavy iuecticide usage (Gcorghiou.. 1986), make sustainin8 &ona term tmccticidc wmor control. extrcmdy CWIIy aDd pmcticaUy UNChic"aWe. Drup 10 tre.t malari. have heen used ror thousands of years and WHO and govenunents of malarial c:ounmes have turned IOward dru& treatment strategies 10 reduce malarial incidence rates. Owing the carly 19005. doc&on used quirune for rnaIariIi thenp:y. The use ofthr drug was short· lived and wa, rq:Uced wilh chloroquine, a che.p. sate, and effectiw anti·m.JariaJ th8t gMned wideIprcad Kcepwx:e among doclOrs in eodtmk: countries durins the t9SOs and f'e"$ulleci in an enormous decline ill maa.;. incidenc:c rales (WHO. 1997). But.fter years of usc. becau.sc it 'A'ti cilhcr misused or abused. chloroquine-resistant mata.;. ...... ba\"C cvoh'Cd (NIAID, 2000). From Ihe 19505 10 the present., c:hl~uinc resdIMcc .. ptu,aJ1)" spmtd 10 nearly a.l1/okipanun malaria endmttc: tqtons (WIIO, 1997; NJAID, 2000). University of Ghana http://ugspace.ug.edu.gh Newer drug therapin. unfortunllfely. have not been effective .pinst drug resistant stnUDS of the malaria paras.ite. The drug mcfloquillC' was introduced in ~ Asit in the mid.I98Os. bul complete drug resistance was observed only four years thereafter (Ba5U. 2000) and is DOw almost aalhe same level as chloroquine (Nchinda. 1998). Resistance 10 a more recent drug ato\'aquone developed SO quickJy that resistant strains were detce-Ied during clinical trials (Strobel, 1999). Artemisinin and its derivatives (artemcther. arttet.her. and artesunate). which were developed as a result of !he more serious dNa rWstaoce situation in Asia. are currendy the most effective of all anti-malarial drugs. However, it bas been shown in laboratory studies that malaria parasites can also become resismnt to artemisinin (MAID. 2000). Moreover. according to scientists studying drug ~slstance.themalari"purasile frequently mUlate3ancicanlhtreforeevolveresislanc:elo ".arly any drug (Basu, 2000) The ~ a1tached to disease control, especially of P. /alciparum malaria.. for example, bas moUvalCd retc::an:b ineo the development of efficient vaccines. Vac:cinalion apinst P. /oJciparum and P. vivar is seen as the mcthod of intervention with the ~atest poteoti&l to tnlu.;e the morbidity and mortality associated with severe malaria in areas of intense tnnsmiSliion (Miller a. Hoffman. 1998) and even contribute to eradicating the disea:(Greenwood,I997). Forthesc rea5OnS. considerahle effort and severa.l hundred million dollan have been ~J'l'tnt 0\'1:1" the last )0 yean 10 develop vaccines (Collins & l·a~k.ew11Z, 1995). UnfmtutLaldy. YJlCcine devek»pment is associated with mumrous Jlllicultio: such as antigenic divCr.iIlY and immune evasion of the paruitened cae of foreign DNA introduction into Aft gombiae genome in 1987 (Miller ~I aI., 1987). These were intensified after the !UCCeSSful development of routine: u.nsfonnalion teetw.iques usina the Mino$ lransposable element in the Mediterranean fruitfly CUalil" cop4la1Q (Loukeris ~I aJ., 1995) and the Morlner and Nermes elements in Aer/u tU1JIPI/ (Coeles ~I aI., 1998; Juinskicne ~I uf.. 1998). The ability of the MiltO$ traasposabJc eLemcnt to fwlc1ionasau-ansform2tion vector in anopbelim-mosquitoc:s has n:cmlly bttn demonstrated (Canerucia ~I aI. . 2000) 'The potential of using N[ur3Uy occurring symhiotic bacteria that can be geocltcally erlginecn.-d (or ~ control of mosquitoes., on the other hand. has re-c::ei~ much less artenlJOn. \\"uh thiS 3fJprur;x:h. the arthropod is not transformed. but the symbi~ bacteria ------------------------------------------~8 University of Ghana http://ugspace.ug.edu.gh tha' it harbours are (Beard elol., 1993.). Such uthropods an: called paratraJ\SgeDic:. This approach is guided by the (o!towing observations and basic concepts: 1) Throughout the cnlin.! devck>pmenaal cycle many arthropods. especially tOOK thai feed on restricted food sources such as blood. cellulose. phloem and stored grains. harboW' bacterial symbionts; 2) Some of these symbionts can be cultured and genetically transformed to express 8 gene whose product kills a pathogen that the arthropod transmits; 1) Normal arthropod symbtoota CaD be replaced with genetically modified symbionts, resuhing in a population of arthropod 'oUtOfl that can no longer transmit diseasc (13card f!laJ .. 1993a). A wKk variety of t.cleri.l symbionts have been idenlified in practicaJly every <>rder of thclnsectaandinticks. In those insects lhat feed on f]uid m41lcrial such as plantjuioesor baood. the symbionts appear to provide metabolic products that supplement the host Wec:t'. outritioo (Beard et aI., 1993.). Exampln of such symbionts include WolhDchia In drosophiHds Ind p3taSilic ""-asps (O'Neil tl aI .• 1992; Stouthamer el 01.. 1993). Rickmsia-like orpnisms (RiO) from IsclSC nies Glossl"o spp.(Beard tl al., I 993b) and ladybttdes (WcrTen el 01. 1994) and Rhodococcu.r rhodnii in the vector of Chaps di9C*te. Rhodnius prolixus (Beard dol. . 1993a). for mosquilOeS. Wolbochio has been reported in Ihe Aslan tiler mosquilO, Aedes olboplclU$ (Graig el al., J 994) and in Culex plplnu (O'NeiI el aI., 1992). Ho-.-.ner, there have been very few reports of bacteria symbionts in Culicids and there is no reason for symbionls not to occur in An. gambior If symbiotic bacteria occur in An. gambiae. then it becomes possible 10 exploit them as vchtcies for the introduction of foreign genes tNt can express anti-Plmmodlum proteins. t'1te feasibility of using transgenk cndus~ mbionts 10 alter the disease carrying ~ity of • d.isea9t wctor has beta demonstrated in It pro/aus (Duravasula el 01 .• 1997). Morc:owr,usinc bKl.c:ti.IlbIthavcspecialisedand5fX'(iflCsymbtot,cl.5.SOCiationwith insect hosts 10 spread ~ greatly mtuca the chance of unwanted gene spl'Qd. University of Ghana http://ugspace.ug.edu.gh Ikfol"C' lhe development of assays utilizing mo~ular techniques. the quantiUltive docwnentation of the presence of bacteria in blood-sucking arthropods such IS lic;ks, mites and mosquitoes. depended upon specific slaining techniqucsand serologK:a1-based assays. However. biological staining may be unreliable whilst serological.based assays require species-specific antisera (Higgins & Azad. 1995). Moreover, very few symbionts have been cultured outside their host in cell free systems. thereby preventing even a nditional bacterial taxonomic placement (O'Neil, 1997), though in a few cues. symbionts have ICtUaJly been isolated and grown in artificial culture medium (Welbum ('I ul.. 1987: W.1bum &; Dal •• 1997). Atthoush &he flSlidious nature of symbionts makes their isolation and in vilw cultivation diffICult, the advent of polymerase chain reaction (peR) has greatly aided efforts to characterise these symbionts at the molecular lC\el, and also 10 monitor changes that occur in vector populations over a period. Ribosomal gene primers specific for WO!. pIpi~tl/i.J haw: been used to detect lhis symbiont in insects belonging to several different Orden (O'Neil el al.. 1992). Roussel el at. (1992) described the use of 16S and 23S ribosomal RNA (rRNA) primers to amplify segments of these gcnes from Wolbochiu in terrestriaJ isopods, IS ...., "ell a..s insects. There arc several published oligonucleotide primer seb for amplif'ying sequences of Ri~kClIsiae-)lkt: symbionts (O'Neil et aI., 1992; Roussel rl uf. 1992a.; Higgins and Azad, 1995) and Eubacteria (Wcisburg elol. . 1991) in several insccls. These can be used 10 screen initially for symbionts in An. gambl«. For Idcntification purposes and 10 investigatetbe phylogenetic rclalionshipsoflhescbacleria, lhe sequences of the peR amplified DNA fragments can be compared to those lhal have been published ant.! depo .. iled in DNA databanks !luch as the GENBANK. Such phy1ogcnc1ic shidlCC'i can ousist in dcvclopins successful culture conditions in v;Iro as well as in Kkntifying .. tJlI.;shk- pla."imid transformation \'eClors. University of Ghana http://ugspace.ug.edu.gh 1.1 Objectives The uJbmate goal of this study is to identify a microbial symhiont that is universal in All gamb;~ s.l. and which can potentially be Benetically manipulated to express anti- PlaIIItOdilllllproteinsmadultmosquitoes. rhespet;ificobjectivc~are: I To use published oligonucleotide primer sequences to amplify by peR, the DNA sequences of eubac:teria and rickettsiae-like organisms (RLO) of An. gumbiae. 2 To confirm the identicy of the symbiont's amplified DNA sequence using restriction analysis. 3. To done and sequence the amplified DNA fragments. 4. To idcalify the species of the symbionts by conductinB bio-informatic search for homologous sequences in gene database banks 5 To determine lhe symbiont', phylogenetic relationships with related orgllflisms. University of Ghana http://ugspace.ug.edu.gh CHAPTER 2 LITERATURE REVIEW 2.1 Mosquitoes 2.1.1 General int roductio n Mosquitoes arc perhaps the muse Familiar of all blood·sucking insects. 'They are of tremendous significance to man from both the economic and health point of vteW, bc.:a~ of their role as para.. lte vectors . 1ney include the only organisms able to transmit human malaria. and, apart from carrying this and other diseases. are almost unrivalled as initatingbillngpests Aristotle was the first to chronicle mosquitoes, which he rt:fernd to as "cropis" in. his "Historia Animalwn" in 300B.C in which he documented their life cycle and metamorphic abilities (Floore. 2000). Although it has bc-en c:Iaimcd that the name ""kt!oquito" is eitha a Spanish or. Portuguese word meaning "little fly", Floore (2001) postulltes thai lbe. word is ~lly North American, and dates back to about 1583. In Europe. mosquitoes were caJled -musketas" by the Spanish. "gnats" by the British. "Lcs moucherons" or"Les cousins" by the French, and "Stc:chmuc:ken" or "Schnaekc" by the 2.1.2 ClOlSsificatlon and identification \.1·' ..... 1IJil,~ belong to the order Diplera. sub-order Nemaloccrca ofttle class Insecta wbi<:b " 1M most dominant group of the Phylum Arthropoda. Mosquitoes are readily discin~uished from ot11<.., similar-lookmg fli~ in the sub-order (Nematocerca) by their con:,plclIUUS forwardl) prujttting proboscis, scaJC$ on the thorax, legs. abdomen and "'In~ \CUb, and fringe or tealcs alona the posterior margin of the "ings (Service. \91)). KCllk. 1(95), AU mosquilOc:s b..:long to the famil) Culicidae. which is divided into three ,---------------------------------------------.12 - University of Ghana http://ugspace.ug.edu.gh sub--families Cuilcinae, Anophdiue aod Toxorhynchites. and JS genera (While. 1996). The CUIH::lnac (culicines) and me AnopheJinae (anophelines) both (onwn blood fcedinS man-biling species that are Unportant disease vectors, and it is important to be able to dislinguish betwem these sub-families. The Fi&W'C 2.1 illustrates the criteria which are uswIlyutedtodlstinguishthcm. The3e two mosquito subfamilies ruso differ significantly in their genomic structures. Anophc:li,. have heteromorphic sex chromosomes, a small genome size, and repetitive C'lcrntnlS dull Il'C' distributed in _ long-period interspersion pattern. In contrast. Culicinae have bomomorpbic sex chromosomes, and repetitive DNA tlw is organi.led in a short- period inlerspenion panem (Rai & Black. 1999). The Anophelinae contain about 450 sp«ies in three genera. Bironella (which is Aumalasian in distribution. ocxwrillg mainly in the Papunn subregion). Chogruio (which is NtotrOpicaJ) and AnopJwl,s (wbich cootairu: by far the largest number of species. 437. and ls newty lA'Orldwidc in distribution) [Lehane, 1991 ; Sallwn et aI., 2000j. 'The phylogenetic relationships among Anopheles. Bironefla. and Chagasla were evaluated by lIarbl:lI.:h and Kitching (1998) using morphological characters. The Anophelinae were fuund by the authors to be: monophyletic:. "ith Chagoslo occupying a basal position withm the subramily. and AnopIwlts sharing. sister-group relationship with Biro,,~l/a The Culic:iMt. the largest sub-ramily, is divided into two tribes the Culicini and Sabethini (Servicr, 1993). with almost 2500 spe'Cic5 in over )0 genera (White, 1996). The main medically important genera an: Atdts. Cwu and Mwuomo species ~ ____________________________________- '3~ University of Ghana http://ugspace.ug.edu.gh ANOPHELINES I CULICINES ~ffr~f~ ;'~.:~~~:, ~ egg' ·~ · II~~ .... IA edes Culex MIJn$onla ~~o,~~ I Aed.s Culex Mo,,,onl. 1 (7 po~~e ~~ tfj) AnoPheles IA edes .. Culex ~son;. fi&- 2.1 Chart showing the main differences between mOSQuitoes of the subfamilies Anophclinae (len) and Culicinae (right), (From Service, 1993). The main characttristics of anopheJines which distinguish them from culicines arc: adult anopbelines rest with !heir bodies II an angle of 30450 to the surface with the head, thorax and abdomm in a straight line,have datk and pa1c scalcs on the veins of the winas arTangN in distinct bklcks and palps ,bout as long IS the proboscls in both 5eXCl; CIIt him: flouts and are laid singly on the surface of water; larvae lack a siphon and re .. with their bodtcs par.allel to the water stuface. below the swface lilm. and pupae have bn:attunc uumpclS wttic:h arc short with a broad openina. In conlrUl, adult culicines rest with tbeir bodies roughly parnllello the surf.:e. have tc.aIes on the wina veins - but not ananacd an blocks and palps which are much ~horttt than the proboscis in females. Eggs are~ptOvidcdwithnoat.sandan:eithc:rhIltJsepa.rateI)'(Aetks).orJtackedin.floetin& ~n U 'wu) or in a ~ on ~ling \'egcl~Jllnn (Mamvm); larvae have • lana or ~hort Siphon and rest with then bochCl> at !lit angle to the "'Utc' surface: and PupK baYc: mort or long bn:alhinl:.\ trumpecs ""jlh a natT'O"" opening. University of Ghana http://ugspace.ug.edu.gh A high proportion of mosquitoes belOlll to the genus AeMs in which there are mote than 1000 species. clAssified as 40 subgenera. AMes ~,,-,,(ICS lft dislributcd throughout the world, especially in temperate COWltries (While, 1996). M..ny _ vector.> ofarboviruses. inctudintJ; SC\cral of the most important mosquito-borne human viral diseases (e.g. yellow fcver,J~eoccph.1litis.dengucfcver,RiftvaHeyfeverandWestNilefever). Representatives of three subgenera of AeMs Dre naluml veetors of BrUI(IU malay; and Wuchereria bancrofti that cause human disease (White. 1996). The subgenus. Slfgo. .y ia. is probably of the grealest medical interest. Aedes (Slegomyia) aegypli is the most widespread and most dangerous species in this sul-ogenus. It is the principal vcclOr of ehikuogunya and dengue viruses in almost evcry outb~ak. Urban ycllow feycr is also mainly transmiued by Ae. a~ygpIi in Central and South America and in WcsIAfrica(While.l996), About&OOspccicsofCuJuareknown.htingclassifiedin2Isubgencra,withrTW'lyspccies acting as me vecLon of enzootic arboviruses. protozoa and filariae ('Mlite. 1<)81). 1(96) 1Dc rypiul subacnus Culex contains the majority of species. Japanese cncephalitis virus is transmitted mainly by Culex spp. in \be Oriental region. espcclal.ly by the followlI1& species which bn:ed prolifically in peddy and swamps: ex 1';la~niOThyrtcltll$, ex, fl~lldll.J and ex, v;shltll; (Vanna, 1989} Culex lh~Jlerl in southern Africa and membtts of the ex, piPI~lLf complex in Eg) pi are important vectors of Rift Valley fever virus transmitted from livestock to man. In Australia, ex. annulir(JJlr;s plays a simil. role In thc= rptdcmiology of \1urray Valley encephalitis (Whilc. 1996). The Cul~x pip;ens complex comprises xvmtl species. subspccic!\ and forms., with ~(lrescntatives in .U pans III' the world (Scrvke. 1993). Typical ex. pipiens occurs in temperate c;ountnes of the nonhnn hem.isphe~. spading through temperate highlands 10 southem Africa. In Egypt. lhe: l.x. pIP'~ns complex is responsibk for roncwl\ian m.nasis transmis:sion University of Ghana http://ugspace.ug.edu.gh (Southgate. 1919) and has been implicated as the main vector in outbreaks of Rill VaUey fevc=T. CtJu quinqueja.JclallU is the main man·biting tropical member of the ex. pipl~fIS complc:x, distributed up to 38"N in USA. lOON in Asia, but only 24"N in Africa. Bancroftian filariasis is largely maintained in tropical villag~ and towns by ex. qumqlle/tudtJllLf alone. although West African strains of W. boncrofti do not develop in Ihls species of mosquito (White. 1996). I he: ~cnll' Mansonia has only a dozen American specics. notably the widespread pest M. III1JJmu "hich brced~ in swamps and marshes with floating water lettuce (Pislia) and WIllet hyacinth (Elcltornia) or rooced Pon/~Mriu (White, 1996). Mansonia liritlans is an important \'«10(" ofVenc.ruclan equine encephalitis and may contribute to transmission of If bancrojil. As with other troublesome MansOnia and some Coquillellidla, .\1 IIIII/ans disperses far from the breeding sites in search of hosts to bite. Subgenus MansonioiMs abo h.as a dot.tn 5peCies., endtmic to the Old World tropics. btiRg pests arising from breedina sites in water ..-.ilh plenty of Eichornio, Purla, rooted Isachne (swamp grass), ZIIZtJIUa IIbd other suitable "egelation (White, 1996). Some atboviruses are transmitted "} Mansonioilks spp .• but these species of mosquito are generally refractory to W. bancroft'- The bites are more painful than with most other mosquitoes. Mansonioitks spp. are uf particular ~itologjcal interest because. together with Anopheles (AtrOpMJ~J) 5pp., they are the \I«lors ofbrugian fi lariasis (White. 1989). In South-East A~il'l. sv.amJ>-forest populations of M unnuluru, .1.1. uniform/.! and especially the siblill8 species M. hunftelJl! and .\/ Jiws share the transmission of zoonotic subperiodic 8. 1IftJIayt, wtUch they ttansmitto man from Icafmonkeysan:Sochc:r wild animals (Wharton, 1962) The third su.b-family, the Toxorh)"ochiliR3(. has unly one genu .. the Toxot'Jtyncltilu, :mJ contains the world's largest mosquitoes lScnoicc. 1993) lhere llf'r some 76 mainl~ University of Ghana http://ugspace.ug.edu.gh b'opical ipCCies of ToxorJrynch/(t.'J bue • few occur in Japan. the maritime atC'a of the far eastern pans of Russia and in the eastern USA northwards towards Canada (Service. (993). Adults are readily recognised b), the large size (up &0 19 rnm long. and 24 mm wing spread), rneuIlic colouration and prominent curved probo-'Cis. Females are i~)eofb)oodfcedingandbothsexcl> sockrteCtarandothcrnatutall)'occurring supry sub5t&nces. ToxOl'hynchil~J is consequently not involved in disease transmission (WhiI<.I996). Approximatel)' 3300 spc:cies of Culicidae have been described. more being recognised each year (While, 1996). The most important man-biting species of mosquitoes belong 10 tht ~nera AnoplwleJ, Cu/~x, Aedes. MaflSon;a, HlNmu~(lglIs. Sabethes and PsorQP#tora (Servke.1980). 2.1.3 Species complexes ~ mu~uitoes occur as sibling spc:cies (or cryptic species) complexes, pall's or ¥roups ofckllc:ly ~1a1ed species lhat are morphologiCliI indistinguishable (isomorphic) but rc:producti~l)'isoltllcd.andwb.ich frt!qut!nlly live in lhe same area (Le. aresympatric) [Senic:c.1993). me first to be recognised as such is the A.noplw/~,t lltaCuli~n"u complex. (Service, 1993). Simia.ty. All gambioe was aJJO later shown to be a species complex. (eoluzzi. 1966), and m:tny olha Amlpheln species can now be considered as rcpre.seotiat COItlplcxcs, for example, An Jirus (Green el al., 1992), An 1,.·IJ"cifocj~J (Supna~/aJ.,19U)andAn. qlJadrimaculatlJs(Narang~/aJ., 1989a). In many complexes. species-spccilic differences in ecology and behaviour can sub5tantiaU)' a!Tect both disc:a~ trarumission and the success of rDClhods IiSCd for coouol. Fnrexamrle. II difTCTC'ntial response Iu Lnsecticides has bttn noted in the All University of Ghana http://ugspace.ug.edu.gh cwicifQC;~~ complex in India. In this case. DDT resistance is mainly associated with species B. which is. very poor vector of malaria (Subbarao d 0/ .• 1988). 2.1.4 Methods for distinguishing between cryptic species Sptcics Vo llhm a l:oOlplex by definition cannol be distingu l~hed morphologically, or only wilh great difficulry . In mOSI cases the benchmark definition o f cryptic s ibling spec ies has been mating incompatibility (Collins & Paskewitz, 1996). Mating compatibiJity studies were later replaced by polytene chromosome examinations, whieh wert amonc: the flfSt to provide researchers with langtble genetic markers that could be uttd to differentiate between spc:ctes (within cryptic complexes) and chromosomal forms (Coluzzi & Sabatini, 1968; unui el aI. . 1977; Coluzzi d 0/., 1979; Toure tI aI., 1998). For routine species identification however, chromosomal markers have major limitatioM. A number of alternative 5pC'Cies-diagno~tic procedlUcs h.m: IhcTtfnre hem developed, including idenlifiCltion by electrophoretic IJUItysis of gene- etv.ytne systems (species-specific allozymes) [Mabon~' 01 .• 1976; Miles. 1978; Narana el al. • 1911. 1919a,b; lanzaro el oJ .• 1995). hybridisation with DNA probes (Colhns el oJ., 1987; Gille & Crampton, 1917; Co..:kbum &. Seawright. 198&~ Panyim el of., 1988) and quantitative differences in the cuticular hydrocarbon profiles (Carlson & Service, 1979, 1980, Ilamilton& Service. 1983; Phillips&. Milligan, 1986: Phillips~'al., I'JM8) From these: ·traditional tools ' of cytological eXllmination of polytene chromosomes. namdy. genetic compatibility, immunolosical and hybridisation techniqUH and isozyme an31 ~~ is (Fa"ja ~I 01. . 199411; MunstCTTT\alUl & COM, 1997), identification tools have expanded 10 include a \ 'as( amy (If moleculllr milrkers. 11lc:sc conlcmporvy rn.rt.ers range frum what are no\\ rdc:rrcd to IS 'classicaJ gemtic mutte rs' such as mitochondrial University of Ghana http://ugspace.ug.edu.gh DNAs (Caccone elm. • 1996: Besaasky elal. • 1997; Lehmann dol. • 1997; Thclwell elal., 2(00). spccieHpeciflC restriction fragment length polymorphism (RFLP) analysis of ri~ UNA (Collins ~I ul . 1988; YasothornsrikuJ el al., 1988: Torres el al., 2000; FlviaetaJ. . 2001). methodsusedtodetectandidentifysinglenucleoctdepolymorphisms (SNP,) [De Merida " oJ .• I999J and fmally 10 highly polymorphic nwkers such as random ampluaed polymorpruc DNAs (RAPDs) [Wilkerson d al. • 1993: Favia et al. • 1994aJ. microsatellite ONAs (Zbeng e, 01 .• 1996. 1997; Lanzaro et al., 1998; Conn" m., 2001; Shankov II ul .. 200la), and amplified fragment length polymorphisms (AFLPs) (Bl.ack & Lan2aro, 2001). M~ recently, sequence-tagged amplified RAPDs and sing1e~ copy markers (Mukaba)"irc I!/ al. • 2001) have alto been used 10 distinguish between Onc of the areatcst advantagcs of this wide variety ofgcnelic J'TUU"kers is that researchers RlIY c:booee to IAilisc any combination of marfc;crs or techniq..an to addrcs.s muJtifacetcd quaciooIrclatinatnmalana Iransmissionbyanophclinemo5quilocs(Taylorelal. . 2001). 1bex mol~ul;u markers have proven useful in a wide variety of applications including molccul;u taxonomy, evolutionary systematics. population genetics. genetic mapping, and I variety of molecular diagnostics. I) "orphotogicatldentification The analysis oi morphological features is silli the must widely used technique for taxonomic and sysccmatic studies uf anopheline mo~uiIOCs. and lbe intemal dauiflC&llon of the genus Anophtle~ is based largely on morphological character!> (H,..b.ch. 1994). The available morphological di.qnostic ctw.cttf'S. altholJlh OOt relLablc, an: of ddinile laXonomic \ "Iue fOf the distinction of the: saltwater and the fresh University of Ghana http://ugspace.ug.edu.gh WIlier sibling species of the An. Kambi(N complex. The number of sensilll coeloconica. the value oflhe palpal index and the shape ofthc eggs can separate these two groups (Scf'\·kc, 1993). However, none of these characters appears to be completely discnmlllani as tht= morphological characters of adults are variable and overlap in many inslal1c~(Coluz.zi.I964). In tM field. morphology still remains a very useful tool for identifying WU'Clated species. and cnnt'lk .. the preliminary sorting of material prior to the applicatti"oo. 1991), The rONA cistron cumprilCS the 288, S.SS and ISS coding ~gion~ (hat arc University of Ghana http://ugspace.ug.edu.gh interspersed with inle:maJ traMCribed sp.:er rcgioos (ITSI and ITS2, either side of the S.SS gene) (Hillis&: Dixon. 1991). These modWesue then repealed in tandem linked by the inlcrgmic spacG' region (lGS) (Avise. 1(94). Thc:re arescvcral different sites with.in the rONA for wruc:b universal PCR primers have been construCted (Black. 1991). HO"'-'evc:I', cODSCl"Y'Cd sequences flaok regions of bypc:rvariability making lhis mmer versatile at di{f~nt levels of taxonomy reflecting different rates of evolution (JTS2- .:!ss. Campbell ~r aI. • 1993; ITSI - ITS2. PaskewilZer al. • 1993; lIS, Pashley ~l ell .• 1993). Many invc5tigators have used thc:3c regions to distinguish between members of spedes populations and complexes of medically important Vtt10rs of disease (Paskewitz d oJ .• 1993: TOWlUOn.t: Onapa. 1994; Collins cl Paslcewitz, 1996). Currmlly. the most widely used method in the identification of member species of the An. gambiae complex is besed OQ lIJrCies-specific nucleotide sequences in the ribo!Omal DNA (rONA) inlergeRie spacer rcgKxu (Scott et DI. • 1993). Thjs may be used 10 identify both species and inlcrspedcs of bybrids, reprdJess of life stage:. using either exlr1lctcd DNA or fragments of. sp«illlt!n. In usina this method for the Ktentification oC mosquitoes. intact portions of a specimen as small as an eea or the segment of one leg. may be placed directly into the PCR mixlUre' for amplirtCatioo and PnIlysis (Scott d aJ. • (993). This t~ p.! l·f d;:agnnstic tool has abo been extensively used for differentiation of species -.ithin the All {UIWJIW group (Hacken el a'-. 2000). All dtrus (Xu ~I al. • 1998), An. jllWkl1i1i.f (Marn:mmani el aJ .• 20011. and the An quodrimucululLU complex (Comel el al. • 1996). Such srudies have readil) identified the lUoCfulnc:s.s of this molecular marker for difTcrmc~ at the I~el of sub-species. races and strains. Ho"c\cr. continuous \'lIri"ti~ of the! ln1crl::eRlc 'racer (lOS) and multiple copies of the gent within I sin~k I~I\ "jual can confound analyst,>. PCR amplification and sequencing results. This is "-ell University of Ghana http://ugspace.ug.edu.gh demonstraltd by (GS variability within an indIvidual which has been {0W'd to be as greaa as thai across a sample population of the aphid, Schizqphis grammum Rondani (JiemipCera: AphJdidat) at various spatial scales (Shufran el 01 .. 1991). The costs involved with this marker are similar to any PCR based technique. In gmeral. problems with muhiplC' copies mHll that lhis technique, unlike others. should be used with careful consideration. 10 some studies to date. contaminants have ~n amplified along with the ~udy $OUtce (Fenton el m.. P)()4); in this particular case, a novel fungal organism was identified. Apart from rONA Benes. there are a few other nuclear Benes mat have been widely applied to phylogcnelK: investigations. However, Friedlander (I 01. (1992) have reported 14 nuclear lenes thai may prove useful for highcr level phylogenetic analysis over a wide range of taxa, e .g ENOL. enolase; G6PO. glucose-6-phosphate dehydrogenase and SODPUMP.Na'/K··ATI»uc. Further sites for short sequences thai are proving useful in \·;ariow taxonomic and population genetic SNdies have recently been investigated in mlrons (Lc~...a. &:. Applebaum. 1993; Palwnbi &. Baker. 1994; He & Haymer, 1997), non· coding regulatory regjonsclosely a.s.soc;iated with genes (Slade elul .. 1993). 2.1.4.3 Highly polymorphic markers I) MlcrowteNlteONA$ Simple tandem repetitive DNA, more commonly known as microsalcllile DNA. has become a popular 1001 for ~rnctic studies ofanophdines. Micros.lelliles are described as DNA fragments consiscini; of simple short sequences woolly of 2~ nuckotide!l (nl). tandanly fq)C31C'd in man: or less unifonn tracts up to approximarcly 10' nllon, (TaulJ'~ 1993; Chamben &. MacAvoy, 2(00). MicrosateUiles art' found in the I~nomes of just ilboul (vcry known organism and ~Ue. In most eukaryotic organisms.. University of Ghana http://ugspace.ug.edu.gh micro5PItellites are disperxd throughoul the genome (Hamada el al .. 1982) and can occur as frequently as evt:r) 10 kb (T4Utz. 1989). Polymorphism in miccosatclliles leads 10 aD tncru.sed probability of finding heterozygous indi\'iduaJs. Mkrosattllitc kx:i are liso frequently hy~rvariabk. (Le. they possess several alleles 11 ~lIlively high frequency, improving lMlr versalillt)' and hence are tdell tools for mokcular charactcrwtk>n of individu.aJsandilUdiesofint.raspecific ... ariatton(Tautz, 1993;L.anzaroetai. . I 99S}. £Xploiwton of micros.atellite kxi has pro\'ided biologists with 8 sel of molecular tools with un)U~stdversatilily.Thckeytothisversatilitylicsinthehighlcvelsof\'ariability, which arecharactcristically(oundatsuchloc i.coupledwiththe speedandreltabilitywith which this in(ormo1tton can be accessed in the labol1lloT) (Chambers &: Mac:Avoy, 2000). Their applications range (rom estimation o(the spatial relationships between chromosome segments 10 the elucidation of temporal ~Ialion!ohip.s bet. .... een oricins of species and genera Mlcros.tellite loci ...... eallO been described ti ideal mark~f!lfotmc:asuringpopuillion le .... el phenomena iUCh &5 population structun: due to their high polymorphi.vn, c:odominance, abundanl preaeN:e throughout the genome and relalive ease in sceKing (Bowcock et al.. 1994 ; Butt.nan ~taJ. . 1994: Scribneret ul. . 1994: lanzaro et 01.. 1995). OtMr applicatKMu include: ux in the analysis of lIboratory :mJ agricullul1ll organisnu and human genome div~rs ity projecu 10 mIp single gene trailS rapidly (Hcamc et 01.. 1992), DNA profiling (Gill ~t al. • 1994) and .150 in fou;lIsic .... ork (Chambers ~I al .. 1997). MicrosalCliites art almost perfect tools for application In dcwnnining the ,,"em of ~lationship:. bctv.ecn individuals because !hey are capable: o(hiply discriminatinl biperentally inherited co-dominant markers. In the analysis of An gu",""", popubtions., microsarcllile DNA _lysis has been used 10 generate. genctK: map of the mosquito (Zhcng ~I aI. . 19(3), in the Ilenet ..; difftrenlillt~ of populations (Lanzaro~ttJl.. 1995) and in the studyofpopulatton Mructurc (Donnelly etal. . 1999). University of Ghana http://ugspace.ug.edu.gh The advantage of this approach is me ability to de~t greater levels of genetic variability as many microsatellite loci, often with numerous aJle&es. can potentially be screened for ecologjealuse (i::vans, 1993). Individuala1le~canbesc;orcdatpatticularlociand provide good Mrodclian markers. Tri· and tetra·nuclcotidc repealS provide much better signals. i.e. reduced stuIIert.nds (caused by mispriming inducing templates of smaller size 10 the ori&inal t.nd) and are more easily recorded for size variations (Queller tf al., 1993). However. the disadvantages include the I't'quirementlo sen.:·cn ~veralloci for adequate population infonnation (a minimum of four polymorphic loci in clonal orpnisms and ten and above forsexuaJ populations). thus increasing development timc lOCI costs pcr sample. Although micro-satellite peR primers are designed to be species specific., cross species allele amplification his been noted (Queller ~I oJ .• 1993). III Randomly ampllfled polymorphle DNA (RAPO) RaDdomIy amplified polymocphic DNA (RAPO) marker:! ka\'e been extensively used 10 distinguish berween members of cryptic species (Williams el al .. 1990). The technique is fast. lechnicaJly C81)'. aDd requires little material. Mo!rt importantly. no prt'viow; nucleotide SC'qlkncc iDfonnallon is needed for the construction of primers. Many markers can be readily identifted for a variety of laXOnOmic: kvels and, in comparison with UNA sequmcina. the effort and cost are modest so Ihat many individuals can be assayed. This method, whtch I'C'veals great genetic variabilily due to the regHms in which amplification takesplace(BlackelaJ.,1992),iSU5C'ful in differealiatinl closcly-related indi\'iduals, and there are numerous commcTCially auilable primer kits which can be used to ICTttn popalatiom.. The __y Iis of RAPDs provides a novel and etrccth.. c mtthod for di:Miapishing AttopJwles specin and other organisms accon1inC 10 the bMding panerns ofthrir DNA. as wet! as providing a ncw means of obtaining gcnctK: markers (Iledriek. 1992) University of Ghana http://ugspace.ug.edu.gh However. theae markers have se~raJ dnwbecks: (i) they often reveal continuous variation betwem sample populations: (ii) primer libraries need screening to identify suitable primers and stable genetic polymmphisms, both of wbich are time (:onsummg and costly; (iii) the dominant natw"C of the markers makes it impossible (0 diS1inguish between homu.lygous and heterozygous alleles (Carlson et aL. 1991); (iv) additional tedwques arc n:quim1 for useful Mendelian data to be: obtained (Vaughn Ie. Antolin. l 1998); and (v) numerous factors such as DNA qualiry and quantity. Mg ' concentrations.. Toq souree etc. can affect the reproducibility and standardization of reactions (Black. \993). 2.1.50istributionMMl-=oI09V Mosquitoes havc an almost \\orldwidedistribution.. being found throughout the tropics and temperate f'C'gioll.'l and even well beyond the Arctic Circle; they are Ibsent onJy from Antarctici and a few islands (Service. 1993). They ate found at clcvations of 5.500 m and in mines at a depth of 1.2S0 m below sea level (Service. 1980). Their great diversity of habitats and IifC'-hi"ory strategies has allowed them 10 colonize many contrd.Sting environments. For example, mosquito larvae arc founc1 in ponds, swamps. sa]HYaler manbc:s,pools,tree-ho~,poliuted""'8tC1ofSCJ>'ictanks..ricefields.discardeddomcstic conlainers., rock pools, plant axils and pilcher plunts. and in a variety of other aquatic habitats. Adults are encountered in almost all types of ecological zones. from equatorial rain rorests. urbanised conglomerates, cuitivalCd lands ttl ~cmi-arid areas (Service, t 993 J Some gcr'lC13.. bo¥l'n'a. tuve a rnuiC1Cd distribution and may be confined to certain areas of the world. lbe gCncr:l IJolu"otfJgut and Su~thts. for example. are round in only Central and South Amenca. Some mosquitoes may occur in only • few counTries or localities. for example An. hwambtN, .....i lcreas oth..-n such as~. ,wnqueJafciOlu.f and At. De/OlPti arc widcspn:8d in the troptcaI region .. ofthc ¥lurid (Service. 1980) University of Ghana http://ugspace.ug.edu.gh Most mosquiloes probabl) dispenc only a few hWldred meU'CS or SO from their emergence sites; for example A~des Qltgypll usually proMbly files only 25-100 m or so. Nunnally Anopht/~J do not fly more than 2 kIn, but in certain circwnstances they can rqulatly fly l·S km (Scrvic:e. \993). The distance mosquiloes fly is determined largely by the enviroomenl: if suitable hosts and breeding places are nearby, mosquitoes do nol have 10 di!'pcrse far, but if one or both are 1(. di~tam; ('. sreater dispersal will be 2.1.6 Hedicallmportance Mosquilocs cause more bwnan sUITcring than any other organism, with over one million people dying from mosquito-borne diseases every year. Mosquito-bome dixases continue 10 cau-.e significant human health problems.. lugely in the sUbtropics and tropics, and lheit inCldc:m:e h. .b increased significantly ....i thin the last 2 dcc:1des(World Bank Repott, 2001). Not only can mo~ulloeS carry diSCl.X'S that affitcl humans, but they also [nnSmit ",veral diseases and parasites that dogs and horses arc: very susceptible to. These include dog heartworm, Wesl Nile virus (WN) and Eastern equine encephalitis (EEE) LWbil~· . 19961. In addition. mosquito bites can cause severe sk.in irritalion through an ,Ilk/gil.: n.:~llun to the mosquito's saliva causing a red bump and itching. Mosquito \ ••: t'lore,j d.~3$C's include prolOllOaR di.seucs, i.e. , malaria., filarial diseases such as dog hc:artwonn.and arthropod-bome viruses (1lJ"tKwiruses) such udenaue, enccphalilisand yellow fever. Estimales from the: World Health Orpnization indicate that three mosquilo-bome dilcascs an-: among the leading causes of morbidilY and mortality in developing countries around the world (WHO, !OOI,. Nearly 500 million clinical cases of ma.1aria caused by University of Ghana http://ugspace.ug.edu.gh mfection wtthPltuwtOdi. ... ,.....,itesoccurcach yc&r, resulling in 2.7 million deaths. mainly in children. Malana is exclusively transmitted by AttOph~/eJ mosquitoes Lymphatic filariasis is caused by parasitic nematodes and is the second leading cause of pc1l1lIllCnt and long-term disability worldwide. with 120 million people presenting chmcal morbidIty (WHO, 200(8). More than half of the world' s burden of lymphatic filariasis (LF) is transnutted by ex. qlllnquefasciafllS and other man-biting mosquitoes of the Cx. p'pirtU complex which are responsible for Bancroftian filariasis transmission in thc:Americas. Egypt. urban East Africa, the Indian subcontinent, Indonesia and SOUthC4St Asia (WHO. 2000). In about 40 countries in the African regaon and Papuan sub-region. W. bancrofli is largely transmitted by AnoplteleJ mosquitoes that also vector mafari. in rural areas.. In most Pacific countries. W. bancrofti is vectored by aedine mosquitoes (Aedes and Od/rl'OlDtu..f) that also transmit arboviruses. notably dengue. Srugian filiUla.. . IS, tnnslTullcd by Man.wtlia and AMplw/eJ. i.J now limited to only 8 orientaJ countries (While, 1996) Dengue fever virus, particularly Its haemorrhagic form, is. threat to >2.5 billion people. with an annual incidence in the tens of millions and 324,000 death.! per )'Car (WHO, 2002b). Dengue vit\I5CS are transmitted to bumms through the biles of infective female A.:Ja mosquitoes. Many mosquito spc:cu:s are abo vectors of otht:r arboviruses. including sevaal of the MO!illmportant mOSQuito-borne hwnan diseases., West Nile ViruS (WN). ca:>tem cqumc: encephalitis (EEE). western equine encephalitis (WEE). S1. Louis encephalitis (SlE), La Crosse (LAC) cnc:ephaIitis and Jap.anc::sc: c:nccphalitis (JE) lWh.ite. 1996]. The: arbovl.rusc:s are the mMt divc:ne, nlJmcroUJ and senous dlsca.tel transmitted to SUSC\.-p(lblc vcnebrate hosts by mosquitoes and other blood· reeding arthropods. University of Ghana http://ugspace.ug.edu.gh 2.2 The Genus Anopheles Nocablc d~in&uiwog ClW'a..:teristics of anopheline mosquitoes (from other mOSQuitoes) include the long pe.lps tbat are present on both males and femaJes and the chatacterislic pattern of blocks of dark and pale scales on tht.: Wln~ veins. especially along the CO$I.II (top partoflhe w;ng) [Fig. 2.2]. Chromosomal data suggest that Anoplwlu is "primilh'C''' within the rllffiil~ CulieK:iae (Besansky & Collins. 1992). Evidc:nc.c. incluJinl significantly smaller chromosomes and lower nutle.r DNA conlcnl (Rae> & R.i, 1990), and their unique possession of dimorphic sex chromosomes and long-period interspersion of repetitive sequences in the genome (Black &. Rai, JI)8KJ. support the extensive divergence of AnoPM/~s from the other mosquitoes. "The genus AltOpIt~/es Meigen consists cunmtiy of 437 recognised species which divide into six su!>-gcncra; Anopheles (185 species). Celllll (200 speei~), LophopodomyJa (6 species). K~r/~zjQ (12 species), Nyssorhynchus (29 species), and SJelhomyia (5 species) [SaIlwne/ul. . 2000). 2.2.1 The life c:ycle of Anopheles mosquitoes The AnoPM/~s mosquito goes through four separale and distinct stage. of its life cycle; egg. larva. pupa. and adult (Fig. 2.3). Each of these scaaes (lUI t't easily recognized by its spec ..' appcaranc(. The immahate scaaes of Anophtl('s are aquatic, "hilst Ihe aduJt is tenatrW. Anophc:linelarvac:cxisclnawidc\·arielyofdiffcrf:nlhabitats(Scrvi«.1993). University of Ghana http://ugspace.ug.edu.gh Flr.l.l Art IIDOphelme wmg which 15 idMfied by i1s c~ ~ 11wn are blocks ofpale aad dad scales (red arrows) on the vein. especial Iv along me costa (tIw lOp ... of"~. Then is also • Iiiap of rwrow. outsa.tdina SQ].. . (peM arrows) alona1hebonomoftbcwma 36 University of Ghana http://ugspace.ug.edu.gh P'4'a "1.1..1 Li:fe~eofAllop#wlesmosqui1oes 'Iben 1ft r~dillinc.1 ~ of development (completcmetamorpbOSlsl: ega. lana. pupa. and adult. the eggs are LMiD ..... lDdhawepalredlalenlaar-ftlledfloll$,lJOb..a ..... _lMYhllehin 1-3d.,.. lhelro"ali_.,...&&eItodlewttersurf'aceandhlsconspltuousmouth brushes 'Mae pupa IS collllDHhlped. bas afuscd head aad thor-.. The pupil SfaBe IS • ";'ar:l\·el~' short (two to line dayt), aonf-'ins. a.sIIJonaI .... 11'1 which the adult ~trilhiBtbepupaicovennsBolhedullrmllllesandmaJesdnnknec:tlr1nd 01Nrplad:Jwcesasenll'gysources : ~onl)' ranakllClb.bkIod"". tdizin8the ...--_ .. pmd. ... - University of Ghana http://ugspace.ug.edu.gh Some species, such as All. ps~ucJopunclipeltlfU Theobald. prefer pools of water in drying streams in relatively dry. sunny areas, while Qthas prefer inundated an:as such as ric:e fields. AnDjJMles minimus in 'I118iland and the Philippines lives amongst bamboo roots in relalivcly swift·moving water. Anopheles gambiae sensu lato often lives in puddles of water that accwnulale in lyre ruLs. and An. stephens; may live in rnatrmade conlainers Anoplwles l.rvae as a whole lead 10 prefer clean. rather than polluted water (Service, 1993). The A1IOphelesegg is difficult to sec .....- ith the nak.ed eye, being quite small about O.S x 0.1 mm in length (Service, 1993). It as boaI·shapcd and. has two laleral floalS which are filled with air. The shape ofthesc floats. as well as the chorion ornamentation may be used to distinguish the subgenera and sometimes the species of IlllOphelines (Hervy el aI., 1998). The eggs are laid singly on the swf8Ce or on the edge of water and are attached side·to· side or end·to--cnd. but never md·to--side. The eggs arc white when newly laid but tum brown or ~k within [·2 hours. In tropical temperatures, ens hatch in 2·3 days to produce t.vac. These eggs cannot ""ithstantl desiccation (Service, 1993). Some 100·200 per batch are laid every 2-4 days. Survival and egg development are mainly dependent on temperature and relative humidity; Wlder extreme climatic conditions, mosquitoes may go into hibmYtion or aestivation, which allows the survival of the species through Ihe winler in temperate climates. or long dry !Casons in tropical arid areas (Clements, 1992). l'helarvaewhichhatch from the floating egg have ahead, a thorax und nine abdominal segments. The larvae abo show numerous morphological pcculiaritirs., the most striking ofth~ lxing the IKk of a siphon on the segment vln. the presence of palmalc setae on the lil'!it <;c\cn abdominal segmenLs. and Ihc shape of the cephalic setae (Hervy el al. 1M). Anopheline I. ..... arc filter (ecdcn; of paniculales at the surface of water. The University of Ghana http://ugspace.ug.edu.gh buccaJ appatalUS is SllJ1'OUI1ded by bruYles which create a "''llter flow that brings food f*1K:les to the mouth (Service. 1993). A conspicuous distinctive eharnctcr of AnopMleJ is the \\ay the larval body positions ilSel(parallc:! to the water surface as against the obIiqoou:s st)'k exhibited by Atlks and Clilu. This position might be related to the abscnce. in lheA1tOpIteltsWva. ofLhc respinilOry siphoa which inCulicioelarvaebreUs the surflCe Of\\13Icr and takes in air into tbe trac:beae. The dumion of the 1Qf'\"al stage ranges between S and II days dunng which the larvae uoderso 3 JUCCessive moults. during which they maintain the same morphology, but tnC~asc in sitt from less than 1 mm for the I Sf. stage larvae to more than 0.5 em for the 4" inscar(Service. 1993). Rates of larval growth are influenced by co\'ironmental facton: aach IS warer temperatwl:. photoperiodicity, food supply, degree of overcrowding and the 5pC'Cies(Scrvicc, 1993: White, 1996). Tht fourth instar larva moults into a pupa, in which lhe hc.:t and thorax are (wed to form 11 common en,'elope whicb bears the respimtol) lrWnpelS. Thepu~lsarelativelyshor1stage,usuaIly2-4d.)·,.durin&wtUchildoesnot feed. The actual eclosion takes approximately 12·15 minutes (~ervice. 1993) and for most species emergence occurs in the early evening or late at nighl. After~cnce,Mluitfemales5ecksbelleramongstvegeta1ionunliJtheyarereadyfor mating (!xt\\lren a few bouts 10 2 days) wtuk the mouthparts harden 10 permit ~kln pc:nclntion and f«ding. Males arc unable to mate until their genitalia have rotated through Un." of the male's acceuory ,t.nds. CffC"Cli\e1y blocks sperm University of Ghana http://ugspace.ug.edu.gh (rom subsequent copulations from raching the spennatheca (Gillies & Chir t 956; GigHoti &; Ma-.on 1966; Kettle. 1995). Female anopheline mosquitoes obtain energy from sugar meals for metabolism and for night Flight is Deeded for mating and for finding a ho51 that will provide II blood meal SOW'CC: (appc=titive night). The blood meal is • protein·rich diet lhal the mosquito SWTOWlds, after ingestion. with the peritrophic matrix (PM). a thin structure containing chitin and proteins. Digestion requires secreted proteases that penmate the PM (Clemenl5.. 1992). The smaller digestion JWOducts are hydrolyzed by microvilli-bound enzymes before absorption by the midgut cells. lbe blood meal-derived nutrients are processed by the in5eC1 fat body (eqw\,it!entto the ]i\'C~r and Mfipox tissue in vertebrates) intoeu proteins (vitellogeruns) and vanous lipicis associated with tipoproteins. These are then exported through the haemolymph to the insect ovaries. The egg development process takes 1 to 3 d.ys, and no further food intake i! needed until after oviposition. when a new cye~ of active host finding and blood fceding. digestion, and egg devdopmenl begins (Clements. 1992) Inc adult female may live from a few days 10 wdl nvcr • month. going tlvough several cycle .. til' hload feeds and egg-laying. '] hI.' Juralion of the whole process. from o\ipnsilion In adull eclosion. illCmpcrature-uccflm A.(A.)Jinell.fn A. (C .)jtyporieruis A.(C.)ptJtlOIti A. lA )b.l<:r,,/11I A . (C.)kUl"WUFI A.(C)/f"Qulilypel A. (C.) loliensi$ A. (C)/.'lJllllfJ'IJe 2 A.(C.)plllfcluinllU A. lC.)h;/Ii A. (C.)suhpic/w University of Ghana http://ugspace.ug.edu.gh --------------------------------------~4,__ University of Ghana http://ugspace.ug.edu.gh In most JWlS of Africa, several vectors tranSDlit maJana \0 each location, in some cases at the same lime and in other eues during different ~asons, as for example in iJietmo, Senegal (Fonlcnille tH oJ .• 1997). Much variation can be observed between villages 3 few kilometres apart . Ln Central Africa. it is not rare to capture four different vectors (e.g. An. gombioe. An. . junes/us. An. "iii and An. mouchen) during Lhc same nighl (Fonlenille cl Locho. ..... I999). Anophelt'.f gambiDI sensu slriclo and An. arobiensis, the two most anthropophilic species, an: responsible for more than 3/4 of Ihe world Plasmodium !alciparum inoculations (Favia & Louis, 1999) IU1d an: thus the most efficient vectors in sub-Saharan Africa. However. they differ in the degree of anlhrupophilly and endophily. with An. gwnbiae S.s. sbowina an iocft:ucd. propensity towards endophily, endophagy, and anthropophagy eompued 10 An. orabiensis. Anop~/es gambiae 5.5, the nominal species, is thus consigicaidependellCehasevolvedtosuchdegrecthat if the tWO organisms are separated, neilher will survive (Roberts &. Janovy. 1996). Macinnis (1976) defines mutualism as when each ofthc interacting species functions as bothahoscandaparasile. Tennites,and lheceliuJose-digcsting prOlozoansor bacteria tMI Jive in their inkSlines. are an excellent example of mutualism; neither organism can swvivc:wiitMlucthcother. P..sililm is an association in which an organism (parasite) living in or on another living orgarusm (host). obtains from the host part or all of its organic nutriment. usually to the detriment aCme host (Roberts &. Janovy, 1996). Parasite! may CIUse: rnechaniea.l injury • .sucba boring a bole into lhehostordiga.ing inlo its skin or other tissues. by stimulating a d.mqing iJlf1ammatory or immune resporuc. or simply by robbing the host of nutrients (RobcrIs & Janovy, 1996). The relationship may be permanent, as in the case of lIpCwonns found in the intestines of manunals, or very temporary. as during the feeding ofmosquilOes. leeches. and ticks on their hosts' blood. Pa.rusilism is said to be obligatory bccaux the parasite cannot normaJl) survive if it is preven1ed from makina contact with th.host(Chcna.1971). 2.4.2 Modes of how symblonts get together All symbiotic auociations mu.'il meet the challetlge of maintainin. their rartncrships over SUCCCS.SIVC rentnbons. In The \o\cll-known associations bctv.(",;n bacteria and metazoans., 61 University of Ghana http://ugspace.ug.edu.gh the bacterial symbiont must 5IOftId1ow become intimately as.soeialed with its metazoan host There are three major modes of getting this accomplished . ("he firsc mode involves external transfer. Here, the metazoan host is born free of becterial symbiorns and becomes infected from the mvironmcnt . This appears 10 be the case of the tquid infections with VibrlofiscMTI . lbesquid is bom without mest OOclcri3 and IICquires them from the water (Nuholm el aI.. 2000). The 5econd method is horizontaJ (in1ertaxon) transfer. Here. the hosts are born frce of symbionts. but they atqui~ them (rom members of the pamlW generation. The third method. venieal lrVISfer, involves gametes (most commooly, the oocyte) carrying the symbionts 10 the nexlgod, lissue), yet gentle enough 10 preserve lhe target nucleic acid. Commoa lysis procedures include: mecharUca1 disruption. chtmicAl lrtaunent and enzyrnatie:digc:stioo. Mechanica.l dLomJpthlR methods include grinding. hypotortic lysis and freeze-thawing. Examples of chemical treatment methods include detergent lysis. usc of strona cbaotropic agents such as guanjdiniwn thiocyanate and cesiwn Irinuoroacctatc and thioJ reduction. Enzymatic digestion methods include treabnenl with either Proceinue K or Prona.tc. Cell membrane disruption and inactivation of intracellular nucleaes may be combined (Roche Applied Science, 2002). For instance. a single solution may contain detergents such as sodiwn dodccyl sulphate (SOS) to solubilize cell membranrs and strona chaotropic salts (for example. ethylene diamine letr:l IIcetate. EDT A) 10 inactivate intracellular enzymes. After cell lysis and nuclease inactivation, ccllullrdebris rnly easily bc removed by filtrationorprecipilalion. Methods fot purifying nucleic acids from cell extracts are often combinations of curoctiotvpreeipitation, chromatography. ctntrifuption. electrophoresis. and affinity ~paration, Solvent extraction is often used In eliminate contaminant!!; from nucleic acids. "ficr cell lysis. the cell debris and proteins are removed by denaturation and centrifugatiOll. T1w:re ~ sevcraJ mc1hods for deproteinilina the lysed cell suspension ;md ~ include shaking with a mixture of chlorofonnfiJOam),lak:ohol (Mannur. 1963), all)'mlltie cSt:p.btion of the proteins with Prona..-ce Of Proteinase K (flotta &. ~1. I96S).and shakinl with phc-nol (Kirby, 1968), The~noldisruPlScC'lIularinlegrit)'and dcna,u~, ptUteins. and the final extraction with chlorofonn remo\es traces of phenol Protein in tht I)'sa&e can also be so!ubili,cd by treabnmt '-"ith SOS, Degradation b~ ---------------------------------------.~--- University of Ghana http://ugspace.ug.edu.gh DNAse and divalent meta! ion contamination is prevented by the presence of cbclaling lIIeotsand by the actionofSDS (Mannur, 1963). The DNA is sel«lively precipitated by the addition of ethanol or isopropanol. Precipitation with aJcohol tervcs to concentrate the high mo~uJar 'Neight DNA whilst removing the small oligonucleotide! of DNA and RNA, dctcrgcnl. and the organic solvents used in the removal of proteins (Rodriguez &. Tail. 1983). A subsequent wash with 70% ethanol, followed by brief centrifugation. removes residual salt and moislun= . To further purify the DNA, the enzyme ribonuclease IRNa.c) may be used to digest !.he contaminating RNA (Marmur. 1963; Rodriguez &. lad. 1983). I[the amount of target nucieic acid is low. an inert carrier such as glycogen caabeaddedlOlhc:mixluretoincreascprecipitalionefficicncy 2.5.2 Restriction endonucleases The class of enzymes known as restnction endonucleases have played a key role in the developmtnt of recombinant DNA technology. These bacterial enrymcs possess an c.ndoauckuc activity which is directed to a specifw: sequence of bases in double-stranded DNA. In IWure, they serve to protect bacteria from the possible incorporation of foreisn DNA into their genomes by digesting such material (Arber &. Duissoix, 1962). The battnium's o~n DNA is protected by being methylated on A or C residues. whk:h mtden il UI\I\'ait.ble for digestion by its own enzymes (Smith & Nathans. 1973~ Watson~' "I . 1992). It has been sugested that these enzymes may also playa role in promoting "site- spccifk iIIeg,itimatc m:ombiR3lion", allowinB incoming DNA to be delved mel incorpon&edinaoiliectwmosome(Rodriguez&' ·fait. 1983). The ItTm MrestrictK>n- arose beaux it wa originally found that certain bacleriophqes "uuld not grow on certain biKlcnal "trains; hence they were -.aid to be restricted. Invcstigation of this phenomenon ~\"C'aled lhat it wa due to the actIOn of this class of enzymes (Arbcr, 1979). University of Ghana http://ugspace.ug.edu.gh ReMc,ion enzymes have been classified inlOthree groups. Types I and III enzymescany modification (i.e . methylation) MId ATP-dcpendent restriction (cleavage) activities in the same procein (Sambrook el 0/. . 1989). Type III enzymes cut the DNA at the m:ognition site and then dissociate from the substrate; \\hile Type 1 enzymes bind to the recognition xquenc:~ hut cleave at random siles when the DNA loops bock to the bound enzyme. Neither Type I nor III restriction enzymes are widely used for mokc ular biology studies. Type II restriction/modification systems are birwy systems consi sting of • restriction endonuclease that: cleavcs a specific sequence of nucleotides. and a separate methylase that modifies the same recognition sequence (Sambrook el oJ. , 1989). These enzymes recogniu: specific sequences of four to eight base pairs in length (Watson ~t lIl., 1992), The: 5a:jucnca in the ty,'() strands of DNA that are recognized by the enzymes possess • Iwo·rold axis. of symmetry (Rodriguez 4 Tail, 1983; Sambrook et 0/. . 1989), The locatioaofcLeavage sites ....i trua the wsofa dyad syrnrnrtry differs from enzyme to enzyme. Some enzymes make cuts which ate exactly opposite in the Iwo DNA strnnds. • .,1hIllbe ends are said 10 be 'blunt', Otberscleavc each strand at similar locations on opposilc sides of Ihe axis of symmetry. creating fragments of DNA that carry protruding single-stranded termini (Rodriguez &:. Tail. 1983; Sambrook et aJ. 1999). Restriction enzymes thIt cut specifIC sequences havc been is.olaled (rom several hundred bacterial strains. and O\'cr 150 different specifIC cleavage sites havc been found (Watson ~l oJ. . t 992). 1.0 order to simplify the naming of these enzymes, a nonk:nclature has been dcwlopcd that is b~d on an abbrni.ation of the RiIlIlC of the organism from Which the \'n~'mc was isolated (Smith &. Nathans. 1973: Smith, 1979). The first initial of the genus ~ thc first two iMia1s of the spcciC$ form the basic name This may be foUowed by a ~n designation. when the enT.) me is prescfU in a .specific strain, or. Roman numeral (0 --------------------------------------.68 University of Ghana http://ugspace.ug.edu.gh diff'emttialc ~ from the same source. For example, HM II is one of the three enzymes purified from the 5b'ain H«mophtllU MgypllclU and //inf I is the enzyme purified from lloemophilus influelfZQlt strain f (Rodriguez 8t Tait, 1983). In general. diffcrrnl restriction enzymes recognize different sequences. Ilowever, there a.re many C'xamples of mzymes isolated from different sources that cleave wilhln the same target Kqucnces. T'he:te an: k.no\\ n as isoschizomeT'S (Rodriguez & Tail. 1983; S3mbrook el 01 .. 1989). Examples of such enzymes are Hind Ul and Hsu I (Rodriguez &; Tail. 1983). 2.5 .3 Restrict60n fragment length polymorphis m (RFlP) Polymorphi:w of nucleotide scqucncclii in the DNA of SCH.:ral organisms has proven useful In distinguishing between strains and determining then relaredness to one another (pancrson.tHyypia.1985). t 'ntil recently. polymorphisms could be detected only if they were expresxd by difTel'C'ras in w behaviour of • prOle in, for example, by differenc~ in enzymatic Ktivicy IX electrophon!tic mobi lity. This situation changed dramalicaJly with lhe realisation that sites recognil£d by restriction endonuckascs could be polymorphic (WIllSon ttl u1 .. 1992), This is because mutations would haw caused me loss of sites at which. panicular restriction enzyme can act. Since these sites au only present in the genome of certain species, they an: polymorphic. 1'(l I~m. . )rphi$rhS dc1ccted in this way are known as restriction fragment length JXlI~ moqlhi!>lnS. RflPs can be u.'w.'d to characterize an organism. levels of lenotypic dl\<'nll" and phylogenetic rtla tlllmtups. They can arise from point mUlation.s leading to ('Ilhcr a loss or I pin of. site at which a n:-striction mciofll.acleasc acts, In addition. --------------------------------------.~--- University of Ghana http://ugspace.ug.edu.gh deleliom or insertions rcsuJtina: in variations in the nwnber o f tandemJy repeated ONA sequences can aller the k'ngth of fragments bctwttn two endonuclease sites (Watson" ai , 1992). Since DNA polymorphismscan occur in any type orONA sequence. it means thal an aJteration producing a resuiction frag.ment polymorphism JeQIIh can occur wilhin a coding ~uence of a gcne, ooncoding sequences (inlrons), sequences between genes, and e\lcn DNA with no known function. such as repetitive DNA. ONA polymorphisms can .ffect any pu1 of the genome and are thm:fore extremely valuable markers tRothwell.198S) Restriction fi'agmcnt length polymorphism (RFI .P) is most suited for taxonomic studies at the inlnlSpeCific kvel or among closely rdated tlUa (Avisc el al .• 1989). Prc-scncC' andlor abscnceoffrqmc:nts resulting from changes inrttognilion silesare used in identifying speclCS and in characterising poplllations . For example. if the total genomic DNA is digestcdtocompJetionwith8highf~u.:ncyrestriclionendonudcasc,anextremelylargC' number of fragments are produced that can be SCJ*IIled by electrophoresis and will show up as eilher weak or strong bands. When DNA (rom different species are analysed this ~")' the pettem of blinds will bediffC!renl. These restriction spectra will be !'Ipn:ific. not ONY (()f' individual restriction endonucleases but liso. (or the genomc:. ofthc different specic:s (A\'i.se ef aI. • 1989). Thus. RfLPs arc used fO measure gcnetic divergence between different populations or related !'Ipecics. The restriction-sire difference is etTectivdy a DNA difTcmtCc. so I measure o( the tolli l nwnber of RFLP differences repraents. mcasureofgencllc differences., and an: therefon: important inevolulionary 5IUdics(Griffilhs<,oI. . I9'l9) 70 University of Ghana http://ugspace.ug.edu.gh 2.5.4 Agarose gel electrophoresi5 Elccuophorcsis bas abo played an cS~'utial role in the elucidation of the structure, !OCqucnce and function of DNA. Electrophoresis through as.rose or polyacrylamide gels is lhe standard method used to separate, identify and to purify DNA fragments (Sam brook et oJ .• 1989). The: method is used for DNA fragments generated after endonuclease digestion. before or after enzymatic modification., ligation with other fragments. or after sequendns (PerbaJ. 1988). The technique is simple, rapid and c.pable of resolving DNA that cannot be separated adequately by other procedures, such IS density gradient cCl'llnfugalion (Sambrook el 01 .• 1989) A~. which is extrKted from seaweed, is. linear polysaccharide lSamnrook tl 01 .. 19&9; Pbannacia, 1989), It has larae pores and aJlows rapid run times. (t is easy to prt'PIItetndc.n be set into a variety o(shapes, sizes and porosities, and can be run ina number of different configurations. The choices \.\ithin these paramc1tn depend primarily on the sizes ofthc DNA fragments fa be separated. Although agarosc gels have lowtTI'CI01vina power than polYKr)'lamideacLs, tbey have a greater rangc of separation. DNA fragmrnts from 70 bp to approximately tHJO kb in length can be separated on qarosc gels of various concentrations (Fangman, 1978). The location of DNA within the gel an be dctcnnmed direetly by staining with low cOrl\:cntratitln~ of the fluorescent intercalating dye. ethidiwn bromide. Band~ containin, as linle as I-lOng of DNA can be Cd. the mobiliry ofhitth·moJecular-\\,cight fragmcnu of DNA incrnses differentially --------------------------------------~72 --- University of Ghana http://ugspace.ug.edu.gh Thus, the effedi~ range of separaliuo in agarose gels decreases as the , 'ollage applied is intreased (S::unbrook etQ/., 1989). A baiancc has to be struck berween resolution and SoCpanition. Low.molecular-weight fragments diffuse and arc thus best sC'J*'8lcd at fairly hig.h-,"oltage gradients. Large fragments. however, diffuse vcry slowly, and lhc best resolution IS achieved by using low-voltage gradients and running fot long times (McDonell elw. • 1971; Pha.macia, 1989) DNA molecules of larger than 50-1 00 kb in length migrate through agarose gels at the same rMc: if the dlrccrion of the electric field remains constant (Smith &. Cantor. 11)87: Sambrook et aI. . 1989). HOWf::vcr, because of Ihe sieving eff«t oflhe gel mltrix. if the direction of the electric field is .1tered periodicaHy. the DNA molecules are forced 10 change coone. The lime it takes for a molec:ule 10 reorient itself in the new electric field depends on its length (Smith ~ Cantor. 1978). Larger moh:culcs take longer to orient 10 !.he new ditec.tion of the facld: as a result pulse-field gel electrophoresis is uxd to fractionMe extremely large molecules of DNA, up 10 about 10,000 kb (Smith &: Canlor, 1987, Son>brook"aJ. . 1989). The behaviour or DNA in agarose gels. in contrast 10 polyacrylamide gels (Allet et oJ. • 1973). .. nol significandy affected by either the base composilinn of the DNA (Thomas &. Davis. 1975) or by the temperature at which the gel is run (Sambrook ef oJ,. 1989). Mosl q,&I'O$C gels arc run al room Icmperalure. because the rehuive clectrophorc:lic mobilities or DNA (rtgmcnts of di(ferenl si:t.cs do not vary between 4'C and JO'c. However,eels cQ,uainin& less than O. .sV. aprose and low-melling Icmpcralun: agarose eels are rather ru-y. and.llt: heSI run at 4'C, \o\o"herc they are stronscr {Samhrool ('I dl. , 1919) ----------------------------------------~73 University of Ghana http://ugspace.ug.edu.gh Ethidtum bromMk. a (luoresc;:eat dye that is used to detect DNA In agaro:st and polyacrylamlGc gels, reduces the electrophoretic mobility of linear DNA by about I(). 15% (PerbaJ. 1988; Sambrook eI al.. 1989). The dye intercalates between Slacked base peirs. extending the length of linear and nicked circular DNA and making them RlOf'e risid (Slmbroolt tt aI. . 1989). TbecompositionandionicstrmgthofeJecrrophoresisbuffendoaJso.ffecl the mobility of UNA (Sambrook el QI.. 1989). In the absence of ions, electrical conductance is minimal and the DNA migraaes slowly, if at all. In buffers of high ionic strength. electrical conductance i$ vny effiaem sod significant amounts of ml are gcncrukd. If O\.erhc:ating sboukI occur. the DNA bands will be distoned, the DNA denatured or the eel melts (Sambrook l!1 m.. 1989). Seven! different buffers are available for e lectrophoresis of double-stranded DNA. These contain EO J A (PH 8.0) with either Tris-acctate (1 AE). or Tris-borate (TBf-:), or Tris-phospbate (TPE) at a finaJ concentration of approximately 50 mM(PH 7.5-7.8)(Sunbrook ttal. . 1989). 2.5.5 Determination of DNA fragment sizes ~imalion of molecular Size of nucleic acids in either acrylam.ide or .garose gels after electrophoresis is important for the identifiulion and the dwacterisalion of restriction fragmen15and for studies of native and denaturcd DNA. To estimate the size of DNA fngmenlS from lhc-ir mobilitjl:S in gel electrophoresis. a rct.tionship is ntatKished between the mobilitil"S and the lengths of S1andard fraammlS (SouIhnn, 1979; Duggleby rf 01.. 1981; Elder & Southern. 1983). This ff:lationship is then used to c.llcWate the lengths of unknown frd.gl1lC11ts from their mobilities. The .xtU'K)' with whM:h fr.J.gmcnt lengths can be estimated dqJends on the lCCutacy of the --------------------------------------~74 University of Ghana http://ugspace.ug.edu.gh chosen relationship between mobility and length (Dugglcby ~I tJi" 1981; Elder " Southem, 1983: Rocbelleetal. . t985; Oertcr~tol. . 1990). Numerous methods have been proposed for grnphical and computer analyse! of the rclatiooships between mobiliry and length (Duggleby et 01.. 1981 ; Elder & Southern. 1913; Rocbcll el aI. . 1985; Oerter ef 01 .. 1990). The most commonly used method is the ploning of the logarithm of the molecuJar weight against the mobility of standard frq.mcnts. and e~tlmating the lengths of unknown fragments from the resulting graph (Duukby ~I uf . 1981: Schaeffer & Sederoff. 1981 ; Elder &. Southern. 1983), "The st.andud ctJrVe:oo ohtaincd from such plots oRcn show pronounced curvulUre which may introduce significant subjectivity into the interpolation (Duggleby el al. . 1981: Elder & Southern. 1983). As a result. linear models have been developed that more or less tit with expenmental data (Aaij & Borst. 1972; Duggleby el 01. • 1981 ; Schaeffer &: Sederoff 19S1 ; Eidu '" Southern. 1983). 2.5.6 Polymerase Chain Reaction (PCR) Jne pol) mcra...: chain reaction (peR) is an In l·lIm tcchni411~ for the enzymatic ,Implificatum of. specirtC DNA fragmmt of interest from small amounts of a longer mulccule, 1be technique was invented by KaIy Mullis (Mullis el 01., 1986: Mullis & Faloona, 19K7, ..1 Od was originally med to amplify human (Il)~globin DNA and for prerwaldia£l14.hlsofsidde-ce1l anaemia {Saiki flat .• 1'185; Saiki el al., 1')8h; Embwy et uJ .• 1987). A typical amplification reaction medium includes a thmnostal"lle UNA polyrrcrasc. oIiJOOuclrotidc: primm.. deox)nuclrolidc triphosphates {dNII)>J. rt"aClion buffer. "..~sium and optK:w.J additi\CS 3.nd a tl!mplate DNA. ~ reaction is earned OUI usins 75-- University of Ghana http://ugspace.ug.edu.gh an automated thennal qclcr (reactor). which takes the reaction through series of different &empetarurt's for varying time dunttioas. Each peR cycle thromically doubles the amount of wgeled template sequence (amplicoo) in the reaction: 10 cycles theoreticafly multiply theampHcOD by a f&clarofabout I thousand; 20 cycles. by a factor of more than InUllioninam.:ltterofhours(WhiletloJ. • 1989). Each cycle of PCR amplification consists of three phases (Fig. 2.5). The first phase is the dcnatucatioo or separation of the two strands of the duplex DNA molecule. This is ,l("l.'tlmrh!'hed by brieny healing the raction mix 10 temperatures of about 92·9SoC. Each ~mmd serves as a template on which a new strand is built. The temperature is reduced during tlk: s«onJ phasco so that the primer1 which are oriented with their 3' ends pointing towards each other can anneal to the template (White el 01 .• 1989). lbc temperature is then raised dwina the third phase to the optimum for the DNA polymerase to .dd nl)tleolides onto thI: mds of the anneaiC'd primers (cxlen.-;ion). The time or incubllion at Ihi:llcmperalute varies according 10 the kogth ortuget being amplified (Saiki, 1989). At the end oCme cyc~. which lasts ror a few minutes, the temperature is raised to 92-95°C for dc:nalunlion 10 commence another cycle. A typical peR of usually 25 to 30 cycles produces 8 sufficient amooot of DNA for further experimental procedures e.,. cloning. A major problem encountered in the original PCR procedure was thai the DNA poIymensc: (Uclwrich,o coli DNA Polymerase I, Kle:now f1ll8Jlleftt) had to be tqIIenished after every cycle becawe it is not stable at the high tcmpetlturc:s needed for denalur.uion (Saiki ~I oJ.. 11)85. ~ulli, &: Faloona. 1987). This problem was sol-.'ed in --------------------76 University of Ghana http://ugspace.ug.edu.gh ::"~ y.~~~'~.,~==T=====~- ..... - ~9~§~-~...f. . ---~ ,..,.._+--N tOil· 2.S S<:hemalic diagram of the PCR process (Protocols and Application Guide, lid Edllton. Promep Corporation. Madison. USA). University of Ghana http://ugspace.ug.edu.gh 1917 with the diJCOVC1)' or. heat-sc.ble DNA polymerase called Tuq (Saiki el al. • 1(88); an enzyme isolated (rom the thc:rmophiHc bacterium. Th~"n&U aquulicUJ. which inhabits hot springs (Chien el oJ .. 1976). Taq DNA JXllymerase has an optimal c xteos;on rate (polymerisation rme) of 35·100 nucleOl:ides per second at between 70-lifC (Newton & Graham. 1997). The higher temperature optimum for the Taq polymerase extension allo"'Cd the use of higher temperatures for primer annealing and extension. thereby incmtSlrtl the ovenll stringency of the peR reaction and minimising the extension of primers lhaI were mismatched with the template, The increase in the specificity of Toq polymerase t'eR1Jts in an improved yield of amplified target fragment by reducing the competition by nonwgct products for enzyme and primer.; (While ~, ul. 1989). Apart from Taq DNA polymerase, its cloned and modified version, Tth DNA polymerase from TlwrmtlS Ih~r1ltOpirilw. Pfo DNA polymerase from Pyroc(xCIJS ju,iosus and a (ew others can also be used for PeR (Newton & Graham, 1997). The annealina seep is an imponanl parameter in enhancing the specificity of a PCR reaction , USI.I&Ily. the temperalure chosen for annealing of the primer is a compromise (Saiki ~I 0/. • 19&&). lbis is because at lower temperatures anncaJing is more efficienl but there is • significantly increased amount of mispriming. At higher tcmpenuun:s, however. there is lncreased sp«ifieity but the overall efficiency of amptiftCation is ~d. Under standard conditions. the: annealing tempcr.llurc in a reaction should be S·C loWC( than the: mtlling tempnalUres (T,.) of the primers (Innis &. Gelfand. 1990). To tktcnnine the aprtu.'IOimale melting lemperatures of the primer-DNA duplex, the equation T", ... 4(G +C)+2(A+n (",,"",U " dUTP. C - dCTP. A' dATP md T· dTTP) con be used (W;nnac:k ... 1917). More CUC1 numbm; cun be calculated hy applying algorithms (Rychlik & Rhoads. 1919) rbli take inlO ac:tourI( the OC:cum:nce tlf IRtramolccular IRteraotfion. o;uch as. Mirpin University of Ghana http://ugspace.ug.edu.gh str'UC'tUr'eS. Normally, a series of reactions are set up 10 determine the opcimal lI'UlIeaIing tempcralun:. A number of factors influence the specificity of the amplification reactions. The SIJi.n8cncY of the annealing step can be controlled lD some extent by adjusting the .nnealina Icmpcruture. Minimizing the incubelion lime during the annealing and extmsion steps will limit the opponunitics for mispriming and extension by molecules of otherwise idle polymerase (Saiki, 1989). Reducing primer and enzyme concentrations also servcs 10 limit mispriming. particularly tbe type that leads to dimerization. Finally. changing the: concentration I~ls of MgCh can further improve specificity. either by increasing (be stringmcy of the reaction or by direct effects on the polymerase itself (s.;ki.1989). The PeR method can be used with a complex template such IU genomic DNA and can amplify a singlt-copy ge:oecontained therein. It is aJ50Clp.lble of amplifying a single moleculeoft.:u-gct ina complex mixture of ON As and RNAs (Saiki erat., 1988) and can. undeT cena.in conditions. produce fragments up to )5-42 kbp long (Barnes. 1994; Cheng rt aI., 1994). The starting material, the template DNA may be single strandctJ or double wanded DNA. The DNA to be amplified should be free of contaminanls or non.target materials 10 avoid their amplification ...t her ttw. the target DNA. Nonnally, subnanogram quantities of the template DNA arc used for PCR. The PeR is '*'" in a wide "'ariety or fields including : molecular bioLoa;y. environmental Ktcnce. formsic: 5Cien~. medical Kienee. binh:\.'hnulogy, microbiology. the rood indu.\'t). di3.¥J'OSl.ic science. epidemiology, ~C1lC'tics. gene ck>nina. and many more. II h.u been I.l'C'd In the diagnosis of l!CI'ICIK: disorders (Saiki tl 01.. 1985; 1981). the analysis -----------------------------,"9 --- University of Ghana http://ugspace.ug.edu.gh of .UelM: xquence variation (Sambrook el aI., 1919), and analysis of mutation or any research that invoh'CS the rapid cloning and ~uencing of homologous DNA fragments (Gyllensten, I <189). II bas also been used for the analysis of individual identity in forensic samples by the amplifKalion of highly polymorphic DNA regions (Higuchi el 0/., 1988), and the examination of nucleotide sequences from ancienl preserved specimens (Paabo el aI., 1988: Pabo. 1990). Th: PCR has been applied to difficuh problems in dtvcloprnmtaJ biology. For nampk, peR of eDNA has been UKd 10 study .V-J regjon combinations in the T ~C'II rcreplor 5-ctWn (Loh tl 01 .• 1989) and the ('xamination of the mRNAs for growth rectors in smaJl nwnbns of mocrophagcs isolated from wounds aclivelyundetgoingbeaiing(Rappoleeeloi., 1988). University of Ghana http://ugspace.ug.edu.gh 2.6 Mole cular Phyloge netic Ana lysis \iolo.:w.r phylOKeoy is me study of evolutionary relationships among organisms by using tcchniques of mokcular biology. It began at the twn of the century, even before Mendel's I.lWS "en: disco\"cred in 1900. ImmwlOchcmicai studies had shown that serologIcal cross-reactions were stronger for closely relaled organisms than for distant!)' relakdones. The evolutionary implications of these fIndings wen: used by Nultal (1904) to infer phyLogenetic relationships among various groups of animals. In the 1960sa 1970s the study ofmolecularphyJogeny , us ing prolcin sc-qucnce data. prosres$Cd Il"mlCndously. Less expensive and more expedient methods, such 15 protein electrophoresis., DNA-DNA hybridization, and immunological methods. though less accurate than protein sequencing, -were used to study the: phylogenetic relationships amana populations or dO!Jety related ~ics. The applacation of these melhods stinaalalQi the dtveiopmcnt of measute!J of gC1"'lCtic distance and tree-making methods (Fil<:h'\ Morioliash. 1961; Nei. 1915; Fel>enstein. 1988) The rapid accumulation of DNA sequence data from the late 197~ has had a grrat impact on molecular phylott'ny" DNA sequence data are not only more abundant, but also easier to analyze than protein sequence data. Thus, they have ix'Cn used in a wide ranac of applatiOlll. R.-consttuc1ing the ancestral gene sequences from which c:..tanl genes are derivul; srudying the origin and epidemioJoKY of human dixaxs; inferring the evolution ofccoiogic.al and behavioral trait' through time; C'Stimating hiStorical biogeographic rd,1llonships; prioritizing the conscn:al lonofendangered spccies; and reconstructing the: historical relatW>nsIUps acrouaJl of life (Wocse. 1987; Hillis, 1(97). --------------------------------------,81 University of Ghana http://ugspace.ug.edu.gh Whatever the application. all phyloge:ndic analySts require a crilerion (known as an optimaJity criterion) for assessing how welllhe d8Ia fit candidate trees, a method for searcbing among possible solutions for the nee with the best fit to the data. and a method for assessing confidcnce in lhe results (Hillis., 1997). 2.6.1 Molecular phylogeny methods MoilxuJar phylogeny methods allow. from a given set of aligned sequences.. the suggl:Stlono(phy~ctrtts~nichaim'lrttonstructinilhehistoryorsuccessive di\'crgmccs which lOOk place during the evolution betW!!I!" the considered sequences and 1.6.1.1'hy~enetk:trees All life forms. both extant and extioct. have I common orisin somcy,hcre in the past. (rom whieh they C'\'olvcd inlO \he plethora of spec ies found today. ThUs. all animals. plants. and baclcriaarc rcllted by dcscent to cacb other. C losely rclalcd spe:cies share a fI'IOfe ft'Ccnt common ancestor than distantly related species. The objectives of ph)"logcnc:tic Sludiesare to: I) ftt(lnstruct the correct genea.logical tics bet:wecn uq;anisms, and Ii) estimate the time of diveraence ~l"'ft1l organisms since lhcy last shared a common ances.tor(Li&Gnaur.1991). The evolutionary ~lationships between species can be represmted by • phylogenetic tree. Thi.s is • gnph COI1lpOSC.'d of nodes and branches in which onl), one branch Conntf;U any .tj.:cm nodes. The nodes rqnscnt taxonomic units, while the branches connttlin~ thcm. refkct thrit R'lationships in terms of descenl and ancestry. A phylogenetic tree is chuxkriscd by its topology (form) and its k:ngth (sum of its branch lengths) Iti a: 82 University of Ghana http://ugspace.ug.edu.gh Graut. 19911. The topology is the pattern of branches found in a tree. The branching. pmtem (often caJled branching order) shows lbe ,('ncalagy of the organisms. That is. it shows which species !bare more common ancCSIfy with which others The branch length is commoa.Iy used to indkatc some ronn of evolutionary distance represented by thli brtilCh. The BCluat still existing taxonomic units are often called operational taxonomic Wlics (OTUs); a generic lenn that can represent many types of comparable taxa. for c,\;unple .• family of organisms. individuals of a single spec~s. a set of related genes or even gene rqjons. are represented by nodes on the lips ofthe' branches.., called external nodes (Wcillcr el m.. 1995). The other nodes arc called internal nodes. Inlemal nodes may be called hypotheTical taxonomic units (HTU) (0 emphasise that they are the hypothetical progC'nilOf1 orOTUs (Weiller tl ai. • 1995). Phylogenelic trees can be either rooted or unrooted. A tree where 8 special node indicating the common anceS10r to all oros is present is called a rooted tree. An untOOted tree le.ves the position of the common a.nl,;l,:~IOf unspecified The number of bir..c.t:insrootcdtrces(.Va)fornOTIJsisgivcnby ror,,~ 2(Li&Graur, 1(91). The number of blfun:ating unmuled tn:c:s (Nu) for tr ~ 3 is . (20-5) .~IJ= 2- ' (n_3)! {PeM} dul .. 1982: li & Graur. 1991) -------------------83-- University of Ghana http://ugspace.ug.edu.gh A SfWcie$ rue is s phylogenetic tm: reprcxoting the cvolutionary pathway of. aroup of sptCte3. When a tree is construCted from one gene from each species. the inferred tree is somcti~ called a J1:nW trn (Nei, 1987). These two arc not necessarily the same. as the divergmcc of the gene can .ctually predate the divergence of the species. This is not a problem when k»ng-term evolution is studied. Another problem is that the topology of the gme ttee does not nccessarily COlnclde wilh that of the species (li & Graur, 1991). 2.6.1.2 $earchalgorithms ()OI;cacriterionhasbcmselectcd for evaluating the filofdalatQtrces. it is necessary 10 scatth among the universe of possible trees for the: optimal solution. A number of a1gorilhms for ~1111311Og phylogeny from DNA sequences exist, and it not aJv.ays clear what the: sumgths and weaknesses of each method are, or which should be preferred in a given situation (Kuhner & felsentein. 11)')4). For a small number of taxa, there are few po$$.ib~ &rec topologies (branching arrangements), but the number of distinct tn:n lncrcaes rapidl)' as • fWX:llon of the number of wa. There art only three branch.ing OfJCI'l for four Wta, but there are more than two million ditTertni trees for 10 taxa. and mo~ than 2 x IOIIl different tr~es Cor 100 taxa (Hillis. 1997). For smaller data sets. there are two computOltional procedures. or algorithma, that are guaranteed to find the optimal1ltt(s) for a particular criterion , One is simply AD ex.baustive evaluation oCall distinct ~ (Saitou 4 Imanishi, IIIIN). but thi, can be \'ery timc-consumin& and is f"II'Cly feasible for more than about 12 taxa (Hillis. 1997). The ' __O Dd·bound' a1gorilhm (Hendy &< Penny. 1982) is • -.mil algorithm dUll will fitld the exact solul ion(.) (or 20 or mure c.ua, dependina on the computational ~un::n and Ihc '1'IOt51ncM; uf the 1 in the C&K of very small data sets. In addition, point-estimation mtthods pro\'idc no Uldication or how many other trees (Of Yo'bicb trea) may be JUSl IU good. or c\'cn bencr. esUmlittsorthepbylogcny r h~' t~'O moll widely used point estimation methods are stepw15C addition and neighbor- I'llnlng (llillis., 1997). The former method adds taxa one by one 10 • vowing tree, ""hcrn.s the lifter method starts with aU taxa in an unresolved 'sw' and adds the interNl hr.and-=s to the tree in a stepwise fashion. Both methods find Jimilat tc .. ults (or .. mall trtts. but Slepwilr addition usually finds better solutMxl.s (and is computatlonall) .. Ii~tly ------------------------------------,~ University of Ghana http://ugspace.ug.edu.gh sJower) for large data SICU. Once. poi.nl-estimale of a tree has been fotmd. the estimate usually can be impro\-ed in a procedure knoull as branch-swapping (Hillis.. 1997). There are several different types of branch s. ... appang. bul all of them involve rearranging the branches on .... initial tree: 10 look for IOPOIogtca1Jy related tn.~s thai areas good or bctterphylogenetic solutions. If branch swappina fUlds an equally good or better tree. then the bnnch swapping is conlinuedonthenewtrec. 2.6.2 Tree construction methods Numerous methods for constructing phylogenetic trees have been proposed (Sneath IL SoUl, 1913; Fclscnlein. 1981, Nei. 1987; Swofford &: Olsen, 1990). These methods can bcclassifiedinlotwomaintypes:distancemalflxmcthodsanddiscretc-characlcrmcthods (Li" Grauer, 1991), In distance malrix methods, evolutionary distances (usually the number of nucleoeidc or amino acid substitutions separating two taxonomic units) are comput.:d for all pairs of data, and a phylogenetic tree is constructed by using an al.orithm based on some functional relationships among the distance values. In dixrete- cblnctCT mdhods. chanel" stales (e,g .• the nucleotide and amino acid at a site) are used. and tbC' shortest ,.lhWll)' leading 10 these character stales is ch05en as the phylogenetic tree. Allct.raccenan:analyscd sepa ...t elyand usually independently from eKh other (Weillereta/"I99S) Some types of molecular data (t'.g. , DNA hybridiZilion data) exist only as dislanC'e data; therefore phy~nctic trtts for these data can be consuucted only by dist1ln\:c methods, nycontrast.di~-characterdatacanusuaJlybecoD\·('ncdil1l{)diSlanccdalallndthU5 Ih ...~ can he analyzrd either by distance mdhods or by discft:tc-char.Krcr methods. !C, University of Ghana http://ugspace.ug.edu.gh There are two mljor groups of discrete character metbods. namely. maximum plU'SlPlOny mclhods and maximum likelihood methods. Two typeS of errors can occur in the construction of phy10genetic trees: topological errors and branch-length errors (TaleJlo ~I .1,1982). 2.6.2.1 Discrete character methods I) M._lmum ~rslmony (MP) and weighted parsimony (WP ) The theoreltcal blSis of the MP method is William of Ockham's philosophical idea that the best hypothesis 10 explain "processis meont that requires the snmllcst number of assumplions (Sober. 1988). The method was first proposed by Camin and Sakal (1965) as a method for lttonstructing phylogenetic trees from morphological data. Later modiflCatioos edjust.ed the MP method for amino acid (Eck &:. Dayhoff. 1966) and nuclcocide sequences (Fitch.. 1971). The latest modification utilises dynamically modified chancter weightina (Williams &: Fitch, 1990) and is caUed 'y,.-cighlcd parsimony' (WP). A maximum parsimony tree is !.he tree lha1 requires the smallest numl'>cr of evolutionary cha.nies to result in the: lei OTUs under study (Li & Graur. 1991; Hillis. 1997). It is often possible to find more than one ~ with the same minimum number of changes.. so that no tJruqUC lrtt can be inferred. Parsimony considers al l sites. but nol all sites convey iofOl'lD8tion regarding the most parsimonious tree. Only sites thai favour some topologies ova-othcnare informative and. consequeotly, onlythC3C silesarc used in the ealcu1ations. For molecular sequence data. sites are informlfive only when [here ~ al least two different k.inds of residues at the site, each of which is reprexntcd in at least two of the srquc:nc:cs under study (li & Graur. 1991; WeHler ~I al., 1995). This usually I"eICI'icbdralicaUywnumbcrofposilionsuscdiothelUlaJysis University of Ghana http://ugspace.ug.edu.gh InOC'der 10 fiDCl the most parsimonious tree. first . all the informative sites are identified. Nexl, Iht minimum number of substitutions al each informative sile is calcuJated for ali possible tree topologies. In the final step, the number of changes over all the informative 5ites for each tree is summed up and me ~ associaled with the smaJlesl nwnber of 5Ubstitutiooslschosen(Li&Graur, 1991 ) The tWKIard MP method weights all changes equally although for sequence data this is not al"'..,.5 desirable (We-iller el 01 •• 1995). Firstly it is sometimes desirable to weight different sequence positions differentiaUy in order, for example, to emphasise the few changesatconscrved positions rather than the several at variable positions. Secondly it it often advisable to weighl differenl types of changes differently, forexamplctransversions are usually morc siarurlCatlt than transition. . , and certain amino acid substitutions mon: tt.n otbm. Recendy. Williams and Fitch (1990) introduced the dynamically weighted parsimony method (WP) which uses an initial tree (seed lree) 10 assign different weights to different Iypes of substitution and/or sequence positions. It then uses this weighting scheme to generate a new tree. This process is repeated until the same best tree is obttinedintwocon.sc-cutiveruns. The: nwnber of possible topOlogies increases very sharply with the number of taxa, making computer time a limiting factor for calculations on more than • few sequences. The "branch and bound" method (Hendy and PCMY. 1982) can limit Ihe number of topoloa.ies that have to t'C tested. and still find the optimal tree. Even with this optimisation. the method is limited to a small number of sequences. For larger sets., hl:uristic methods must be uscd. but these cannot {!uarantee 10 find the most parsimonious tree. Panimony is also very sensitive 10 tDeqUal ~ of n'olution in different bnncbes University of Ghana http://ugspace.ug.edu.gh in the (Fclxnstcln. 1978) Since it is not possiblc to corrcct for superimposed lrtt cvolutionar)' cvents (where a given nucleotide position bas changed more than once since thc lWO sequences diverged), tbe prescnce of distantly related lineages can cause distonion. The cffecu of the W"lCqual rates of evolution should be solved in the cvolutionary paDimony md.hod proposed by Lake (1987). but this method has been criticized for depending on a number of unreaIistie assumptions (Jin &. Nei, 1990) and for underestimating branch lengths (Olsen, 1991). Information on evolutionary processes may be incorporalCd by weighting character.;: differentially (such as frrst versus third positions of codons), or by weighting character state changes differentially (such as tr1IDSitions~1rIDSYersions)[HjJlis,1997J. II) Mulmum m'~ljhOocl (ML) The maximwn-likelihood method for recooslrUCling phylogenetic relationships has become wry popular because of its main advantage of application of a well-defined n 'olutionmodeltogj'tlendataset(Felsenstein, 1988). The MI. method was originally proposed by Cavalli-Sforza and Edwards (1967) for gene frequency clara. Later. Felsenstein (1981) developed 8 ML algorithm for nucleotide sc:qucnocs. Mnximwn-likelihood is similar to the MP method in that it cxamines every reMOnIIbletlttlOpOlogyande\'alualesthesupportforeachbyexamlningc\,eryscquence posItion Phylogenetic tree iafemx:e baed lin the method or maximum likelihood is appealing from both bio1ocaJ and statistIcal pcn;pecti\'es. The maximum likelihood method tries University of Ghana http://ugspace.ug.edu.gh 10 infer thc topOJogy that is most c:onsistent with a sci of observed dala (FeI$CnS1Cln. 1981; Strimmn '" H~lcr, 1996). The possibility that a liven lopolo&y will produce the observed sequences is calculated for aU or many possible topologies (Swofford It Olsen. 1990. 1996). In order to calculate these possibilities, a concrete model ofthc evolutionary proccss that converts one sequence into another must be spcciried. For example, all nuckotides are assumed to be equally frequent and the probability of change of any nudeotide '0 any other nuclC'Olide is a.c;sumed 10 be the same in the Jukes·Canlor model (Mount. 1999). For each possible tree. the likelihood of finding the actual sequence changes ", each. column in the aligned sequences is calculated. The probabilities for each aligned position ate then muhiplied 10 provide 8 likelihood for each tree. The tree which provicks the: maximum likelihood vaJue is the most probable tree. Just as for lhe parsimony approach. this method requires much wmputer lime and can only be used on • limited nwnber ofscqucnces. However, il is possible to speed up calculations by paraUelizing the aJgorithm by using approximative techniques (Adachi &. HllSC'gawa.IQ94 ; OI~tla/. • 1994) I) Transrormatlonofsequencedatatodlstances Although st'quences are fWldamcnlally charKter data, phylolenc1ic analysis based on sequences IS often prtCCded by a prior reductton 10 distance values relating all pairs of .sequc:nce5. Many fast methods arc available '0 construct evolutionary trees from distance dala (Nci, 19&7; Gojobori ~I 01., 1m). Moreover, the usc of distances easily allows tOrm;:IH)ns for multiple mutations at the same: Mle. which are mone difficult to take into accowllwhen using character data (De Rijk., 1995). ---------------------------------------91 --- University of Ghana http://ugspace.ug.edu.gh la the distance method. a matrix of distance scores among all of the sequences is first drawn. ~ distance score is a measure of dissimilarity between the sequences. so that the: less similar the sequences, the higher the distance score between them (De Rijk. 1995). Another Vo'3.y of Lhinking about genetic distance is that it represents the minimum numberofcMnaes. including substitutions. insertions and deletions lhal are required to convert one scqucnce inlO the other. One of the most common ways of swnmarizing the relationship of t wO sequences [0 a number is their fractional similarity. In its simples. form. st!qmm('(~ similarity is the number of alignment positions containing identical re3idues divided by the number of alignment positions compared (De Rijk, 1995). ACcoWlting for gaps is the biggest problem in !.he caJcwation of similarity. Sequence comparison analyses can provide either .:Iimilwity or distance scores, depending on the scoring ~'ll'm used (Smith &. W;ucnnan. 1981; von Heijnc, 1987). SimHarity scores also may be converted to distance scores, baed upon assumed models of evolutionary change in sequences (Swofford & Olsen. 1990. 1996) Adcfitive distances can bc: fitted to an unrooledtree such that.1I pairwise distances are equailO the sum of the branch lengths that connect lhc OTUs (Dc Rijk, 1995; WeHler Itt oJ., 1995). Uhrametric distances arc the most constrained. They will precisely lit a tree so that the dislanc~ between any two taxa is equal to the sum of the: branches joining them. and can bc: rooted so that all taxauec:quidistant from thc rool (De Rijk. 1995). It) Unweighted paIr group method wtth arithmetic me,." (UPGMA) Ttk: Wlweighl",1 p.ur group method with arithmetic mean (UPGMA) is the simplest method ror In.''!: n=construclion. 11 .... '&5 originally developed for constructing taxOnomic ------------------92 - University of Ghana http://ugspace.ug.edu.gh phrnograms. i.e .• U'CeS that reflect the phenotypic similaritieS between OTU, (Sakal It Michener, 1958), but it can also be used to construct phylogenetic trtt'S if the rates of evolution are approxiRllltdy constant among the different lineages, so that an approximately linear relation exists ~ evolutionary distance and divergence time (Ne;. 1975). For such a relation 10 hold. linear distance measures such as the number of nuclcotidc {or amino ac:id) subSlitutions shouJd be used. The trtt is conslructl-d by a sequential clustering algorithm (Sneath cl Sokal, 1973). Within (be matrix. the two OTUs with the smallest distance (i.e .• most similar) are clustered inlo a composite oru. which will then be treated as a new single OTU. A new matrix is creased, where the distances of other orus to the composite OTU are c3kulalN. This process is repeated until only two orus remain. These are cluslered togcthcr,andthcrootisplaccd.thalfthccalcuJateddistan<"cbc:twemthetwoc!usteB The UPGMA caJculales the dislatlces to a composite 0111 IS the arithmetic mean of the distances between the constiruent orus in each compo~te oru. If a simple average of the dislancC$ hctwecn the composiCe on;s is used, the method is called ~ig.hted pair group TIklhod with arithmetic mean (WPGMA) [Sneath & Sokal. 1978J. If the: assumption of I"IIC constancy amoag liocagcs does not hold, UPGMA may give an erroneous topology. The topological errors might be remedied. howevcr, by using a correction called the t~nsformt:d distance method (Farris. 1m; Klot~ 1I 01. • 1919; li, 1981). "lhismethodusesanoutgroupasref~ lofMkecorrutioasforuncqual rates and then applies the UPGMA to the new distance INtnx 10 infer the topology or the tree. T1lc outgroup is an OTIJ which. hued on C'Jf.tcmaJ knowledge such a.. . tnonomic or 93 University of Ghana http://ugspace.ug.edu.gh paleontologic informMion.. dearly diverges before the common ancestor 10 all other ow, (De Rijk. 1995~ 11 is oftcn difficuh 10 decide which of the WI is an outgroup. To overc:omc this problem. Li (1981) proposed. two--stagc approach which first infers the root using UPGMA, and then UICS the taxa on either side of the rool LS an oulgroup to correct the distances among lincagcsonthcothcrsidc. I) Heighbour,"'tionmettlOdS The neighbour-joining (NJ) method (SaiIOU &. Nei. 1987; Studier & Keppler, 1988) constructs the tree by sequentially finding pairs of neighbours. which are the pairs of OTUs connected by a s;ngle interior node. 1be method is conceptually related to chISler analysis. but mnG\'C"5 the assumption IhII data are ultrametric (De Rijk, 1995). In other words. il doe:I not requin: that all lineages have diverged by equal amounts. As it does not ;Jnmlpt to clusttr the most closely related Ol1Js, but rather minimises the length of aJl intemal brlJlChes and. thus, the length of the entirc tree, it can be regarded as parsimony applied to distance data (WciUer tlol., 1995). lnoontrasl tocluslcranalysis. ncighbow-joining keeps track ofnodesoflhc tree rather thin taxa or clusters of tau. The NJ algoritJun SIartS by assuming a bush like Irec thai has no iDtemaI branches. In the fn step, it introduces the first internal branch and calcubles the length of the rrsulling tree. The algorithm sequetltiaJly connecb every possible OllJ. pair and finally joins theOTIJ·pairtlw:yiekbtheshortesl trce(WciliereloJ. • 1995). The length of a brancb joining a ".u of neighbours, X and Y. 10 their adjacent node, is the aVttagC distance ~ all OTIJs and. X as well as Y, Icsslhc: avcrqe distances of all 94 University of Ghana http://ugspace.ug.edu.gh remaining OTU-pairs. This process is then repealed, always joining two OTUs (nc;ghboun) by ;ntrodudng the shortest possible internal branch. N) F'rtc:h and Hilrgoliash (FM) method The FM a1gonlhm (Fikh 4. Margoliash, 1967) initially uses the same clustering method as UPGMA and thus the initial topology is aI~ the same &S in UPGMA. but the branch lengths art calculated differently. AU trees with closely related topologies are then C'xplottd and compM:d in Irons of the so caJJed 'percent sumdard deviation', This is essnui.Uy • IDeIlStft that assesses how well the distances in the matrix correlate with the branch length obtained in the tree (palristic: di.stanc:cs). The tree wilh the smallest 'standard deviation' is chosen. III) Olst:lncf! Wagner method (OW) As the network reconstnJ(:ted from character state data is often called a Wagner network. Farris (1972) caned nis method for reconstructing WlI'OOted rrecs from distance data "Oisunce Wqner". Like the FM method.. the OW method ~s a diS1ance matrix to analytc: many possible trees. Like many clUSIcrin, algorilhm.s., the OW method firstly combines the two mOil closely rtlated 011Js. As the network grows, it tries to fil all remaining oros in turn into any of the possible edaes. choosing the one that can be connetted with the shortest branch. This method minimises the total length of each subtree u it is buill (W~iller el ai., 1995). As it depencb ~ry much on correct estimation of e\'c:ry bnneh length. it is vel') suscC:plible to SllDplint errors, Improvements to the .i.lSorilhrn have beco suqcsted and these yield the modified Farris method (MF) [Talmo ""'. • 1982]. --------------------------------------,~ University of Ghana http://ugspace.ug.edu.gh Iv) Minimum evolution (ME) The nuwmum evolution (ME) method ( Edwards &. Cavalli-Sforza. 1963; Saitou & lmaniw. 1989; Rzhc1sky &: Nei. 1992; Edwards. 1996) is a diSlatKe-ba.sed algorithm. The method rust computeS each branch length ofa given tree topology by using Fitch and Margoliasb's (1967) procedure for branch-length estimation, and the tree that shows the smallc:st sumofbranch lengrns is chosen as the: best o nc. lbusthecri1erion used in this method is the sum of branch lengths (SBL). Rzhelsky and Nei (1993) showed thai the expected valuc of SBL ls smaJlesI for the true topology when unbiased estimates of pUwix distamC5 U'C Wtd. HO'A.'CVCT.lhis result does not mean that the topology with the smaticstSBL value is the most probable tree (Nei, 1996). 1be pnnciple of minimum (',olulion, which implies search for a tree with minimum mutations to explain difTerences observed between sequences, was proposed by Cavalli- Sfona and Edwards (1961), who eonsideRd II Steiner tree. With the use of Filch and Margoliash's prcxedwc for branch-length eslimalion. compulation becomes much simpler lhan the c.vaJ1i-Sforza and Edwtnls method. The ME method also seems to be relalcd to o.yhoff's (197&) method (Blanken el al., 1982). Although Ihis principle bears some ~Ianoe 10 mal of maximum parsimony, in lhal it seek.!: the: tree with the lowest overall change in chanclers, it differs from parsimony in that 'change' is adjusted 10 accoWlt for inferred superimposed events, using Il model ofevolulion (Hillis, 1997). The method ls similar 10 the NJ method bccau:te the principle of minimum evolution is also adopced in the NJ _ (Sail,," .It Nei. 19M7). ond because both the ME and NJ methodsaredLst:.ccmcthods. Simulalion results have sho""n that Ihe ME method and the NJ method an: indeed quite similar. However,lhc ME method is an exhaustive-KarCh method and examines.1I possible trees to choose the: best one . In contrast. the NJ method --------------------------------------'96 -- University of Ghana http://ugspace.ug.edu.gh is a scepwise clustering method, and the computational time is much shorter in this lIlCtbod than in the ME mcthod 2.6.3 Accounting for superimposed events (Nucleotide substitution in a DNA sequence) A ta\ic process in the evolution of DNA sequences is the change in nucleotides wilh IUnc . However. as the process of nucleotide substitutions is extremely slow, to detect evolutiooary changes in a DNA sequence, compAmive methods whereby a given sequence with which it shared a common ancestry in the past are used (Li & GraUl. 1991). To study the dynamics of nucleotide substitution, several assumptions regarding the probability of substitution of one nucleotide by another are made. Numerous such mathcmeticalschemes have been proposed (Li&.Graur. 1991). The subllitution scheme of Jukes and Cantor's (1969) model is onc of the most frequently often uxd ones to correct for multiple mutations per site. This model swts from the assumptions thai all subsl:itutions are independent. that aJi 5CqU(ncc po~ilions are equally subject to clwlge, that substitutions occur randomly among the four types of nucleotides, and that no iluertions ordelctions have occurred . For example. iflhe nucleotide under considention is a. it will change to T. C. or G with equal probability . Since the model involves only one paramder. it is also called the one-parameter model (L i &. Graur, 1991). ~ on Ihese pre-assumptions, the authors derived an flIU3tion for estimating evolutionary distances from observed dissimilarity. Sncnl otbtt equations for the estimation of evolutionary distances have been proposed. For e:umpit. K1mur. (19&0) has provided a method for inferring eVOlutionary distances ------------------------------------~97--- University of Ghana http://ugspace.ug.edu.gh based on a model of evolution in which uansitions and transvenions may occur al ditTerentrat(..'S. ltisthtreforeof'tenca1ledthe'twoparametcrmodel'. Oo)ding (1983) has shown that applying ~. Jukes-Cantor correction 10 sequences composed of sites with unequal evolutionary rates leads to an WlCiercstimalion of large evolutionary distances. As a result, distant species may seem closer than they actually Ite aod this can cause artifICial clustering of long branches (Olsen. 1987). This artefact can be avoided by con\'crtingdissimilarities intoevolurionary distances according 10 Jin and Nei (1990). 'IlleY asswne thai there is a gamma distribution of substitution rates over the sequence po5ilioM. A general equation for applying this idea in the correction of distances was proposed by Rzetsky and Nei (1994). This equation conlains a parameter a which depends on thcdalaset understudy. OlbcTcquationsan:bascdClnsubstitulton models in which the four different nucleolidcs art no( uscc:I: in equal proportions (e.g. Tajima &. Nei. 1984). or where a bias in the dim:tiooofchmgc is accounted for (e.g. Tamura&. Nei.I993; Zharkikh. 1994), An imponant drawback of most ofthesc models is that lbey do not consider dilTereocl!" in substitution rat~ among the sites of a molecule. 2.6.4 Auessing the reliability of II tree (confidence In phylogenetic estimates) All methods oftrtt construction make: assumptions. \\-hen these a.'.sumptions are not satisfied. systematic: errors can be introduced. h'm if .U twumptions arc satisf.ed, random erTOrS may occur due to the use of. limited data~I , Methods are thcref<>f'r: ne-edcdfortestine:lhestatistkalsipi6ca:oc:eofanygiventree(Hillis. I997), --------------------------------------98 -- University of Ghana http://ugspace.ug.edu.gh 1berc are two different types of methods for testing the reliability of. tree constructed. One is latest the topological difference bern'een the tree and its ciolc:ly related tree by using certain qlWltiticS such as the likelihood value in the maximum likelihood method (Kishino &, H~gawa, 1989) and the swn of all branch lengths in the minimum evolution method (R2hetsky &: Nei. 1992), This type: of lest is supposed 10 examine the reliability ofevcry inleriorbranch of the tree. and it isgenernJly • conservative test. The pcocedure of the lCSI is I1SU&Jly quite complicated and requires considerably more computation. The other type of test is to examine the reliability of each interior branch whether it is significantly different from zero or DOl. If a particular interior branch is not significantly different (mmura. the possibility of trifurcation of the branchcsassociated or even tbe other'typesofbifuccatingtrees that can be generuted by changing the splining order of the tIuu branchel involved cannot be excluded (KWT\II el oJ., 1993). There are rwo different "',,),s of tcstiD& the reliability of an interior branch. One way is to compute the standard error of the interior branch and lest the deviation of the hranch length from 7.cro, and the othC'CiJlOusethebootrtraptest(Efron,1912;}'elsenstein, 1985) When the sequences do oot provide enough phylogenetic information (e. g. sequences are too short or IKking in variation), no algorithm will prOOlK:C sensible answers, One way loevaltmtewhethcrtbett is enough phylogenclic sianal in the data is 10 apply tests like ';':kknifing or bootstrapping (Efron, 1981, I-dsenslein. 1985; li &: Gouy, 19(0), Both methods indicate \o\oht:ther smaller sample sizes would result in the same tree. -------------------------------------.99---- University of Ghana http://ugspace.ug.edu.gh CHAPTER THREE MATERIALS AND METHODS 3.1 Che miCAl 's, Ru ge nt5, Equipment and Software The sources and/or manufacturers of the chemicals and rt'tlgcnts. software and amine analysis servers. and equipment used for the study arc listed in Appendix I. T'he various bufTasand solutions used were prepared as described in Appendix II. 3.2 Biological Specime ns and Sample Collection Mosquito larvae and pupae samples were collected from six locations, separated by • few kitometre5, in the Greater Accra region of Ghana. The areas were as fo llows: Madina (S'4Q'N, O'9'W), Adm. . (S'42'N, 0'9'W), Dzorwulu (S'J7'N, 0'12'W), Ach;mola (S-3S'N. OOI4'W), Legon (S039'N, Oog'W) and Airport Residential Area (S036'N, O"II'W). The &lot.J. positioning system (GPS) was used to determine the geographical coordillAtesofeachbm:dinisite. Anophr/u moequico breeding sites ",ne identified by sampling of small pools of stagnant water in the open and in sutters. exposed CO sunlight (Figs. l. la, b. c and d), The Dz:orwulu and Airpon Residential sites were characterized by small shallow pools of SUIgJlant rtinwalCr. At the Adenta. Madina and East Legan, the sites ..... ert mainly open drains containing stagnant waler exposed to sunlight. and these contained Il mixtur~ of bach Anopheles and Culex pre-adult stqes. The site at Ac:himota was water flowing from a leak.ina water supply pipc and exposed. to suaJight. University of Ghana http://ugspace.ug.edu.glOhll ftl. 3.la Mosqw.to collection sne at Ad~ (opm drain wllh ~ waaer ...,...t., ....i sJa~ from 101 University of Ghana http://ugspace.ug.edu.gh n,. 3.1(' Mosquito · coiltA;:ti~n site .. Ac:himot. (~ strdch or water fJOWIng fromabrokenwUerpipe) ne.3.14I Mosqwto t.ollecbon Sl1e aI Dzonrrulu (snWllhIllow pool of,..1ftI University of Ghana http://ugspace.ug.edu.gh TIle Anophele$larvae were identified by their characteristic resting position, with the body parallel to the water surface and just be~w the surface film. The larvae and pupae wt'T'C carefully collected into small plastic containers by scooping gently wiili a JSO mJ dipper to avoid injwXs. The samples were transported to the laborntOC}' in 2-liue jars that were loosely capped to allow adequate ventilation. Wild adult An. gambiae mosquitoes were also obtained from NaVTongo (IOo30'N, I"OO' W) and Dodowa (soSI'N. 004 ' W) both in Ghana and Jaribuni (3°00'5, 3~OO'E) in the- Kilifi District. Kenya. I.aboratory colonies of adult An. Rambiae from Kilimanjaro (TIUlT..ania). Suakoko (Liberia) [which is the strain used as the: standard reference for An 80mb/De 5.5,1 and Kisumu (Kenya) and An. arabiemu (Wageningcn strain) were obtained by courtesy ofM.D Wilson. University of Ghana http://ugspace.ug.edu.gh 3.3 Laboratory Rear ing of Mosqui toes In tilt' laboratory. the Iaf"\'ac and pupae samples were transferred into \4 x 24 " S.S em plastk basins (Fig.l.la). In cases where larvae of Culex and/or Aed~$ species were pre~-nt they wne identified by their angular position on the waler surface: and were removed. Using a plastic Pasteur pipelle. a few Anophelu larvae and pupae were tranSferred and plxc:d in 1.5 ml cppendorflubes containing I mI ofoopropanoJ and The remaining pupae were similarly transferred into small plu.<:lic cups and placed in labelled 25 cm cubic cages (Fig. 3.2b) for adull emergence. The remaining Iwvae were maintained in the bastns filled with waleT. from the collcction sile, to a depth oflbout 2 em. The larvae were fed once on finely ground Nutrafin Boldfish food (Rolf Hagen, USA). The larvae and pupae were reared to adults under conditions of 27·3s Thennal Cycler (flyba;d Lid., UK). -------------------'112 University of Ghana http://ugspace.ug.edu.gh For the WOl.816SFI I WOlBI6SRI primer!. the kmpcratlR profile was as follows: an initLaI denalur.ltionat 94"C for 3 min, followed by 35 cyclcsof94I1C for60s. 6(/C (or 60 s aDd 7'rC for 2 nUn for 3S cyclcs.. plus one oddilional cycle with • finaJ 10 min chain dongJtion step. The same reaction conditions were used for the ftsZll/ilZl primers, cxccpt for tbe anneaJing k'mpen.ture which was sctat 55°C. Template DNA from locally caught brown-banded cockroach Supello long/po/po (Blattaria : 8lauellidae) nymphs was used as positive control for the presence of Wolbach,." Negative controls. containing only double-distilled water, were abo included to check r.,r contamination. lbe peR products were electrophoresed in D. 1.00/. agacosc gel as described under section 3.4.2.3. The sizes of PCR produclS were estimated by comparison with the mobility ofastandard I kbladder 3.5.1 EstimOiltion ot the concentration of peR products Five microli1tes of PCR product was diluted to 95 ",I with nuclease-free water and the absorbance reading al 260 nm wavclcngth (A2*) was m:ol'dcd. TIle conct:ntrntion of PeR product (X tAg/tAl) was then estitnlted by using the formula (Wilfinger el al. • 1997) below, Al6O X dilutionfactorxSOpglJ,ll- X tAg/p.1 -------------------'113 University of Ghana http://ugspace.ug.edu.gh 3.6 Amplified Ribosomal DNA Restriction Analysis (ARDRA) Reruiction analysis of enzymaticaJly amplified 16S rONA ..:an be used to identify the species of many bacteria genera. This exploits the differences in the DNA sequences that resulrsindifTt."1cntrestrictionsiles. lntbis variationofribotyping, the ribosomal DNA is amplifted and the product digested with • restriction enzyme. and the diagnostic DNA fi1IImenlprofilesare vi5U&l.ized following separation by gel electrophoresis. avoidinglhe need ror Southern blotting which involves cwnbersome blolling techniques. This method has been used to differentiate bacterial species (Vaneechoufle, 1996; Dijkshoom d al. • 1998). 3.6.1 In silica (computationill molecular bloJogy) restriction analysis Tbe most predominant bacteria species detected in An. gamblae have been E$cherichia coli and Punt(HJe agglomerans (Slraif et oJ. . 1998). In order to identify whether M)' of dleamplicons might bceitberoftbese microotganisnu. virtual restriction analyses were perfonned on their rONA nucleotide 5CqUCTICes. The complete 16S rONA scqu~nces of £ coli and P uKiJlomrruns Md the complete 23S rONA sequence of E.. col. were retrieved from the Ribosomal Database Project II (Cole et aI .• 2(03) and copied into the software "Jellyfish" version I.l (biowire.cOm). The FIND rWlction of Ibis programme was used to align the primer pairs used for the peR amplir.cations to the homologous rONA sequences, and the region thai they flanked was copied, saved as a new file and then imported into the software programme DIGEST vers.ion 1.0 (Nak..isa, 1992). InsJlico anatysis WaJ pnfonned with the DIGEST software using the enzymes in its database WISCONSI.920. The output included the positions of the DNA sequcocc:s thai each enzyme cut. the number of fragments and their slzJ:s. It -------------------n~--- University of Ghana http://ugspace.ug.edu.gh also listed all enzymes that do not restrict The results w~re compared to those obtained by the experimental restriction p8I1tms of the amplified 165 .nd 23$ rDNAs. 3.6. 2 Rutriction endonuclease digestion and analysis hom the results obtained from the in silieD analyses, the ..:nzymes that cut at less than six sites, and some which had no sites. were selected for this anaIyis. Dral. EcoRl. Hindlll. hut. Soli and XbaJ were selected for the analysis offDl/rDI. fDJ/rPI. fd2/rP2 and 238 rRNA FIIRI amplicons and Apol. £CoRl. Hindill. &01. Nspl. Hinjl and BamHI for the WOlBI6SFII WOLBI6SRJ amplicons. (be digestions were carried out iii recommended by the manuf.cruten (Sigma-Aldrich, USA). The final reaction volume of 20 j.iJ contained 2-6 ~1 of the amplified products. The incubation was carried out at 37'C for 2 hours using a heat block. The products were electrophoresed inethidium bromide- ained 2% .garose leis using IX TAB buffer and the results were visualised by lrarWllwninaiion. A standard 100 bp ladder was used as roolecular weight marker. The sizes of the die;ested products were calculated u.slna the computer software DNAFRAG version 1.03 (Nash. 1992). 115 University of Ghana http://ugspace.ug.edu.gh 3.7 Cloning of Amplified Bacterial Sequences I. ciani", of PCR products allows the genention of relatively large amouftlS of the MIpIified teSion. which can then be con\'CflieOlly uxd. v.rt.encver needed. for experiments such AS sequcncina and hybridisation (Newton & Grah;un, 1997). The cloning process involves \be JipUon. using DNA ligase. of the amplifltd DNA into I suilab&c plasmid vC'Ctor to fOrm • recombinInC molecuJc. followed by transfonnation of Ihc recombinant molecule inlo suitable competent bacteria cell. for replication and multiplication. To select for transformants. "blue-white" colour scrtening, a well- c:Slablished means for identifyins a ligation product or indk:atina the presenc:e of a DNA insert. is employed. The mdbod is based upon the abilily of P-plKtosida.te to hydrolyse S.tJromo-4..chloro-3-indolyIIJ-D-galactopyranoside (X-pi). resultina in the characteristic blue ....... of. coIooy .. phose pIoque ida0e 8<"". Successful liption of a DNA fr.tgment ink> the multiple cJonina site disrupts ~ galactosidase expression; without insertioo. expression 110 unincmuplCd. Thus. when the f." mli host, ell.prcuina the carhoxylenn.inaJ portion of fS-g.daclosidase, is transformed with I p&a.nid vector or infected with a phaae withom I DNA insert, a "blue" colony or plaque wiD result (Ullmann el U/., 1967). Transformation Of infection of I vector with DNA fraament inxrtion results in I "white" or clear colony or pl-tue. For this study, the TA eloaiac method was used 10 clone tbe Implified bacteriaJ !llC'Qucrxa. The TA cloninS method takesadvantaccoflhc1erminal transrCTduc1 (CIaot, I9IS, HII, 199)), _ makes il -------------------116-- University of Ghana http://ugspace.ug.edu.gh possible to clone it dltectly ineo a linearized cloning vector with single, )'.T overhangs eTA cloning). DNA polyrncrases with proofreading activity, such as Pfo polymerase, cannot be usN bcal.l3C they provide bluol-cnded peR products. 3.7.1 dA tailing of peR products To enable a hlah efficiency of cloning of the PCR product in the TA Cloning system (Invitrogen, USA), the addition of J'A-overbangs post amplification was carried out. Twmty·five micmlilrt' reactions each consisting of 19.75 ~I of diluta:l peR product ( 10 in 9.75 ~I oeWaltr), 2.5 ,.d lOX PCR buffer C (from the Invitrogen PCR Optimizer Kit), 2.5 "at 2 mM dA TP and 0.25 "I Toq were set up. Each reaction mix was vQrtcxed, briefly centrifuged and incubated aI noc for 10 minutes in a thennal cycler. The products were used immtdiately for the ligation reaction 3.7.2 Ligation To cstilTUllc the amount of PCR product needed to ligate with SO ng (20 fmoles) of pCR~.1 veClor, the formula (TA Cloning Manual. Invitrogen, USA) below was used: (V bp peR productXSO iii pCR"2.1 v2.1. The: sequence of the mulliple cloning lite is shown willi • P('R product inserted by TA Clorung.&\ E,'oR I sites flank the insetted peR product on each side. The arrow indicates the start of transcription for the 17 RNA polymerase. ------------------------------------'122 University of Ghana http://ugspace.ug.edu.gh option W'Kkr the CONnG manqcr function of lhc software package An overtop minimum six of 20 bp and • minimum matchins percentage of 6OI'A. wen: selected. 3 .•• 1.2 VecSc,een and chi ...... detectiotI Prior 10 sequcnce analysis.. sequences ~ finl saunrd to revcalllnd remo~ any o(the v~tOfS, using the VecStteeD programme from the BLAST webpa&e of the National Center for 8iotechnolo&y Information (NCBl) websile (Altschul el aI., 1990. 1997). VecScreen searches a quay for segments that match any s:equence in a specialized non- redundant \'eelOr' dltabue. The search UICS BLAST paramcltrs preset for optimal omallJara_ Projcclll (Maidak .,01 .. 2001) _ ...... This PfOII'WiDI c:u help lO decmnifte if.!Cq\ICDCe is composed or two rragments that are ------------------123-- University of Ghana http://ugspace.ug.edu.gh similar 10 cJcarly difTereot sequences from thedatabasc and uploaded user sequences, i.e. if~ sequence is of chimeric origin. The default settings for the program wc~ u.ted. 3.8.1.3 Seiluenc. similarity and DNA databas. searches After editing. the Sl!qlkncCS WeTe submined to the standard nuclernidc-nucleotide BLAST dal.l~ snrch programme (blastn 2.2.1) of the NeBI website (AJtschul el al .• 1990; Altschul el at.. 1991) to search for closely related sequences in the non-redundant GenBank database. The default parameter settings were used. Related sequences were acquired by using the: Batch Entrez progrvnme (NCBI). Percent similarity between the sequcnc:~ obtained was determined by using the maximum matching option under the compare function in the DNASIS software package. This function compares {W(l DNA/amino acid sequencn and Illigns them for maximwn homology. Homology piau were alJo constructed using the: Homology option Wlder the compce fwx:tion of the software 3.8.1.4 Construction of consensus sequence Based on the resullS of the bomulogy plots. sequences showing o\"cr 80% similarity to oneanolhtrweregroupedandalisnedlogcneralcaconscnsussequenceusinglhe multiple alignmtnl funcLion in DNASIS. The automatic Higgins option was used with the following multiple alignment parameters: gap penalty of 300, fixed gap penally of 20, fbting gap pcnalty of 20, k-tuple vaJue of 2. windowsiz.e valuc of 5 and number of top diagonal value of S. The alignments were compldcd manually and the: consensus sequences 8ft'MIlCd subjected to BLAST sean;hes as described prcvtoudy. 124 University of Ghana http://ugspace.ug.edu.gh 3.8.2 Phylogeny con5truction Sequences homotogous to the consensus sequences .... erc retrieved from the ONA cbtabanks for comparisons. For sequences of groups of nearly identical groups only single representatives ~re used for phylogenetic anaJysis. Reference 165 rONAs used in the analyses wen: obtained from the GcnBank and are shown in Tables J.3a, 3.3b and J.3c. Inc sequence data were aligned using the CLUSTAL W package (Thom~on el oJ., 1994) integrated into the MacVector 7.1 software suite (Accelerys, Pharmacopeia Inc .• USA). For both pairwise and multiple alignmeots. the open gap pcnaJty of 30 and extended gap pmalty of 10 were applied. A slow alignment mode was selected for pairwise alisnmmts. For multiple a!ignments, I delay divcrt;cnce of 60% was selected and the trvlsitionswere ....' Cighted The re,ioru of the DNA tequmces that aligned ambiguously (base positions which ..... ere of indeterminate identity, insertions and deletions, and aJigrunl!nl gaps). most of which com:sponded to variab&e regions of SSU rRNA, ....' Cre not used for the subsequent dclenninalion of phylogenetic relationships. A number of algorithms, including neighbour-joining. maximum likelihood and maximwn parsimony. for inferring phylogeny were used to evaluate the sequmce relationships. 125 University of Ghana http://ugspace.ug.edu.gh Tabk J.Ja Baclttia species and strains used for the phylogenetic analy~is of CONSEN4. Organism accts5ionllumber X78717 Purple bacterium (wwuned) AB041770 Paroco«w kuwasaki~nsis YI6930 P, de"itriftcQIIS 016429 Rhodobocterblastica YI2703 p, mDrcusJl All006899 P. carOl;nifoc;~ru Af365994 Marine alpha proteobactcrium BBATJ X53855 R. sphaeroicks ABO I 7797 Rlwdobacter sp. TCRI S 016427 R. cupsu/orns AYOl4179 P. ),u;;slramGJ06O 070847 R.azotojormuflj' 032238 P. ulkaliphifus AYOO5463 Ruegeriosp 032241 P. Weurll Af245634 UncuJturedRO$toboclersp X78315 R. aJgocolus 088523 Agrobaclerium gelollnovorum 032240 P. aminm'Oram -------------------126 University of Ghana http://ugspace.ug.edu.gh Yabk J.3b Bacteria species and strains used for the phylogenetic analysis of CONSEN2. UenBonk Organism accessioflflllllfber AJ44075 1.1 Gram-negative bacterium MM I AF502217.1 Uncultured bactenum clone HPIB64 X78717.1 Purple bacteriwn (unnamed) str3in SW:! Y16930.1 ParacoccusQeflilTijiCQrIS AF229874.\ Parococc:ussp. 4FB8 032238.\ P.aJkallphlius AYO\4176. \ P.mrrinophilw AF527586.1 Uncultured bacterium clone LPB54 AF1l6850.1 Kelogulooogemum robuslum AYOI417J.\ P ~ei;strainGt212 AF445668.\ Uncultured alpha proleobal:tcnum done SMICll AFl68\83.1 Uncultured Rhodobacl~r group bacterium clone SBRTlS5 AB0I7799. \ Rhodobocler sp. TCRJ 127 University of Ghana http://ugspace.ug.edu.gh Table 3.3(' Bacteria ~rccies and strains used for the phylogenetic anaJysis of AgLS. Organism a('cessionnumber VIS83) lIymenobaclerroseosalivarius AY279405 Endosymbiont of Aphytis sp AY271)402 Endosymbiont of Aspldiocu.s "..,ii AY2741J9.1 Uncultured bacterium clone 0 113 AFJ82107.1 Uncultured bactenum clone ZA2626c AF3J6J56.1 UacteriumWuba47 D12658.1 CYlophogo auran/laea AYOJ8780.1 Uncu1tured CFB group bacterium clone TAF-B66 AJ244689.1 Cyclobocler;um sp. V4.MSJ2 AFJ61200.1 Unculrured Cytophoga/es bacterium clone 30 AJOl1917 F7«,obacillussp. AFJJ788J .2 Uncultured gold mine bacterium 028 AY279404 Endosymbiont of Aphyfis lingnanensfs Af408175.1 Unculnued Hy",enobacler sp. clone KL-2- 4-9 Al400J40.1 Uncultured marine bacterium Z00203 YI88Jl Tateeoboc'"ocellatus 128 University of Ghana http://ugspace.ug.edu.gh ].8.2.1 Neighbour-Joining (NJ) analvsis The neighbour joining (N/) programme in the PHYLlP package version 3.Se (Felsenstein. 1993) (or inferring phylogenelic relationships. also contains prog.rturulcarrying oul other rtlatcd wks U5ing different algorithms on difTerent kinds of data. SEQBOOT was flfSt used to bootstrap the sequences with 1000 replicates (Felsenstein, 1985). DNADIST was then used to calculate evolutionary distances from the bootstrapped OUlput data with the Kimura two-parametcr model for nucleotide change (Kimura, 1980) using a transition-tnlnSVersion ratio. estimated from the data with the software PUl.ZlE 4.0.2 (Strimmer & von Haese)cr, 1996) and one category of subslitution rates. Phylogenetic trees were constructed from the cvolutioD8l}' sequence data usmC NeiGHBOR (Sajtou and Net. 1987). A conxnsus tree was constructed using CON SENSE with the trttfile from NEIGHBOR. 3 ••. 2.2 Maximum IUceUhood iiln.alys's (ML) Phyk>tmetic lItt'5 wen: alto deriv~ from the molecular sequence data by the maximwn likelihood (ML) method carried out using the quartet puzzling (QP) analysis with the programme PUULE 4.0.2. PUZZLE is ANSI C compliant and a PHYUP compatible program that implements the quartet puzzlm,. method for reconstructing tree topologies from character state ciatL Quanet: puzzlina is. method that applies maximum likelihood 1m: rec:onstrUCtion to all possible quan.ets of taxa and subsequently tries fA) combine most of tbt fow-taxa maximum likelihood trees to construct aD overull In:C. " 1000 puzzling slt:r~ U~II" tbt Ha.sc:sa .... a, Kishino. and Vano (HKY) substitution mode:! (Hascp ...... rl 01 . IQX5) with an estimated lransition-~rsion ratio. and site-to-site substitution rate varialion modelLed on I gamma disbibuLioo with four categories and the 129 University of Ghana http://ugspace.ug.edu.gh shape pwunctc1 wen: estimated from the data. For paramctC't estimation. quarkt sampling (Cor suhstirution. process) plus NJ tree (for rW variation) WCfe used. 3.8.2.3 JoIIaximum parsimony. analysis (HP) A MP tree requires the smaJlest number of ewlutio,*>, cblnges 10 result in the set of oro. under the study (Hillis. 1997). for each tree 1o be evalua&cd. the minimum possible number of chances for each 'CMrac\et' (nucleotide position or morphotoaical nit) is caJc:v1aIed, tad the minimum number of chartJcs across all characters are totalled, 10 obc3in the parsimony score. The best tree is the one thai requires the fewest changes KfOSi all charatter$ (Hillis, 1997). Infonnation on evolutionary processes may be UlcOlpO;.ltcd by ~ightina characters differentially (such as first versus third positions of c:odoas).orby wcichtinacbarader state changesdifferentiaJly (such utransitions versus nnsvmions) Maximwn parsimony (MP) analysis was accomplished with the Phytoaenetic analysis w.ina: parsimony (PAUP) version 3.0r propun (Swofford, 1991). A slepmatrix. was used to give transvet'Sions twic:e the weight of tnnsitions. and pps were lreaIed as missing da&I. Two heuristiC -=-reh 5Irarcgies ~ UIed for each IMlysis, with ac!cheq • simple. and lht ICCOftd with IIddJeq • random and 10 rcplicaacs. Branch swappiOi was by tree bixction-reconnection. Consensus (50% majority rule) trees 1.Io-en: construttcd by usinS wx::orrecled NJ distances with 1,OOObootstnpreplicatcs. Iheseueesexc:ludedaroupings thatocc:urredinleathln 5O'/.o(tbereplKaiesMdncgath·cbtanch k:nachswereprohibiled. 1 30~ University of Ghana http://ugspace.ug.edu.gh 3.B.3 Nucleotide Sequence Accession Numbers r The :unplified 165 rONA sequences of the bac1eria from the mosquitoes. after !; iden'iticataoo, were submiued to the ONA Databank of Japan (OOJB), European Mol«ular Biology Laboratory (EMBL) and GenBank (at NCBI) nucleotide sequence University of Ghana http://ugspace.ug.edu.gh CHAPTER FOUR RESULTS 4.1 Mosquito Species Identification Gmunllc DNAs for peR were extracted from 432 mosquito specimens comprising of91 IItrVK, 37 puper and 304 adult femaJes (Fig 4. 1) and these were ustd for the srudies. Out of the 432 mosqWIO specimens processed, PCR amplification for species idcntiticatton was successful (or 395 (91.40,..) while amplification failed for 37 (8.6%). Of the 395 peR positive specimens, 373 (94.4%) were identified as An KIJ,"b;n~ S.S., 20 (5.1%) as An. arab/ens/s, 1 (0.25%) as An. merus and I (0.25%) An. me/as (Fig. 4.2 and Table 4.1). The 395 mosqwto specimens comprised 295 (74.1%) adults, 29 (7.3%) pupae and 71 (IS-/.) larvae. The details of the species identifted and of the life-stage forms are abogivco in Table 4.1. -----------------~Ilz_____ University of Ghana http://ugspace.ug.edu.gh extrw:wd from AIL ga",biae 5.1. mosqui&oes. Lane \1 .}., DNA· Hind III diant; lane I - oduU, _2-.14-1arwo;_3 o _ University of Ghana http://ugspace.ug.edu.gh 600bp 400bp 200bp Fic 4.2 Elhidi ... bromido-!iIaincd 2.0 % agam. . gel ekclrophorId;I_6-XOQ[ 144 University of Ghana http://ugspace.ug.edu.gh I.Skb- O.75tb - O.5 kb -to fil 4.8 Ethidiwn bromide.scained 2.U'Y. aprose gel clectrophorcs,ram of restriction enzyme dilC"StS of 23S rONA PCR products. Lae M - I kb molecular weieht marker (MBI Fermentas, Germany); _ I - Oral; I. .. 2 - Pv.i ; lane } - XI>oi; lane 4 • £CoR!; lane\ = 1Ihtd111; 1ane6-Sa/I 145 University of Ghana http://ugspace.ug.edu.gh Tabk-"'" Obsen'cd, and expected fragment sizes for amplicon., E. coli and P. UUlonwrans rDNAs after digestion with the various restriction enzymes ( - denotes no restriction) Rcstric:lioD fragmeat sizn (bp) for 16S RCSlric:tion fra281c., rONA primers sius (bp) for 23S Restriction rONA primers Amplicon, P.agglomf!rllns* Amplkont E.coli* n..1 1020 1080 S40 &oR! 880 870 836 840 680 .70 619 670 890 l70 80 SDiI 820 806 1260 1340 670 636 340 160 lO l3 Xbo( *Restriction fragments sizes sho'An are those obtained from the In 3t11"' analyses using the software DIGEST 146 University of Ghana http://ugspace.ug.edu.gh AJt the amplic:ons obl.&ined using the WOLI6S primers were digested wilh Apol. EcoRI. HindlH, R.JoI. Nspl . Hinjl and BamHI. None ofthe ampltc:oos was cleaved by the Hindfll restriction enL}'me (Fig. 4 .9). NspJ RsQI and Apol , each cut once 10 produce the same two fra&meolS in all the amplicons analysed. The restrfClion fl"l!llgmcnlS sizes were estimated .. be 800 and 180 bp for Apol (Fig. 4.10), 524 and 418 bp for //sQ1 (Fig. ' . 11) and 585 andJ3J bpfoc Nspl (Fig.'. II). Restriction with EcoRJ , BamHI and Hinjl gave multiple banding patterns. Complex patterns. with the sum of the restriction bands being greater than the Si7.e of undigested PeR.amplified WOL16S rONAs wen: obtained with the EcoRJ and Hinjl . The £CoRl digestion (Fig. 4.12) yielded 3 fragments of sizes 1039. 590 and 436 bp and 4 fragments of sizr:s 1039. 736. 590 and 436 bp. The Hinjl digcstion (Fig. 4.13) yielded rragments of 171 bp, 871 and 693 bpand 800 bp in sizes. For the BamHI restriction. 3 distinct reMriction pcltterns with estimated fragment sizes of 746, 663. and 693 bp rcspeetively 'tII1"'t'Teobsttved (Fig. 4.IJ) 147 University of Ghana http://ugspace.ug.edu.gh 1.00b O.6kb F.4.9 t::lhidium bromide-stainrd 2.0-/0 .garosc SCi electrophorcgram of 1Jina111 mcrictioa eazymc digesuof WOLI6S rONA PCR producu. lAne M· 100 bp motcc. ,«iaM nwkt'f (Sigma-Aldrich. USA); lanes I - 4 - digeskd bacterial peR products &om diffmnlnkl~_5-undigc:stedPCRproduct. 148 University of Ghana http://ugspace.ug.edu.gh IOObp 600bp 400bp Fit 4. • ' Ethidium bromKie-stained 2.0% aaarox gel e'"tropbort'gram of Apo( __ enzyme digests of WOLl6S rONA PCR products. lane: M. 100 bp mol