MXPA01005378A - Homologous 28-kilodalton immunodominant protein genes of ehrlichia canis - Google Patents
Homologous 28-kilodalton immunodominant protein genes of ehrlichia canisInfo
- Publication number
- MXPA01005378A MXPA01005378A MXPA/A/2001/005378A MXPA01005378A MXPA01005378A MX PA01005378 A MXPA01005378 A MX PA01005378A MX PA01005378 A MXPA01005378 A MX PA01005378A MX PA01005378 A MXPA01005378 A MX PA01005378A
- Authority
- MX
- Mexico
- Prior art keywords
- seq
- ser
- gly
- phe
- asn
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 196
- 241000605312 Ehrlichia canis Species 0.000 title claims abstract description 85
- 229940051998 ehrlichia canis Drugs 0.000 title claims abstract description 38
- 230000014509 gene expression Effects 0.000 claims abstract description 23
- 102000004169 proteins and genes Human genes 0.000 claims description 85
- 108020004414 DNA Proteins 0.000 claims description 68
- 241000282465 Canis Species 0.000 claims description 48
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 33
- 238000000034 method Methods 0.000 claims description 30
- 150000007523 nucleic acids Chemical class 0.000 claims description 28
- 239000013598 vector Substances 0.000 claims description 23
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 19
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims description 17
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims description 17
- 239000000203 mixture Substances 0.000 claims description 13
- 108020004707 nucleic acids Proteins 0.000 claims description 13
- 102000039446 nucleic acids Human genes 0.000 claims description 13
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 11
- 239000013604 expression vector Substances 0.000 claims description 10
- 208000015181 infectious disease Diseases 0.000 claims description 9
- 229920001184 polypeptide Polymers 0.000 claims description 8
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 6
- 101710127328 28 kDa antigen Proteins 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 5
- 230000002401 inhibitory effect Effects 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 6
- 239000003937 drug carrier Substances 0.000 claims 1
- 101710149506 28 kDa protein Proteins 0.000 abstract description 42
- 101710137943 Complement control protein C3 Proteins 0.000 abstract description 42
- 101710152003 Suppressor of silencing P0 Proteins 0.000 abstract description 42
- 238000010367 cloning Methods 0.000 abstract description 8
- 238000012163 sequencing technique Methods 0.000 abstract description 7
- 150000001413 amino acids Chemical class 0.000 description 69
- 241000605310 Ehrlichia chaffeensis Species 0.000 description 40
- 210000004027 cell Anatomy 0.000 description 37
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 28
- 239000002773 nucleotide Substances 0.000 description 26
- 125000003729 nucleotide group Chemical group 0.000 description 25
- 238000003752 polymerase chain reaction Methods 0.000 description 25
- 108010050848 glycylleucine Proteins 0.000 description 17
- 239000012634 fragment Substances 0.000 description 15
- 101710139711 IkB-like protein Proteins 0.000 description 12
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 11
- 230000000890 antigenic effect Effects 0.000 description 11
- 101100205189 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-5 gene Proteins 0.000 description 10
- 239000000427 antigen Substances 0.000 description 10
- 102000036639 antigens Human genes 0.000 description 10
- 108091007433 antigens Proteins 0.000 description 10
- 238000009396 hybridization Methods 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- 241000282472 Canis lupus familiaris Species 0.000 description 9
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 9
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 9
- 108010078144 glutaminyl-glycine Proteins 0.000 description 9
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 9
- 230000036961 partial effect Effects 0.000 description 9
- 108010048818 seryl-histidine Proteins 0.000 description 9
- 238000003776 cleavage reaction Methods 0.000 description 8
- 230000000295 complement effect Effects 0.000 description 8
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 8
- 230000007017 scission Effects 0.000 description 8
- 108010071207 serylmethionine Proteins 0.000 description 8
- 229940094937 thioredoxin Drugs 0.000 description 8
- 241000605317 Anaplasmataceae Species 0.000 description 7
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 7
- 102100026061 Mannan-binding lectin serine protease 1 Human genes 0.000 description 7
- 101710161855 Methionine aminopeptidase 1 Proteins 0.000 description 7
- 101710103983 Modulator of apoptosis 1 Proteins 0.000 description 7
- 238000012408 PCR amplification Methods 0.000 description 7
- 239000008186 active pharmaceutical agent Substances 0.000 description 7
- 108010009297 diglycyl-histidine Proteins 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 108010054155 lysyllysine Proteins 0.000 description 7
- 108010079317 prolyl-tyrosine Proteins 0.000 description 7
- 108091008146 restriction endonucleases Proteins 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- LHRCZIRWNFRIRG-SRVKXCTJSA-N Cys-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O LHRCZIRWNFRIRG-SRVKXCTJSA-N 0.000 description 6
- 241000605314 Ehrlichia Species 0.000 description 6
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 6
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 6
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 6
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 6
- 101150064138 MAP1 gene Proteins 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 101710138624 Otolith matrix protein 1 Proteins 0.000 description 6
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 6
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 6
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 6
- 102000002933 Thioredoxin Human genes 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 108010092114 histidylphenylalanine Proteins 0.000 description 6
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 150000003839 salts Chemical class 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 108060008226 thioredoxin Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- 108091035707 Consensus sequence Proteins 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 5
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 5
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 5
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- 108091092724 Noncoding DNA Proteins 0.000 description 5
- 108700026244 Open Reading Frames Proteins 0.000 description 5
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 5
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 5
- GYBVHTWOQJMYAM-HRCADAONSA-N Tyr-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N GYBVHTWOQJMYAM-HRCADAONSA-N 0.000 description 5
- 238000012512 characterization method Methods 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 230000029087 digestion Effects 0.000 description 5
- 208000000292 ehrlichiosis Diseases 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- 102000037865 fusion proteins Human genes 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 238000010369 molecular cloning Methods 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 4
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 4
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 4
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 4
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 4
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 4
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 4
- 108091029795 Intergenic region Proteins 0.000 description 4
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 4
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 4
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 4
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 4
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 4
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 4
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 4
- YZJKNDCEPDDIDA-BZSNNMDCSA-N Phe-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 YZJKNDCEPDDIDA-BZSNNMDCSA-N 0.000 description 4
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 4
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 4
- 244000292604 Salvia columbariae Species 0.000 description 4
- 235000012377 Salvia columbariae var. columbariae Nutrition 0.000 description 4
- 235000001498 Salvia hispanica Nutrition 0.000 description 4
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 4
- 210000001744 T-lymphocyte Anatomy 0.000 description 4
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 4
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 4
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 4
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 4
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 230000001154 acute effect Effects 0.000 description 4
- 239000002671 adjuvant Substances 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 235000014167 chia Nutrition 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 4
- 230000028993 immune response Effects 0.000 description 4
- 230000002163 immunogen Effects 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 238000007363 ring formation reaction Methods 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- -1 ß-D-glucosidase Proteins 0.000 description 4
- BFDDUDQCPJWQRQ-IHRRRGAJSA-N Arg-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O BFDDUDQCPJWQRQ-IHRRRGAJSA-N 0.000 description 3
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 3
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 3
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 3
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 3
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 3
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 3
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 3
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 3
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 3
- MWWOPNQSBXEUHO-ULQDDVLXSA-N His-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MWWOPNQSBXEUHO-ULQDDVLXSA-N 0.000 description 3
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 3
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 3
- HKRYNJSKVLZIFP-IHRRRGAJSA-N Met-Asn-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HKRYNJSKVLZIFP-IHRRRGAJSA-N 0.000 description 3
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 3
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 3
- ACJULKNZOCRWEI-ULQDDVLXSA-N Phe-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O ACJULKNZOCRWEI-ULQDDVLXSA-N 0.000 description 3
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 3
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 3
- OIDKVWTWGDWMHY-RYUDHWBXSA-N Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 OIDKVWTWGDWMHY-RYUDHWBXSA-N 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 3
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 3
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 3
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 3
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 3
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- 238000002105 Southern blotting Methods 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 3
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 3
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 3
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 3
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 3
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 3
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 239000008188 pellet Substances 0.000 description 3
- 230000002688 persistence Effects 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 108010061238 threonyl-glycine Proteins 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 101150084750 1 gene Proteins 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 2
- JSLGXODUIAFWCF-WDSKDSINSA-N Arg-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O JSLGXODUIAFWCF-WDSKDSINSA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 2
- VGRHZPNRCLAHQA-IMJSIDKUSA-N Asp-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O VGRHZPNRCLAHQA-IMJSIDKUSA-N 0.000 description 2
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 2
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 2
- 101150013191 E gene Proteins 0.000 description 2
- 241000606675 Ehrlichia ruminantium Species 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 2
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical class O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 206010020429 Human ehrlichiosis Diseases 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- 101150017040 I gene Proteins 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 2
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 2
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 2
- GHQFLTYXGUETFD-UFYCRDLUSA-N Met-Tyr-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N GHQFLTYXGUETFD-UFYCRDLUSA-N 0.000 description 2
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 2
- 101710159910 Movement protein Proteins 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- 101710116435 Outer membrane protein Proteins 0.000 description 2
- 102000003992 Peroxidases Human genes 0.000 description 2
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 2
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 2
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- GVUVRRPYYDHHGK-VQVTYTSYSA-N Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GVUVRRPYYDHHGK-VQVTYTSYSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 2
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfisoxazole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 2
- 230000024932 T cell mediated immunity Effects 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 2
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 2
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 208000022531 anorexia Diseases 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 150000001718 carbodiimides Chemical class 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 230000001684 chronic effect Effects 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000001816 cooling Methods 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 206010061428 decreased appetite Diseases 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 201000009162 human monocytic ehrlichiosis Diseases 0.000 description 2
- 230000002480 immunoprotective effect Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- MIKKOBKEXMRYFQ-WZTVWXICSA-N meglumine amidotrizoate Chemical compound C[NH2+]C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO.CC(=O)NC1=C(I)C(NC(C)=O)=C(I)C(C([O-])=O)=C1I MIKKOBKEXMRYFQ-WZTVWXICSA-N 0.000 description 2
- 230000036651 mood Effects 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 108040007629 peroxidase activity proteins Proteins 0.000 description 2
- 235000020030 perry Nutrition 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000002285 radioactive effect Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 206010043554 thrombocytopenia Diseases 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- GETQZCLCWQTVFV-UHFFFAOYSA-N trimethylamine Chemical compound CN(C)C GETQZCLCWQTVFV-UHFFFAOYSA-N 0.000 description 2
- 108010087967 type I signal peptidase Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- LLXVXPPXELIDGQ-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 3-(2,5-dioxopyrrol-1-yl)benzoate Chemical compound C=1C=CC(N2C(C=CC2=O)=O)=CC=1C(=O)ON1C(=O)CCC1=O LLXVXPPXELIDGQ-UHFFFAOYSA-N 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- 101710091601 21 kDa protein Proteins 0.000 description 1
- 101710106459 29 kDa protein Proteins 0.000 description 1
- QRXMUCSWCMTJGU-UHFFFAOYSA-N 5-bromo-4-chloro-3-indolyl phosphate Chemical compound C1=C(Br)C(Cl)=C2C(OP(O)(=O)O)=CNC2=C1 QRXMUCSWCMTJGU-UHFFFAOYSA-N 0.000 description 1
- 241000238876 Acari Species 0.000 description 1
- 206010067484 Adverse reaction Diseases 0.000 description 1
- AAXVGJXZKHQQHD-LSJOCFKGSA-N Ala-His-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N AAXVGJXZKHQQHD-LSJOCFKGSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- OMSMPWHEGLNQOD-UWVGGRQHSA-N Asn-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OMSMPWHEGLNQOD-UWVGGRQHSA-N 0.000 description 1
- GADKFYNESXNRLC-WDSKDSINSA-N Asn-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GADKFYNESXNRLC-WDSKDSINSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 1
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 1
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- CZECQDPEMSVPDH-MNXVOIDGSA-N Asp-Leu-Val-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CZECQDPEMSVPDH-MNXVOIDGSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 1
- NTQDELBZOMWXRS-IWGUZYHVSA-N Asp-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O NTQDELBZOMWXRS-IWGUZYHVSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 1
- GHAHOJDCBRXAKC-IHPCNDPISA-N Asp-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N GHAHOJDCBRXAKC-IHPCNDPISA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- 101150111062 C gene Proteins 0.000 description 1
- 206010006895 Cachexia Diseases 0.000 description 1
- 101100024439 Caenorhabditis elegans msp-3 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- OWAFTBLVZNSIFO-SRVKXCTJSA-N Cys-His-His Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OWAFTBLVZNSIFO-SRVKXCTJSA-N 0.000 description 1
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 1
- DSTWKJOBKSMVCV-UWVGGRQHSA-N Cys-Tyr Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DSTWKJOBKSMVCV-UWVGGRQHSA-N 0.000 description 1
- HPZAJRPYUIHDIN-BZSNNMDCSA-N Cys-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N HPZAJRPYUIHDIN-BZSNNMDCSA-N 0.000 description 1
- YVGGHNCTFXOJCH-UHFFFAOYSA-N DDT Chemical compound C1=CC(Cl)=CC=C1C(C(Cl)(Cl)Cl)C1=CC=C(Cl)C=C1 YVGGHNCTFXOJCH-UHFFFAOYSA-N 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 108091065810 E family Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 101710204837 Envelope small membrane protein Proteins 0.000 description 1
- 241000672609 Escherichia coli BL21 Species 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- UKKNTTCNGZLJEX-WHFBIAKZSA-N Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UKKNTTCNGZLJEX-WHFBIAKZSA-N 0.000 description 1
- 101100024440 Globodera rostochiensis MSP-3 gene Proteins 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- XMBSYZWANAQXEV-QWRGUYRKSA-N Glu-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-QWRGUYRKSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 108010015776 Glucose oxidase Proteins 0.000 description 1
- 239000004366 Glucose oxidase Substances 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- 208000032843 Hemorrhage Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- FAQYEASGXHQQAA-XIRDDKMYSA-N His-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC3=CN=CN3)N FAQYEASGXHQQAA-XIRDDKMYSA-N 0.000 description 1
- VHOLZZKNEBBHTH-YUMQZZPRSA-N His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 VHOLZZKNEBBHTH-YUMQZZPRSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 1
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 1
- LQGCNWWLGGMTJO-ULQDDVLXSA-N His-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N LQGCNWWLGGMTJO-ULQDDVLXSA-N 0.000 description 1
- VUUFXXGKMPLKNH-BZSNNMDCSA-N His-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N VUUFXXGKMPLKNH-BZSNNMDCSA-N 0.000 description 1
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 1
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 206010020633 Hyperglobulinaemia Diseases 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N L-Prolyl-L-glutamic acid Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 208000008771 Lymphadenopathy Diseases 0.000 description 1
- 108010074338 Lymphokines Proteins 0.000 description 1
- 102000008072 Lymphokines Human genes 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- KVNLHIXLLZBAFQ-RWMBFGLXSA-N Lys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N KVNLHIXLLZBAFQ-RWMBFGLXSA-N 0.000 description 1
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- 101710145006 Lysis protein Proteins 0.000 description 1
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 1
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 1
- AOFZWWDTTJLHOU-ULQDDVLXSA-N Met-Lys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AOFZWWDTTJLHOU-ULQDDVLXSA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 1
- 101710151833 Movement protein TGBp3 Proteins 0.000 description 1
- 101100084404 Mus musculus Prodh gene Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 206010033661 Pancytopenia Diseases 0.000 description 1
- 208000037581 Persistent Infection Diseases 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- JXWLMUIXUXLIJR-QWRGUYRKSA-N Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JXWLMUIXUXLIJR-QWRGUYRKSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- PYOHODCEOHCZBM-RYUDHWBXSA-N Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 PYOHODCEOHCZBM-RYUDHWBXSA-N 0.000 description 1
- WURZLPSMYZLEGH-UNQGMJICSA-N Phe-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N)O WURZLPSMYZLEGH-UNQGMJICSA-N 0.000 description 1
- GKZIWHRNKRBEOH-HOTGVXAUSA-N Phe-Phe Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C=CC=CC=1)C([O-])=O)C1=CC=CC=C1 GKZIWHRNKRBEOH-HOTGVXAUSA-N 0.000 description 1
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 101800001016 Picornain 3C-like protease Proteins 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- LQZZPNDMYNZPFT-KKUMJFAQSA-N Pro-Gln-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LQZZPNDMYNZPFT-KKUMJFAQSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- 241000238076 Procaris Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 101100173636 Rattus norvegicus Fhl2 gene Proteins 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 241000282849 Ruminantia Species 0.000 description 1
- 229930190657 Rupin Natural products 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- YZMPDHTZJJCGEI-BQBZGAKWSA-N Ser-His Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 YZMPDHTZJJCGEI-BQBZGAKWSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 241000607715 Serratia marcescens Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 1
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- BDYBHQWMHYDRKJ-UNQGMJICSA-N Thr-Phe-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N)O BDYBHQWMHYDRKJ-UNQGMJICSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- GKUROEIXVURAAO-BPUTZDHNSA-N Trp-Asp-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GKUROEIXVURAAO-BPUTZDHNSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- SMLCYZYQFRTLCO-UWJYBYFXSA-N Tyr-Cys-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O SMLCYZYQFRTLCO-UWJYBYFXSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- PDSLRCZINIDLMU-QWRGUYRKSA-N Tyr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PDSLRCZINIDLMU-QWRGUYRKSA-N 0.000 description 1
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- ZMKDQRJLMRZHRI-ACRUOGEOSA-N Tyr-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N ZMKDQRJLMRZHRI-ACRUOGEOSA-N 0.000 description 1
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- ZSXJENBJGRHKIG-UWVGGRQHSA-N Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UWVGGRQHSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108010046334 Urease Proteins 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 230000006838 adverse reaction Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 230000000172 allergic effect Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 230000027645 antigenic variation Effects 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010036999 aspartyl-alanyl-histidyl-lysine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 208000010668 atopic eczema Diseases 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- HFACYLZERDEVSX-UHFFFAOYSA-N benzidine Chemical compound C1=CC(N)=CC=C1C1=CC=C(N)C=C1 HFACYLZERDEVSX-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- WZHCOOQXZCIUNC-UHFFFAOYSA-N cyclandelate Chemical compound C1C(C)(C)CC(C)CC1OC(=O)C(O)C1=CC=CC=C1 WZHCOOQXZCIUNC-UHFFFAOYSA-N 0.000 description 1
- 230000002380 cytological effect Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 125000005442 diisocyanate group Chemical group 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 208000026500 emaciation Diseases 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 208000001780 epistaxis Diseases 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229940116332 glucose oxidase Drugs 0.000 description 1
- 235000019420 glucose oxidase Nutrition 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 239000011544 gradient gel Substances 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 150000004679 hydroxides Chemical class 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 239000012212 insulator Substances 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- JJWLVOIRVHMVIS-UHFFFAOYSA-N isopropylamine Chemical compound CC(C)N JJWLVOIRVHMVIS-UHFFFAOYSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000002045 lasting effect Effects 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000006193 liquid solution Substances 0.000 description 1
- 239000006194 liquid suspension Substances 0.000 description 1
- 208000018555 lymphatic system disease Diseases 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 230000011278 mitosis Effects 0.000 description 1
- 238000007479 molecular analysis Methods 0.000 description 1
- 210000002864 mononuclear phagocyte Anatomy 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 210000000472 morula Anatomy 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 108010009920 neokyotorphin (1-4) Proteins 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- JPXMTWWFLBLUCD-UHFFFAOYSA-N nitro blue tetrazolium(2+) Chemical compound COC1=CC(C=2C=C(OC)C(=CC=2)[N+]=2N(N=C(N=2)C=2C=CC=CC=2)C=2C=CC(=CC=2)[N+]([O-])=O)=CC=C1[N+]1=NC(C=2C=CC=CC=2)=NN1C1=CC=C([N+]([O-])=O)C=C1 JPXMTWWFLBLUCD-UHFFFAOYSA-N 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 238000013081 phylogenetic analysis Methods 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- MFDFERRIHVXMIY-UHFFFAOYSA-N procaine Chemical compound CCN(CC)CCOC(=O)C1=CC=C(N)C=C1 MFDFERRIHVXMIY-UHFFFAOYSA-N 0.000 description 1
- 229960004919 procaine Drugs 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 108010071967 protein K Proteins 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 229940098362 serratia marcescens Drugs 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 230000010415 tropism Effects 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Abstract
The present invention is directed to the cloning, sequencing and expression of homologous immnunoreactive 28-kDa protein genes, ECa28-1 and ECa28SA3, from a polymorphic multiple gene family of Ehrlichia canis. A complete sequence of another 28-kDa protein gene, ECaSA2, is also provided. Further disclosed is a multigene locus encoding all five homologous 28-kDa protein genes of Ehrlichia canis. Recombinant Ehrlichia canis 28-kDa proteins react with convalescent phase antiserum from an E. canis-infected dog.
Description
GENES OF THE IMMUNODOMINANT PROTEIN OF 28-KILODALTOMS, HOMOLOGA, OF EHRLICHIA CANIS AND USE OF THEM
BACKGROUND OF THE INVENTION Field of the Invention The present invention relates generally to the field of molecular biology. More specifically, the present invention relates to the molecular cloning and characterization of the 28-kDa homologous protein genes in Ehrli chla canis and a multi-gene site encoding the 28-kDa homologous proteins of Ehrlichia cani s and uses of them. Description of Related Art Canine ehrlichiosis, also known as canine tropical pancytopenia, is a condition caused by rickettsia that carry ticks in dogs, first described in Africa in 1935 and in the United States in 1963 (Donatien and Lestoquard, 1935 Ewing, 1963). The condition is best recognized after an epizootic epidemic has occurred in United States military dogs during the Vietnam War (Walker et al., 1970). The etiological agent of canine ehrlichiosis is Ehrlichia cani s, a small intracellular bacterium.
REF: 129792 gram-negative parasite that exhibits tropism for mononuclear phagocytes (Nyindo et al., 1971) and is transmitted by the dog brown tick, Rhipi c pha l us sanguineus (Groves et al., 1975). The progression of canine ehrlichiosis occurs in three phases; acute, subclinical and chronic. The acute phase is characterized by the presence of fever, anorexia, depression, lymphadenopathy, and mild thrombocytopenia (Troy and Forrester, 1990). Typically, dogs recover from the acute phase, but they become carriers of the organism persistently infected without clinical symptoms of the disease for months or even years (Harrus et al., 1998). A chronic phase develops in some cases, which is characterized by thrombocytopenia, hyperglobulinemia, anorexia, emaciation, and hemorrhage, particularly epistaxis, followed by death (Troy and Forrester, 1990). The taxonomic analysis based on the 16S rRNA gene has determined that E. Canis and E. Chaffeensis, the etiological agent of human monocytic ehrlichiosis (HME), are closely related (Anderson et al., 1991, Anderson et al., Dawson et al., 1991; Chen et al., 1994). A considerable cross-reactivity of the 64, 47, 40, 30, 29 and 23 kDa antigens has been reported between E. Cani \ s and E.
homologs according to the analysis by peripheral blood PCR of 5 days after the immunogenic test (Ohashi et al., 1998). Molecular cloning of similar but not identical 28-kDa genes placed in tandem of E. canis homologous to the E. chaffensis omp-l gene family and the C. rumanin ti um map-1 gene (Reddy et al. al., 1998). The previous technique is deficient due to the absence of cloning and characterization of new genes of the immunoreactive 28-kDa homologous protein of Ehrli chia canis and a single multi-gene site containing the genes of the homologous 28-kDa protein. In addition, the prior art is deficient due to the absence or lack of recombinant proteins of such Ehrlichia canis immunoreactive genes. The present invention fulfills this need for a long time and desire in the art.
BRIEF DESCRIPTION OF THE INVENTION The present invention describes the molecular cloning, sequencing, characterization, and expression of the genes of the mature immunoreactive 28-kDa protein homologous of Ehrlí chia canis (termed Eca28-1, ECa28SA3 and
ECa28SA2), and the identification of a single place (5.592-kb)
^ ygfcs | ? gj containing five 28-kDa protein genes from Ehrli chia canis (ECa28SAl, ECa28SA2, ECa28SA3, Eca28-1 and ECa28-2). The comparison of E. Chaffeensis between the genes of the 28-kDa protein of E. Canis revealed that ECa28-l shares the highest amino acid homology with the multiple gene family of E. Chaffensis omp-l and is highly conserved between E. Canis isolated. It was predicted that the five 28-kDa proteins have signal peptides that result in mature proteins, and have amino acid homology in the range of 51 to 72%. The analysis of intergenic regions revealed hypothetical promoter regions for each gene, suggesting that these genes can be expressed independently and differentially. The non-coding intergenic regions vary in size from 299 to 355-bp, and are from 48 to 71 homologs. In one embodiment of the present invention, DNA sequences encoding an immunoreactive 30-kDa protein from Ehrlichia canis are provided. Preferably, the protein has an amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID NO. 4 and SEQ ID No. 6, and the gene has a nucleic acid sequence selected from the group consisting of SEQ ID No. 1, SEQ ID No. 3 and SEQ ID No. 5 and is a member of a
family of multiple polymorphic genes. Generally, the protein has an N-terminal signal sequence that was cleaved after the post-translational process resulting in the production of a mature 28-kDa protein. Still, preferably the DNAs encoding the 28-kDa proteins are contained in a single multi-gene site, which have the size of 5,592 kb and which encode all five proteins of 28-kDa homologous Ehrlichia canis. In another embodiment of the present invention, there is provided an expression vector comprising a gene encoding a 28-kDa immunoreactive protein of
Ehrlichia canis, capable of expressing the gene when the vector is introduced into the cell. In yet another embodiment of the present invention, there is provided a recombinant protein comprising an amino acid sequence selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4 and SEQ ID No. 6. Preferably, the sequence of amino acid is encoded by the amino acid sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3 and SEQ ID NO: 5. Preferably, the recombinant protein comprises four variable regions which are exposed in the
MtefctMÜM-Éa ^ BaMaA? Atfkri? surface, hydrophilic and antigenic. The recombinant protein can be useful as an antigen. In yet another embodiment of the present invention, there is provided a method for producing the recombinant protein, comprising the steps of obtaining a vector
comprising an expression region comprising a sequence encoding the amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID No. 4 and SEQ ID No. 6 operatively linked to a
promoter; transfecting the vector in a cell; and culturing the cell under conditions effective for the expression of the expression region. The invention can also be described in certain embodiments as a method of inhibiting the infection of
Ehrli chía canis in a subject, which includes the steps of: identifying a subject suspected of being exposed or infected with Ehrli chia canis; and administering a composition comprising a 28-kDa antigen of Ehrlichia canis in i i an amount effective to inhibit an infection of
Ehrlichia canis. Inhibition can occur through any means such as, for example, the stimulation of subjects' mood or cellular immune responses,
normal antigen of 28-kDa, or even compete with the antigen for interaction with some agents in the body of the subjects. Other aspects, features and additional advantages of the present invention will be apparent from the following description of the present preferred embodiments of the invention given for the purpose of the description. BRIEF DESCRIPTION OF THE DRAWINGS 10 So that the subject in which the characteristics, advantages and aforementioned objects of the invention, as well as others that are clear, will be reached and can be understood in detail; furthermore the particular descriptions of the invention briefly summarized above can be made by reference to certain embodiments thereof which are illustrated in the appended drawings. These drawings are part of the specification. It will be noted, however, that the appended drawings illustrate the preferred embodiments of the invention 20 and are therefore not considered to limit its scope. Figure 1 shows the nucleic acid sequence (SEQ ID NO: 1) and the deduced amino acid sequence (SEQ ID No.2) of the Eca28-1 gene which includes the
* «Fc -» »~ *. ~ *. * m * «??? t * ~« *, ^ É ^ encoding 5 'and 3' adjacent. The ATG start codon and the TAA termination are shown in bold, and the leader signal sequence of 23 amino acids is underlined. Figure 2 shows the SDS-PAGE of the expressed 50-kDa recombinant Eca28-l-thioredoxin fusion protein (lane 1), arrow) and the 16-kDa thioredoxin control (lane 2, arrow), and the corresponding immunoblotting of the recombinant Eca28-l-thioredoxin fusion protein recognized by the canine antiserum of the convalescent E. canis phase (lane 3). ). Thyroredoxin control was not detected by E. canis antiserum (not shown). Figure 3 shows the alignment of the amino acid sequences of the protein Eca28-1 (SEQ ID NO: 2), and ECa28SA2 (partial sequence, SEQ ID NO: 7) and ECa28SAl (SEQ ID NO: 8), E. chaffeensi s P28 (SEQ ID NO.9), family E. chaffensis OMP-1 (SEQ ID Nos: 10-14) and C. Ruminantiu MAP-1
(SEQ ID No. 15). The amino acid sequence ECa28-l is presented as the consensus sequence. The amino acids not shown are identical for Eca28-1 and are represented by a dot. Divergent amino acids are shown with the corresponding abbreviation of a letter. The intervals entered for the maximum alignment of the. Amino acid sequences are denoted by a hyphen. The
jjgjj'J'jj ^ variable regions are underlined and denoted (VR1, VR2, VR3, and VR4). The arrows indicate the cleavage site of the signal peptidase predicted by the signal peptide. Figure 4 shows the phylogenetic relationship of E. canis Eca28-1 with Eca28SA2 (partial sequence) and Eca28SAl, the 6 members of the multiple gene family of E. chaffeensis omp-l, and the amino acid sequences deduced from C. Rumaninti um map-1 using the construction of an unbalanced tree. The length of each pair of each branch represents the distance between the amino acid sequence of the pairs. The scale measures the distance between the sequences. Figure 5 shows Southern blot analysis of E. cani s genomic DNA completely digested with the six individual restriction enzymes and hybridized with a probe labeled with DIG Eca28-1 (lanes 2-7); the molecular weight markers labeled with DIG (lanes 1 and 8). Figure 6 shows the comparison of characteristics of the predicted protein of ECa28-l (Jake strains) and E. chaffeensis P28 (Arkansas strains). The surface probability predicts surface residues using a hexapeptide window. A surface residue is
^^^^ j ^ j ^ any residue with an accessible surface area of water > 2.0 n mz. A hexapeptide with a value higher than 1 is considered as a surface region. The antigenic index predicts potential antigenic determinants. Regions with a value above zero are potential antigenic determinants. Portions of T cells localize the antigenic determinants of potential T cells using a 5 amino acid portion with a 1-glycine or polar residue, 2-hydrophobic residue, residue
3-h? Drofóbico, 4-hydrophobic residue or proline, and 5- polar residue or glycine. The scale indicates the positions of the amino acids. Figure 7 shows the nucleic acid sequences and amino acid sequences deduced from the genes
Eca28SA2 of the E canis protein of 28-kDa (nucleotide 1-849: SEQ ID No. 3, amino acid sequence: SEQ ID No. 4) and ECa28SA2 (nucleotide 1195-2031: SEQ ID NO: 5) : SEQ ID No. 6) including non-coding intergenic sequences (NC2, nucleotide 850-1194: SEC
ID No. 31). The ATG start codon and the stop codons are shown in black. I Figure 8 shows schematically the place of the gene of the five E canis proteins of 28-kDa (5,592-Kb)
indicating the genomic orientation and the non-coding intergenic regions (28NC1-4). The 28-kDa protein genes showing sites 1 and 2 (shading) have been described (McBride et al., 1999, Reddy et al., 1998, Ohashi et al., 1998). The complete sequence of ECaSA2 and a new designated 28-kDa protein gene (ECa28SA3- not shaded) were sequenced. The non-coding intergenic regions (28NC2-3) between ECaSA2, ECa28SA3 and ECa28-l were supplemented by joining the previously unbound site 1 and 2. Figure 9 shows the phylogenetic relationship of the five members of the E canis protein of 28- kDa based on amino acid sequences that use the construction of an unbalanced tree. The length of each pair of branches represents the distance between the pairs of amino acids. The scale measures the distance between the sequences. Figure 10 shows the alignment of the intergenic non-coding nucleic acid sequences of the gene of the 28-kDa protein of E. canis (SEQ ID No. 30-33). The nucleic acids not shown, denoted by a dot (.), Are identical for the noncoding region 1 (28NC1). The divergence is shown with the corresponding abbreviation of a letter. The intervals
- '* * - *' "- *** • '- - * - introduced for the maximum alignment of the amino acid sequences are denoted by a dash (-). The putative transcriptional promoter regions (-10 and -35) and the ribosome-binding site (RBS) are enclosed in a box.
DETAILED DESCRIPTION OF THE INVENTION The present invention describes the cloning,
Sequencing and expression of homologous genes that encode a 30-kilodalton (kDa) protein of Ehrli chia canis. We also performed a comparative molecular analysis of homologous genes among seven isolated E. canis and the multiple gene family E. chaffeensis omp-l. Two new genes of the 28-kDa protein were identified, ECa28-l and ECa28SA3. The ECa28-l has an open reading structure of 834-bp that encodes a protein of 278 amino acids (SEQ ID No. 2) with a predicted molecular mass of 30.5-kDa. An N-terminal signal sequence was identified suggesting that the protein is post-translationally modified to a mature 27.7-kDa protein. The ECa28SA3 has an open reading structure of 840-bp that encodes a protein of 280 amino acids (SEQ ID No. 6). I
encodes a 28-kDa immunoreactive protein from Ehrl i chia
canis and that is able to express the gene when the veator is
enter the cell. Still, in another embodiment of the present invention, a recombinant protein comprising a
amino acid sequence selected from the group consisting of SEQ ID No. 2, SEQ ID No. 4, and SEQ ID No. 6. Preferably, the amino acid sequence is encoded by a nucleic acid sequence selected from the group consisting of
consists of SEQ ID No. 1, SEQ ID No. 3 and SEQ ID No. 5. Preferably, the recombinant protein comprises four variable regions that are exposed on the hydrophilic and antigenic surface. Even preferably, the recombinant protein is an antigen. In still another embodiment of the present invention, there is provided a method for producing the recombinant protein, comprising the steps of obtaining a vector comprising an expression region comprising a sequence encoding the amino acid sequence selected from the group consisting of of SEQ ID No. 2, SEQ ID No. 4 and SEQ ID No. 6 operatively linked to a promoter; transfect the vector to a cell; and cultivate
cell under effective conditions for the expression of
region of expression. The invention can also be described in certain
embodiments as a method of inhibiting the infection of Ehrlichia canis in a subject, comprising the steps of: identifying a subject suspected of being exposed or infected with Ehrlichia canis; and administer a composition
comprising a 28-kDa antigen of Ehrlichia canis in an effective amount to inhibit an infection by Ehrl i chia canis. Inhibition can occur through any means such as for example; the stimulation of a person's mood or cellular immune responses, or by other means such as the inhibition of the normal function of the 28-kDa antigen, or even by competition with the antigen by interaction with some agent in the subject's body In accordance with the present invention, conventional molecular biology, I microbiology, and recombinant DNA techniques can be employed within the state of the art. Such techniques are explained
II (D. N. Glover ed. 1985); "Oligonucleotide Synthesis"! (M.J. I Gait ed. 1984); "Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; "Transcription and Translation"
[B.D. Hames & S.J. Higgins eds. (1984)]; "Animal Cell
Culture "[RI Freshney, ed (1986)]," Immobilized Cells And Enzymes "[IRL Press, (1986)], B. Perbal," A Practical Guide To Molecular Cloning "(1984) .Therefore, if they appear here the following
terms will have the definitions set forth above. A "replicon" is any genetic element (eg, plasmid, chromosome, virus) that functions as an autonomous unit of DNA replication in vivo; that is, capable of replication under its own control. A "vector" is a replicon, such as a plasmid, phage or cosmid, to which another DNA segment can adhere in order to bring about replication of the attached segment. i "A DNA molecule" refers to the polymeric form of the deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in their form of either single-stranded or double-stranded helix. This term refers only to the primary and secondary structure of the molecule, and does not limit it to any of the particular tertiary forms. Thus, this term includes the double-stranded DNA found, inter. Alia, in DNA molecules (for example, restriction fragments), viruses, plasmids, and chromosomes. In the discussion of the structure of the present, according to the normal convention of providing only the sequence in the 3 'to 5' direction along the non-transcribed DNA strand (ie, the strand has a homologous sequence). for mRNA). A "sequence that encodes" DNA is a sequence
the 3 'direction for the coding sequence.
The translational and i i transcriptional control sequences are DNA regulatory sequences, i such as promoters, enhancers, polyadenylation signals, and terminators, and the like, which provide for the expression of i I a sequence encoding a host cell. A "promoter sequence" is a DNA regulatory region capable of binding the RNA polymerase in a cell and initiating the transcription of a sequence encoding the i direction 3 '. For purposes of defining the present invention, the promoter sequence is linked to its 3 'terminus via the transcription initiation site and extends in the 5' direction to include the minimum number of bases or elements necessary to initiate transcription at detectable levels mentioned above. Within the promoter sequence will be the transcription initiation site, as well as the protein binding domains (consensus sequences) responsible for the RNA polymerase binding. Frequently, but not always, eukaryotic promoters contain "TATA" boxes and tables and "CAT". Prokaryotic promoters contain sec?
Shine-Dalgarno in addition to the consensus sequences -10 ¡and -35. A "control expression sequence" is a DNA sequence that controls and regulates the transcription and translation of another DNA sequence. A sequence that
codifies this "under the control" of transcriptory and translational control sequences in a cell when 1 RNA polymerase transcribes the sequence encoding the mRNA, which is then translated into the protein encoded by the
sequence that encodes. A "signal sequence" can be included near the sequence it encodes. This sequence encodes a signal peptide, N-terminal for the polypeptide, which
I The term "primer" as used herein refers to an oligonucleotide, whether it occurs naturally
m ** m *? m ^ ííMk The primers here are selected to be "substantially" complementary to different strands of a particular target DNA sequence. This means that the primers can be sufficiently complementary to hybridize with their respective strands. Therefore, the sequence of the primer does not need to reflect the sequence
^ Lyg & exact template. For example, a non-complementary nucleotide fragment may be linked in the final 5 'direction of the primer, with the remainder of the primer sequence being complementary to the strand. Alternatively, the non-complementary bases or longer sequences may be interspersed in the primer, provided that the primer sequence has sufficient complementarity with the sequence or is hydrolyzed therein and thus forms the template for the synthesis of the product. of extension. i i I A cell has "transformed" DNA mediantje! heterologous or exogenous when such DNA has been introduced into the interior of the cell. The transforming DNA may or may not be integrated (covalently linked) into the cell genome. In prokaryotes, yeasts, and mammalian cells for example, the transforming DNA can be maintained in an episomal element such as a plasmid. With
, > M- 'ú, «.- r-« Mfflfc »a» cells or clones that comprise a population of cells and daughters that contain the transforming DNA. A "clone" is a population of cells derived from a single cell or I progenitor by mitosis. A "cell line" is a clone i of a primary cell that is capable of establishing I I growth in vi tro by various generations. i i i Two DNA sequences are "substantially I I homologous" when at least approximately j 75% i
(preferably at least about 80% and more preferably at least about 90% or 95%) of the nucleotides are paired at the defined length of the DNA sequences. Sequences that are substantially homologous can be identified by comparing the sequences using standard count programs available in sequence data banks, or by low Southern hydridization experiments eg strict conditions defined by the particular system. Defining the appropriate hybridization conditions within those skilled in the art. See, for example, Maniatis et al., Supra; DNA Cloning, Volumes I & II, supra; Nucleic Acid Hybridization, supra.
A "heterologous" region of the DNA construct is an identifiable segment of DNA within a larger DNA molecule that is not found in association with the largest molecule in nature. Thus, when the region proteins can also be labeled with a radioactive element or with an enzyme. The radioactive label can be detected by any of the commonly available procedures. The proto-isotope can be selected from H, C, 32P, 35S ,,? 36C, l,
51Cr, 57Co, 58Co, 59Fe, 90Y, 125I, 131 I, and 186Re. Label enzymes are equally useful, and can be detected by any of the colorimetric, spectrophotometric, fluorospectrophotometric, amperometric, or gasometric techniques currently used. The enzyme is conjugated to the selected particle by reaction with cross-molecules such as carbodiimides, diisocyanates, glutaraldehydes and the like. Various enzymes that can be used in these procedures are known and can be
used. Preferred are: peroxidase, β-glucuronidase,
ß-D-glucosidase, ß-D-galactosidase, urease, glucose oxidase
plus peroxidase and alkaline phosphatase. The U.S. Patent Nos. 3,654,090, 3,850,752 and 4,016,043 are referred to by way of example for their description of alternative labeling methods and materials. ii
As used here, the term "host" includes not only prokaryotes but also eukaryotes such as
* i i,? mÉ .- *. »--_ ¿jfaMfc. Not even me? b? nt ^ a ^ yeasts, plant and animal cells. A molecule of i I
DNA or gene encoding an immunoreactive protein of 28-kDa I I of Ehrlichia canis of the present invention can be used to transform a host using any of the techniques commonly known to those skilled in the art. Especially preferred is the use of a vector containing i sequences encoding a gene encoding I I a 28-kDa immunoreactive protein of Ehrlichia cafois of the present invention for transformation and prokaryotic purposes. ¡Prokaryotic guests can include E. coli, S. tymphimuri um, Serra tia marcescens and Ba cil l us subtilis. Eukaryotic hosts include yeasts such as Pichia pastoris, mammalian cells and insect cells. In general, expression vectors containing promoter sequences that facilitate efficient transcription of the inserted DNA fragment are used in connection with the host. The expression vector typically contains an origin of replication, promoter (s), terminator (s), as well as specific genes that are capable of providing phenotypic selection in transformed i cells. The transformed hosts may be 3 or SEQ ID No. 5. The protein encoded by the DNA of this invention may share at least 80% of the sequence identity (preferably 85%, and more preferably 90%, and more preferably 95%) with the amino acids listed in SEQ ID No. 2 or SEQ ID No. 4 or SEQ ID No. 6. More preferably, the DNA includes the coding sequence of the nucleotides of SEQ ID. 1,
from the same. Such a probe is useful for detecting the expression of the 28-kDa immunoreactive proteins of Ehrlichia canis in a human cell by a method that includes the steps of (a) contacting the mRNA obtained from the cell with the labeled hybridization test.; and (b) I detect the hybridization of the test with the mRNA. The invention also includes a substantially pure DNA containing a sequence of at least 15 consecutive nucleotides (preferably 20, more
preferably 30, even more preferably 50, and most preferably all) of the region of the nucleotides listed in SEQ ID No. 1, SEQ ID No. 3 or SEQ ID No.
'High severity' refers to DNA hybridization and washing conditions characterized by high temperature and low salt concentration, eg, washing conditions of 65 ° C at a salt concentration of approximately 0.1 x SSC, or the functional equivalent thereof. For example, high severity conditions may include I hybridization at about 42 ° C in the presence of about 50% formamide; a first wash to
contains 1% SDS; followed by a second wash of approximately 65 ° C with approximately 0.1 x SSC. By "substantially pure DNA" is meant DNA that is not part of a medium in which DNA occurs naturally, by virtue of the separation (partial or partial purification) of some or all of the molecules in the environment, or by virtue of alteration of sequences flanking the claimed DNA. The term therefore includes, for example, a recombinant DNA that is incorporated into a vector, into a virus or plasmid that replicates autonomously, or into the genomic DNA of a procari.ote or eukaryote; or that it exists as a separate molecule (eg, a genomic cDNA or a cDNA fragment produced by the polymerase chain reaction (PCR) or restriction of the endonuclease digestion process) independent of other sequences. It also includes a DNA
The DNA can have at least about 70% of
the sequence identity for the coding sequence of the nucleotides listed in SEQ ID No. 1 or SEQ ID No. 3 or SEQ ID No. 5 preferably at least 75% (eg, at least 80%); and more preferably at least 90%. The identity between the two sequences is a function directed to the number of identical matings or positions. When a subunit position in both of the two sequences is occupied by the same monomeric subunit, for example, if a given position is occupied by an adenine in each of the
two DNA molecules, then these are identical to the position. For example, if 7 positions in a sequence of 10 nucleotides in their length are identical to the corresponding positions in a second sequence of 10 nucleotides, then, the two sequences have 70% sequence identity. The length of the comparison sequences will generally be at least 50 nucleotides, and preferably at least 60 nucleotides, more preferably at least 75 nucleotides, and more preferably 100 nucleotides. Identity is typically measured using a sequence analysis (eg, Software Package of the Genetics Computer Group, Uniyersity
of Wisconsin Biotechnology Center, 1710 University Avenue,
Madison, Wl 53705). The present invention comprises a vector comprising a DNA sequence coding for a gene! encoding a 28-kDa immunoreactive protein of Ehrli chia canis and said vector is capable of replicating in a host comprising, in an operable union: a) an origin of replication; b) a promoter; and c) a sequence of
DNA coding for said protein. Preferably, the vector of the present invention contains a portion of the DNA sequence shown in SEQ ID No. 1 or SEQ ID No. 3 or SEQ ID No. 5. A "vector" can be defined as an acid term replicable nucleic acid, eg, a plasmid or viral nucleic acid. The vectors can be used to amplify and / or express nucleic acids encoding a 28-kDa immunoreactive protein of Ehrlichia canis. An expression vector is a replicable term in which the amino acid sequence encoding a polypeptide is operably linked to the appropriate control sequences I capable of effecting the expression of the Jen polypeptide in the cell. The need for such control sequences will vary depending on the cell selected the selected transformation method. Generally, the control sequences include a transcriptional promoter and / or
enhancers, suitable for the ribosomal binding site mRNA,
the sequences that control the termination of transcription and translation. Methods that are well known to those skilled in the art and can be used to construct expression vectors that contain appropriate transcriptional and transcriptional control signals. See, for example, the techniques described in Sambrook et al., 1989, Molecular Cloning: A labora tory Manual (2nd Ed.), Cold Spring Harbor Press, N. Y. A gene and its transcriptional control sequences are defined as "operably linked" if the transcriptional control sequences effectively control the transcription of the I gene. Vectors of the invention include, but are not limited to, vectors of plasmids and viral vectors. The preferred viral vectors of the invention are those! derivatives of retroviruses, adenoviruses, adeno-associated viruses, SV40 virus or herpes virus. By a "substantially pure protein" is meant a protein that has been separated from at least some of those components that naturally accompany these. Typically, the protein is substantially pure
j ^^ jj when at least 60% by weight, is free of proteins and
weight. A substantially pure 28-kDa immunoreactive protein of Ehrlichia canis can be obtained, e.g., by extraction of a natural source; by means of the
A protein is substantially released from the naturally associated components when these are separated from at least some of those contaminants that accompany these in their natural state. Thus, a protein that is synthesized I chemically or produced in a cellular system different from I
naturally associated Accordingly, substantially pure proteins include eukaryotic proteins synthesized in E. coli, other prokaryotes, or any other organism in which they do not occur naturally. In addition to the full length or full length proteins, the invention also includes fragments (eg, antigenic fragments) of the 2!
immunoreactive kDa of Ehrlichia canis (SEQ ID No. 2 or ISEC ID I No. 4 or SEQ ID No. 6). As used herein, "fragmented" as applied to the polypeptides will ordinarily be at least 10 residues, more typically at least 20 residues, and preferably at least 30 (eg, 50 residues) in length, but less than the intact sequence. total. Fragments of the immunoreactive 28-kDa protein of Ehrlichia canis can be generated by methods known to those skilled in the art, for example, by the enzymatic digestion process of the recombinant Ehrlichia canis immunoreactive 28-kDa protein or that occurs naturally, by recombinant DNA techniques using an expression vector encoding a defined fragment of an immunoreactive 28-kDa protein from Ehrl i chia canis, or by chemical synthesis. The ability of a candidate fragment to exhibit a characteristic of
the immunoreactive 28-kDa protein of Ehrlichia canis (for example, binding to an antibody specific for the immunoreactive 28-kDa protein of Ehrlichia canis) can be evaluated by methods described herein. The immunoreactive 28-kDa protein of purified Ehrlichia canis or antigenic frgments of the 128-kDa immunoreactive protein of Ehrlichia canis can be used to generate new antibodies or to test existing antibodies (eg, as positive controls in a test of diagnosis) using standard protocols by those skilled in the art. Included in this invention is the polyclonal antiserum generated by the use of the immunoreactive 28-kDa Ehrli chia canis protein or a fragment of the immunoreactive 28-kDa protein of Ehrlichia canis as the immunogen in e.g., cqnejos. Standard protocols for the production of the monoclonal and polyclonal antibody known to those skilled in the art are employed. The monoclonal antibodies i generated by this method can be protected by the ability to identify recombinant Ehrlichia canis clones, and to distinguish them from known cDNA clones.
"..i,. , t__J_¿t ____ > tl iriiii f ^ ^ t¡¿¿ß Additionally included in this invention are
fragments of the immunoreactive 28-kDa protein of Ehrlichia canis that are encoded at least in part by portions of SEQ ID No. 1 or SEQ ID No. 3 or SEQ ID No. I 5, eg, splicing products from! Alternative mRNA 1 or events that process alternative protein, or in which a section of the sequence has been deleted.
The fragment, or the immunoreactive 28-kDa protein of
Ehrli chia canis intact can be covalently linked to another polypeptide, for example., Which act as a
label, a ligand or a means of increasing antigenicity. The phrase "pharmaceutically acceptable" refers to molecular entities and compositions that do not produce an allergic or similar adverse reaction when administered to a human. The preparation of an aqueous composition containing a protein as an active ingredient is well understood in the art. Typically, such compositions i are prepared as injectables, either as liquid solutions or suspensions; Suitable solid forms for the solution in, or suspension in, may also be liquid preparations prior to injection. The preparation can also be emulsified.
A protein can be formulated in a
composition of a salt or neutral form. Pharmaceutically acceptable salts include the addition of acid salts (formed with the free amino groups of the protein) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acid, or organic acids such as acetic, oxalic, tartaric, mandelic , and similar. Salts formed with free carboxyl groups can also be derived from inorganic bases such as for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, histidine, procaine and the like. After the formulation, the solutions will be
administered in a manner compatible with the formulation of the dose and in therapeutically effective amounts. The formulations are easily administered in a variety of dosage forms such as injectable solutions. For administration other than parenterally in an aqueous solution, for example, the solution should be adequately buffered if necessary and the diluent liquid first supplied isotonic with sufficient saline or glucose. These rich solutions
^ J * »particular are especially suitable for
intravenous, intramuscular, subcutaneous and intraperitoneal administrations. In connection with this, the sterile aqueous medium that can be employed will be known to those skilled in the art in the clarity of the description.
I presented. For example, a dose could be dissolved in 1 ml of an isotonic NaCl solution and either 1000 ml added! of fluid hypodermoclysis or injected into the planned site of preparation, (see for example, "Remington's I Pharmaceutical Sciences" Fifteenth edition, pages 1035-1038 and 1570-1580). Any variation in the dose will necessarily be presented depending on the condition of the subject in question. The person responsible for the administration will depend on any event to determine the appropriate dose for a particular subject. As is well known in the art, a given polypeptide is often immunogenic (by invention) with preferred are albumin of sue a variety of lymphokines and adjuvants such as IL2, 114,
IL8 and others. Means for conjugating a polypeptide to a carrier protein are well known in the art and include glutaraldehyde, m-maleimidobenzoyl-N-hydroxysuccinimide ester, carbo-diimide and bis-biazotized benzidine. It is also understood that the peptide can be conjugated to a protein by genetic engineering techniques that are well known in the art. ! As is also well known in the art, the immunogenicity for a particular immunogen can be improved by the use of non-specific stimulators of the immune response known as adjuvants. Examples of preferred adjuvants include complete BCG, Detox, (RIBI, Immunochem Research Inc.) ISCOMS and aluminum hydroxide adjuvant (Superphos, Biosector). As used herein, the term "complement" is used to define the nucleic acid strand that will hybridize to the first nucleic acid sequence to form a double-stranded molecule under stringent conditions. The stringent conditions are those that allow hybridization between the two nucleic acid sequences with a high degree of homology, but not including
The temperature and ionic strength of a desired severity are understood to be applicable to the lengths of the
particular tests, for the base content and sequence sequence and for the presence of formamide in the hybridization mixture. As used herein, the term "genetically engineered" or "recombinant" cells is intended to refer to a cell in which the recombinant gene, i has been introduced such as a gene encoding an antigen of
Ehrli chia chaffeensis. Therefore, cells generated by genetic engineering are distinguished from naturally occurring cells which do not contain, a gene introduced recombinantly. Thus, the cells generated by genetic engineering are cells that have a gene or genes introduced either in the form of a gene of i cDNA, a copy of a genomic gene, or will include genes positioned adjacent to a naturally associated non-associated promoter. with the gene introduced particularly. In addition, the recombinant gene can be integrated into the genqma of the
MMéak-M-bÜH-My host, or it can be contained in a vector, or in a bacterial genome transfected in the host cell. The following examples are given for the purpose of illustrating various embodiments of the invention and are not intended to limit the present invention in any style.
EXAMPLE 1
Ehrlichiae and i Purification! ' ! The Ehrli chia canis (strain Florida and isolated Demon, i DJ, Jake, and Fuzzy) are provided by Dr. Edward
Breitschwerdt, (College of Veterinary Medicine, 'North
Carolina State University, Raleigh, NC). E. canis * (strain
Louisiana) was provided by Dr. Richard E. Córstvet
(School of Veterinary Medicine, Louisiana State University, Baton Rouge, LA) and £. canis (Oklahoma strain) was provided by Dr. Jacqueline Dawson (Centers for Disease Control and Prevention, Atlanta, GA). Propagation of I ehrlichiae was performed in DH82 cells with DMEM supplemented with 10% bovine calf serum and 2 mM 1-glutamine at 37 ° C. The intracellular growth in DH28 cells was monitored by the presence of a morula of
£. canis using general methods of cytological staining.
The cells were harvested when 100% of the cells were
~ ^ * ^ were infected with ehrlichiae and pellets were then formed in a centrifuge at 17,000 x g for 20 minutes. The cell pellets were destabilized with a sonicator
Braun-Sonic 2000 twice at 40W for 30 seconds on ice. The Ehrlichiae was purified as previously described (Weiss et al., 1975). The lysate was loaded on discontinuous gradients of 42% -36% -30% renografin, and centrifuged at 80,000 x g for 1 hour. The heavy and light bands containing ehrlichiae were harvested and washed with sucrose-phosphate-glutamate buffer (SPG, 218 mM sucrose, 3.8 mM KH2P04, 7.2 mM K2HP04, 4.9 mM glutamate, pH 7.0) and pellets were formed by centrifugation.
I Nucleic Acid Preparation L Ehrlichia canis DNA was prepared by resuspension of purified ehrlichiae with renografin in 600
μl of 10 mM Tris-HCl buffer (pH 7.5) with 1%
sodium dodecyl sulfate (SDS, weight / volume) and 100 ng / ml protein K as previously described (McBride et al., 1996). This mixture was incubated for 1 hour at 56 ° C, and the nucleic acids were extracted twice with a mixture of phenol / chloroform / isoamyl alcohol (24: 24: 1). The DNA is
It was pelleted by absolute ethanol precipitation, washed once with 70% ethanol, dried and resuspended in lOmM Tris (pH 7.5). The plasmid DNA was purified using a High Isolation Plasmid kit.
Purity (Boehringer Mannheim, Indianapolis, IN), and the! PCR products were purified using a QIAquick PCR Purification Kit (Qiagen, Santa Clarita, CA).
EXAMPLE 3
PCR amplification of the 28-kba E protein genes. ¡I canis i The regions of 1 gene of E. canis ECa28-l
Jotun-Hein of E. p28 chaffeensis and the Cowdria ruminant tí um map-1 genes. The forward primer 793 ((5- I
GCAGGAGCTGTTGGTTACTC-3 ') (SEQ ID No. 16) and reverse primer
1330 (5'-CCTTCCTCCAAGTTCTATGCC-3 ') (SEQ ID No. 17) corresponding to nucleotides 313-332 and 823-843 < of C.! Ruminate ti um MAP-1 and 307-326 and 834-814 of E. chaffeensis P28. 1 E. canis (a North Carolina isolator, Jake) the DNA was amplified with primers 793 and 1330 with a thermal cyclization profile of 95 ° C for 2 minutes, and subjected to 30 cycles of 95 ° C for 30 seconds, 62 C for 1 minute, 72CC for 2 minutes followed by an extension of 72 ° C for 10 minutes and 4 ° C. The PCR products were analyzed in 1% agarose gels. The amplified PCR product was sequenced directly with primers 793 and 1330. i The primers specific for the ECa28SA2 gene designated 46f (5'-ATATACTTCCTACCTAATGTCTCA-3 ', SEQ ID No. 18) and primer 1330 (SEQ ID NO. 17) were used to amplify the target region. The amplified product was gel purified and cloned into a TA I cloning vector (Invitrogen, Santa Clarita, CA). The clone was bidirectionally sequenced i with primers: inverse M13 of the vector, 46f, Eca28SA2 (5'- AGTGCAGAGTCTTCGGTTTC-3 ', SEQ ID No. 19) ECa5. 3 (5'-GTTACTTGCGGAGGACAT-3 ', SEQ ID No. 20). The DNA was amplified with a subject thermal cyclization profile 95 ° C for 2 minutes, and 30 cycles of 95 ° C for 30 seconds, 48 ° C for 1 minute, 72 ° C for 1 minute
followed by an extension of 72 ° C for 10 minutes and 4 ° C.
EXAMPLE 4 Unknown Sequencing of the 3 'and 5"Regions of the Eca28-1 Gene
The total length of the sequence of Eca28-1 was determined using a Universal Genome Walker device (CLONTECH, Palo Alto, CA) according to the protocol supplied by the manufacturer. The genomic DNA of E. canis (Jake isolate) was digested completely with five restriction enzymes (Dral, EcoRV, PvuII, Seal, Stul) that produces DNA finished in edge. A power adapter (API) in the equipment was ligated to the end of the E DNA. canis. Genomic libraries were used as templates to find the known DNA sequence of the ECa28-l gene by PCR using a primer complementary to a known portion of the ECa28-l sequence and a specific primer for the API adapter. Primers specific for ECa28-l used for the walking genome were designed from the known DNA sequence derived from PCR amplification of ECa28-l with primers 793 (SEQ ID No. 16) and 1330 (SEQ ID NO. 17). The primers 394 (5'-GCATTTCCACAGGATCATAGGTAA-3 '; nucleotides 687-710, SEQ ID No. 21) and 394C (5'-TTACCTATGATCCTGT GGAAATGC-3; NUCLEOTIDES 710-687, SEQ ID No API provided known from the corresponding gene to the 5 'region of the amplified ECa28-l gene
with primers 394C and API (2000-bp) were sequenced
unidirectionally with the 793C primer
(5'GAGTAACCAACAGCTCCTGC-3 ', SEQ ID No. 23). A PCR i i product corresponding to the 3 'region of the ECa28-l gene amplified! with primers 394 and API (580-bp) were sequenced i I bidirectionally with the same primers. The non-coding regions ii in the adjacent 3 'and 5' regions and the open reading frame were sequenced, and the primers EC280M-F (5'- TCTACTTTGCACTTCC ACTATTGT-3 ', SEQ ID No. I 24) AND EC280M-R (5'-ATTCTTTTGCCACTATTT TTCTTT-3 ', SEQ ID I I No. 25) complementary to the regions were designed to amplify the entire ECa28-l gene.
EXAMPLE 5, ~ - ^ - ~~~ - I I Sequencing of E. canis isolates The DNA was sequenced with an ABI Prism 377 DNA sequencer (Perkin-Elmer Applied Biosystems, Foster City, CA). The whole genes of seven isolated ECa28-l (four from North Carolina, and each from Oklahoma, Florida and Louisiana) were amplified by PCR with primers EC280M-F (SEQ ID No. 24) and EC280M-R (SEQ. ID No. 25) with a thermal cyclization profile of 95 ° C for 5 minutes, and 30 cycles of 95 ° C for 30 seconds, 62 ° C for 1 minute, and 72 ° C
for 2 minutes and an extension of 72 ° e for 10 minutes. The resulting PCR products are bidirectionally with the same primers.
EXAMPLE 6
Cloning and Expression of E, canis ECa28-l i The E gene. canis ECa28-l integrated was amplified with PCR with primers EC280M-F and EC280M-R and cloned into a cloning vector pCR2.1-TOPO TA to obtain the desired set II of the restriction enzyme cleavage sites (Invitrogen, Carlsbad, CA). The insert was excised from pCR2.1-TOPO with BstX 1 and ligated into the eukaryotic DNAse 3.1 expression vector (Invitrogen, Carlsbad, CA) designed for cDNA 3.1 / EC28 for subsequent studies. Plasmid DNApc 3.1 / EC28 was amplified, and cleaved with a double digestion of Kpnl-Xbal and a prokaryotic expression vector pThioHis (Invitrogen,! Carlsbad, CA) was ligated directionally into I. The clone (designed pThioHis / EC28) produced a recombinant thioredoxin fusion protein in Escherichia coli BL21. the recombinant fusion protein was crudely purified in the insoluble phase by centrifugation. The control thioredoxin fusion protein was purified from cell phones under native conditions using
turning columns of NTA with nickel (Qiagen, Santa Cljarita, I
CA). ! i I
I
I
I
EXAMPLE 7 Western Immunoassay Assay The recombinant fusion protein E. caní s \ ECa28-
1 was subjected to SDS polyacrylamide gel electrophoresis
(SDS-PAGE) in 4-15% gradient gels Tris-HCl (Bi-o-Rad, i
Hercules, CA) and was transferred to pure I nitrocellulose i (Schleicher &Schuell, Keene, NH) using a cell from
semi-dry transfer (Bio-Rad, Hercules, CA). The membrane
was incubated with the antiserum of the convalescent phase of a
Poerro infected with E. canine diluted 1: 5000 for 1 hour, washed, and then incubated with a secondary antibody from
purified affinity alkaline conjugated phosphatase (H & L) anti-canine IgC 1 at 1: 1000 for 1 hour (Kirkegaard &Perry
Laboratories, Gaithersburg, MD). The bound antibody is
visualized with substrate 5-bromo-4-chloro-3-yl phosphate / nitro blue tetrazolium (BCIP / NBT) (Kirkegaard &Perry i i Laboratories, Gaithersburg, MD). 1 I
I
EXAMPLE 8 i Southern blot analysis
a - Éi -_--- i-IÍ_áu < ... * Í- **. *. *. -., * > ,. *, * > . . «. IX.A To determine the multiple homologous genes for the Eca28-1 gene that were presented in the E. canis genome, a genomic Southern blot analysis was performed using a standard procedure (Sambrook et al., 1989). The genomic DNA of E. canis was completely digested with each of the restriction enzymes Bail, EcoRV, Hael l, Kpnl and? Spel, I that does not excise with the ECa28-l gene, and Asel that digests the i I nucleotides ECa28-l 34, 43 and 656. The test was analyzed ii by PCR amplification with primers EC280M-F and I 10 EC28M-R and deoxynucleotide triphosphates labeled with (DIG) II digoxigen (dNTPs) (Boehringer Mannheim, Indianapolis,! IN) and i digested with Asel. The digested test (566-bp) was separated by agarose gel electrophoresis, purifixed gel and then used by hybridization. The DNA of E. canis
completely digested was electrophoresed and transferred to a nylon membrane (Boehringer Mannheim, Indianapolis, IN) and hybridized at 40 ° C for 16 hours with the DIG-labeled test of the ECa28-l gene in Hyb Easy i DIG buffer according to the manufacturing protocols (Boehringer
Mannheim, Indianapolis, IN). The bound test was detected with an anti-DI3 anti-alkaline phosphatase antibody and a luminescent substrate (Boehringer Mannheim, Indianapolis,
- * "... .... **. Í? U¿ ?. ". *.« **. i - ^ .. ^^.,. .,,. ^^ g ^^ ¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡¡) and was exposed to a scientific image film iBioMax (Eastman Kodak, Rochester, NY).
EXAMPLE 9
Sequence Analysis and Comparison 'I The DNA sequences of C. ruminan ti um map-1 and E. i chaffeensis p28 were obtained from the National Center l of I
I Information on Biotechnology (NCBI) (World Wide Web site at URL: http: // ww. Ncbi. Nlm. Nih. Gov / Entrez). The nucleotide and I amino acid sequences, the phylogenetic analysis and the protein i were performed with a LASERGENE computer program
(DNASTAR, Inc., Madison, Wl). The analysis of the translational post-I process was carried out using the McGeoch and von Heijne method by means of signal sequence recognition using the SPORT program (Mcgeoch, 1985, von Heijne, 1986) (World Wide Web site at URL: PRÍVATE HREF = "http // www imcb osaka-u.ac.jp/nakai/form.htm", MACROBUTTON HtmlResAnchor http: // www. Í cb. Osaka-u.ac.p / nakai / form. Htm). I The access numbers for the nucleic acid and I amino acid sequences of the ECa28-l genes of E. 'Canis describe in their study: Jake, AF082744; siana
AF082745; Oklahoma, AF082746; Demon, AF082747; DJ, 82748; Fuzzy, AF082749; Florida, AFO82750.
* ¡? A? *? * A * m * - ^^ mm ^^ * * a ^ * lláß ^ áamlm The sequence analysis of ECa28-l of | seven different strains of E. canis were made with primers designed to extend the whole gene. The analysis revealed that the sequence of this gene was conserved between the insulators of North Carolina (four), Louisiana, Florida and Oklahoma.
EXAMPLE 10
Amplification by PCR, Cloning, Sequencing and Expression of Eca28-1 Alignment of the nucleic acid sequences of E. chaffeensis p28 and Cowdpa ruminantium map-1 using the Jotun-Hein algorithm produced a consensus sequence with I regions of high homology (>90%). These hoological regions (nucleotides 313-326 and 814-834 of E. chaffeensis p28) were the abject sites as hybridization sites by heating and cooling of primers for PCR amplification. PCR amplification of the genes E. I canis ECa28-l and E. chaffeensis p28 was completed with I I primers 793 and 1330, resulting in a PCR product of 518-bp. The nucleic acid sequence of the PCR product, from E. i canis was obtained by sequencing the pipeline directly with primers 793 and 1330. The analysis of the sequence revealed an open reading structure that
- * ^^ encodes a protein of 170 amino acids, and the alignment of! the sequence of 518-bp obtained from the PCR amplification of E. canis with the DNA sequence of the gene E. chaffeensis p28 revealed a greater similarity than 70%, indicating that the genes are homologous. The PCR adapter with the primers 394 and 793C were made to determine the sequence of the i I segments 5 'and 3' of the integral gene. Primer 394 produced four PCR products (3-kb, 2-kb, 1-kb, and 0.8-kb),! and the 0.8-bp product was sequenced bidirectionally using primers 394 and API. The deduced sequence overlapped j with the 3 'end of the product 518-bp, extending the open reading structure of 12-bp to a termination codon. A 1 sequence that does not encode additional 625-bp in the final 3'j of the ECa28-l gene was also sequenced. Primer 394C was used to extend the final 5 'of the ECa28-l gene with the API supplied primer. Amplification with these primers resulted in three PCR products (3.3, 3-kb, and 2-kjb). The 2-kb fragment was sequenced unidirectionally with primer 793C. The sequence provided the putative start codon of the ECa28-l gene and was completed by open reading 834-bp encoding a 278 amino acid. An additional i 144-bp readable sequence was generated in the 5 'non-coding region of the ECa28-l gene.
The primers EC280M-F and EC2 (= MR were designed from the complementary non-coding regions adjacent to the i ECa28-l gene.) The PCR product amplified with these first 1 primers was sequenced directly with the same primers. complete DNA sequence (SEQ ID No. 1) for the 1 gene ECa28-l of E. canis is shown in figure 1. The PCR fragment of ECa28-l was amplified with these primers contained in the open reading frame whole and 17 j amino acids of the 5 'non-coding primer region.The I gene was directionally subcloned into the expression vector pThioHis, and E. coli (BL21) was transformed into its construct.The thioredoxin fusion protein ECa28- The expressed protein was insoluble, the protein expressed has 114 amino acids associated with the thioredoxin, 5 amino acids for the enterokinase recognition site, and 32 amino acids from the multiple cloning site and the primer region. or coding 5 'in the term N. The convalescent phase of the antiserum of a dog infected by E. canis recognized the recombinant fusion protein expressed, but does not react with thioredoxin control. (Figure 2). i
EXAMPLE 11 I
Sequence Homology i The nucleic acid sequence of ECa28-l (8¡34-bp) i and the family of E genes. Chaffeensis omp-l that injclude! Signal sequences (ECa28-l, omp-lA, B, C, D, E, and F) were aligned using the Clustl method to examine the homology between these genes (no alignment shown). Nucleic acid homology was similarly conserved (68.9%) between i ECa28-l, and E. chaffeensis p28 and omp-l. Other putative outer membrane protein genes in the E family. chaffeensis omp-l, omp-lD (68.2%), omp-lE (66.7%), omp-lC (64.1%), Cowdria ruminate ti um map-1 (61.8%), 1 g of protein of 28 -kDa E. canis (60%) and 2 protein genes of 28-kDa (partial) (59.5%) are also homologous to ECa28-l. E. chaffeensi s omp-lB has at least nucleic acid homology (45.1%) with ECa28-l. The alignment of the predicted amino acid sequences of ECa28-l (SEQ ID No. 2) and E. P28 chaffeensis reveal amino acid substitutions resulting from I four variable regions (VR). Substitutions or deletions in the amino acid sequence and the locations of i I the variable regions of ECa28-l and the family E. chaffeensis OMP-1 were identified (Figure 3). The amino acids
compared to the signal peptide revealed that I Eca28-1 shares the highest homology with OMP-1F (68%) i of the family E. chaffeensis OMP-1 followed by E. chaffeens \ is P28
(65.5%), OMP-1E (65.1%), OMP-1D (62.9%), O%), Cowdria ruminate tum MAP-1 (59.4%), 1 protein of 28-kDa E. i canis (55.6%) and 2 proteins of 28-kDa (partial) (53.6%), and OMP-1B (43.2%). The phylogenetic relationship based on the! Amino acid sequences show that ECa28-l 'and C. Ruminantium MAP-1, proteins E. chaffeensis OMP-1, and 1 protein 28-kDa of E. canis and 2 (partial) are related (Figure 4).
EXAMPLE 12
Probability of Predicted Surface and Immunoreactivity The analysis of Eca28-1 of E. canis using hydropathic profiles and hydrophilicity profiles in the exposed surface junctions predicted in Eca28-1 (Figure 6). | The eight major exposed surface regions that I consist of 3 to 9 amino acids were identified in Eca28-1 and are similar for the profile of the regions exposed to the surface in E. chaffeensi s P28 (Figure 6). Five, the 1 large regions exposed to the surface in Eca28-1 were located in the N-terminal region of the protein. The
aM ^ a | ^^ kiß | | j | ^ H ¡^ ari | ji |||| HAM || aa m ^ ^^? Jlg i hydrophilic regions exposed to the surface were found in all four variable regions of Eca28-1. 10 portions of T cells were predicted in E? ca28-1 i using the Rothbard-Taylor algorithm (Rothbard and Tjaylor, I 1988 ), and high antigenicity of Dca28-1 was predicted by the Jameson-Wolf antigenicity algorithm (Fig.
6) (Jameson and Wolf, 1988). Similarity was observed in the! I antigenicity and the portions of T cells between Eca28-1 and I E. chaffeensis P28. EXAMPLE 13
Detection of the homologous genomic copies of the Eca28-1 gene. The analysis of Southern genomic spotting of E. canis DNA i digested completely independently with the restriction enzymes Bañil. EcoRV, Haell, Kpnl, Spel, 1 restriction endonuclease in the Eca28-1 gene, and Asjel, which has internal restriction endonuclease sites at nucleotides 34, 43 and 656, revealed the presence of at least I three copies of Eca28-1 gene homologs (Figure 5). Although Eca28-1 has internal restriction sites, the 1 probe labeled with DIG used in the I hybridization experiment targets a region of the gene within a single DNA fragment generated by the digestion of
• * "* • -" - -T- * * * - ** - »». * &? < i? Asel of the gene. Digestion with the Asel produced 3 'bands
(about 566-bp, 850-bp and 3-kb) Hybridized with the DNA probe Eca28-1 indicated the presence of multiple gene homologs to Eca28-1 in the genome. The ! digestion with EcoRV and Spel produced two bands that
hybridized with the Eca28-1 gene probe. EXAMPLE 14
Identification of the Place of the 28-kDa Protein Gene. The specific primers designated i EcaSA3-2 (5 '-CTAGGATTAGGTTATAGTATAAGTT-3', SEQ ID No. 26) corresponding to the Eca28SA3 regions and the primer 793C were used. (SEQ ID No. 23) that is tempered to a region with Dca28-1
To extend the intergenic region between the SA3 gene and ^ Eca28-1. The 800b-p product was sequenced with the same primers. The DNA was amplified with a thermal cyclization profile of 95 ° C for 2 minutes, and 30 cycles of 95 ° C for 30 seconds, 50 ° C for 1 minute, 72 ° C lasting 1 ° 1 minute followed by an extension 72 ° C was maintained for 10 minutes and 4 ° C.
EXAMPLE 15 i The PCR Amplification of the 2i kDa Protein Genes and the Identification of the Place of Multiple Genes
< Mt »?? *? «Mtíit.a .... - - ii > In order to specifically amplify II the known genes in the 3 'direction of ECa28SA2, the 46f primer for ECa28SA2, and primer 1330 was used which is directed to a region conserved at the end 3 'of the ECa28-l gene for amplification A 2-kb PCR product was amplified with these primers containing 2 open reading frames The first open reading frame comprises the known region of the gene, ECa28SA2, and a 3 'portion of the gene not previously sequenced In the 3' direction of an additional ECa28SA2, a 28kDa protein gene was found that was homologous but not identical and was designed
Eca28SA3. The two known sites were joined by amplification with the SA-2 primer specific for the 3 'end of the Eca28SA3 gene were used in conjunction with a reverse primer 793C, which is hybridized by the application of heat and cooling at the end of ECa28-l. An I-PCR product of 800-bp was amplified at the end!
3 'of Eca28SA3, the intergenic region between Eca28SA3 and i ECa28-l (28NC3) and the 5' end of ECa28-l, which was bound to the previously separated site (Figure 8). The 849-bp open reading frame of ECa28SA2 encodes a prp 1 teine of 283 amino acids, and the Eca28SA3 has an open reading structure of 840-bp that encodes a protein of 280
í «É-lÍÍMÍ-rt-MÍM amino acids. The non-coding intergenic region between Eca28SA3 and ECa28-l was 345. bp in length (Figures 7 and
EXAMPLE 16! Homology of Amino Acid and Nucleic! The amino acid and nucleic sequences of all five genes of the 28-kDa protein were aligned using the Clustal method to examine the homology between these genes. Nucleic acid homology was found in the range of 58 to 75% and similar amino acid homology was observed in a range of 67 to 72% between the members of the 28-kDa protein gene of E. cani s (Figure 9). I
EXAMPLE 17
Transcriptional Promoting Regions The intergenic regions between the ¡de genes of the 28-kDa protein were analyzed for the promoter sequences by comparing the promoter I 1 promoter Escherichia coli regions and a promoter] of E. chaffeensis (Yu et al., 1997 McClure, 1985). The putative 1 1 promoter sequences including RBS, regions -10 and -35 'in 4 were identified.
intergenic sequences corresponding to the i genes ECa28SA2, Eca28SA3, ECa28-l, and ECa28-2 (Figure 10). The non-coding region in the 5 'direction of ECa28SAl < it is not known and it was not analyzed.
EXAMPLE 18 N-terminal Signal Sequence The analysis of the amino acid sequence revealed that the entire E. canis Eca28-1 has a molecular mass! deduced from 30.5-kDa and the entire Eca28SA3 has a mass
molecular weight of 30.7-kDa. You love proteins have an N-terminal signal peptide of 23 amino acids (MNCKKILITTALMSLMYYAPSIS, i
SEQ ID No. 27), which is similar to that predicted by E. chaffeensis P28 (MNYKKILITSALISLISSLPGV SFS, SEQ ID No. 28), and the OMP-1 protein family (Yu et al., 1998; Ohashi et al. ., 1998b). A preferred cleavage site will stop signal peptidases (SIS, Ser-X-Ser) (Oliver, 1985) found in amino acids 21, 22, and 23 of ECa28-l. An additional putative cleavage site I was also presented at the position of amino acid '25 (MNCKKILITTALISLMYSIPSISSFS, SEQ ID No. 29) identical to the predicted cleavage site of E. chaffeensis P28 (SjFS), and 1 result in a mature Eca28-1 with a molecular mass
n.t i? i.i.irr? • i., - i, r i ^ e-aMMÉjÉt-l ^ iiÉÉMi ^ ^ predicted of 27.7-kDa. The cleavage site I i was predicted from the partial sequence previously reported from
Eca28SA2 at amino acid 30. However, the analysis of the predicted signal sequence had an unscreened signal sequence.
Summary Proteins of similar molecular mass have been identified and cloned from multiple riches agents including E. canis, E. chaffeensis, > and C. I ruminantium (Reddy et al., 1998; Jongejan et al., < 1993; Ohashi et al., 1998). A single site has been previously described in Ehrli chia chaffeensis with 6 p28 homologous genes, and
2 places in E. canis containing some genes i of the 28-kDa protein. The present invention demonstrated that the cloning, expression and characterization of the genes encoding a mature 28-kDa protein from E. canis that are homologous to the omp-l multiple gene family of E. chaffeensis and the I i C gene ruminate ti um. Two new genes were identified from the 28-kDa protein, ECa28-l and Eca28SA3. Another I-protein gene of 28-kDa E. canis, ECa28SA2, partially sequenced previously (Reddy et al., 1998), was completely sequenced in the present invention. The identification and characterization of a single place in E is also described.
canis that contains all the five genes of the protdine of I 28-kDa E. canis. The 28-kDa E. canis protein is homologous to the I I family E. OMP-1 chaffeensis and the MAP-1 protein of C. i rumanin tium. The majority of homologous proteins of 28-kDa! E. canis (Eca28SA3, Eca28-1 and Eca28-2) meet! in the sequential range of the place. The homology of these proteins I are in the range of 67.5% to 72.3%. the divergence between these 28-kDa proteins was from 27.3% to 38.6%. the 28-kDa proteins E. canis, Eca28SAl and Eca28SA2 were at least homologous with a range of homology of 50.9% to 59.4% and I! divergence from 53.3 to 69.9%. The differences between the genes lie mostly in the four regions and it is suggested that these regions be exposed on the surface and the subject select the pressure by the immune system. The conservation of ECa28-l has been reported among the seven isolated E. canis (McBride et al., 1999). Suggesting that the E. I canis may be clonal in North America. Conversely, 1 significant diversity of p28 has been reported among isolated E. I chaffeensis (Yu et al., 1998). All proteins of 28-kDa E. canis are I I presented by being post-translational processed from
_ ^ ^^^ _ ^^ _ ^^^^^^^^^ * ¿^^^^^^^^^^^^^ A ^^^^^ < A- »MSy.J? .tf I! II a 30-kD protein to a mature 28-kD protein, ii Recently, a signal sequence was identified in E. I chaffeensis P28 (Yu et al., 1998) and the N-terminal amino acid sequence has verified that the protein is processed post-translationally resulting in the cleavage of II the signal sequence to produce a mature protein
(Ohashi et al., 1998). The leader sequences of 0MP-1F 'and OMP-1E have also been proposed as leader signal peptides.
(Ohashi et al., 1998). The signal sequences identified I in E. chaffeensis OMP-1F, 0MP-1E and P28 are homologous! to the leader sequences of the 28-kDa protein E. canis. The promoter sequences for the p28 genes have not been determined experimentally, but the putative promoter regions were identified by comparing the consensus sequences of the RBS, the promoter regions 1-10 and -35 of E. coli and other ehrlichiae (Yu et al.,: 1997); McClure, 1985). Such promoter sequences would allow each gene to be potentially transcribed and translated,! I suggesting that these genes can be differentially expressed in the host. The persistence of infection in dogs can be related to the differential expression of p28 genes resulting in changes
; j .¿., - Att -! ^ y ^ ^^ antigenic in vivo, thus allowing the body to evade the immune response. The 28-kDa E. canis proteins may be important in immunoprotective antigens. Several reports have shown that the 30-kDa antigen of E. Canis exhibit strong immunoreactivity (Rikihisa et al., I 1992). The antiserum antibodies of the convalescent phase of humans and dogs have reacted consistently with proteins in this rangq size of E. chaffeensi s and E. canis, suggesting that these may be important immunoprotective antigens (Rikihisa et al., 1994, Chen et al., 1994, Chen et al., 1997). In addition, antibodies 30, 24 and 21-kDa proteins recently developed in the immune response for E. canis (Rikihisa et al., 1994, Rikihisa et al., 1992), suggesting that these proteins can be especially important in immune responses in the acute state of the disease. Recently, a family of homologous genes that encode outer membrane proteins with molecular masses of 28-kDa in E. chaffeensis, and mice immunized with E, have been identified. Recombinant P28 chaffeensis are presented by having developed immunity against the homologous change i I (Ohashi et al., 1998). It has been shown that P281 of E.
chaffeensis is present in the outer membrane, and immunoelectron microscopy has located P28 'on the surface of the organism, and thus suggest that it can' serve as an adhesive (Ohashi et al., 1998). It is likely that the 28-kDa proteins of E. canis identified in this study have the same place and possibly serve as a similar function. , Comparison of different strains of ECa28-l from
E. canis reveal that the gene is fully conserved. Studies involving E. chaffeensis have shown molecular and immunological evidence of diversity in the
ECa28-l. Patients infected with E. chaffeensis [have variable immunoreactivity for 29/28-kDa proteins, 1 suggesting that this is antigenic diversity (Chen et al., 1997). Recently molecular evidence has been generated to support antigenic diversity in the p28 ide E gene. chaffeensis (Yu et al., 1998). A comparison of five insulating E. chaffeensis revealed that two isolates (Sapulpa and St. Vincent) were 100% identical, but three others (Arkansas, Jax, 91HE17) are divergent by no less than 13.4% at the amino acid level. The conservation of ECa28-l suggests that E. canis strains found in
The United States can be genetically identical, and so the
^^ Mútí iá protein of 28-kDa of E. canis is an attractive candidIato of the vaccine for canine ehrlichiosis in the United States. i
In addition, the analysis of E. canis isolates outside the United States can provide information regarding the origin and evolution of E. canis. The conservation of the 28-kDa proteins make an important potential candidate for a reliable serodiagnosis of canine ehrlichiosis. I The role of homologous genes is not conjoined at this point; however, the persistence of E infections. canis in dogs could conceivably be related to an antigenic variation due to the variable expression of the homologous genes of the protein
28-kDa, thus allowing E. canis to evade immune supervision. The variation of the msp-3 > in A.
Margina is partially responsible for the variation in the MSP-3 protein, resulting in persistent infections I (Alleman et al., 1997). Studies to examine the I expression of the 28-kDa protein gene by E. canis in chronically infected dogs and acutely I would provide insight into the role of the family of the 28-kDa protein gene in the persistence of the infection. '
..... i-? ^ te *.
The following references are cited here. Alleman A.R., et al., (1997) Infect Immun 65: 156-163 Anderson B.E., et al., (1991) J Clin Mi crobiol 29: 2838¡-2842
Anderson B.E., et al., (1992) Int J. Syst Bacteriol 42 :, 299- i 302. 'Brouqui P., et al., (1997) Clin Diag Lab Immunol 4: 731-735. I Chen S.M. , et al., (1994) Am J Trop Med Hyg 50: 52-58 Dawson J.E. et al., (1992) Am J Vet Res 53: 1322-1327. Dawson J.E. et al., (1991) J Infect Dis 163: 564-567. Donatien, et al., (1935) Bull Soc Pathol Exot 28: 418-9. Ewing, (1935) J Am Vet Med Assoc 143: 503-6. Groves M.G., et al., (1975) Am J Vet Res 36: 937-940. Harrus S., et al., (1998) J Clin microbiol 36: 73-76. Jameson B.A., et al., (1988) CABIOS 4: 181-186. Jongejan F., et al., (1993) Rev Elev Med Vet Pays Trop 46: 145-152. McBride J. W., et al., (1996) J Vet Diag Invest 8: 441-447 McBride, et al.,. (1999) Clin Diagn Lab Immunol .; (In I press). ,! McClure, (1985) Ann Rev Biochem 54: 171-204. 'McGeoch D.J. (1985) Virus Res 3: 271-286. Nyindo, M., et al., (1991) Am J Vet Res 52: 1225-1230. I Nyindo, et al., (1971) Am J Vet Res 32: 1651-58.
Dawson J.E. et al., (1992) Infect Immun 66: 132-9. Dawson J.E. et al., (1992) J Clin Microb 36: 2671.80. I Reddy, et al., (1998) Biochem Biophys Res Comm 247: 636-43. i Rikihisa, et al., (1994) J Clin Microbiol 32: 2107-12. i Rothbard J.B. et al., (1988) The EMBO J7: 93-100. Sambrook J., et al., (1989) In Molecular Cloning: A Laboratory Manual. Cold Spring Harbor: Cold Spring Harbor Press. Troy G. C. et al., (1990) Canine ehrlichiosis. Infectious I diseases of the dog and cat. Green C. E. (ed). Philidelphia: W. B. Sauders Co. von Heijne, (1986) Nucí Acids Res 14: 4683-90. | i Walker, et al., (1970) J Am Vet Med Assoc 157: 43-55.; Weiss E., et al., (1975) Appl Microbiol 30: 456-463. Yu, et al., (1997) Gene 184: 149-154. Yu, et al., (1998) J. Clin. Microbiol. (In press). Any patent or publications mentioned in this specification are indicative of the standards of those skilled in the art to which the invention belongs. These patents and publications are incorporated herein by reference by the same scope as the individual publication was individually indicated to be incorporated by reference.
- u-Htoa ,, ..- .. ~ j * .. - * a *? AA *. A person skilled in the art will readily appreciate that the present invention is well adapted to carry out the objectives ii and obtain the purposes and advantages mentioned, I as well as those that are inherent to them. The present examples together with the methods, procedures, I treatments, molecules, and specific compounds described herein are representative of the preferred embodiments at present, are examples, and are not intended to limit the scope of the invention. The changes thereof and other uses I will be presented by those skilled in the art which are encompassed within the spirit of the invention as defined by the scope of the claims.
It is noted that in relation to this date, the best method known to the applicant to carry out the aforementioned invention, is that which is clear from the present description of the invention.
^^ * ^^ ij¿yya »g? LIST OF SEQUENCES
< 110 > Walker, David K. McBride, Jere W. Yu, Xue-Jie < 120 > Genes of the Immunodominant Protein of 28-kilodalton homologous of Ehriichia canis and uses thereof < 130 > D6152PCT < 141 > 1999-1 1-30 < 150 > 09/261, 358 < 151 > 1999-03-03 < 160 > 33
< 210 > 1 < 211 > 1607 < 212 > DNA < 213 > Ehriichia canis < 220 > < 223 > nucleic acid sequence of? Ca28-l < 400 > 1 attttattta ttaccaatct tatataatat attaaatttc tc tacaaaa 50 atctctaatg ttttatacct aatatatata ttctggcttg tatetaettt 100 | gcacttccac tattgttaat ttattttcac tattttaggt gtaatatgaa 150 attcttataa ttgcaaaaaa caactgeatt aatatcatta atgtactcta 200 ttccaagcat atctttttct gatactatac aagatggtaa catgggtggt, 250 ttagtggaaa aacttctata gtatgtacca agtgtctcac attttggtag 1300 cttctcagct aaagaagaaa gcaaatcaac tgttggagtt tttggattaa, 350 aacatgattg ggatggaagt ccaatactta agaataaaca cgctgacttt '400 actgttccaa actattcgtt cagataegag aacaatecat ttctagggtt! 450 tgcaggagct atcggttact caatgggtgg eccaagaata gaattcgaaa i500 tatcttatga ageattegac gtaaaaagtc ctaatat'caa ttatcaaaat 550 gacgcgcaca ggtactgcgc tetatetcat cacacatcgg cagccatgga 600 agetgataaa tttgtcttct taaaaaacga agggttaatt gacatatcac 650 ttgcaataaa tgcatgttat gatataataa atgacaaagt acctgtttct, 700 cettatatat gcgcaggtat tggtactgat ttgatttcta tgtttgaagc 750 tacaagtcct aaaatttcct accaaggaaa actgggcatt agttacteta 800 | ttaatccgga aacctctgtt ttcatcggtg ggcatttcca caggatcata 850 ggtaatgagt ttagagatat tcctgcaata gtacctagta actcaactac 900 aataagtgga ccacaatttg caacagtaac actaaatgtg tgtcactttg ¡950
* - * --- • »^ gtttagaact tggaggaaga átUjfaj tt aacttct aattttactg ttgccacata llOOD ttaaaaatga tctaaacttg tttztawtat tgctacatac aaaaaaagaa the bear aaatagtggc aaaagaatgt agcaataaga gggggggggg ggaccaaatt iioo tatcttctat gcttcccaag ttt tcycg ctatttatga cttaaacaac 1150 agaaggtaat atcctcacgg aaaacttatc ttcaaatatt ttatttatta i oo ccaatcttat ataatatatt ttctct aaa! tacaaaaatc actagtattt 1250 tataccaaaa tatatattct gacrtgcttt tcttctgcac ttctactatt 1300 tttaatttat ttgtcactat taggttataa taawatgaat tgcmaaagat 1350! ttttcatagc aagtgcattg atatcactaa tgtctttctt acctagcgta 1400 tctttttctg aatcaataca tgaagataat ataaatggta acttttacat 1450 tagtgcaaag gtgcctcaca tatatgccaa ttttcagtta ctttggcgta aaacacaaca i500 i550 aagaagagaa actggagttt tcggattaaa acaagattgg gacggagcaa cactaaagga tgcaagcwgc agccacacaw tagacccaag tacaatg i600 i607
< 210 > 2 < 211 > 278 < 212 > PRT < 213 > Ehriichia canis < 220 > I < 223 > sequence of the amino acids of the ECa28-1 protein < 400 > I Met Asn Cys Lys Lys lie Leu lie Thr Thr Ala Leu lie SerLeu
Met Tyr Ser lie
Gly Asn Met Gly
Ser Val Ser His 50 55 60 Ser Thr Val Gly Val Phe Gly Leu Lys His Asp Trp Asp Gly, Ser 65 70! 75 Pro lie Leu Lys Asn Lys His Wing Asp Phe Thr Val Pro Asn¡ Tyr 80 85 90 Ser Phe Arg Tyr Glu Asn Asn Pro Phe Leu Gly Phe Wing Gly¡ Wing 95 100 105 lie Gly Tyr Ser Met Gly Gly Pro Arg He Glu Phe Glu He Ser 110 115 120
Tyr Glu Wing Phe Asp Val Lys Ser Pro Asn He Asn Tyr Gln Asp 125 130 135
Asp Ala His Arg Tyr Cys Ala Leu Ser Kis His Thr Ser Ala Ala 140 145 150
Met Glu Wing Asp Lys Phe Val Phe Leu Lys Asn Glu Gly Leu 155 155 165
Asp He Ser Leu Wing He Asn Wing Cys Tyr Asp He He Asn Asp 170 175 180
Lys Val Pro Val Ser Pro Tyr He Cys Ala Gly He Gly Thr Asp 185 190 195
Leu He Ser Met Phe Glu Wing Thr Ser Pro Lys He Ser Tyr Gln 200 205 210
Gly Lys Leu Gly He Ser Tyr Ser He Asn Pro Glu Thr Ser Val 215 220 225 i Phe He Gly Gly His Phe His Arg He He Gly Asn Glu Phe Are 230 235 240
Asp He Pro Ala He Val Pro Ser Asn Ser Thr Thr He Ser Gly 245 250 255 i Pro Gln Phe Ala Thr Val Thr Leu Asn Val Cys His Phe Gly Leu 260 265 270
Glu Leu Gly Gly Arg Phe Asn Phe 275
< 210 > 3 < 211 > 849 < 212 > DNA < 213 > Ehriichia canis < 220 > < 221 > mat_peptide < 223 > nucleic acid sequence of ECa28SA2 < 400 > 3 atgaattgta aaaaagtttt cacaataagt gcattgatat catecatata! 50 cttcctacct aatgtctcat actctaaccc agtatatggt aacagtatgt 100 atggtaattt ttacatatca ggaaagtaca tgccaagtgt tcctcatttt 150 ggaatttttt cagctgaaga agagaaaaaa aagacaactg tagtatatgg 200 cttaaaagaa aactgggcag gagatgcaat ate agtcaa agtccagatg 250 ataattttac cattcgaaat tacteattea agtatgcaag caacaagttt | 300"-" ----- • ttagggtttg cagtagctat tggttactcg ataggcagtc aga caagaa 350 agttgagatg tcttatgaag catttgatgt gaaaaatcca ggtgataatt 400 acaaaaacgg tgcttacagg tattgtgc t tatctcatca agatgatgcg 45 gatgatgaca tgactagtgc aactgacaaa tttgtatatt taattaatga 50 aggattactt aacatatcat ttatgacaaa catatgttat gaaacagcaa 55 gcaaaaatat acctctctct ccttacatat gtgcaggtat tggtac gat 60 ttaattcaca tgtttgaaac tacacatcct aaaatttctt atcaaggaaa 65 gctagggttg gcctacttcg taagtgcaga gtcttcggtt tcttttggta 70 tatattttca taaaattata aataataagt ttaaaaatgt tccagccatg 75.0 gtacctatta actcagacga gatagtagga ccacagtttg caacagtaac 8QO i attaaatgta tgctactttg gattagaact tggatgtagg ttcaacttc 849
< 210 > 4 < 211 > 283 < 212 > PRT < 213 > Ehriichia canis < 220 > < 223 > amino acid sequence of the ECa28SA2 protein < 400 > 4 Met Asn Cys Lys Lys Val Phe Thr He Ser Wing Leu He Ser Ser 5 10 15
He Tyr Phe Leu Pro Asn Val Ser Tyr Ser Asn Pro Val Tyr Gly 20 25 30
Asn Ser Met Tyr Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro 35 40 45
Ser Val Pro His Phe Gly He Phe Ser Ala Glu Glu Glu Lys Lys 50 55 60
Lys Thr Thr Val Val Tyr Gly Leu Lys Glu Asn Trp Wing Gly Asp 65 70 75
Wing I Be Ser Gln Ser Pro Asp Asp Asn Phe Thr He Arg Asn 80 85 90
Tyr Ser Phe Lys
Ala He Gly Tyr 110 115 120
Ser Tyr Glu "Wing" Phe Asp Val Lys Asn Pro Gly Asp Asn Tyr ^ ys 125 130 ¡135
- ~ ** ~ * ~ to sn Gly Wing Tyr Arg Tyr Cys Wing Leu Ser His Gln Asp Asp Wing 14C .45 150 Asp Asp Asp Met Thr Ser Wing Thr Asp Lys Phe Val -Tyr Leu He 155 160 1165! Asn Glu Gly Leu Leu Asn He Be Phe Met Thr Asn He Cys iTyr 170 175 180 Glu Thr Ala Ser Lys Asn He Pro Leu Ser Pro Tyr He Cys Wing 185 190 195 Gly He Gly Thr Asp Leu He His Met Phe Glu Thr Thr His Pro 200 205 210 I Lys He Ser Tyr Gln Gly Lys Leu Gly Leu Ala Tyr Phe Val (Ser 215 220 225 Wing Glu Ser Ser Val Ser Phe Gly He Tyr Phe His Lys He He 230 235 | 240 Asn Asn Lys Phe Lys Asn Val Pro Wing Met Val Pro He Asnj Ser 245 250 | 255 Asp Glu He Val Gly Pro Gln Phe Ala Thr Val Thr Leu Asn¡ Val 260 265 '270 Cys Tyr Phe Gly Leu Glu Leu Gly Cys Arg Phe Asn Phe 275 280 '
< 210 > 5 < 211 > 840, < 212 > DNA < 213 > Ehriichia cernís < 220 > < 221 > mat_peptide < 223 > nucleic acid sequence of ECa28SA3 < 400 > May 1 atgaattgca aaaaaattct tataacaact gcattaatgt cattaatgta 50 ctatgctcca agcatatctt tttctgatac tatacaagac gataacactg 100 catcagtgga gtagcttcta aaatatgtac caagtgtttc acattttggt 150 gttttctcag aagaaactca ctaaagaaga actgttggag tttttggatt |! 200 aaaacatgat tggaatggag gtacaatatc taactcttct ccagaaaata 250 tattcacagt tcaaaattat tcgtttaaat acgaaaacaa cccattctta 300 gggtttgcag gagctattgg ttattcaatg ggtggcccaa gaatagaact 350 tgaag cg tacgagacat tcgatgtgaa aaatcagaac aataattata 400
J ^^ • "* - línr t - '-Mjáa-fa-aat-aÉ - ...- ^ i-ii ttijM agaacggcgc acacagatac tgtgctttat ctcatcatag t cagcaaca 450 agcatgtcct ccgcaagtaa caaatttgtt ttcttaaaaa a gaagggtt: 00 aattgactta tcatttatga taaatgcatg ctatgacata ataattgaag 5¡50 gaatgccttt ttcaccttat at GTGCAG gtgttggtac tgatgttgtt 6.00 tccatgtttg aagctataaa tcctaaaatt tcttaccaag gaaaactag 650 attaggttat agtataagtt cagaagcctc tgtttttatc ggtggacact 100 ttcacagagt cataggtaat gaatttagag acatccctgc tatggttcct 750 atcttccaga agtggatcaa aaaccaattt gcaatagtaa cactaaatg 800 840 gtgtcacttt ggcatagaac ttggaggaag atttaacttc
< 210 > 6 < 211 > 280 < 212 > PRT < 213 > Ehriichia canie < 220 > < 223 > sequence of the amino acids of the ECa28SA3 protein < 400 > 6 I Met Asn Cys Lys Lys He Leu He Thr Thr Wing Leu Met Ser Leu 5 10 | 15 Met Tyr Tyr Ala Pro Ser He Phe Ser Asp Thr He GlnjAsp 20 25 i 30 Asp Asn Thr Gly Ser Phe Tyr He Ser Gly Lys Tyr Val Pro Ser 35 40 45 Val Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Arg Asnj Ser 50 55 '60 Thr Val Gly Val Phe Gly Leu Lys His Asp Trp Asn Gly Gly, Thr 65 70! 75 I have been Asn Being Ser Glu Asn He Phe Thr Val Gln Asn¡ Tyr 80 85 ¡90 Being Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Wing Gly¡ Wing 95 100! 105 He Gly Tyr Ser Met Gly Gly Pro Arg He Glu Leu Glu Vali Leu 110 115 120 Tyr Glu Thr Phe Asp Val Lys Asn Gln Asn Asn Asn Tyr Lys Asn 125 130 135 Gly Ala His Arg Tyr Cys Ala Leu Ser His His Ser Ser Ala Thr 140 145 150
Being Met Being Being Wing Being Asn Lvs Phe Val Phe Leu Lvs Asn | Glu 155 160 165 Gly Leu He Asp Leu Being Phe Met He Asn Aia Cys Tyr Asp He 170 175 180 He He Glu Gly Met Pro Phe Ser Pro Tyr He Cys Ala Gly Val 185 190 195 Gly Thr Asp Val Val Ser Met Phe Glu Ala He Asn Pro Lys He 200 205 210 Ser Tyr Gln Gly Lys Leu Gly Leu Gly Tyr Ser He Ser Ser Glu 215 220 225 Wing Ser Val Phe He Gly Gly His Phe His Arg Val He Gly Asn 230 235 240 Glu Phe Arg Asp He Pro Wing Met Val Pro Ser Gly Ser Asn Leu 245 250 255 Pro Glu Asn Gln Phe Wing He Val Thr Leu Asn Val Cys His Phe 260 265 270 Gly He Glu Leu Gly Gly Arg Phe Asn Phe 275 280
< 210 > 7 < 211 > 133 < 212 > PRT < 213 > Ehriichia canis < 220 > < 223 > partial sequence of the amino acid of the ECa2ßSA2 protein < 400 > 7 Met Asn Cys Lys Lys Val Phe Thr He Ser Wing Leu He Serj Ser 5 10 15
He Tyr Phe Leu Pro Asn Val Ser Tyr Ser Asn Pro Val Tyr¡ Gly 20 25 '; 30
Asn Ser Met Tyr Gly Asn Phe Tyr He Ser Gly Lys Tyr Met; Pro 35 40 45
Ser Val Pro His Phe Gly He Phe Ser Ala Glu Glu Glu Lys Lys 50 55! 60 Lys Thr Thr Val Val Tyr Gly Leu Lys Glu Asn Trp Wing Gly Asp 65 70 Wing 75 Wing Be Ser Gln Ser Pro Asp Asp Asn Phe Thr He Arg Asn 80 85! 90"- • -" • "- • -» - * - < "- < * - i ~ -f *? ia ** m? * L? iA? I Tyr Ser Phe Lys Tyr Wing Ser Asn Lys Phe Leu Gly Phe Wing Val 95 100 105 Wing He Giy Tyr Ser He Gly Ser Pro Arg He Glu Val Glu Met 110 115 120 Ser Tyr Glu Ala Phe Asp Val Lys Asn Gln Gly Asn Asn 125 130
< 210 > 8 < 211 > 287 ¡< 212 > PRT < 213 > Ehriichia canis < 220 > < 223 > amino acid sequence of the ECa28SA1 protein i < 40C > 8 Met Lys Tyr Lys Lys Thr Phe Thr Val Thr Wing Leu Val Leu1 Leu 5 10 '15 Thr Ser Phe Thr His Phe He Pro Pro Phe Tyr Ser Pro Ala Arg Ale 20 25, 30 Ser Thr He His Asn Phe Tyr He Ser Gly Lys Tyr Met Pro1 Thr 35 40 45 Wing Ser His Phe Gly He Phe Ser Wing Lys Glu Glu Gln Serj Phe 50 55, 60 Thr Lys Val Leu Val Gly Leu Asp Gln Arg Leu Ser His Asn | He 65 70 | 75 He Asn Asn Asn Asp Thr Ala Lys Ser Leu Lys Val Gln Asri Tyr 80 85 '90 Ser Phe Lys Tyr Lys Asn Asn Pro Phe Leu Gly Phe Wing Gly Wing 95 100 105 He Gly Tyr Being He Gly Asn Being Arg He Glu Leu Glu Val Ser 110 115 120 His Glu He Phe Asp Thr Lys Asn Pro Gly Asn Asn Tyr Leu Asn 125 130 135 Asp Ser His Lys Tyr Cys Ala Leu Ser His Gly Ser His He Cys 140 145 150 Ser Asp Gly Asn Ser Gly Asp Trp Tyr Thr Wing Lys Thr Asp Lys 155 160 165 Phe Val Leu Leu Lys Asn Glu Gly Leu Leu Asp Val Ser Phe Met 170 175 180
- '- "- - * - g & Leu Asn Ala Cys Tyr Asp He Thr Thr Glu Lys Met Pro Phe Ser 185 190 L95 Pro Tyr He Cys Wing Gly He Gly Thr Asp Leu He Ser Met Phe 200 205 210 I Glu Thr Thr Gln Asn Lys He Ser Tyr Gln Giy Lys Leu Gly Leu 215 220 225 Asn Tyr Thr He Asn Ser Arg Val Ser Val Phe Aia Gly Gly His 230 235 240 Phe His Lys Val He Gly Asn Glu Phe Lys Gly He Pro Thr Leu 245 250 ¡255 Leu Pro Asp Gly Ser Asn He Lys Val Gln Gln Ser Wing Thr Val 260 265 270 Thr Leu Asp Val Cys His Phe Gly Leu Glu He Gly Ser Arg | Phe 275 280! 285 Phe Phe '
< 210 > 9 < 211 > 281 < 212 > PRT < 213 > Ehriichia chaffeensis < 220 > < 223 > amino acid sequence of E. chaffeensis P28 < 400 > 9 Met Asn Tyr Lys Lys Val Phe He Thr Ser Ala Leu He Ser! eu 5 10 15 I Ser Ser Leu Pro Gly Val Ser Phe Ser Asp Pro Wing Gly, Ser 20 25 30 Gly He Asn Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro Ser 35 40 45 Wing Ser His Phe Gly Val Phe Ser Wing Lys Glu Glu Arg Asn Thr 50 55 60 Thr Val Gly Val Phe Gly Leu Lys Gln Asn Trp Asp Gly Ser Wing 65 70 75 He Ser Asn Ser Ser Pro Asn Asp Val Phe Thr Val Ser Asn Tyr 80 85 90 Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Ala Glyj Ala 95 100 105
He Gly Tyr Ser Met Asp Gly Pro Arg He Giu Leu Glu Val Ser 110 115 120
Tyr Glu Thr Phe Asp Val Lys Asn Gln Gly Asn Asn Tyr Lys Asn 125 130 135
Glu Ala His Arg Tyr Cys Ala Leu Ser His Asn Ser Ala Aia Asp 140 145 150
Met Being Being Wing Being Asn Asn Phe Val Phe Leu Lys Asn Giu Gly 155 160 165
Leu Leu Asp He Being Phe Met Leu Asn Ala Cys Tyr Asp Val iVal 170 175 '180
Gly Glu Gly He Pro Phe Ser Pro Tyr He Cys Wing Gly He Gly 185 190 195
Thr Asp Leu Val Ser Met Phe Glu Wing Thr Asn Pro Lye He Be 200 205 '210
Tyr Gln Gly Lys
Ser Val Phe He Phe Arg Asp He Pro Thr He He Pro Thr Gly Ser Thr Leu Wing 245 250 '255
Gly Lys Gly Asn Tyr Pro Wing He Val He Leu Asp Val Cys His His 260 265 ¡270
Phe Gly He Glu Leu Gly Gly Arg Phe Ala Phe '275 280
< 210 > • 10 < 211 > 283 < 212 > PRT < 213 > Ehriichia chaffeensis < 220 > < 223 > amino acid sequence d »< 400 > 10 Met Asn Tyr Lys Lys He Phe Val Ser Ser Ala Leu He Ser Leu 5 10 ¡15
Met Ser He Leu Pro Tyr Gln Be Phe Wing Asp Pro Val Thr Ser 20 25! 30
***** * • • '- «»! - '^ - "*** - -' -" | | Asn Asp Thr Gly
Val Lys Tyr Asn 50 55 ¡60 Glu Glu Ala Pro He Asn Gly Asn Thr Ser He Thr Lys Lys' Val 65 70! 75 Phe Gly Leu Lys Lys Asp Gly Asp He Wing Gln Ser Wing Asn Phe 80 85 i 90 i Asn Arg Thr Asp Pro Wing Leu Glu Phe Gln Asn Asn Leu lie Be 95 100! 105 Gly Phe Ser Gly Ser He Gly Tyr Wing Met Asp Gly Pro Arg He 110 115 120 Glu Leu Glu Wing Wing Tyr Gln Lys Phe Asp Wing Lys Asn Pro Asp 125 130 '135 Asn Asn Asp Thr Asn Ser Gly Asp Tyr Tyr Lys Tyr Phe Gly Leu 140 145 150 Ser Arg Glu Asp Wing He Wing Asp Lys Lys Tyr Val Val Leu Lys 155 160 '165 Asn Glu Gly He Thr Phe Met Ser Leu Met Val Asn Thr Cys Tyr 170 175 ii 180 Asp He Thr Ala Glu Gly Val Pro Phe He Pro Tyr Wing Cys Wing 185 190 195 Gly Val Gly Wing Asp Leu He Asn Val Phe Lys Asp Phe Asn Leu 200 205 210 Lys Phe Ser Tyr Gln Gly Lys He Gly He Ser Tyr Pro lie Thr 215 220 '225 I Pro Glu Val Ser Wing Phe He Gly Gly Tyr Tyr His Gly Val He 230 235 240 Gly Asn Asn Phe Asn Lys He Pro Val He Thr Pro Val Val Leu 245 250! 255 Glu Gly Ala Pro Gln Thr Thr Ser Ala Leu Val Thr He Asp Thr 260 265 270 Gly Tyr Phe Gly Gly Glu Val Gly Val Arg Phe Thr Phe 275 280
< 210 > 11 < 211 > 280
eleven
"* - ~" "- - <I <212> PRT! <213> Ehriichia chaffeensis j <220> <223> amino acid sequence of .chaffeensie OM ^ -IC < 400> 11! Met Asn Cys Lys
Met Ser Phe Leu Asp Ser Val Ser Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro 35 40 45 Ser Ala Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Lys Asn 50 55 60 Pro Thr Val Ala Leu Tyr Gly Leu Lys Gin Asp Trp Asn Gly Val 65 70 75 Be Ala Be Ser His Wing Asp Wing Asp Phe Asn Asn Lys Gly Tyr 80 85 90 Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Wing Gly Wing 95 100 I05 I He Gly Tyr Ser Met Gly Gly Pro Arg He Glu Phe Glu Val Ser 110 115 120 Tyr Glu Thr Phe Asp Val Lys Asn Gln Gly Gly Asn Tyr Lys Asn 125 130 135 Asp Wing His Arg Tyr Cys Wing Leu Asp Arg Lys Wing Ser Ser Thr 140 145 150 Asn Wing Thr Wing His Ser Tyr Val Leu Leu Lys Asn Glu Gly Leu 155 160 165 Leu Asp He Ser Leu Met Leu Asn Wing Cys Tyr Asp Val Val Ser 170 175 180 Glu Gly He Pro Phe Ser Pro Tyr He Cys Wing Gly Val Gly Thr 185 190 195 Asp Leu He Ser Met Phe Glu Wing He Asn Pro Lys He Ser Tyr Gln Gly Lys Leu Gly Leu Ser Tyr Ser He Asn Pro Glu Wing Ser 215 220 225 Val Phe Val Gly Gly His Phe His Lys Val Wing Gly Asn Gli Phe 230 235 240
12
Arg Asp He Ser Thr Leu Lys Aia Phe Ala Thr Pro Ser Ser, Ala 245 250 255
Wing Thr Pro Asp Leu Wing Thr Val Thr Leu Ser Val Cys His Phe 260 265 270
Gly Val Glu Leu Giy Gly Arg Phe Asn Phe 275 280 -
< 210 > 12 < 211 > 286 < 212 > PRT < 213 > Ehriichia chaffeensis < 220 > < 223 > amino acid sequence of? OMP-1D chaffeensis
< 400 > 12 Met Asn Cys Glu Lys Phe Phe He Thr Thr Ala Leu Thr Leu Leu 5 10 15
Met Ser Phe Leu Pro Gly Be Ser Leu Ser Asp Pro Val Glni Asp 20 25 30
Asp Asn He Ser Gly Asn Phe Tyr He Ser Gly Lys Tyr Met Pro 35 40 45
Being Ala Ser His Phe Gly Val Phe Ser Ala Lys Glu Glu Arg¡ Asn 50 55 60
Thr Thr Val Gly Val Phe Gly He Glu Gln Asp Trp Asp Arg Cys 65 70 75
Val He Ser Arg Thr Thr Leu Ser Asp He Phe Thr Val Pro Asn 80 85 90
Tyr Ser Phe Lys Tyr Glu Asn Asn Leu Phe Ser Gly Phe Wing Gly 95 100. 105
Ala He Gly Tyr Ser Met Asp Gly Pro Arg He Glu Leu Glu? Val 110 115 120
Ser Tyr Glu Wing Phe Asp Val Lys Asn Gln Gly Asn Asn Tyr, Lys i 125 130 I 135!
Asn Glu Ala His Arg Tyr Tyr Ala Leu Ser His Leu Leu Gly Thr 140 145 150
Glu Thr Gln He Asp Gly Wing Gly Being Wing Being Val Phe Leu He
155 160 165
13 Asn Giu Gly Leu Leu Asp Lys Ser Phe Met Leu Asn Wing Cys Tyr 170 175 180 Asp Val He Ser Glu Gly He Pro Phe Ser Pro Tyr He Cys Wing 185 190 195 Gly He Gly He Asp Leu Val Ser Met Phe Glu Wing He Asn Pro 200 205 210 Lys He Ser Tyr Gln Gly Lys Leu Gly Leu Ser Tyr Pro He i Ser 215 220 225 Pro Glu Wing Ser Val Phe He Gly Gly His Phe His Lys Val He 230 235 240 Gly Asn Glu Phe Arg Asp Lie Pro Thr Met He Pro Ser Glu 'Ser 245 250 255 Ala Leu Ala Gly Lys Gly Asn Tyr Pro Ala He Val Thr Leu Asp 260 265 270 Val Phe Tyr Phe Gly He Glu Leu Gly Gly Arg Phe Asn PheiGln 275 280 285
Leu
< 210 > 13! < 211 > 278 I < 212 > PRT | < 213 > Ehriichia chaffeensis, < 220 > < 223 > amino acid sequence of E. chaffeensis OMP-1E
< 400 > 13! i
Met Asn Cys Lys Lys Phe Phe He Thr Thr Ala Leu Val Ser! Leu 5 10 15
Met Ser Phe Leu Pro Gly He Ser Phe Ser Asp Pro Val Gln Gly 20 25 30
Asp Asn He Ser Gly Asn Phe Tyr Val Ser Gly Lys Tyr Met Pro 35 40 45
Being Ala Ser His Phe Gly Met Phe Being Ala Lys Glu Glu Lys¡ Asn 50 55 ¡60
Pro Thr Val Ala Leu Tyr Gly Leu Lys Gln Asp Trp Glu Gly He 65 70! 75
Being Being Being His As Asp Asn His Phe Asn Asn Lys Gly Tyr 80 85 90
14
_ ^ _ j¡y¡¿ ^ * faith * ¡^ ¿^ iíw ^^^ > Be Phe Lys Tyr Glu Asn Asn Pro Phe Leu Giy Phe Wing Gly Wing 95 100 105 He Gly Tyr Ser Met Gly Gly Pro Arg Val Glu Phe Glu Val Ser 110 115 120 Tyr Giu Thr Phe Asp Val Lys Asn Gln Gly Asn Asn Tyr Lys Asn 125 130 135 Asp Wing His Arg Tyr Cys Wing Leu Gly Gln Gln Asp Asn Ser Gly 140 145 150 He Pro Lys Thr Ser Lys Tyr Val Leu Leu Lys Ser Glu Gly i Leu 155 160 165 Leu Asp I Am Being Phe Met Leu Asn Ala Cys Tyr Asp I have He Asn Glu Being He Pro Leu Being Pro Tyr He Cys Wing Gly Val Gly > Thr 185 190 195 Asp Leu He Ser Mét Phe Glu Wing Thr Asn Pro Lys He Ser Tyr 200 205 210 Gln Gly Lys Leu Gly Leu Ser Tyr Ser He Asn Pro Glu Ala i Ser 215 220 225 Val Phe He Gly Gly His Phe His Lys Val He Gly Asn Glu Phe 230 235 ¡240 Arg Asp He Pro Thr Leu Lys Ala Phe Val Thr Ser Ser Ala ¡Thr 245 250 255 Pro Asp Leu Ala He Val Thr Leu Ser Val Cys His Phe Gly He 260 265, 270 Glu Leu Gly Gly Arg Phe Asn Phe 275
< 210 > 14 < 211 > 280 < 212 > PRT < 213 > Ehriichia chaffeensis < 220 > < 223 > amino acid sequence of E. chaffeensis 0MP-1F < 400 > 14 I
Met Asn Cys Lys Lys Phe Phe He Thr Thr Thr Leu Val Ser¡Leu 5 10 ¡15
fifteen
Met Ser Phe Leu Pro Gly He Ser Phe Ser Asp Wing Val Gin Asn 20 25 30 Asp Asn Val Gly Asn Phe Tyr He Ser Gly Lys Tyr Val Pro 35 40 45 Ser Val Ser His Phe Gly Val Phe Ser Ala Lys Gln Glu Arg Asn 50 55 60 Thr Thr Thr Gly Val Phe Gly Leu Lys Gln Asp Trp Asp Gly Ser 65 70 75 Thr He Ser Lys Asn Ser Pro Glu Asn Thr Phe Asn Val Pro Asn 80 85 190 Tyr Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Wing Gly 95 100 105 Wing Val Gly Tyr Leu Met Asp Gly Pro Arg He Glu Leu Glu i Met 110 115 ¡120 Ser Tyr Giu Thr Phe Asp Val Lys Asn Gln Gly Asn Asn Tyr 'Lys 125 130 ¡135 Asn Asp Ala His Lys Tyr Tyr Ala Leu Thr His Asn Ser Gly¡Gly 140 145 150 Lys Leu Ser Asn Ala Gly Asp Lys Phe Val Phe Leu Lys Asn 'Glu 155 160 ¡165 Gly Leu Leu Asp He Ser Leu Met Leu Asn Ala Cys Tyr Asp Val 170 175 180 He Ser Glu Gly
Gly Thr Asp Leu
Ser Tyr Gln Gly 215 220 225 Wing Ser Val Phe Val Gly Gly His Phe His Lys Val He Gly Asn 230 235 240 Glu Phe Arg Asp He Pro Wing Met He Pro Ser Thr Ser Thr Leu 245 250 255 Thr Gly Asn His Phe Thr He Val Thr Leu Ser Val Cys His Phe 260 265 270 Gly Val Glu Leu Gly Gly Arg Phe Asn Phe 275 280
16
< 210 > 15 < 211 > 284 < 212 > PRT < 213 > Cowdria ruminantium < 220 > | < 223 > amino acid sequence of C. rupin ti um MAP-i < 400 > 15 Met Asn Cys Lys Lys He Phe He Thr Ser Thr Leu He Ser Leu 5 10 ¡15 Val Ser Phe Leu Pro Gly Val Ser Phe Ser Asp Val He Gln¡Glu 20 25 ¡30 Glu Asn Asn Pro Val Gly Ser Val Tyr I Have To Be Wing Lys Tyr Met 35 40 45 Pro Thr Wing His Phe Gly Lys Met Ser He Lys Glu Asp Ser 50 55 60 Arg Asp Thr Lys Wing Val Phe Gly Leu Lys Lys Asp Trp Asp 'Gly 65 70 75 Val Lys Thr Pro Ser Gly Asn Thr Asn Ser He Phe Thr Glu i Lys 80 85 90 Asp Tyr Ser Phe Lys Tyr Glu Asn Asn Pro Phe Leu Gly Phe Wing 95 100 105 Gly Wing Val Gly Tyr Ser Met Asn Gly Pro Arg He Glu Phe ! Glu 110 115 120 Val Ser Tyr Glu Thr Phe Asp Val Arg Asn Pro Gly Gly Asn Tyr 125 130 135 Lys Asn Asp Ala His Met Tyr Cys Ala Leu Asp Thr Ala Ser Ser 140 145 150 Ser Thr Ala Gly Ala Thr Thr Ser Val Met Val Lys Asn Glu Asn 155 160 165 Leu Thr Asp Be Ser Leu Met Leu Asn Wing Cys Tyr Asp He Met 170 175 180 Leu Asp Gly Met Pro Val Ser Pro Tyr Val Cys Ala Gly He Gly 185 190 195 Thr Asp Leu Val Ser Val He Asn Ala Thr Asn Pro Lys Leu Ser 200 205 210 Tyr Gln Gly Lys Leu Gly He Ser Tyr Ser He Asn Prt > Glu Ala 215 220 225
17
- - "•• * ~ p ** • ** .- .., .... ^ .é .. ...,." "T .., .... < M" «* ^ * M. ^ -MißH ^ ktfM Ser He Phe He Gly Gly Kis Phe Kis Arg Val He Gly Asri Glu 230 235 240 Phe Lys Asp He Wing Thr Ser Lys Val Phe Thr Ser Ser Gly Asn 245 250 255 Wing Being Ser Wing Val Ser Pro Gly Phe Wing Being Wing He Leu Asp 260 265 270 Val Cys His Phe Gly He Glu He Gly Gly Arg Phe Val Phé 275 280
< 210 > 16 < 211 > 20 < 212 > DNA < 213 > artificial sequence < 220 > < 221 > < 222 > linker 313-332 of c. ruzpinantium MAP-1, also nucleotides 307-326 of E. chaffee? sie P28 < 223 > 793 front primers for PCR < 400 > 16 gcaggagctg ttggttactc 20
< 210 > 17 < 211 > 21 < 212 > DNA < 213 > artificial sequence < 220 > < 221 > < 222 > nucleotides 823-843 of C. rt ipaptium MAP-1, also nucleotides 814-834 of E. chaffeensis P28 < 223 > reverse primer 1330 for PCR < 400 > 17
ccttcctcca agttctatgc c 21
< 210 > 18 < 211 > 24 < 212 > DNA
18
< ? 13 > artificial sequence < 220 > < 221 > link_strainer < 223 > 46f primer, specific for the gene? Ca28SA2 < 400 > 18
atatacttcc tacctaatgt ctca 24
< 210 > 19 < 211 > 20 < 212 > DNA < 213 > < 220 > < 221 > starter_link, < 223 > primer used to sequence the genes of the 28-kDa protein in E. canis ¡< 400 > 19!
agtgcagagt cttcggtttc 20
< 210 > 20 < 211 > 18 < 212 > DNA < 213 > < 220 > < 221 > link_strainer < 223 > primer used to sequence the 28-kDa protein genes in E. canis < 400 > twenty
gttacttgcg gaggacat 18
< 210 > 21 < 211 > 24 < 212 > DNA < 213 > artificial sequence < 220 > < 221 > starter_band < 222 > nucleotides 6 6887-710 of ECa28-l
19
< 223 > 394 primer for PCR < 400 > twenty-one
gcatttccac aggatcatag gtaa 24
< 210 > 22 < 211 > 24 < 212 > DNA < 213 > artificial sequence < 220 > < 221 > band_highlight < 222 > nucleotides 710-687 of ECa28-l < 223 > 394C primer for PCR < 400 > 22
ttacctatga tcctgtggaa atgc 24
< 210 > 23 < 211 > 20 < 212 > DNA < 213 > artificial sequence < 220 > < 221 > starter_link < 223 > r c > < e-hbaardtro? rr 77Q93 m hybridizes to a region with i Eca28-1 used to amplify the intergenic region between the gene ECa28SA3 and ECa28-1. < 400 > 2. 3
gagtaaccaa cage tec tgc 20
< 210 > 24 < 211 > 24 < 212 > DNA < 213 > artificial sequence < 220 > < 221 > flyer_strider
twenty
from? Ca28-l
tctactttgc acttccacta ttgt 24 < 210 > 25
< 212 > DNA
< 220 > < 221 > Cebador_banda Cß &ññ EC250W-! encoders adjacent to the open reading structure of
^ < nns
attcttttgc cactattttt cttt 24
< 210 > 26
< 212 > DNA ¡-i a-, < r22Q > '< 221 > Primer_link = ¿23? Primer ECaSAS -2 corresponded to Iss regions depíre of ECa28SA3, used to amplify the iptergenic region NC3
< 4üü 2 &
ct ggattag gctatagtat aagtt 25
< 210 > 27 ¡211 * 23 i < 212 > PRT < Z2.3 = > Jif chiu c ms < 220 > ! < 221 > PEPTIDO --223 ups signal fFFsrrpipat predicts the peptide of ECa2S-1 i
twenty-one
and? Ca23SA3 < 4CC > 27
Met Asn Cys Lys Lys He Leu He Thr Thr Aia Leu Ket Ser Leu 5 1C, 15 Met Tyr Tyr Ala Pro Ser He Ser 20
< 210 > - 28 < 211 > 25 < 212 > PRT < 213 > Ehriichia chaffeensis < 220 > < 223 > Amino acid sequence of: E. chaffeensis P28 < 400 > 28 Met Asn Tyr Lys Lys lie Leu He Thr Ser Wing Leu He Ser Leu 5 10 15 He Ser Ser Leu Pro Gly Val Ser Phe Ser 20 25
< 210 > 29 < 211 > 26 < 212 > PRT < 213 > Ehriichia canis < 220 > < 223 > Amino acid sequence of the putative cleavage site of ECa28-1 < 400 > 29 Met Asn Cys Lys Lys He Leu lie Thr Thr Ala Leu He Ser Leu 5 10! 15 Met Tyr Ser He Pro Ser He Ser Ser Phe Ser 20 25 i
< 210 > 30 < 211 > 299
~ n
_JMÍi¡_ _M- * i-Í ---- l *. . ,. . -j ^ jt-Jw- < 212 > DNA < 213 > ? hriichia canie < 220 > < 222 > Nucleic acid sequence of intergenic non-coding region 1 (28NC1)
< 400 > 30 taatacttct attgtacatc ttaaaaatas tactastttg cttctstsg '50 aagagagaaa ttataaacgc tagttagtaa taaattagaa ag aaatat 100 tagaaaagtc atatgttttt cattgtcatt gatactcaac taaaagtagt 150 crtattaata ataaatgtta gtatattaaa attttacgta tttcccttac 200 aaaagccact agtattttat actaaaagct atactttggc ttgtatttaa 250 tttgtatttt tactactg t aatttacttt cacrgtttcr ggtgtaaa- 299 < 210 > 31 < 2ii > 345 < 212 > DNA < 213 > Ehriichia canis < 220 > < 223 > Nuc intergenic acid sequence (28NC2) < 400 > 31 taatttcgtg gtacacatat cacgaagcta aaattgtttt tttatctctg 50 ctgtatacaa gagaaaaaat agtagtgaaa attaectaac aatatgacag 100 tacaagttta ecaagettat tctcacaaaa cttcttgtgt cttttatctc 150 tttacaatga aatgtacact tagetteact actgtagagt gtgtttatca 2¡00 atgctttgtt tattaatact ctacataata tgttaaattt ttcttacaaa 250 acteactagt aatttatact agaatatata ttctgacttg tatttgcttt 300 atacttccac tattgttaat ttattttcac tattttaggt gtaat 3 ^ 5
< 210 > 32 < 211 > 345 * < 212 > < 213 > Ehriichia canis < 220 > DNA < 223 > Sequence of intergenic acid (28NC3) < 400 > 32
2. 3
! egatettatt gttgccacat taaaaatg atctaaactt gtttttatta 50 ttgctacata caaaaaaaag aaaaatagtg gcaaaagaat gtagcaataa 100! gagggggggg ggggactaaa tttaccttct attcttctaa tattctttac 150 tatattcaaa tagcacaact caatgcttcc aggaaaatat gtttctaata 20C i ttttatttat taccaatcct tataaatat attaaatttc tcttacaaaa 250 atctctaatg ttttatactt aa ataata ttctggc tg tatttacttt 300i gcacttccac tattgttaat ttattttcac tattttaggt gtaat 345 '
< 210 > 33 < 211 > 355 < 212 > DNA < 213 > Ehriichia canis < 220 > < 223 > Nucleic acid sequence of integnetic non-coding region 4 (28NC4) < 400 > 33 taattttatt gttgccacat attaaaaatg atctaaactt gtttttawta 50 caaaaaaaga ttgctacata aaaatagtgg caaaagaatg tagcaataag 100 aggggggggg gggaccaaat ttatcttcta tgcttcccaa gttttttcyc 150 acttaaacaa gctatttatg cagaaggtaa tatcctcacg gaaaacttat 200 cttcaaatat tttatttatt accaatctta tataatatat taaatttctc 250 ttacaaaaat cactagtatt ttataccaaa atatatattc tgacttgctt 300 ttcttctgca cttctactat ttttaattta tttgtcacta ttaggttata taaw 350 355
24
.e $ ¿«- ** &B4
Claims (19)
1. The DNA sequences that encode a 30-kilodalton protein of Ehriichia canis, characterized in that said protein is immunoreactive with antiserum Ehriichia canis, and wherein said protein contains an amino acid sequence selected from the group consisting of SEQ ID No. 4 and SEQ ID No. 6.! I
2. The DNA sequences of claim 1, characterized in that said protein has an N-terminal signal sequence.
3. The DNA sequences of claim 1, characterized in that said protein is post-translationally modified to a 28-kilodalton protein.
4. The DNA sequences of claim 1, characterized in that the DNA has a sequence 1 selected from the group consisting of SEQ ID No. 3? and SEQ ID No. 5.
5. A vector comprising the DNA sequences I of claim 1, characterized in that said A N is contained in a single place or locus of Ehriichia canis.
6. The DNA sequences of claim 5, characterized in that said site or locus is a locus or locus of multiple genes of 5,592 kb in length.
7. The DNA sequences of claim 6, characterized in that said site or locus encodes 28-kilodalton homologous proteins of Ehriichia canis.
8. The DNA sequences of claim 7, characterized in that said homologous 28-kilodalton proteins of Ehriichia canis are selected from the group consisting of Eca28SAl, Eca28SA2, Eca28SA3, Eca28-1 and Eca28-2.
9. A vector comprising the DNA sequences of claim 1.
10. The vector of claim 9, characterized in that said vector is an expression vector capable of expressing a peptide or polypeptide encoded by the sequence selected from the group consisting of SEQ ID NO: 1 and SEQ ID NO: 3 and SEQ ID NO: 3. NO: 5, when said expression veptor is introduced into a cell.
11. A characterized recombinant protein, because it comprises the amino acid sequence is selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4 and SEQ ID NO: 6.
12. The recombinant protein of claim 12, characterized in that said amino acid sequence is encoded by a nucleic acid segment comprising a sequence selected from the group consisting of SEQ. NO: l, SEQ ID NO: 3 and SEQ ID NO: 5.
13. A host cell, characterized in that it comprises the nucleic acid segment selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3 and SEQ ID NO: 5.
14. A method for producing the recombinant proteote of claim 11, characterized in that it comprises the steps of: obtaining a vector comprising an expression region comprising a sequence encoding the amino acid sequence selected from the group consisting of SEQ ID NO. NO: 2, SEQ ID NO: 4 and SEQ ID NO: 6 operatively linked to a promoter; transfecting said vector into a cell; and culturing said cells under conditions effective for the expression of said expression region.
15. An immunoreactive antibody with an amino acid sequence, characterized in that it is selected from the group consisting of SEQ ID NO: and SEQ ID NO: 6.
16. A method for inhibiting the infection of Ehrlichia canis in a subject, characterized in that it comprises the steps of: identifying a subject suspected of being exposed to or infected with Ehriichia canis; and administering a composition comprising a 28-kDa antigen of Ehriichia canis in an amount I effective to inhibit an infection of Ehriichia canis.
17. The method of claim 16, wherein said 28 kDa antigen is a recombinant protein characterized in that it comprises an amino acid sequence 1 selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4 and SEQ ID NO: 6.
! 18. The method of claim 17, characterized in that said recombinant protein is encoded by a gene comprising a sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3 and SEQ ID NO: 5. !
19. The method of claim 17, characterized in that said recombinant protein is dispersed in a pharmaceutically acceptable carrier.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US09201458 | 1998-11-30 | ||
| US09261358 | 1999-03-03 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MXPA01005378A true MXPA01005378A (en) | 2002-05-09 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6660269B2 (en) | Homologous 28-kilodalton immunodominant protein genes of Ehrlichia canis and uses thereof | |
| US6403780B1 (en) | Homologous 28-kilodalton immunodominant protein genes of ehrlichia canis and uses thereof | |
| AU2001290926A1 (en) | Homologous 28-kilodalton immunodominant protein genes of ehrlichia canis and uses thereof | |
| JP4550998B2 (en) | Ehrlichicanis 120 kDa immunodominant antigenic protein and gene | |
| US20020115840A1 (en) | Homologous 28-kilodaltion immunodominant protein genes of Ehrlichia canis and uses thereof | |
| AU762315B2 (en) | Homologous 28-kilodalton immunodominant protein genes of ehrlichia canis and uses thereof | |
| US20030147913A1 (en) | Ehrlichia chaffeensis 28 kDa outer membrane protein multigene family | |
| MXPA01005378A (en) | Homologous 28-kilodalton immunodominant protein genes of ehrlichia canis | |
| NO20110559L (en) | Use of nucleic acid sequences and amino acid sequences for diagnosis or detection of Piscirickettsia salmoni | |
| Immunoreactive | Molecular Cloning of the Gene for |