hpv通过什么传播| 肺不张是什么意思| 银花有焰万家春是什么生肖| 蜜蜡五行属什么| 什么是星座| nf是什么| 中药吃多了对人体有什么伤害| 为什么要打胰岛素| 曹操属什么| 素手是什么意思| 从什么时候开始| 2pcs是什么意思| 接站是什么意思| 话费为什么扣那么快| 下肢动脉硬化吃什么药| 伟哥是什么| 加码是什么意思| 睡觉背疼是什么原因| 8月27号是什么星座| 无性婚姻会有什么好处| 箜篌是什么乐器| 外阴病变有什么症状| 什么叫偏光眼镜| 妇科炎症小腹坠痛吃什么药| 脾胃是什么意思| 风寒咳嗽吃什么药| 手机服务密码是什么| 红馆是什么地方| 阴部毛变白是什么原因| 冬天有什么| 大拇指疼痛什么原因引起的| 飞机联程票是什么意思| 小孩脚底脱皮是什么原因造成的| 心律不齐房颤吃什么药| 休息是什么意思| 吃什么让月经量增多| 鸡蛋胶是什么鱼胶| 喝盐水有什么作用和功效| 布洛芬属于什么类药物| 来事吃什么水果好| 女人梦见棺材是什么征兆| 鼻子流黄水是什么原因| 籍贯指的是什么| 默念是什么意思| 什么是肌张力| 甘油三酯高吃什么药效果好| 什么东西进去是硬的出来是软的| 自渎什么意思| 貌不惊人是什么意思| 汉武帝叫什么| 卵泡不破是什么原因造成的| 紫萱名字的含义是什么| 谷丙转氨酶高吃什么药可以降下来| 头昏吃什么药| 八路军为什么叫八路军| 孕妇吃什么长胎不长肉| 小猫打什么疫苗| 阴谋是什么意思| 团粉是什么| 芒果和什么不能一起吃| 过敏忌口不能吃什么| 女人梦见掉头发是什么征兆| 梅毒螺旋体抗体阳性是什么意思| 后背中心疼是什么原因| 什么叫黑户| 阳历六月是什么星座| 三月初六是什么星座| 土地出让和划拨有什么区别| 对什么什么感兴趣| 蜘蛛为什么不是昆虫| 什么样人不能吃海参| 农历七月初七是什么节日| 朱砂是什么意思| 什么运动有助于长高| 反手引体向上练什么肌肉| 君子兰叶子发黄是什么原因| 浸润性是什么意思| 皮肤出现红点是什么原因| 驰字五行属什么| 佩戴沉香有什么好处| 对口升学什么意思| 什么时间运动减肥效果最好| nub是什么意思| 龟头敏感吃什么药| 龙的五行属性是什么| 佛龛是什么| 睾丸变小了是什么原因| 吃海鲜不能吃什么| 眼睛干涩吃什么中成药| 煮牛骨头放什么调料| 微商是什么意思| 诸葛亮是什么生肖| 腰肌劳损是什么原因造成的| 惯犯是什么意思| 6.25什么星座| yp是什么意思| 圻字五行属什么| 恐龙是什么时代| 四库全书是什么| 女人做梦梦到蛇是什么意思| 日本为什么经常地震| 气血不足吃什么药最好| 多发息肉是什么意思| pp材质和ppsu材质有什么区别| 净高是什么意思| 肌酐偏低有什么危害| 身上皮肤痒是什么原因| 上火吃什么| 荨麻疹是什么原因| 三七粉什么人不适合吃| 石女什么样| 血虚吃什么中成药| 卤水点豆腐的卤水是什么| 前列腺增生有什么症状表现| 宫颈钙化灶是什么意思| 小孩老是眨眼睛是什么原因| 气短吃什么药效果好| 蜜蜡脱毛有什么危害吗| 黑醋是什么醋| 专科什么意思| 优质是什么意思| 急性阴道炎是什么引起的| 右额头上有痣代表什么| 上海话十三点是什么意思| 国家为什么重视合肥| 一失足成千古恨是什么意思| 月经喝酒有什么影响| 水什么| 宫寒是什么原因引起的| 身上长红点很痒是什么原因| 嗳气是什么意思| 补是什么偏旁| 花千骨最后结局是什么| 女性阳性是什么病| 那个叫什么| 怔忡是什么意思| 冻干粉是什么| rush是什么东西| 孕期用什么护肤品| 今期难过美人关是什么生肖| 凯乐石属于什么档次| 静脉血是什么颜色| 纹身有什么危害| 什么书买不到| 婴儿什么时候可以吃盐| 紫癜有什么症状| 香蕉为什么是弯的| 弯弯的月儿像什么| 蜂蜜可以做什么美食| t代表什么| 东营有什么大学| 河汉是什么意思| 减肥期间可以吃什么水果| 脚指甲发白是什么原因| 豌豆有什么营养价值| 儿童包皮过长挂什么科| 工作是为了什么| 腰不好挂什么科| vj是什么意思| 蓝矾对人有什么危害| 烧心反酸水吃什么药| 脓毒症是什么病| 骨加后念什么| 水浒传有什么故事| 手足无措的意思是什么| 鼠辈是什么意思| 心律不齐什么症状| 碘化银什么颜色| 男性左下腹疼痛是什么原因| 打粉是什么意思| 极端是什么意思| 心脏吃什么药最好| 三级手术是什么意思| 苏州有什么好玩的地方| 女生什么时候是排卵期| 女性内科检查什么| 熬夜吃什么| 羿字五行属什么| 骨质断裂是什么意思| 00年属什么| 属龙的本命佛是什么佛| 空气炸锅可以做什么| 寄什么快递最便宜| 煤气是什么气体| 做梦梦到猪是什么意思| 9.9号是什么星座| 麻醉对身体有什么伤害| 突兀什么| 冰糖和白砂糖有什么区别| 水解奶粉是什么意思| 胃热吃什么食物好| 什么方法不掉头发| 吃一个海参相当于吃了什么| 什么是白噪音| 什么言什么色| 11月15日什么星座| 中暑是什么感觉| 柠檬什么时候开花结果| 伏地魔是什么意思| 捡什么废品最值钱| 王大治与董洁什么关系| 什么不止| 什么是eb病毒| 路痴是什么原因造成的| 什么叫总胆红素| 缺钾最忌讳吃什么| 粉尘螨是什么东西| 晚上吃什么不发胖| 岁寒三友是指什么| 反酸是什么感觉| 2月20号是什么星座| fashion是什么意思| 月牙消失了是什么原因| sherpa是什么面料| 麻油跟香油什么区别| 腮腺炎吃什么消炎药| 赵匡胤为什么不传位给儿子| 阿咖酚散是什么药| 修面皮是什么皮| 舌苔发黑是什么病| 菩提有什么寓意| 工口什么意思| 12月5日什么星座| 为什么会突然晕倒| 成功是什么| 色纸是什么| 心内科是看什么病的| 神阙穴在什么位置| 黄茶属于什么茶| 离苦得乐什么意思| 怀孕为什么会流褐色分泌物| 早饭吃什么好| 牛油果和什么榨汁好喝| 7.9是什么星座| 1846什么意思| 有生之年什么意思| 女生补气血吃什么好| 威化是什么意思| 胸是什么| 一马平川是什么意思| 医生停诊是什么意思| 同房后出血什么原因| 医院点痣挂什么科| 月字旁的字有什么| 为什么山东人个子高| 抓阄什么意思| 胎儿停止发育是什么原因造成的| 鲤鱼喜欢吃什么| 梦见掉牙齿是什么征兆| 冒菜是什么菜| 任达华是什么生肖| 艾滋病一年有什么症状| 炀是什么意思| 龟头炎有什么症状| 夫妻是什么意思| 高干文是什么意思| 怀孕十天左右有什么反应| 普罗帕酮又叫什么| 女人阴道痒是什么原因| 曹真和曹操什么关系| 雨中即景什么意思| 肝脏挂什么科| 感冒吃什么水果| 感冒可以吃什么水果| 百度Jump to content

From Wikipedia, the free encyclopedia
(Redirected from Peptide sequence)
Protein primary structureProtein secondary structureProtein tertiary structureProtein quaternary structure
The image above contains clickable links
The image above contains clickable links
This diagram (which is interactive) of protein structure uses PCNA as an example. (PDB: 1AXC?)
百度   导游服务质量将更优质  意见要求,提升导游服务质量。

Protein primary structure is the linear sequence of amino acids in a peptide or protein.[1] By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.

Formation

[edit]

Biological

[edit]

Amino acids are polymerised via peptide bonds to form a long backbone, with the different amino acid side chains protruding along it. In biological systems, proteins are produced during translation by a cell's ribosomes. Some organisms can also make short peptides by non-ribosomal peptide synthesis, which often use amino acids other than the encoded 22, and may be cyclised, modified and cross-linked.

Chemical

[edit]

Peptides can be synthesised chemically via a range of laboratory methods. Chemical methods typically synthesise peptides in the opposite order (starting at the C-terminus) to biological protein synthesis (starting at the N-terminus).

Notation

[edit]

Protein sequence is typically notated as a string of letters, listing the amino acids starting at the amino-terminal end through to the carboxyl-terminal end. Either a three letter code or single letter code can be used to represent the 22 naturally encoded amino acids, as well as mixtures or ambiguous amino acids (similar to nucleic acid notation).[1][2][3]

Peptides can be directly sequenced, or inferred from DNA sequences. Large sequence databases now exist that collate known protein sequences.

22 natural amino acid notation
Amino Acid 3-Letter[4] 1-Letter[4]
Alanine Ala A
Arginine Arg R
Asparagine Asn N
Aspartic acid Asp D
Cysteine Cys C
Glutamic acid Glu E
Glutamine Gln Q
Glycine Gly G
Histidine His H
Isoleucine Ile I
Leucine Leu L
Lysine Lys K
Methionine Met M
Phenylalanine Phe F
Proline Pro P
Pyrrolysine Pyl O
Selenocysteine Sec U
Serine Ser S
Threonine Thr T
Tryptophan Trp W
Tyrosine Tyr Y
Valine Val V
Ambiguous amino acid notation
Symbol Description Residues represented
X Any amino acid, or unknown All
B Aspartate or Asparagine D, N
Z Glutamate or Glutamine E, Q
J Leucine or Isoleucine I, L
Φ Hydrophobic V, I, L, F, W, M
Ω Aromatic F, W, Y, H
Ψ Aliphatic V, I, L, M
π Small P, G, A, S
ζ Hydrophilic S, T, H, N, Q, E, D, K, R, Y
+ Positively charged K, R, H
- Negatively charged D, E

Modification

[edit]

In general, polypeptides are unbranched polymers, so their primary structure can often be specified by the sequence of amino acids along their backbone. However, proteins can become cross-linked, most commonly by disulfide bonds, and the primary structure also requires specifying the cross-linking atoms, e.g., specifying the cysteines involved in the protein's disulfide bonds. Other crosslinks include desmosine.

Isomerisation

[edit]

The chiral centers of a polypeptide chain can undergo racemization. Although it does not change the sequence, it does affect the chemical properties of the sequence. In particular, the L-amino acids normally found in proteins can spontaneously isomerize at the atom to form D-amino acids, which cannot be cleaved by most proteases. Additionally, proline can form stable trans-isomers at the peptide bond.

Post-translational modification

[edit]

Additionally, the protein can undergo a variety of post-translational modifications, which are briefly summarized here.

The N-terminal amino group of a polypeptide can be modified covalently, e.g.,

Fig. 1 N-terminal acetylation
  • acetylation
The positive charge on the N-terminal amino group may be eliminated by changing it to an acetyl group (N-terminal blocking).
  • formylation
The N-terminal methionine usually found after translation has an N-terminus blocked with a formyl group. This formyl group (and sometimes the methionine residue itself, if followed by Gly or Ser) is removed by the enzyme deformylase.
  • pyroglutamate
Fig. 2 Formation of pyroglutamate from an N-terminal glutamine
An N-terminal glutamine can attack itself, forming a cyclic pyroglutamate group.
  • myristoylation
Similar to acetylation. Instead of a simple methyl group, the myristoyl group has a tail of 14 hydrophobic carbons, which make it ideal for anchoring proteins to cellular membranes.

The C-terminal carboxylate group of a polypeptide can also be modified, e.g.,

Fig. 3 C-terminal amidation
  • amination (see Figure)
The C-terminus can also be blocked (thus, neutralizing its negative charge) by amination.
  • glycosyl phosphatidylinositol (GPI) attachment
Glycosyl phosphatidylinositol(GPI) is a large, hydrophobic phospholipid prosthetic group that anchors proteins to cellular membranes. It is attached to the polypeptide C-terminus through an amide linkage that then connects to ethanolamine, thence to sundry sugars and finally to the phosphatidylinositol lipid moiety.

Finally, the peptide side chains can also be modified covalently, e.g.,

  • phosphorylation
Aside from cleavage, phosphorylation is perhaps the most important chemical modification of proteins. A phosphate group can be attached to the sidechain hydroxyl group of serine, threonine and tyrosine residues, adding a negative charge at that site and producing an unnatural amino acid. Such reactions are catalyzed by kinases and the reverse reaction is catalyzed by phosphatases. The phosphorylated tyrosines are often used as "handles" by which proteins can bind to one another, whereas phosphorylation of Ser/Thr often induces conformational changes, presumably because of the introduced negative charge. The effects of phosphorylating Ser/Thr can sometimes be simulated by mutating the Ser/Thr residue to glutamate.
A catch-all name for a set of very common and very heterogeneous chemical modifications. Sugar moieties can be attached to the sidechain hydroxyl groups of Ser/Thr or to the sidechain amide groups of Asn. Such attachments can serve many functions, ranging from increasing solubility to complex recognition. All glycosylation can be blocked with certain inhibitors, such as tunicamycin.
In this modification, an asparagine or aspartate side chain attacks the following peptide bond, forming a symmetrical succinimide intermediate. Hydrolysis of the intermediate produces either aspartate or the β-amino acid, iso(Asp). For asparagine, either product results in the loss of the amide group, hence "deamidation".
Proline residues may be hydroxylated at either of two atoms, as can lysine (at one atom). Hydroxyproline is a critical component of collagen, which becomes unstable upon its loss. The hydroxylation reaction is catalyzed by an enzyme that requires ascorbic acid (vitamin C), deficiencies in which lead to many connective-tissue diseases such as scurvy.
Several protein residues can be methylated, most notably the positive groups of lysine and arginine. Arginine residues interact with the nucleic acid phosphate backbone and commonly form hydrogen bonds with the base residues, particularly guanine, in protein–DNA complexes. Lysine residues can be singly, doubly and even triply methylated. Methylation does not alter the positive charge on the side chain, however.
Acetylation of the lysine amino groups is chemically analogous to the acetylation of the N-terminus. Functionally, however, the acetylation of lysine residues is used to regulate the binding of proteins to nucleic acids. The cancellation of the positive charge on the lysine weakens the electrostatic attraction for the (negatively charged) nucleic acids.
  • sulfation
Tyrosines may become sulfated on their atom. Somewhat unusually, this modification occurs in the Golgi apparatus, not in the endoplasmic reticulum. Similar to phosphorylated tyrosines, sulfated tyrosines are used for specific recognition, e.g., in chemokine receptors on the cell surface. As with phosphorylation, sulfation adds a negative charge to a previously neutral site.
  • prenylation and palmitoylation
The hydrophobic isoprene (e.g., farnesyl, geranyl, and geranylgeranyl groups) and palmitoyl groups may be added to the atom of cysteine residues to anchor proteins to cellular membranes. Unlike the GPI and myritoyl anchors, these groups are not necessarily added at the termini.
  • carboxylation
A relatively rare modification that adds an extra carboxylate group (and, hence, a double negative charge) to a glutamate side chain, producing a Gla residue. This is used to strengthen the binding to "hard" metal ions such as calcium.
  • ADP-ribosylation
The large ADP-ribosyl group can be transferred to several types of side chains within proteins, with heterogeneous effects. This modification is a target for the powerful toxins of disparate bacteria, e.g., Vibrio cholerae, Corynebacterium diphtheriae and Bordetella pertussis.
Various full-length, folded proteins can be attached at their C-termini to the sidechain ammonium groups of lysines of other proteins. Ubiquitin is the most common of these, and usually signals that the ubiquitin-tagged protein should be degraded.

Most of the polypeptide modifications listed above occur post-translationally, i.e., after the protein has been synthesized on the ribosome, typically occurring in the endoplasmic reticulum, a subcellular organelle of the eukaryotic cell.

Many other chemical reactions (e.g., cyanylation) have been applied to proteins by chemists, although they are not found in biological systems.

Cleavage and ligation

[edit]

In addition to those listed above, the most important modification of primary structure is peptide cleavage (by chemical hydrolysis or by proteases). Proteins are often synthesized in an inactive precursor form; typically, an N-terminal or C-terminal segment blocks the active site of the protein, inhibiting its function. The protein is activated by cleaving off the inhibitory peptide.

Some proteins even have the power to cleave themselves. Typically, the hydroxyl group of a serine (rarely, threonine) or the thiol group of a cysteine residue will attack the carbonyl carbon of the preceding peptide bond, forming a tetrahedrally bonded intermediate [classified as a hydroxyoxazolidine (Ser/Thr) or hydroxythiazolidine (Cys) intermediate]. This intermediate tends to revert to the amide form, expelling the attacking group, since the amide form is usually favored by free energy, (presumably due to the strong resonance stabilization of the peptide group). However, additional molecular interactions may render the amide form less stable; the amino group is expelled instead, resulting in an ester (Ser/Thr) or thioester (Cys) bond in place of the peptide bond. This chemical reaction is called an N-O acyl shift.

The ester/thioester bond can be resolved in several ways:

  • Simple hydrolysis will split the polypeptide chain, where the displaced amino group becomes the new N-terminus. This is seen in the maturation of glycosylasparaginase.
  • A β-elimination reaction also splits the chain, but results in a pyruvoyl group at the new N-terminus. This pyruvoyl group may be used as a covalently attached catalytic cofactor in some enzymes, especially decarboxylases such as S-adenosylmethionine decarboxylase (SAMDC) that exploit the electron-withdrawing power of the pyruvoyl group.
  • Intramolecular transesterification, resulting in a branched polypeptide. In inteins, the new ester bond is broken by an intramolecular attack by the soon-to-be C-terminal asparagine.
  • Intermolecular transesterification can transfer a whole segment from one polypeptide to another, as is seen in the Hedgehog protein autoprocessing.

History

[edit]

The proposal that proteins were linear chains of α-amino acids was made nearly simultaneously by two scientists at the same conference in 1902, the 74th meeting of the Society of German Scientists and Physicians, held in Karlsbad. Franz Hofmeister made the proposal in the morning, based on his observations of the biuret reaction in proteins. Hofmeister was followed a few hours later by Emil Fischer, who had amassed a wealth of chemical details supporting the peptide-bond model. For completeness, the proposal that proteins contained amide linkages was made as early as 1882 by the French chemist E. Grimaux.[5]

Despite these data and later evidence that proteolytically digested proteins yielded only oligopeptides, the idea that proteins were linear, unbranched polymers of amino acids was not accepted immediately. Some scientists such as William Astbury doubted that covalent bonds were strong enough to hold such long molecules together; they feared that thermal agitations would shake such long molecules asunder. Hermann Staudinger faced similar prejudices in the 1920s when he argued that rubber was composed of macromolecules.[5]

Thus, several alternative hypotheses arose. The colloidal protein hypothesis stated that proteins were colloidal assemblies of smaller molecules. This hypothesis was disproved in the 1920s by ultracentrifugation measurements by Theodor Svedberg that showed that proteins had a well-defined, reproducible molecular weight and by electrophoretic measurements by Arne Tiselius that indicated that proteins were single molecules. A second hypothesis, the cyclol hypothesis advanced by Dorothy Wrinch, proposed that the linear polypeptide underwent a chemical cyclol rearrangement C=O + HN C(OH)-N that crosslinked its backbone amide groups, forming a two-dimensional fabric. Other primary structures of proteins were proposed by various researchers, such as the diketopiperazine model of Emil Abderhalden and the pyrrol/piperidine model of Troensegaard in 1942. Although never given much credence, these alternative models were finally disproved when Frederick Sanger successfully sequenced insulin[when?] and by the crystallographic determination of myoglobin and hemoglobin by Max Perutz and John Kendrew[when?].

Relation to secondary and tertiary structure

[edit]

The primary structure of a biological polymer to a large extent determines the three-dimensional shape (tertiary structure). Protein sequence can be used to predict local features, such as segments of secondary structure, or trans-membrane regions. However, the complexity of protein folding currently prohibits predicting the tertiary structure of a protein from its sequence alone. Knowing the structure of a similar homologous sequence (for example a member of the same protein family) allows highly accurate prediction of the tertiary structure by homology modeling. If the full-length protein sequence is available, it is possible to estimate its general biophysical properties, such as its isoelectric point.

See also

[edit]

Notes and references

[edit]
  1. ^ a b Sanger, F (1952). "The arrangement of amino acids in proteins". In Anson, M.L.; Bailey, Kenneth; Edsall, John T. (eds.). Advances in Protein Chemistry. Vol. 7. pp. 1–67. doi:10.1016/S0065-3233(08)60017-0. PMID 14933251.
  2. ^ Aasland, Rein; Abrams, Charles; Ampe, Christophe; Ball, Linda J.; Bedford, Mark T.; Cesareni, Gianni; Gimona, Mario; Hurley, James H.; Jarchau, Thomas (2025-08-07). "Normalization of nomenclature for peptide motifs as ligands of modular protein domains". FEBS Letters. 513 (1): 141–144. Bibcode:2002FEBSL.513..141A. doi:10.1016/S0014-5793(01)03295-1. ISSN 1873-3468. PMID 11911894.
  3. ^ IUPAC-IUB Commission on Biochemical Nomenclature (July 1968). "A One‐Letter Notation for Amino Acid Sequences: Tentative Rules". European Journal of Biochemistry. 5 (2): 151–153. doi:10.1111/j.1432-1033.1968.tb00350.x.
  4. ^ a b Hausman, Robert E.; Cooper, Geoffrey M. (2004). The cell: a molecular approach. Washington, D.C.: ASM Press. p. 51. ISBN 978-0-87893-214-6.
  5. ^ a b Fruton, Joseph S. (May 1979). "Early theories of protein structure". Annals of the New York Academy of Sciences. 325 (1): xiv, 1–18. Bibcode:1979NYASA.325....1F. doi:10.1111/j.1749-6632.1979.tb14125.x. PMID 378063. S2CID 39125170.
蜈蚣为什么不能打死 宝宝睡眠不好是什么原因 红脸代表什么 玟字五行属什么 肠阻塞有什么症状
什么是马赛克 心脏早搏吃什么药效果好 新房送什么礼物好 桐字五行属什么 分泌性中耳炎吃什么药
夜盲症是什么意思 孔子的原名叫什么 桑叶有什么作用和功效 霉菌是什么菌 什么是白领
特别的意思是什么 干咳无痰是什么原因引起的 什么是帽子戏法 南瓜炒什么好吃 白天嗜睡是什么原因
砚台是什么hcv8jop7ns4r.cn 妇科炎症吃什么消炎药效果好xscnpatent.com 屁股痛是什么引起的hcv9jop2ns3r.cn 放射科检查什么zsyouku.com 胆结石属于什么科hcv9jop3ns0r.cn
小孩肠系膜淋巴结炎吃什么药hcv9jop3ns9r.cn 皮蛋与什么食物相克hcv9jop1ns1r.cn b超是查什么的hcv9jop4ns3r.cn 难舍难分是什么意思hcv8jop7ns2r.cn 什么是树脂材料ff14chat.com
巴结是什么意思hcv7jop5ns1r.cn 一什么花瓶hcv8jop1ns0r.cn 96166是什么电话hcv8jop2ns7r.cn 吃什么补钙最快hcv9jop0ns2r.cn 32年婚姻是什么婚hcv8jop0ns4r.cn
喉咙里老是有痰是什么原因hcv8jop5ns8r.cn 尿尿疼是什么原因hcv8jop3ns8r.cn 鱼子酱是什么鱼的鱼子hcv9jop3ns0r.cn 体毛多是什么原因inbungee.com 什么是乳腺结节hcv8jop8ns6r.cn
百度