Science

    Peptide Structure and Amino Acids: Bonds, Folding, and the Structural Basis of Biological Activity

    A scientific guide to peptide and protein structure — covering amino acid chemistry, the peptide bond, primary through quaternary structure levels, and how structural features determine the biological activity of research peptides.

    By Alpine Labs Editorial Team | 12 min read
    Published · Last reviewed · Last updated
    Reviewed by Alpine Labs Editorial Team

    The Building Blocks: Amino Acids

    Amino acids are the monomeric units from which all peptides and proteins are built. Each amino acid contains four groups bonded to a central alpha-carbon (Cα): an amino group (–NH₂), a carboxyl group (–COOH), a hydrogen atom, and a variable side chain (R group) that determines the amino acid’s unique chemical properties.

    Twenty standard (proteinogenic) amino acids are encoded by the genetic code. These 20 amino acids, in various sequences and combinations, produce the enormous diversity of peptides and proteins found in biology.

    Classification by Side Chain Properties

    The chemical character of each amino acid’s side chain determines its behavior in peptide structures:

    Nonpolar (hydrophobic): Glycine (Gly, G), Alanine (Ala, A), Valine (Val, V), Leucine (Leu, L), Isoleucine (Ile, I), Proline (Pro, P), Phenylalanine (Phe, F), Tryptophan (Trp, W), Methionine (Met, M)

    These residues tend to cluster in the interior of folded peptide structures, away from water. Their hydrophobic interactions are a major driving force for peptide folding.

    Polar uncharged: Serine (Ser, S), Threonine (Thr, T), Asparagine (Asn, N), Glutamine (Gln, Q), Tyrosine (Tyr, Y), Cysteine (Cys, C)

    These residues can form hydrogen bonds with water and with other polar residues. Cysteine is unique — its thiol (–SH) group can form disulfide bonds (–S–S–) with other cysteines, creating covalent cross-links within or between peptide chains.

    Positively charged (basic) at pH 7: Lysine (Lys, K), Arginine (Arg, R), Histidine (His, H)

    These residues carry positive charge at physiological pH, enabling ionic interactions with negatively charged residues, nucleic acids, and cell membrane phospholipids. Histidine (pKa ~6.0) is unique in being partially protonated at physiological pH, making it versatile in enzyme active sites.

    Negatively charged (acidic) at pH 7: Aspartic acid (Asp, D), Glutamic acid (Glu, E)

    These residues carry negative charge at physiological pH. They participate in salt bridges (ionic interactions with Lys or Arg), metal coordination (carboxylate groups chelate Ca²⁺, Zn²⁺, Cu²⁺), and enzyme catalysis.

    Special Amino Acids in Peptide Research

    Amino AcidSpecial PropertyResearch Relevance
    ProlineCyclic side chain constrains backbone, disrupts helicesBPC-157 contains Pro-Pro-Pro (stability); Proline-rich peptides resist proteases
    CysteineForms disulfide bonds (–S–S–)IGF-1 LR3 has 3 disulfide bonds; oxytocin has 1
    GlycineNo side chain, maximum backbone flexibilityCollagen requires Gly at every third position (Gly-X-Y repeat)
    MethionineSusceptible to oxidationKey stability concern for peptides containing Met
    TryptophanUV-absorbing (280 nm), fluorescent, photosensitiveEnables UV quantification; light-sensitive storage required

    Non-Standard Amino Acids

    Research peptides may contain amino acids not found in the standard 20:

    • D-amino acids: Mirror images of the natural L-forms. Resistant to proteases (which are stereospecific for L-amino acids). Used in peptide modifications to enhance stability.
    • Aib (α-aminoisobutyric acid): A non-natural amino acid with two methyl groups on Cα. Used in semaglutide at position 2 to prevent DPP-4 cleavage. Strongly promotes helical structure.
    • Norleucine (Nle): An isosteric replacement for methionine that is not susceptible to oxidation. Used in research analogs to eliminate oxidation liability.
    • β-alanine: An amino acid with the amino group on the beta-carbon rather than the alpha-carbon. Used as a spacer in peptide conjugates.

    The Peptide Bond

    The peptide bond is the covalent linkage between amino acids in a peptide chain. It is formed by a condensation reaction between the carboxyl group of one amino acid and the amino group of the next, releasing one molecule of water.

    Peptide Bond Geometry

    The peptide bond has several distinctive features that profoundly affect peptide structure:

    Partial double bond character: The peptide bond (C–N) has approximately 40% double bond character due to resonance between the C=O and C–N bonds. This restricts rotation around the C–N bond, making the peptide bond planar — the six atoms (Cα, C, O, N, H, Cα) of each peptide unit lie in a single plane.

    Trans configuration: The vast majority of peptide bonds adopt the trans configuration (the two Cα atoms are on opposite sides of the C–N bond). The trans isomer is favored by approximately 1000:1 over cis due to steric clash in the cis form. The exception is X-Pro bonds, where the cis:trans ratio is approximately 1:4 due to proline’s cyclic side chain reducing the steric penalty of cis.

    Phi (φ) and Psi (ψ) angles: While rotation around the peptide bond itself is restricted, rotation around the Cα–N bond (phi, φ) and the Cα–C bond (psi, ψ) is permitted. These two dihedral angles, defined for each residue, determine the overall three-dimensional structure of the peptide backbone. Only certain combinations of φ and ψ are sterically allowed — these are visualized in a Ramachandran plot.

    Levels of Peptide Structure

    Peptide and protein structure is described at four hierarchical levels:

    Primary Structure

    Primary structure is the linear sequence of amino acids in the peptide chain, read from N-terminus (free amino group) to C-terminus (free carboxyl group). It is the most fundamental level — it is encoded in DNA and determines all higher levels of structure.

    Primary structure is described using single-letter or three-letter amino acid codes:

    Example — BPC-157: Gly-Glu-Pro-Pro-Pro-Gly-Lys-Pro-Ala-Asp-Asp-Ala-Gly-Leu-Val (G-E-P-P-P-G-K-P-A-D-D-A-G-L-V)

    Example — GHK-Cu: Gly-His-Lys (G-H-K) complexed with Cu²⁺

    Example — Semaglutide: 31 amino acids (modified GLP-1 analog) with Aib at position 2 and a C18 fatty diacid attached to Lys at position 26 via a linker

    Primary structure determines:

    • Molecular weight (sum of amino acid residue masses minus water lost in peptide bond formation)
    • Net charge at a given pH (sum of ionizable group charges)
    • Isoelectric point (pI) — the pH at which net charge is zero
    • Susceptibility to specific proteases (which recognize specific amino acid motifs)
    • Degradation vulnerabilities (based on the presence of labile residues)

    Secondary Structure

    Secondary structure refers to local, regular patterns of backbone hydrogen bonding within the peptide chain. The two major types are:

    Alpha-helix (α-helix): A right-handed helical structure in which the backbone C=O of residue (i) hydrogen bonds to the backbone N–H of residue (i+4). This creates a coil with 3.6 residues per turn and a rise of 5.4 Å per turn. Side chains project outward from the helix.

    Helix-promoting residues: Ala, Leu, Met, Glu, Lys Helix-breaking residues: Pro (introduces a kink), Gly (too flexible)

    GLP-1 receptor agonists (semaglutide, tirzepatide) adopt alpha-helical conformations when binding to their receptors — the helix is the bioactive conformation.

    Beta-sheet (β-sheet): Extended peptide chains (beta-strands) align side by side and hydrogen bond between strands. Parallel sheets have strands running in the same direction; antiparallel sheets have strands running in opposite directions. Antiparallel sheets have stronger, more linear hydrogen bonds and are more common.

    Beta-sheet formation is relevant to peptide research because aggregation-prone peptides (like amyloid beta) form intermolecular beta-sheets that assemble into fibrils.

    Beta-turn: A sharp reversal of chain direction involving four residues. Proline and glycine are frequently found in turns due to proline’s constrained geometry and glycine’s flexibility. Turns connect beta-strands in antiparallel sheets and are common structural motifs in bioactive peptides.

    Polyproline helix (PPII): A left-handed helix with three residues per turn. Adopted by proline-rich sequences (like collagen’s Gly-Pro-Hyp repeats and BPC-157’s Pro-Pro-Pro motif). Unlike alpha-helices, PPII helices have no intramolecular hydrogen bonds — they are stabilized by steric constraints of the proline rings.

    Tertiary Structure

    Tertiary structure is the overall three-dimensional shape of a single peptide chain, determined by interactions between side chains that may be far apart in the primary sequence:

    • Hydrophobic interactions: Nonpolar side chains cluster in the interior, away from water
    • Hydrogen bonds: Between polar side chains (Ser-Asn, Thr-Gln, etc.)
    • Salt bridges: Between oppositely charged side chains (Lys–Asp, Arg–Glu)
    • Disulfide bonds: Covalent S–S bonds between cysteine residues
    • Metal coordination: Side chains (His, Cys, Asp, Glu) coordinating metal ions (GHK-Cu chelates Cu²⁺ via His and the N-terminal amine)

    Most research peptides (under 30-40 amino acids) have limited tertiary structure in solution — they are often flexible, adopting multiple conformations. Larger peptides and small proteins (IGF-1 LR3 at 83 amino acids) have defined tertiary structures stabilized by disulfide bonds.

    Quaternary Structure

    Quaternary structure describes the arrangement of multiple peptide chains (subunits) in a multi-chain complex. Most research peptides are single-chain molecules and do not have quaternary structure. However, some relevant examples exist:

    • Insulin consists of two chains (A and B) linked by two interchain disulfide bonds
    • Some peptide hormones (e.g., inhibin) are heterodimers

    Structure-Activity Relationships in Research Peptides

    The biological activity of a peptide depends directly on its three-dimensional structure at the point of receptor interaction:

    Conformational Selection

    Many small peptides are intrinsically disordered in solution (lacking stable secondary/tertiary structure) but adopt a specific bioactive conformation upon binding to their receptor. The receptor “selects” the bioactive conformation from the ensemble of conformations sampled by the free peptide.

    This has practical implications:

    • Modifications that stabilize the bioactive conformation (e.g., cyclization, helix-promoting substitutions) can increase potency
    • Modifications that prevent the bioactive conformation (e.g., proline substitution at a critical helical position) can abolish activity

    Key Structure-Activity Examples

    Semaglutide: The alpha-helical conformation of residues 7-30 is essential for GLP-1 receptor binding. The Aib substitution at position 2 prevents DPP-4 cleavage without disrupting the helix. The C18 fatty diacid at Lys26 enables albumin binding (extending half-life) without sterically blocking the receptor-binding helix.

    GHK-Cu: The tripeptide Gly-His-Lys chelates Cu²⁺ through the histidine imidazole nitrogen, the glycine alpha-amino nitrogen, and the glycine amide nitrogen. This specific coordination geometry is required for biological activity — scrambling the sequence (e.g., HGK or KHG) alters copper binding affinity and biological effects.

    BPC-157: The Pro-Pro-Pro motif (positions 3-5) creates a rigid polyproline II helix that is critical for stability and may contribute to the peptide’s interaction with its (as yet unidentified) molecular target. The overall structure in solution remains poorly characterized — it may be predominantly flexible with the PPII segment providing a structural anchor.

    Cyclic peptides (Melanotan II, PT-141): Cyclization via a lactam bridge constrains the peptide backbone, reducing the conformational entropy penalty upon receptor binding. This typically increases binding affinity and selectivity compared to the linear analogue.

    Peptide vs. Protein: Where Is the Line?

    The distinction between peptides and proteins is somewhat arbitrary but conventionally drawn at approximately 50 amino acids:

    FeaturePeptideProtein
    Length2-50 amino acids>50 amino acids
    Molecular weight<~5,000 Da>~5,000 Da
    Stable 3D structureOften flexible/disorderedUsually well-defined fold
    SynthesisChemical (SPPS) preferredRecombinant expression preferred
    Disulfide bonds0-2 typicallyOften 2+

    This boundary is not absolute. Some peptides have stable folds (insulin, 51 amino acids), and some small proteins are largely disordered. In research peptide commerce, the term “peptide” is used broadly for any chemically synthesized product, regardless of length.

    Amino Acid Properties Reference

    AA1-LetterMW (Da)pKa (side chain)Properties
    GlyG57.05Smallest, flexible, helix-breaker
    AlaA71.08Small, hydrophobic, helix-promoting
    ValV99.13Branched, hydrophobic, beta-sheet
    LeuL113.16Hydrophobic, helix-promoting
    IleI113.16Branched, hydrophobic
    ProP97.12Cyclic, rigid, helix-breaker
    PheF147.18Aromatic, hydrophobic
    TrpW186.21Aromatic, UV-absorbing, oxidation-sensitive
    MetM131.20Sulfur-containing, oxidation-sensitive
    SerS87.08~13Polar, phosphorylation site
    ThrT101.10~13Polar, branched, phosphorylation site
    AsnN114.10Polar, deamidation-prone
    GlnQ128.13Polar, slow deamidation
    TyrY163.1810.1Aromatic, polar, phosphorylation site
    CysC103.148.3Disulfide bonds, metal binding
    LysK128.1710.5Positive charge, modification site
    ArgR156.1912.5Positive charge (strongest)
    HisH137.146.0Variable charge, metal binding
    AspD115.093.9Negative charge, metal binding
    GluE129.124.1Negative charge

    Frequently Asked Questions

    What is the difference between a peptide bond and a disulfide bond?

    A peptide bond links the carboxyl group of one amino acid to the amino group of the next, forming the backbone of the peptide chain. It is formed during synthesis (ribosomal or SPPS) and defines the primary sequence. A disulfide bond is a covalent bond between the sulfur atoms of two cysteine side chains (–S–S–). It is a cross-link that stabilizes three-dimensional structure but does not affect the primary sequence. Peptide bonds define what amino acids are present; disulfide bonds help determine how the chain folds.

    Why does BPC-157 have three prolines in a row?

    The Pro-Pro-Pro sequence creates a polyproline II (PPII) helix — a rigid, extended structural element. This serves two functions: (1) it makes BPC-157 highly resistant to proteolytic degradation (most endopeptidases cannot cleave Pro-Pro bonds efficiently), and (2) it provides a conformational anchor that may be important for biological activity. The PPII helix is also compatible with BPC-157’s gastric origin — it is stable in the low-pH, protease-rich environment of gastric juice.

    How does amino acid composition affect peptide solubility?

    Peptides with a high proportion of charged residues (Lys, Arg, Asp, Glu) are generally water-soluble. Peptides rich in hydrophobic residues (Val, Leu, Ile, Phe) may require organic co-solvents (DMSO, DMF) or acidic/basic conditions for dissolution. The net charge at the working pH determines electrostatic solubility — peptides near their isoelectric point (pI) have minimal net charge and minimum solubility.

    What makes semaglutide’s structure different from natural GLP-1?

    Semaglutide has three key structural modifications relative to native GLP-1(7-37): (1) Aib (α-aminoisobutyric acid) at position 2 replaces Ala, preventing DPP-4 cleavage that inactivates native GLP-1 within minutes; (2) Arg at position 34 replaces Lys, preventing fatty acid attachment at an undesirable position; (3) a C18 fatty diacid chain attached to Lys26 via a mini-PEG linker enables non-covalent binding to albumin, extending the half-life from ~2 minutes (native GLP-1) to ~7 days (semaglutide).

    Can a single amino acid change completely alter a peptide’s activity?

    Yes. Structure-activity relationship studies routinely demonstrate that single amino acid substitutions at critical positions can eliminate, reduce, or alter biological activity. This is the basis of alanine scanning — systematically replacing each residue with alanine to identify positions essential for activity. This is also why peptide quality control (identity confirmation by mass spectrometry) is critical — a deletion or substitution impurity is a different molecule, not just a “less pure” version of the target peptide.

    References

    1. Branden C, Tooze J. “Introduction to Protein Structure.” 2nd ed. Garland Science. 1999.
    2. Ramachandran GN, Sasisekharan V. “Conformation of polypeptides and proteins.” Adv Protein Chem. 1968;23:283-438.
    3. Pauling L, Corey RB, Branson HR. “The structure of proteins: two hydrogen-bonded helical configurations of the polypeptide chain.” Proc Natl Acad Sci USA. 1951;37(4):205-211.
    4. Berg JM, Tymoczko JL, Stryer L. “Biochemistry.” 8th ed. W.H. Freeman. 2015.
    5. Lau JL, Dunn MK. “Therapeutic peptides: historical perspectives, current development trends, and future directions.” Bioorg Med Chem. 2018;26(10):2700-2707.
    6. Adzhubei AA, et al. “Polyproline-II helix in proteins: structure and function.” J Mol Biol. 2013;425(12):2100-2132.
    7. Fosgerau K, Hoffmann T. “Peptide therapeutics: current status and future directions.” Drug Discov Today. 2015;20(1):122-128.
    8. Muttenthaler M, et al. “Trends in peptide drug discovery.” Nat Rev Drug Discov. 2021;20(4):309-325.

    Related Monographs