The mustard allergen Sin a 1 belongs to the 2S-albumin family of seed-storage proteins. Because of its high abundance in mustard seeds and the potential to elicit severe allergenic reactions, Sin a 1 is considered to be a major allergen in mustard. Eight Sin a 1 isoforms have been identified using DNA cloning and sequencing, and we aim in this study to thoroughly investigate sequence heterogeneity using a novel combination of bottom-up, middle-down, and top-down proteomics. The characterization of purified Sin a 1 extract shows that sequence diversity is far more pronounced than previously assumed. We identified in total 24 sequence polymorphisms including 17 yet unpublished point mutations. Using middle-down and top-down approaches on the subunit and protein level of Sin a 1, we were able to detect eight consensus isoforms of Sin a 1(including four novel isoforms), which we detect in the majority of the four different mustard samples included in this study. In addition, we provide for the first time data on relative abundance of the main Sin a 1 isoforms and identify phytic acid as a potential ligand of Sin a 1. Together, these data can form the basis for a more detailed investigation of the effect of Sin a 1 polymorphic sites on allergenicity of isoforms.