Metabolite annotation is a critical step in discovery metabolomics, but remains a significant challenge. In this study, the accuracy of metabolite annotation was systematically evaluated by leveraging the proposed strategies for simulation of tandem mass spectrometry (MS/MS) data from high-resolution mass spectrometry (HRMS) and then construction of a large-scale virtual database. Furthermore, various similarity scoring methods were comprehensively compared to assess the performance for annotation. First, three key characteristics that are essential for simulating MS/MS spectra to closely resemble experimental data were identified: (i) the number of mass-to-charge ratio (m/z) features, (ii) the differences between neighboring m/z values, and (iii) the intensity distribution of MS/MS features. These factors were employed to generate representative MS/MS spectra for subsequent study. A meticulously designed virtual MS/MS database was constructed to facilitate accurate annotation assessment, which covered over 100,000 metabolites with diverse structural similarities and differences. To evaluate annotation quality, two simulation strategies on the basis of strong and weak data inference were respectively proposed to replicate MS/MS spectra for unknown metabolites. These simulated spectra were then compared with the virtual database, which provided insights into the expected variations in experimental MS/MS data. Furthermore, eight similarity evaluation methods, including entropy similarity (ES) and weighted dot product (W/DP) algorithms, were rigorously evaluated for their effectiveness in metabolite annotation. The results revealed that some methods, such as ES, exhibited strong resistance to interference and broad adaptability across different MS/MS patterns, whereas others selectively yielded reliable outcomes under specific conditions. This study provided a systematic framework for quality evaluation in metabolite annotation and offered strategies to mitigate false-positive identifications. The findings held great significance for advancing metabolomics research and further improving annotation reliability in complex biological samples.