Lithology identification is the basis of energy exploration and reservoir evaluation, intelligent and accurate identification of underground lithology is a key issue. The establishment of a machine learning lithology identification model using logging data is a hot research direction in recent years. However, the logging data has a high degree of non-linearity and multi-response characteristics, and there are insufficient numbers of labeled samples in the training data set. These will eventually affect the modeling accuracy and may cause over-fitting. Therefore, a framework of active learning and semi-supervised learning for lithology identification based on improved naive Bayes (ALSLINB) is proposed. The contributions are fourfold: (i) The Gaussian mixture model (GMM) based on the EM algorithm is used to estimate the probability density of the log data, which fits the probability distribution of the nonlinear multi-response log data. (ii) A framework combining active learning (AL) and semi-supervised learning is proposed for the expansion of labeled samples in the training data set. (iii) The application of pseudo-labeling detection technology can effectively improve the authenticity of pseudo-label samples. (iv) Different from the general deterministic lithology identification method, the result of the ALSLINB algorithm corresponds to the probability score, which provides an auxiliary basis for the prediction result. Finally, the ALSLINB algorithm is applied to two different data sets for a large number of experiments and compared with the related baseline methods to verify its effectiveness and generalization ability. The result proves that the ALSLINB algorithm can complete the lithology recognition task well and has high accuracy and robustness, which provides a new direction for intelligent lithology identification.