Transient stability assessment (TSA) based on deep learning has recently attracted broad attention. The offline training of deep learning model typically requires a large number of labeled samples, and the training process is very time-consuming. Moreover, it is difficult to ensure that the trained model is suitable for all the operation conditions of power systems. In order to dramatically reduce the computation cost, and further enhance the adaptability of the model, an innovative method is proposed based on the active transfer learning (TL). First, the deep belief network (DBN) is designed as the base model and trained with a large number of labeled samples to obtain better evaluation performance for TSA. Second, when the topologies or operation conditions change substantially, active learning with information entropy is exploited to select the minimum number of the most informative samples, which effectively reduces the time of samples generation. Third, Maximum Mean Discrepancy (MMD) is calculated to select different transfer schemes. Only the last BP layer needs to be fine-tuned for the scenarios with a low MMD, but for those with high MMD, the entire network needs to be fine-tuned. The transfer time is greatly shortened on the premise of ensuring the transfer effect. The comprehensive experimental results of three power systems illustrate the effectiveness of the proposed method.