The emergence of low-cost micro-electro-mechanical system inertial measurement units, cameras and global navigation satellite system (GNSS) receivers has promoted the research of multisensor fusion positioning. With the rapid development of chip technology, it has become a trend for low-cost GNSS chips to provide multi-frequency carrier phase observations while supporting multiple constellations. To enhance the positioning performance of multi-frequency and multi-system precise point positioning (PPP) in the difficult urban environment, we propose a tightly coupled system of monocular visual-inertial odometry (MVIO) and uncombined PPP (UC-PPP) based on factor graph optimization. The initialization of MVIO and UC-PPP adopts a coarse-to-fine approach to correct the transformation of local and global frames online. Moreover, the sliding window and marginalization methods are adopted to retain the constraints between adjacent observations and eliminate useless observations in the window. The pedestrian and vehicle tests in urban environments verify the performance of the proposed method. Compared with open-source software GVINS, the positioning accuracy of the proposed method has been further improved by using carrier phase observations with higher measurement accuracy. Compared with PPP alone, the improvement of the proposed method for the low-speed and short-distance pedestrian test in the east, north, and up directions is 73.3, 54.8 and 62.7%, respectively, while the improvement for the high-speed and long-distance vehicle test is 63.0, 59.3 and 70.5%, respectively. Experiment results show that the proposed method has better positioning accuracy and continuity in difficult urban environments.