Distributed networks have become a cornerstone of modern computing systems, enabling scalable, efficient, and interconnected data processing across diverse computing paradigms, including Cloud Computing, IoT, and Edge Computing. However, ensuring data security and privacy within these distributed networks remain a significant challenge, particularly in tracing the provenance and flow of sensitive data due to their dynamic and decentralized nature. To tackle this challenge, we propose TR-DPro, a novel data provenance scheme that combines Temporal Knowledge Graph (TKG) and Reinforcement Learning (RL) to achieve adaptive and accurate data provenance in distributed networks. TKG captures the temporal and relational dynamics of data flow, while RL optimizes the reasoning process by dynamically learning efficient provenance paths, reducing computational complexity and improving accuracy. Simulation experiments conducted on a private dataset of network traffic and security events demonstrate that the proposed framework, TR-DPro, can effectively and accurately trace data flow paths while outperforming existing methods in key performance metrics.