摘 要: 在分布式系统中,由于各个功能模块的通信网络错综复杂,因此对于单一请求的调用链路监控与分析就显得尤为重要。文章在详细调研大部分分布式系统的通用结构和调用链路特征的基础上,设计了一套通用的分布式系统调用链路监控方案。该系统基于OpenTelemetry(开源可观察性框架)对服务进行埋点采样及数据整体搜集,使用Jaeger(分布式跟踪系统)对数据进行整理分析并进行可视化展示。使用该方案对分布式系统进行监控,可以快速发现系统中存在的链路问题并监控服务健康程度,使分布式系统的运行更加稳定,能给用户带来更好的体验。 |
关键词: 分布式系统;链路监控;OpenTelemetry;Jaeger |
中图分类号: TP391
文献标识码: A
|
|
Design of Distributed System Call Link Monitoring Based on OpenTelemetry and Jaeger |
ZHANG Aihua1, BAI Jinfeng2
|
(1.Department of Network Engineering, Dalian Neusoft University of Information, Dalian 116023, China; 2. OpenJaw Technology (Dalian) Co., Ltd., Dalian 116023, China)
zhangaihua@neusoft.edu.cn; baijinfeng1202@126.com
|
Abstract: In distributed systems, monitoring and analyzing the call link of a single request is particularly important due to the complex communication networks of various functional modules. After a detailed investigation of the general structure and call link characteristics of most distributed systems, the paper proposes to design a universal distributed system call link monitoring scheme. OpenTelemetry (an open source observability framework) is used for buried point sampling and overall data collection of services, and Jaeger (a distributed tracing system) is used to organize, analyze, and visualize the data. By using this scheme to monitor distributed systems, it is possible to quickly identify link issues in the system and monitor service health, making the operation of the distributed system more stable and providing users with a better experience. |
Keywords: distributed system; link monitoring; OpenTelemetry; Jaeger |