Skip to main navigation Skip to search Skip to main content

Robustifying large-scale cloud applications with scalable value-flow analysis

  • Wensheng TANG

Student thesis: Doctoral thesis

Abstract

In the realm of cloud-native applications, ensuring robustness amidst the complexities of distributed architectures presents a substantial challenge. The dynamic and interconnected nature of these systems, characterized by microservices and database-backed infrastructures, necessitates advanced methodologies for maintaining functional correctness, thereby preventing vulnerabilities, performance bottlenecks, and potential financial losses. This thesis aims to address this critical issue by leveraging state-of-the-art value-flow analysis techniques specifically tailored to tackle the scalability challenges and unique robustness issues on vast cloud-native codebases. Our research boldly confronts the scalability dilemma by innovating and redesigning value-flow analysis methodologies. This novel approach enhances parallelism and efficiency, enabling the handling of tens of millions of lines of code typical in cloud-native systems and their associated libraries. This is a task that traditional static program analysis methods find daunting. By achieving path-sensitive precision at such a scale, our approach significantly contributes to the robustification of cloud-native applications, advocating a new standard in software robustness. Building upon this foundational solution, our study delves into solving robustness issues within microservice-based software systems, a challenge that is prevalent in real-world applications such as WeChat Pay, a leading FinTech system. In such systems, managing the correctness of status code propagation among these sub-services poses a longstanding challenge. To address the problem, in this work, we advocate a system-wide value-flow analysis to detect anomalies effectively on top of the statically inferred correlations of status codes, thereby bolstering the system's overall robustness and addressing a key facet of software property correctness in complex, service-oriented architectures. Further, the thesis extends the application of value-flow analysis to cloud-native, database-backed applications, as exemplified by practices within the Ant Group, where data constraints additionally enforce data correctness. While data constraints promise system robustness, they increase maintenance efforts to maintain consistency between two artifacts: data constraints and the built-in checking logic in the application code. To better assess the problem's severity and investigate possible solutions, we study such a representative system and related developers inside Ant Group. In this work, we also propose a specialized value-flow analysis to retrieve traceability efficiently and effectively between the two software artifacts.
Date of Award2024
Original languageEnglish
Awarding Institution
  • The Hong Kong University of Science and Technology
SupervisorCharles Chuan ZHANG (Supervisor)

Cite this

'