Abstract:With the increasing application of Jupyter Notebook in the field of data science, the more functional requirements for multi-user management and cluster computing resource scheduling are increasing. This study, starting with the basic concepts of Jupyter, expounds the influence of Jupyter on the broadcast of scientific research achievements, summarizes the present situation of the research organizations and institutions of higher learning and other organizations in the field of research on the distributed architecture of Jupyter, analyzes the characteristics of the architecture of the Jupyter system in detail, and reconstructs the Jupyter by means of microservice. Through the resource scheduling and allocation algorithm of Kubernetes, a highly elastic distributed architecture based on container technology is implemented. Finally, the test results show that the architecture proposed in this study has been improved to a certain extent on the performance of the access load, and the target of load balancing on the cluster is achieved in the number of users' running.