Li QianZhang KaiWei HaoranZhang Jiao.Discussion on routing scheme based on P4 and machine learning[J].Designing Techniques of Posts and Telecommunications,2018,(12):7-11.[doi:10.12045/j.issn.1007-3043.2018.12.002]
基于P4和机器学习的路由选择方案探讨
- Title:
- Discussion on routing scheme based on P4 and machine learning
- Keywords:
- P4; Reinforement Learning ; Routing; Intelligent network
- 分类号:
- TN914
- 文献标志码:
- A
- Abstract:
- All the time, traditional routing solutions have always had shortcomings such as insensitivity to the network and slow convergence. With the development of the data plane programming language P4, the data plane has the ability to collect network status information in real time. However, a large amount of data is difficult to process through human methods, and application of machine learning tools can solve the problem of difficult data processing. Based on the above two foundations, the P4 and machine learning-based routing schemes are tested by building a platform. The test results show that the scheme using the reinforcement learning algorithm to assist the routing strategy generation shows good performance. From this point of view, machine learning has great potential in the field of networking, and is an important tool for realizing network intelligence
参考文献/References:
[1].Asaf Valadarsky , Michael Schapira, Dafna Shahaf, , and Aviv Tamar . Learning to route . HotNets . 2017
[2].The P4 Language Consortium . The P4 Language Specification . Version 1.0.4 . 2017
[3].The P4.org Applications Working Group. In-band Network Telemetry(INT) Dataplane Specification. 2018
[4].M. Majer , C. Bobda , A. Ahmadinia , and J. Teich . Packet routing in dynamically changing networks on chip . In Proceedings of the 19 th IEEE International Parallel and Distributed Processing Symposium (IPDPS’05) - Workshop 3 - Volume 04, IPDPS ’05, pages 154.2–, Washington, DC, USA, 2005. IEEE Computer Society.
[5].M. Mohri, A. Rostamizadeh, and A. Talwalkar. Foundations of Machine
[6].Learning. The MIT Press, 2012
[7].R. S. Sutton and A. G. Barto. Reinforcement learning: An introduction.
[8].MIT press Cambridge, 1998
[9].John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford and Oleg Klimov . Proximal Policy Optimization Algorithms . OpenAI . 2017
[10].M. Dong, Q. Li, D. Zarchy, P. B. Godfrey, and M. Schapira. PCC: Re-architecting congestion control for consistent high performance. NSDI, 2015.
[11].K. Winstein and H. Balakrishnan. Tcp ex machina: Computer-generated congestion control. SIGCOMM, 2013.
[12].B. Fortz and M. Thorup. Increasing internet capacity using local search. Computational Optimization and Applications, 2004
[13].N. Michael and A. Tang. Halo: Hop-by-hop adaptive link-state optimal routing. IEEE/ACM Transactions on Networking, 2015.
[14].D. Xu, M. Chiang, and J. Rexford. Link-state routing with hop-by-hop forwarding can achieve optimal traffc engineering. INFOCOM, 2008
[15].DeepMind AI Reduces Google Data Centre Cooling Bill by 40%. https://goo.gl/QTdU2T
[16].H. Mao, M. Alizadeh, I. Menache, and S. Kandula. Resource management with deep reinforcement learning. HotNets, 2016.
[17].H. Mao, R. Netravali, and M. Alizadeh. Neural adaptive bitrate streaming with pensive. SIGCOMM, 2017..
备注/Memo
李倩,工学学士学位,目前在北京邮电大学未来网络理论与应用实验室就读硕士研究生