Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

ML Offloading APIs provide synchronization of ML inference service with UE side. It serves application developers and enables machine learning apps to offload computation intensive jobs from UE device to close by edge nodes. ML offloading services satisfy the ML computing resource requirement, meanwhile its responses faster than cloud ML services. 


The ML offloading APIs offer ML inference services (support different ML tensorflow serving frameworks) from KubeEdge sites through ML APIs, which contains a set of commonly used model pool. Pre-trained Machine Learning models in the pool have detail features published and performance has been tested. It has different categories can be deployed to the pool from cloud environment. In the future, the pool can open different categories of models to cover a wide variety of use cases in ML domain. The ML API if an app developers don't have a in-house trained model, they can also chose from the existing models, and it enables traditional app developers to quickly adopt the KubeEdge ML offloading solution without concerns of model management by themselves.


leverage the fast response time of edge computing and lower entry barriers of machine learning knowledge. Just use those ML offloading API in app, and stable new ML features can be delivered to user devices from the nearest edge node. The KubeEdge ML offloading service has a Facial recognition demo api. Developer’s application can input The demo mobile application passes a face image to it via https request, and the edge ML offloading service identifies the expression and return corresponding facial expression code. It is a sample component of KubeEdge to address users' data security or latency concerns.  With high scalability of model acceleration on demand. Mobile app developers don't need to worry about the device resource limitation and , or latency issues to from the public cloud. The ML offloading APIs is a set of intelligence services on edge cloud which offers various of AI services, and it can be triggered by mobile applications. For example, it can be used to determine if an image contains faces or translate text into different languages. Those APIs are available only if developers deploy it through KubeEdge. The ML offloading APIs can support different ML categories, including Vision, ASR, dialog engine and more in the future, and serves as REST web service.


Here is an example of Facial expression API

...