Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

SCHEDULE AT-A-GLANCE

DAY1. Monday, April 29
at ONE Summit 2024(On site only)


2 hour on-site discussion
  Agenda(Idea)
    Collaboration with other communities
    Outreach 
    Next Akraino release and 2024 activities
    AI at Edge



Day2. Monday, April 29 (APAC time zone friendly)


18:00 – 20:10 PDT (UTC-7)
21:00 – 23:10 EDT (UTC-4)
03:00 – 05:10 CEST (UTC+2) (Friday)
09:00 – 11:10 CST (UTC+8) (Friday)
Presentation Sessions


Day3. Wednesday, May 1 (APAC time zone friendly)
*Reserve day


18:00 – 20:10 PDT (UTC-7)
21:00 – 23:10 EDT (UTC-4)
03:00 – 05:10 CEST (UTC+2) (Friday)
09:00 – 11:10 CST (UTC+8) (Friday)
Presentation Sessions


Day2. 

Monday, April 29 (APAC time zone friendly)

Introduction to Akraino activities in 2023, Collaboration with other open communities

Zoom Link: TBD

Recording: TBD

Time(UTC-7)Topics
18:00-18:05

Welcome note
Yin Ding TSC Chair
Fukano Haruhisa TSC Co-Chair

18:10-18:40


18:40-19:10
19:10-19:40

19:40-20:10


Closing



Monday, May 1 (APAC time zone friendly)

Introduction to Akraino activities in 2023, Collaboration with other open communities

Zoom Link: TBD

Recording: TBD

Time(UTC-7)Topics
18:00-18:05

Welcome note
Yin Ding TSC Chair
Fukano Haruhisa TSC Co-Chair

18:10-18:40


18:40-19:10
19:10-19:40

19:40-20:10


Closing


Call for proposal

NoNameCompanye-mailPresentation titleAbstractPreferred Time ZoneComments
1Jeff BrowerSignalogicjbrower at signalogic dot comSmall Language Model for Device AI Applications

Device AI applications running at the AI Edge on very small form-factor devices (for example pico ITX), and without an online cloud connection, need to perform automatic speech recognition (ASR) under difficult conditions, including background noise, urgent or stressed voice input, and other talkers in the background. For robotics applications, background noise may also include servo motor and other mechanical noise. Under these conditions, efficient open source ASRs such as Kaldi and Whisper tend to produce "sound-alike" errors, for example:

  in the early days a king rolled the stake
  
which contains two (2) sound-alike errors that must be corrected to "in the early days a king ruled the state". Sound-alike errors are particularly problematic for robotics applications in which the robot OS requires precise API commands, for example a robotaxi has stalled and must be instructed to "move forward 20 feet, to the right 10 feet, raise the hood, and turn off the engine". A first responder may use a portable backpack device and give commands "get off the road in that turn-out up ahead and shut it down" or similar. Any sound-alike errors in voice commands make translation to machine APIs problematic.

To address this issue, Signalogic is developing a Small Language Model (SLM) is needed to correct sound-alike errors, capable of running in a very small form-factor and under 10W, for example using two (2) Atom CPU cores. The SLM must run every 1/2 second and with a backwards/forwards context of 3-4 words. Unlike an LLM, a wide context window, domain knowledge, and extensive web page training are not needed.

PDT
2






3






4






5






6






7






8






9






10