Internet of Things Devices Voice Recognition with Gemini API

With artificial intelligence permeating every aspect of life, the Internet of Things (IoT) landscape is undergoing fast change

[{"selector":"#anim-a5c62f92-ede6-49be-abf0-6ddf3c0ce95b","keyframes":[{"offset":0,"transform":"translate3d(0, -318.09719%, 0)","easing":"cubic-bezier(.5, 0, 1, 1)"},{"offset":0.29,"transform":"translate3d(0, 0%, 0)","easing":"cubic-bezier(0, 0, .5, 1)"},{"offset":0.45,"transform":"translate3d(0, -89.448929828%, 0)","easing":"cubic-bezier(.5, 0, 1, 1)"},{"offset":0.61,"transform":"translate3d(0, 0%, 0)","easing":"cubic-bezier(0, 0, .5, 1)"},{"offset":0.71,"transform":"translate3d(0, -30.410091364000003%, 0)","easing":"cubic-bezier(.5, 0, 1, 1)"},{"offset":0.8,"transform":"translate3d(0, 0%, 0)","easing":"cubic-bezier(0, 0, .5, 1)"},{"offset":0.85,"transform":"translate3d(0, -11.419689121000001%, 0)","easing":"cubic-bezier(.5, 0, 1, 1)"},{"offset":0.92,"transform":"translate3d(0, 0%, 0)","easing":"cubic-bezier(0, 0, .5, 1)"},{"offset":0.96,"transform":"translate3d(0, -4.962316164%, 0)","easing":"cubic-bezier(.5, 0, 1, 1)"},{"offset":1,"transform":"translate3d(0, 0%, 0)","easing":"cubic-bezier(0, 0, .5, 1)"}],"delay":0,"duration":600,"fill":"both"}] [{"selector":"#anim-50b5a2ce-27ab-4d77-8177-5217a98fc189 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(-12.514648210040342%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}]

The development of artificial intelligence (AI) and cloud services has made it possible to link basic microcontrollers with common sensors and actuators to create a wide range of interactive intelligent gadgets

[{"selector":"#anim-590af615-8267-48b0-84bd-ccc8f24e3a81","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-af4c6915-aa3e-429a-866b-c700f77c3a7d [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(12.514648210040342%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}]

A spoken utterance is recorded by the Internet of Things gadget that has a microphone

[{"selector":"#anim-47eb757e-1d84-4715-9a0f-d25bc6339d9f [data-leaf-element=\"true\"]","keyframes":{"transform":["translate(0%, 0%) scale(1.5)","translate(0%, 0%) scale(1)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"forwards"}] [{"selector":"#anim-49907836-a397-475f-9979-53e6f495a04f","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-e5c4f10d-5b3c-41a7-92f9-23395e7fa849","keyframes":{"transform":["translate3d(-114.46541%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}]

To transfer encoded audio, the Gemini API is called via REST. This call asks for the spoken command's text or instructs Gemini to select a pre-programmed custom function (such turning on lights)

[{"selector":"#anim-69f2ca2d-91e4-49a1-80c5-5ab8c05c94f7","keyframes":{"transform":["scale(1)","scale(1.5)","scale(0.95)","scale(1)"],"offset":[0,0.33,0.66,1]},"delay":0,"duration":1450,"easing":"ease-in-out","fill":"both","iterations":1}] [{"selector":"#anim-8fd5f301-8bf3-4be3-baaa-a52585a86c7d [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(-12.514648210040342%, 0, 0) translate(-25%, 0%) scale(1.5)","translate3d(0%, 0, 0) translate(0%, 0%) scale(1)"]},"delay":0,"duration":2000,"fill":"forwards"}]

The IoT device receives information from the API in the form of a text response with additional instructions, a transcript of the audio, or the next function to call

[{"selector":"#anim-27331c55-9aa4-4d2f-8767-ca5c78fb81a3","keyframes":{"transform":["translate3d(-114.46541%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":1000,"easing":"cubic-bezier(.2, 0, .8, 1)","fill":"both"}] [{"selector":"#anim-1c508f4c-b739-4e74-a841-e912a14c0db6","keyframes":{"transform":["rotateZ(-180deg)","rotateZ(0deg)"]},"delay":0,"duration":1000,"easing":"cubic-bezier(.2, 0, .5, 1)","fill":"forwards"}] [{"selector":"#anim-b3336496-339a-4155-bdb6-3247be756b89 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(-12.514648210040342%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}]

Even for low-memory devices, developers can readily incorporate voice input, making interaction more natural and intuitive

[{"selector":"#anim-44eccfec-dfde-45c7-9f73-76caba02c84e","keyframes":{"transform":["rotate(-540deg) scale(0.1)","none"],"opacity":[0,1]},"delay":0,"duration":1000,"fill":"both","iterations":1}] [{"selector":"#anim-d2ffe97c-6714-42d3-9eb4-30ce2f31f216 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(12.514648210040342%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}]

Gemini AI makes gadgets more dynamic and able to do complex tasks by automatically choosing the right action based on user intent

[{"selector":"#anim-e67a6e1d-6fad-4f07-a039-e0bcef30787e","keyframes":{"transform":["translate3d(-114.46541%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-1a96c046-c3cc-4ee2-ae3c-d1129e5dca15","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-07eaf4a8-d811-4f42-b9c5-009c4798b07d","keyframes":{"transform":["scale(0.15)","scale(1)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"forwards"}] [{"selector":"#anim-3c0a04e0-5385-49b0-88a6-4cba2fc54483 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate(0%, 0%) scale(1.5)","translate(0%, 0%) scale(1)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"forwards"}]

Verbally command robots or provide photos or video to the Gemini API for navigation, task execution, and interactivity to automate repetitive tasks and help in many scenarios For More Details Visit Govindhtech.com