Design the part of the robot named Eliška
Návrh a realizace části robota Eliška
Authors
Supervisors
Reviewers
Editors
Other contributors
Journal Title
Journal ISSN
Volume Title
Publisher
České vysoké učení technické v Praze
Czech Technical University in Prague
Czech Technical University in Prague
Date of defense
2025-06-18
Abstract
Tato bakalářská práce se zabývá návrhem a realizací robotické hlavy Elišky, která je postavena z LEGO Mindstorms EV3. Hlava Elišky slouží pro propagaci fakulty. Eliška je schopná komunikovat s uživateli díky umělé inteligenci, což zahrnuje rozpoznávání a generování řeči. S pomocí počítačového vidění také dokáže sledovat okolní prostředí a reagovat na výzvy uživatele.
Hlava Elišky využívá pro pohyby dvě kostky EV3. Tyto pohyby zahrnují otáčení hlavy, pohyby očí, obočí, otevírání pusy a pohyby koutků. Komunikace mezi řídícím softwarem, napsaným v jazyce Python, a kostkami EV3 je realizována přes Bluetooth. Rozpoznávání hlasu je zajištěno systémy Whisper od OpenAI a Porcupine od Picovoice. Pro zpracování textu i obrazu je použit chatbot od společnosti Google--Gemini AI. Generování Eliščina hlasu obstarává ElevenLabs. Eliščino vidění je realizováno knihovnou OpenCV, která umožňuje sledování obličejů, rukou.
Cílem práce bylo vytvořit interaktivní robotickou hlavu z kostek LEGO, která slouží pro účely reprezentace fakulty. Výsledkem je systém schopný verbální i neverbální komunikace, který slouží pro demonstraci možností umělé inteligence a robotiky.
This bachelor's thesis focuses on the design and implementation of the robotic head Eliška, built using LEGO Mindstorms EV3. Eliška serves as a propagative project for the faculty. It is capable of communicating with users through artificial intelligence, including speech recognition and generation. With the help of computer vision, it can also observe its surroundings and respond to user prompts. Eliška's movements are controlled by two EV3 bricks, enabling head rotation, eye movement and eyebrow movement, mouth opening, and mouth corner movements. Communication between the control software, written in Python, and the EV3 bricks is carried out via Bluetooth. Speech recognition is handled by Whisper from OpenAI and Porcupine from Picovoice. For processing both text and images, a chatbot powered by Google's Gemini AI is used. Eliška's voice is generated using ElevenLabs. Her vision is implemented using the OpenCV library, allowing facial and hand tracking. The goal of this project was to create an interactive robotic head using LEGO bricks for faculty representation purposes. The result is a system capable of both verbal and non-verbal communication, demonstrating the possibilities of artificial intelligence and robotics.
This bachelor's thesis focuses on the design and implementation of the robotic head Eliška, built using LEGO Mindstorms EV3. Eliška serves as a propagative project for the faculty. It is capable of communicating with users through artificial intelligence, including speech recognition and generation. With the help of computer vision, it can also observe its surroundings and respond to user prompts. Eliška's movements are controlled by two EV3 bricks, enabling head rotation, eye movement and eyebrow movement, mouth opening, and mouth corner movements. Communication between the control software, written in Python, and the EV3 bricks is carried out via Bluetooth. Speech recognition is handled by Whisper from OpenAI and Porcupine from Picovoice. For processing both text and images, a chatbot powered by Google's Gemini AI is used. Eliška's voice is generated using ElevenLabs. Her vision is implemented using the OpenCV library, allowing facial and hand tracking. The goal of this project was to create an interactive robotic head using LEGO bricks for faculty representation purposes. The result is a system capable of both verbal and non-verbal communication, demonstrating the possibilities of artificial intelligence and robotics.
Description
Keywords
Robotická hlava, LEGO Mindstorms, EV3, Chatbot, Umělá inteligence, Rozpoznávání řeči, Generování hlasu, Počítačové vidění, Python, Bluetooth komunikace, Gemini AI, OpenCV, Whisper, ElevenLabs, Robot head, LEGO Mindstorms, EV3, Chatbot, Artificial intelligence, Speech recognition, Speech generation, Computer vision, Python, Bluetooth communication, Gemini AI, OpenCV, Whisper, ElevenLabs
Citation
Underlying research data set URL
Permanent link
Rights/License
Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem v platném znění.
A university thesis is a work protected by the Copyright Act of the Czech Republic. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one`s own expense. The use of thesis should be in compliance with the Copyright Act.
A university thesis is a work protected by the Copyright Act of the Czech Republic. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one`s own expense. The use of thesis should be in compliance with the Copyright Act.