A Speaking Coach for Every Student

Your voice matters. Your accent is yours. We're here to help you be heard.

Un coach de habla para cada estudiante

Tu voz importa. Tu acento es tuyo. Estamos aquí para ayudarte a que te escuchen.

MuggsOfDialect is a free, browser-based speaking coach built for classrooms — with one rule that makes it fundamentally different from every other pronunciation and speaking tool on the market: the teacher is always at the helm.

No AI-generated feedback ever reaches a student without a teacher seeing it first. The AI does the time-consuming work — transcribing recordings, scoring words, grouping pronunciation patterns. But every assessment is a draft. The teacher listens to the recording, reviews the AI's analysis, confirms what's accurate, removes what isn't, and adds their own insight and encouragement. Then — and only then — does the student see their feedback. The teacher decides what the student hears about their own voice. Not an algorithm.

This is teacher-in-the-loop by design, not as an afterthought. Most speaking tools hand students an AI score and call it done. MuggsOfDialect puts the teacher's expertise, judgment, and relationship with the student at the center of every interaction. The AI is a powerful drafting tool. The teacher is the coach.

The tool works in two modes. In Presentation Mode, any student can practice public speaking skills — pacing, clarity, filler words, confidence. In Accent Coaching Mode, bilingual students write in their home language (starting with Spanish), the text is translated to English, and they practice reading their own ideas in English. The pronunciation analysis identifies which sounds are challenging based on the student's first language and groups them into learnable patterns. In both modes, the teacher reviews everything.

MuggsOfDialect is not a speech therapy tool. It is not an accent reduction program. It is a dialect coach — the same kind of tool actors and public speakers use to expand their range. Every accent is valid. This tool helps students add English speaking confidence to the skills they already have.

MuggsOfDialect es un coach de habla gratuito, basado en el navegador, creado para el aula — con una regla que lo hace fundamentalmente diferente de cualquier otra herramienta de pronunciación en el mercado: el maestro siempre tiene el control.

Ninguna retroalimentación generada por inteligencia artificial llega a un estudiante sin que un maestro la vea primero. La IA hace el trabajo que consume tiempo — transcribir grabaciones, evaluar palabras, agrupar patrones de pronunciación. Pero cada evaluación es un borrador. El maestro escucha la grabación, revisa el análisis de la IA, confirma lo que es correcto, elimina lo que no lo es, y agrega sus propias observaciones y palabras de ánimo. Solo entonces — y solo entonces — el estudiante ve su retroalimentación. El maestro decide lo que el estudiante escucha sobre su propia voz. No un algoritmo.

Esto es un maestro en el proceso por diseño, no como algo agregado después. La mayoría de las herramientas de habla le dan al estudiante una puntuación de IA y dan por terminado. MuggsOfDialect pone la experiencia, el juicio y la relación del maestro con el estudiante en el centro de cada interacción. La IA es una herramienta de borrador poderosa. El maestro es el coach.

La herramienta funciona en dos modos. En el Modo de Presentación, cualquier estudiante puede practicar habilidades para hablar en público — ritmo, claridad, muletillas, confianza. En el Modo de Coaching de Acento, los estudiantes bilingües escriben en su idioma natal (comenzando con español), el texto se traduce al inglés, y practican leyendo sus propias ideas en inglés. El análisis de pronunciación identifica qué sonidos son difíciles según el primer idioma del estudiante y los agrupa en patrones que se pueden aprender. En ambos modos, el maestro revisa todo.

MuggsOfDialect no es una herramienta de terapia del habla. No es un programa de reducción de acento. Es un coach de dialectos — el mismo tipo de herramienta que usan los actores y oradores para ampliar su rango. Cada acento es válido. Esta herramienta ayuda a los estudiantes a agregar confianza al hablar inglés a las habilidades que ya tienen.

How It Works

Write

Student types or pastes their text. In Accent Coaching Mode, they write in Spanish and the tool translates it to English. The teacher reviews and approves the translation first.

Record

Student reads the English text aloud using their device's microphone. Works on any device with a browser — Chromebooks, phones, laptops, classroom PCs.

Analyze

The tool transcribes the recording using AI speech recognition and compares it word-by-word against the expected text. Struggling words are flagged and grouped by pronunciation pattern.

Review

The teacher sees the AI's draft assessment. They confirm flags, remove false flags, adjust groupings, and add a personal comment. Nothing reaches the student without teacher approval.

Practice

The student receives feedback with flagged words grouped by pattern. They can tap any word to hear it pronounced by real human voices. The teacher's personal note appears alongside the analysis.

Improve

The student records the same text again. The tool tracks what improved and what still needs work. Over time, students see their own growth across multiple attempts.

Cómo funciona

Escribir

El estudiante escribe o pega su texto. En el Modo de Coaching de Acento, escriben en español y la herramienta lo traduce al inglés. El maestro revisa y aprueba la traducción primero.

Grabar

El estudiante lee el texto en inglés en voz alta usando el micrófono de su dispositivo. Funciona en cualquier dispositivo con navegador — Chromebooks, teléfonos, laptops, computadoras del aula.

Analizar

La herramienta transcribe la grabación usando reconocimiento de voz con IA y la compara palabra por palabra con el texto esperado. Las palabras difíciles se marcan y se agrupan por patrón de pronunciación.

Revisar

El maestro ve la evaluación preliminar de la IA. Confirma las marcas, elimina las incorrectas, ajusta las agrupaciones y agrega un comentario personal. Nada llega al estudiante sin la aprobación del maestro.

Practicar

El estudiante recibe retroalimentación con palabras marcadas agrupadas por patrón. Puede tocar cualquier palabra para escucharla pronunciada por voces humanas reales. La nota personal del maestro aparece junto al análisis.

Mejorar

El estudiante graba el mismo texto otra vez. La herramienta rastrea qué mejoró y qué todavía necesita trabajo. Con el tiempo, los estudiantes ven su propio crecimiento a través de múltiples intentos.

Core Principles

Teacher at the helm — always. This is not a preference. It is the architecture. Every AI-generated assessment is a draft that lives in the teacher's review queue. The teacher listens to the recording, confirms or overrides the AI's flags, adjusts pronunciation groupings, and writes a personal note — all before the student sees a single word of feedback. The AI never talks to the student. The teacher talks to the student, informed by the AI. This protects students from wrong assessments, builds trust, and ensures that the person who knows the student best is the one guiding their growth.

Accent is identity, not deficit. MuggsOfDialect never tells a student their accent is wrong. It helps them expand their range. The tool is a dialect coach, not a correction engine. Bilingual students are treated as people adding a skill, not fixing a problem.

Same prompt, ever improving. Students don't take a test and get a score. They record, get feedback, practice, and record again — on the same text. Improvement is visible and audible. They can hear themselves getting better.

Private by default. Only the student and their teacher see results. No leaderboards. No public comparisons. No classroom rankings. This is a practice space, not a performance stage.

A full literacy cycle, not just speaking. MuggsOfDialect looks like a speaking tool. It functions as a reading and writing tool too. The student has to write before they can record. They have to read the English text before they can speak it. They listen to pronunciation models and their own playback. They read their feedback, their teacher's notes, and the phoneme tips. Every loop through the tool exercises writing, reading, listening, and speaking — all four domains of English language proficiency, all from a single piece of the student's own work. In Louisiana, those are the same four domains measured by the ELPT. One tool, one assignment, four skills.

Principios fundamentales

El maestro siempre tiene el control. Esto no es una preferencia. Es la arquitectura. Cada evaluación generada por la IA es un borrador que permanece en la cola de revisión del maestro. El maestro escucha la grabación, confirma o anula las marcas de la IA, ajusta las agrupaciones de pronunciación y escribe una nota personal — todo antes de que el estudiante vea una sola palabra de retroalimentación. La IA nunca le habla al estudiante. El maestro le habla al estudiante, informado por la IA. Esto protege a los estudiantes de evaluaciones incorrectas, genera confianza y asegura que la persona que mejor conoce al estudiante sea quien guíe su crecimiento.

El acento es identidad, no deficiencia. MuggsOfDialect nunca le dice a un estudiante que su acento está mal. Le ayuda a ampliar su rango. La herramienta es un coach de dialectos, no un motor de corrección. Los estudiantes bilingües son tratados como personas que están sumando una habilidad, no arreglando un problema.

El mismo texto, siempre mejorando. Los estudiantes no toman un examen y reciben una calificación. Graban, reciben retroalimentación, practican y graban de nuevo — con el mismo texto. La mejora es visible y audible. Pueden escucharse a sí mismos mejorando.

Privado por defecto. Solo el estudiante y su maestro ven los resultados. Sin tablas de clasificación. Sin comparaciones públicas. Sin rankings del aula. Este es un espacio de práctica, no un escenario de actuación.

Un ciclo completo de alfabetización, no solo habla. MuggsOfDialect parece una herramienta de habla. Pero también funciona como herramienta de lectura y escritura. El estudiante tiene que escribir antes de poder grabar. Tiene que leer el texto en inglés antes de poder decirlo. Escucha modelos de pronunciación y su propia grabación. Lee su retroalimentación, las notas de su maestro y los consejos sobre fonemas. Cada ciclo con la herramienta ejercita la escritura, la lectura, la comprensión auditiva y el habla — los cuatro dominios de competencia en el idioma inglés, todo a partir de un solo trabajo del propio estudiante. En Luisiana, esos son los mismos cuatro dominios que mide el ELPT. Una herramienta, una tarea, cuatro habilidades.

Frequently Asked Questions

Preguntas frecuentes

For Teachers

How is this different from ELSA Speak, BoldVoice, or other pronunciation apps?

Those are excellent consumer apps for individuals, but they share a fundamental design choice that MuggsOfDialect rejects: they let AI talk directly to the student with no teacher in the loop.

In ELSA, BoldVoice, and similar apps, the AI scores your pronunciation and delivers feedback immediately. No human reviews it. No one who knows you checks whether the assessment is accurate or whether the feedback is appropriate for where you are in your learning. The AI is the coach.

In MuggsOfDialect, the AI is never the coach. The AI is a drafting tool that the teacher uses. Every assessment sits in a teacher review queue. The teacher listens to the actual recording, evaluates the AI's analysis, confirms what's right, removes what's wrong, and adds a personal comment that reflects their knowledge of that specific student. The student receives feedback from their teacher, informed by AI — not feedback from AI, unsupervised.

Beyond the teacher-at-helm difference: MuggsOfDialect uses the student's own writing as the practice text, not pre-made content. It's free, not a subscription. And the philosophy is fundamentally different — we don't reduce accents, we expand range. There is no "correct" accent target. There is only growth.

The closest classroom tool is ELSA for Schools, which offers a teacher dashboard for monitoring usage and progress. But even ELSA for Schools delivers AI feedback directly to students. The teacher can see what happened after the fact, but they can't review and approve before the student sees it. That's monitoring, not steering. MuggsOfDialect puts the teacher at the helm, not in the passenger seat.

What does the teacher dashboard look like?

When you log in, you see your classes and a review queue. The queue shows two types of items waiting for you: translations to review (in Accent Coaching Mode) and recordings to review.

When you review a recording, you see the text the student was reading, an audio player to listen to their recording, and the AI's draft assessment — a list of flagged words with confidence scores and proposed pronunciation pattern groupings. You can confirm flags with one click, remove false flags, add words the AI missed, adjust how words are grouped, and write a personal comment to the student. When you're satisfied, you release the feedback and the student can see it.

You can also view any student's full history — every prompt, every attempt, improvement over time, and which pronunciation patterns are resolved versus still active.

How much of my time does this take?

The AI does the heavy lifting — transcribing audio, comparing it to the expected text, scoring every word, and grouping flagged words by pattern. Your job is to listen to the recording, glance at the AI's draft, confirm or adjust, and add a personal note. For most recordings, this takes 2–3 minutes.

Early on, you'll spend more time calibrating — learning where the AI gets it right and where it doesn't for your specific students. Over time, your reviews get faster because you know your students' patterns and the AI's tendencies.

Can students review each other's recordings?

Not in the current version. The teacher-at-helm model means all feedback flows through the teacher first. Peer review is on the roadmap as a future feature — students who have progressed enough could earn a peer coach role, reviewing classmates' recordings with the same tools the teacher uses. But that requires trust in the system first, so we're building the foundation before adding that layer.

What if I don't speak Spanish? Can I still review translations?

Yes. The AI generates the translation, and you see the original Spanish and the English version side by side. You don't need to speak Spanish to evaluate whether the English translation reads naturally, uses appropriate vocabulary for the student's level, and sounds like something a student would actually say out loud. You're reviewing the English, not the Spanish. If you do have a Spanish teacher colleague, they can collaborate on the translation review step.

What data do you collect about my students?

Student name, class enrollment, their writing, their audio recordings, and their assessment results. Audio recordings are stored securely on Cloudflare R2. No email addresses or passwords are required for students — they join with a class code and their name. We do not sell data. We do not serve ads. We do not share student information with third parties. This is a nonprofit educational tool, not a commercial product.

Does this only help with speaking?

No — and this is one of the most important things to understand about MuggsOfDialect. It looks like a speaking tool, but every time a student uses it, they are writing, reading, listening, and speaking. They write first — in Spanish or English, they have to produce text before anything else happens. They read next — the English text, silently and then aloud. They listen — to pronunciation models, to their own playback, to the difference between the two. And then they speak — the recording and the word-level practice drills. That cycle repeats with every attempt. When a student re-records the same passage, they re-read it. When they practice a flagged word, they read the tip, listen to the model, and speak. Every loop through the tool is another rep across multiple literacy skills from a single piece of the student's own writing.

Does this align with the ELPT?

Directly. Louisiana's English Language Proficiency Test measures four domains: Reading, Writing, Listening, and Speaking. MuggsOfDialect exercises all four through a single workflow. The Speaking domain is the most obvious connection — on the ELPT, students record spoken responses into a computer for off-site scoring. MuggsOfDialect is a daily practice environment for that exact skill and that exact format. But the alignment goes deeper. The ELPT's Reading domain is exercised every time a student reads their English text. The Writing domain is exercised when they produce the original text. The Listening domain is exercised through pronunciation playback and self-monitoring.

All identified English Learners in Louisiana take the ELPT every spring. MuggsOfDialect gives them a way to practice year-round, on their own writing, with their own teacher's guidance — not just once a year at test time. For administrators: EL students' ELPT progress counts in the school's Assessment Index at equal weight to all other assessments. A tool that helps EL students improve across all four ELPT domains is a direct lever on school performance scores.

Para maestros

¿En qué se diferencia esto de ELSA Speak, BoldVoice u otras aplicaciones de pronunciación?

Esas son excelentes aplicaciones para uso individual, pero comparten una decisión de diseño fundamental que MuggsOfDialect rechaza: permiten que la IA le hable directamente al estudiante sin ningún maestro en el proceso.

En ELSA, BoldVoice y aplicaciones similares, la IA califica tu pronunciación y entrega retroalimentación de inmediato. Ningún humano la revisa. Nadie que te conozca verifica si la evaluación es correcta o si la retroalimentación es apropiada para donde estás en tu aprendizaje. La IA es el coach.

En MuggsOfDialect, la IA nunca es el coach. La IA es una herramienta de borrador que usa el maestro. Cada evaluación queda en una cola de revisión del maestro. El maestro escucha la grabación real, evalúa el análisis de la IA, confirma lo correcto, elimina lo incorrecto y agrega un comentario personal que refleja su conocimiento de ese estudiante en particular. El estudiante recibe retroalimentación de su maestro, informada por la IA — no retroalimentación de la IA sin supervisión.

Más allá de la diferencia del maestro al control: MuggsOfDialect usa la escritura propia del estudiante como texto de práctica, no contenido prefabricado. Es gratis, no es una suscripción. Y la filosofía es fundamentalmente diferente — no reducimos acentos, ampliamos el rango. No hay un acento “correcto” como meta. Solo hay crecimiento.

La herramienta de aula más cercana es ELSA for Schools, que ofrece un panel de control para monitorear el uso y progreso. Pero incluso ELSA for Schools entrega retroalimentación de la IA directamente a los estudiantes. El maestro puede ver lo que pasó después, pero no puede revisar y aprobar antes de que el estudiante lo vea. Eso es monitorear, no dirigir. MuggsOfDialect pone al maestro al volante, no en el asiento del pasajero.

¿Cómo se ve el panel de control del maestro?

Cuando inicias sesión, ves tus clases y una cola de revisión. La cola muestra dos tipos de elementos esperando por ti: traducciones por revisar (en el Modo de Coaching de Acento) y grabaciones por revisar.

Cuando revisas una grabación, ves el texto que el estudiante estaba leyendo, un reproductor de audio para escuchar su grabación, y la evaluación preliminar de la IA — una lista de palabras marcadas con puntuaciones de confianza y agrupaciones de patrones de pronunciación propuestas. Puedes confirmar marcas con un clic, eliminar marcas incorrectas, agregar palabras que la IA no detectó, ajustar cómo se agrupan las palabras y escribir un comentario personal para el estudiante. Cuando estés satisfecho, liberas la retroalimentación y el estudiante puede verla.

También puedes ver el historial completo de cualquier estudiante — cada tema, cada intento, mejoras a lo largo del tiempo, y qué patrones de pronunciación están resueltos y cuáles siguen activos.

¿Cuánto de mi tiempo requiere esto?

La IA hace el trabajo pesado — transcribir audio, compararlo con el texto esperado, calificar cada palabra y agrupar las palabras marcadas por patrón. Tu trabajo es escuchar la grabación, revisar el borrador de la IA, confirmar o ajustar, y agregar una nota personal. Para la mayoría de las grabaciones, esto toma de 2 a 3 minutos.

Al principio, pasarás más tiempo calibrando — aprendiendo dónde la IA acierta y dónde no para tus estudiantes específicos. Con el tiempo, tus revisiones se hacen más rápidas porque conoces los patrones de tus estudiantes y las tendencias de la IA.

¿Pueden los estudiantes revisar las grabaciones de sus compañeros?

No en la versión actual. El modelo del maestro al control significa que toda la retroalimentación pasa primero por el maestro. La revisión entre compañeros está en la hoja de ruta como una función futura — los estudiantes que hayan progresado lo suficiente podrían ganar un rol de coach de compañeros, revisando las grabaciones de sus compañeros con las mismas herramientas que usa el maestro. Pero eso requiere confianza en el sistema primero, así que estamos construyendo la base antes de agregar esa capa.

¿Qué pasa si no hablo español? ¿Puedo revisar las traducciones de todas formas?

Sí. La IA genera la traducción, y tú ves el original en español y la versión en inglés lado a lado. No necesitas hablar español para evaluar si la traducción al inglés se lee de forma natural, usa vocabulario apropiado para el nivel del estudiante, y suena como algo que un estudiante realmente diría en voz alta. Estás revisando el inglés, no el español. Si tienes un colega maestro de español, pueden colaborar en el paso de revisión de la traducción.

¿Qué datos recopilan sobre mis estudiantes?

Nombre del estudiante, inscripción en la clase, su escritura, sus grabaciones de audio y sus resultados de evaluación. Las grabaciones de audio se almacenan de forma segura en Cloudflare R2. No se requieren direcciones de correo electrónico ni contraseñas para los estudiantes — se unen con un código de clase y su nombre. No vendemos datos. No mostramos anuncios. No compartimos información de estudiantes con terceros. Esta es una herramienta educativa sin fines de lucro, no un producto comercial.

¿Esto solo ayuda con el habla?

No — y esto es una de las cosas más importantes de entender sobre MuggsOfDialect. Parece una herramienta de habla, pero cada vez que un estudiante la usa, está escribiendo, leyendo, escuchando y hablando. Escriben primero — en español o en inglés, tienen que producir texto antes de que ocurra cualquier otra cosa. Leen después — el texto en inglés, en silencio y luego en voz alta. Escuchan — modelos de pronunciación, su propia grabación, la diferencia entre ambos. Y luego hablan — la grabación y los ejercicios de práctica palabra por palabra. Ese ciclo se repite con cada intento. Cuando un estudiante vuelve a grabar el mismo pasaje, lo vuelve a leer. Cuando practican una palabra marcada, leen el consejo, escuchan el modelo y hablan. Cada ciclo con la herramienta es otra repetición a través de múltiples habilidades de alfabetización a partir de un solo escrito del propio estudiante.

¿Esto se alinea con el ELPT?

Directamente. La prueba de competencia del idioma inglés de Luisiana (ELPT) mide cuatro dominios: Lectura, Escritura, Comprensión Auditiva y Habla. MuggsOfDialect ejercita los cuatro a través de un solo flujo de trabajo. El dominio de Habla es la conexión más obvia — en el ELPT, los estudiantes graban respuestas orales en una computadora para ser calificadas fuera del centro. MuggsOfDialect es un entorno de práctica diaria para exactamente esa habilidad y exactamente ese formato. Pero la alineación va más allá. El dominio de Lectura del ELPT se ejercita cada vez que un estudiante lee su texto en inglés. El dominio de Escritura se ejercita cuando producen el texto original. El dominio de Comprensión Auditiva se ejercita a través de la reproducción de pronunciación y el automonitoreo.

Todos los estudiantes identificados como aprendices de inglés en Luisiana toman el ELPT cada primavera. MuggsOfDialect les da una manera de practicar durante todo el año, con su propia escritura, con la guía de su propio maestro — no solo una vez al año en época de examen. Para los administradores: el progreso de los estudiantes EL en el ELPT cuenta en el Índice de Evaluación de la escuela con el mismo peso que todas las demás evaluaciones. Una herramienta que ayuda a los estudiantes EL a mejorar en los cuatro dominios del ELPT es una palanca directa sobre las puntuaciones de rendimiento escolar.

For Students

What is MuggsOfDialect?

It's your personal speaking coach. You write something, record yourself reading it, and get feedback on which words to practice and why. Then you practice, record again, and see yourself improve.

Is this a test?

No. There are no grades. There is no pass or fail. This is a practice tool. You record, you get feedback, you try again. The only person who sees your results is you and your teacher.

Do I need to download anything?

No. MuggsOfDialect works in your web browser. If your device has a microphone, you're good. It works on Chromebooks, phones, laptops, tablets, and desktop computers.

What if I have an accent?

Good. Everyone has an accent. Your accent is part of who you are and where you come from. MuggsOfDialect is not here to take that away. It's here to help you feel confident speaking English in addition to your home language. Think of it like a dialect coach for an actor — you're adding range, not erasing anything.

What happens when I click "Listen" on a practice word?

You hear that word pronounced by a real person — not a robot voice. For some words, you can hear multiple speakers from different countries, so you can hear how the same word sounds in different accents. All pronunciations are powered by real human recordings.

Can I practice as many times as I want?

Yes. You can record the same text as many times as you want. Each time, the tool shows what improved since your last attempt. That's the whole point — practice until you hear the difference yourself.

Para estudiantes

¿Qué es MuggsOfDialect?

Es tu coach personal de habla. Escribes algo, te grabas leyéndolo, y recibes retroalimentación sobre qué palabras practicar y por qué. Luego practicas, grabas de nuevo, y te ves mejorar.

¿Es esto un examen?

No. No hay calificaciones. No hay aprobar o reprobar. Esto es una herramienta de práctica. Grabas, recibes retroalimentación, intentas de nuevo. La única persona que ve tus resultados eres tú y tu maestro.

¿Necesito descargar algo?

No. MuggsOfDialect funciona en tu navegador web. Si tu dispositivo tiene micrófono, estás listo. Funciona en Chromebooks, teléfonos, laptops, tabletas y computadoras de escritorio.

¿Qué pasa si tengo acento?

Bien. Todos tienen acento. Tu acento es parte de quien eres y de dónde vienes. MuggsOfDialect no está aquí para quitarte eso. Está aquí para ayudarte a sentirte seguro hablando inglés además de tu idioma natal. Piénsalo como un coach de dialectos para un actor — estás ampliando tu rango, no borrando nada.

¿Qué pasa cuando hago clic en “Escuchar” en una palabra de práctica?

Escuchas esa palabra pronunciada por una persona real — no una voz de robot. Para algunas palabras, puedes escuchar a múltiples hablantes de diferentes países, para que puedas oír cómo suena la misma palabra en diferentes acentos. Todas las pronunciaciones son de grabaciones humanas reales.

¿Puedo practicar cuantas veces quiera?

Sí. Puedes grabar el mismo texto cuantas veces quieras. Cada vez, la herramienta muestra qué mejoró desde tu último intento. Ese es todo el punto — practicar hasta que tú mismo escuches la diferencia.

For Administrators

What does this cost?

MuggsOfDialect is free for students and schools. The tool is built and maintained as a nonprofit educational initiative sponsored by Irish Network NOLA, a 501(c)(3) organization. Infrastructure costs are minimal — the tool runs on Cloudflare's edge network, speech analysis uses Cloudflare Workers AI, and the pronunciation library uses a combination of free open-source APIs and Forvo (a human pronunciation database at $2/month under nonprofit pricing). A full classroom of students recording weekly for an entire school year costs less than a single textbook in API fees.

What technology do students need?

A device with a web browser and a microphone. That's it. No app downloads, no software installations, no special hardware. Chromebooks, smartphones, tablets, laptops, and desktop computers all work. The tool is designed for the devices schools already have.

Is this FERPA compliant?

MuggsOfDialect collects minimal student data — name and class enrollment only. No email addresses, no passwords, no personally identifiable information beyond what the teacher provides. Audio recordings are stored securely and are only accessible to the student and their teacher. We do not sell, share, or monetize student data in any way. We recommend schools review the tool with their data privacy officer, and we are happy to provide technical documentation about our data handling practices.

How is this different from hiring a speech pathologist?

MuggsOfDialect is not a clinical tool and does not replace speech-language pathology services. It is an educational tool for building speaking confidence and English pronunciation skills, the same way a writing center helps students improve their writing. Students who need clinical speech services should continue receiving them. MuggsOfDialect serves the much larger population of students who simply need practice, feedback, and encouragement to speak confidently — whether they're bilingual students adding English pronunciation, or any student preparing for a class presentation.

Who built this?

MuggsOfDialect was designed and built by Sean Muggivan, a licensed clinical social worker and educator with 11+ years of classroom experience in New Orleans public schools. It is part of the Muggs of Learning ecosystem — a collection of teacher-built educational tools focused on giving schools capabilities rather than selling them subscriptions. The tool is sponsored by Irish Network NOLA, a 501(c)(3) organization connecting Irish heritage to community service in New Orleans.

How does this affect our school's accountability scores?

In Louisiana, English Learner progress on the ELPT counts in the Assessment Index at equal weight to all other assessments. MuggsOfDialect directly supports student growth across all four ELPT domains — Reading, Writing, Listening, and Speaking — through a single workflow. Students who practice regularly with MuggsOfDialect are rehearsing the exact skills and the exact format the ELPT measures: speaking into a microphone, reading English text, listening and self-correcting. For schools with significant EL populations, consistent use of this tool throughout the year — not just test prep in the weeks before the ELPT — gives students sustained practice that translates to measurable proficiency gains. And because MuggsOfDialect also serves non-EL students through Presentation Mode, it's not a niche accommodation — it's a building-wide communication skills tool that happens to deliver targeted EL support.

Para administradores

¿Cuánto cuesta esto?

MuggsOfDialect es gratis para estudiantes y escuelas. La herramienta se construye y mantiene como una iniciativa educativa sin fines de lucro patrocinada por Irish Network NOLA, una organización 501(c)(3). Los costos de infraestructura son mínimos — la herramienta funciona en la red de borde de Cloudflare, el análisis de habla usa Cloudflare Workers AI, y la biblioteca de pronunciación usa una combinación de APIs gratuitas de código abierto y Forvo (una base de datos de pronunciación humana a $2/mes bajo precios para organizaciones sin fines de lucro). Un aula completa de estudiantes grabando semanalmente durante todo un año escolar cuesta menos que un solo libro de texto en tarifas de API.

¿Qué tecnología necesitan los estudiantes?

Un dispositivo con navegador web y micrófono. Eso es todo. Sin descargas de aplicaciones, sin instalaciones de software, sin hardware especial. Chromebooks, teléfonos inteligentes, tabletas, laptops y computadoras de escritorio funcionan. La herramienta está diseñada para los dispositivos que las escuelas ya tienen.

¿Cumple con FERPA?

MuggsOfDialect recopila datos mínimos de los estudiantes — solo nombre e inscripción en la clase. Sin direcciones de correo electrónico, sin contraseñas, sin información de identificación personal más allá de lo que el maestro proporciona. Las grabaciones de audio se almacenan de forma segura y solo son accesibles para el estudiante y su maestro. No vendemos, compartimos ni monetizamos datos de estudiantes de ninguna manera. Recomendamos que las escuelas revisen la herramienta con su oficial de privacidad de datos, y estamos encantados de proporcionar documentación técnica sobre nuestras prácticas de manejo de datos.

¿En qué se diferencia esto de contratar a un patólogo del habla?

MuggsOfDialect no es una herramienta clínica y no reemplaza los servicios de patología del habla y lenguaje. Es una herramienta educativa para desarrollar confianza al hablar y habilidades de pronunciación en inglés, de la misma manera que un centro de escritura ayuda a los estudiantes a mejorar su escritura. Los estudiantes que necesitan servicios clínicos de habla deben seguir recibiéndolos. MuggsOfDialect atiende a la población mucho más grande de estudiantes que simplemente necesitan práctica, retroalimentación y ánimo para hablar con confianza — ya sean estudiantes bilingües sumando pronunciación en inglés, o cualquier estudiante preparándose para una presentación en clase.

¿Quién construyó esto?

MuggsOfDialect fue diseñado y construido por Sean Muggivan, un trabajador social clínico licenciado y educador con más de 11 años de experiencia en el aula en escuelas públicas de Nueva Orleans. Es parte del ecosistema Muggs of Learning — una colección de herramientas educativas creadas por maestros, enfocadas en darle capacidades a las escuelas en lugar de venderles suscripciones. La herramienta es patrocinada por Irish Network NOLA, una organización 501(c)(3) que conecta la herencia irlandesa con el servicio comunitario en Nueva Orleans.

¿Cómo afecta esto las puntuaciones de responsabilidad de nuestra escuela?

En Luisiana, el progreso de los aprendices de inglés en el ELPT cuenta en el Índice de Evaluación con el mismo peso que todas las demás evaluaciones. MuggsOfDialect apoya directamente el crecimiento estudiantil en los cuatro dominios del ELPT — Lectura, Escritura, Comprensión Auditiva y Habla — a través de un solo flujo de trabajo. Los estudiantes que practican regularmente con MuggsOfDialect están ensayando las habilidades exactas y el formato exacto que mide el ELPT: hablar frente a un micrófono, leer texto en inglés, escuchar y autocorregirse. Para las escuelas con poblaciones significativas de aprendices de inglés, el uso constante de esta herramienta durante todo el año — no solo preparación para el examen en las semanas antes del ELPT — les da a los estudiantes práctica sostenida que se traduce en ganancias medibles de competencia. Y como MuggsOfDialect también atiende a estudiantes que no son EL a través del Modo de Presentación, no es una adaptación de nicho — es una herramienta de habilidades de comunicación para todo el edificio que también brinda apoyo focalizado a los estudiantes EL.

How It Works (Technical)

How does the pronunciation analysis work?

When a student records themselves reading a text, the audio is sent to a speech recognition engine (Whisper, an open-source model by OpenAI running on Cloudflare's infrastructure). Whisper transcribes the audio and provides word-level data about what it heard. The tool then compares what Whisper heard against the expected text, word by word. Words where Whisper's transcription doesn't match the expected text, or where the recognition confidence is low, are flagged for review.

Flagged words are then looked up in a pronunciation dictionary (the CMU Pronouncing Dictionary, a free, public-domain resource) to identify which specific sounds — called phonemes — are present in each word. The tool cross-references these phonemes against known pronunciation patterns for the student's first language. For example, Spanish speakers commonly find "th" sounds challenging because that sound doesn't exist in Spanish. Words containing "th" are grouped together so the student practices the underlying skill, not just individual words.

This draft analysis goes to the teacher for review before the student sees anything.

Where do the pronunciation audio samples come from?

Two sources. The default "Listen" button uses the Free Dictionary API, which provides a single high-quality pronunciation recording for most English words — free, unlimited, no account required. For students who want to hear a word spoken by multiple people from different regions, MuggsOfDialect offers a "Hear More Voices" feature powered by Forvo, the world's largest human pronunciation database with over 6 million recordings in 430+ languages. Forvo recordings are made by real people from real places — not text-to-speech — and can be filtered by country and gender. Forvo is used under a nonprofit API plan registered through Irish Network NOLA.

What is the CMU Pronouncing Dictionary?

The Carnegie Mellon University Pronouncing Dictionary is a free, open-source, machine-readable pronunciation dictionary of North American English. It maps over 134,000 English words to their phoneme sequences using a standardized phoneme set called ARPABET. MuggsOfDialect uses this dictionary to identify which specific sounds appear in each flagged word, which enables the phoneme pattern grouping that makes practice recommendations meaningful rather than just a random word list.

What are phoneme pattern groups?

When the tool flags words a student struggled with, it doesn't just list them randomly. It groups them by the specific sound pattern that made them difficult. For a Spanish-speaking student, the flagged words "three," "think," and "the" would all be grouped under "Th sounds" because the "th" phoneme doesn't exist in Spanish. The words "street," "sleep," and "speak" would be grouped under "Initial consonant clusters" because Spanish doesn't start words with an "s" followed immediately by another consonant.

This grouping means the student isn't memorizing isolated words — they're practicing a transferable skill. Once you learn the "th" sound, it applies to every word with that sound, not just the three that were flagged.

The current pattern groups for Spanish speakers include: Th sounds, Sh vs Ch distinction, short vowel pairs, initial S-clusters, V vs B, J vs Y, Z sound, -ng endings, and final consonant clusters. Additional pattern sets for other first languages (Haitian Creole, Vietnamese, Arabic) are planned.

Does the AI talk directly to students?

Never. This is the core design principle of MuggsOfDialect. The AI generates a draft assessment — a suggested list of flagged words, proposed pattern groupings, and confidence scores. That draft goes to the teacher's review queue and stays there until a teacher acts on it.

The teacher can confirm, override, remove, or adjust anything in the draft, and add their own personal comment. Only after the teacher explicitly releases the assessment does the student see any feedback. There is no auto-release by default. There is no "skip review" option. The teacher is always at the helm.

This matters because AI gets things wrong. A low confidence score might be mic noise, not a pronunciation issue. A flagged word might be one the teacher knows the student actually said fine. A wrong assessment can damage a student's confidence. The teacher catches those errors before they ever reach the student. That's the whole point of teacher at the helm — the AI handles volume, the teacher handles judgment.

Can students use this at home?

Yes. MuggsOfDialect runs entirely in the browser. Any device with a microphone and internet access works. Students can practice at home, on the bus, in the library — anywhere they can speak into their device. Recordings upload to the cloud and enter the teacher's review queue regardless of where the student is.

What happens to recordings over time?

All recordings are stored securely and linked to the student's account. When a student records multiple attempts on the same text, all attempts are preserved so the teacher and student can track improvement over time. Recordings are never shared publicly and are only accessible to the student and their teacher.

Cómo funciona (técnico)

¿Cómo funciona el análisis de pronunciación?

Cuando un estudiante se graba leyendo un texto, el audio se envía a un motor de reconocimiento de voz (Whisper, un modelo de código abierto de OpenAI que funciona en la infraestructura de Cloudflare). Whisper transcribe el audio y proporciona datos a nivel de palabra sobre lo que escuchó. Luego, la herramienta compara lo que Whisper escuchó con el texto esperado, palabra por palabra. Las palabras donde la transcripción de Whisper no coincide con el texto esperado, o donde la confianza del reconocimiento es baja, se marcan para revisión.

Las palabras marcadas luego se buscan en un diccionario de pronunciación (el CMU Pronouncing Dictionary, un recurso gratuito de dominio público) para identificar qué sonidos específicos — llamados fonemas — están presentes en cada palabra. La herramienta cruza estos fonemas con patrones de pronunciación conocidos del primer idioma del estudiante. Por ejemplo, los hispanohablantes comúnmente encuentran difíciles los sonidos “th” porque ese sonido no existe en español. Las palabras que contienen “th” se agrupan para que el estudiante practique la habilidad subyacente, no solo palabras individuales.

Este análisis preliminar va al maestro para revisión antes de que el estudiante vea algo.

¿De dónde vienen las muestras de audio de pronunciación?

Dos fuentes. El botón predeterminado de “Escuchar” usa la API de Free Dictionary, que proporciona una sola grabación de pronunciación de alta calidad para la mayoría de las palabras en inglés — gratis, ilimitada, sin necesidad de cuenta. Para los estudiantes que quieren escuchar una palabra dicha por múltiples personas de diferentes regiones, MuggsOfDialect ofrece una función de “Escuchar más voces” impulsada por Forvo, la base de datos de pronunciación humana más grande del mundo con más de 6 millones de grabaciones en más de 430 idiomas. Las grabaciones de Forvo son hechas por personas reales de lugares reales — no texto a voz — y se pueden filtrar por país y género. Forvo se usa bajo un plan de API para organizaciones sin fines de lucro registrado a través de Irish Network NOLA.

¿Qué es el CMU Pronouncing Dictionary?

El CMU Pronouncing Dictionary (Diccionario de Pronunciación de Carnegie Mellon University) es un diccionario de pronunciación de inglés norteamericano gratuito, de código abierto y legible por máquinas. Mapea más de 134,000 palabras en inglés a sus secuencias de fonemas usando un conjunto estandarizado de fonemas llamado ARPABET. MuggsOfDialect usa este diccionario para identificar qué sonidos específicos aparecen en cada palabra marcada, lo que permite que la agrupación por patrones de fonemas haga que las recomendaciones de práctica sean significativas en lugar de solo una lista aleatoria de palabras.

¿Qué son los grupos de patrones de fonemas?

Cuando la herramienta marca palabras con las que un estudiante tuvo dificultad, no las lista al azar. Las agrupa por el patrón de sonido específico que las hizo difíciles. Para un estudiante hispanohablante, las palabras marcadas “three,” “think” y “the” se agruparían todas bajo “Sonidos Th” porque el fonema “th” no existe en español. Las palabras “street,” “sleep” y “speak” se agruparían bajo “Grupos consonánticos iniciales” porque el español no comienza palabras con una “s” seguida inmediatamente de otra consonante.

Esta agrupación significa que el estudiante no está memorizando palabras aisladas — está practicando una habilidad transferible. Una vez que aprendes el sonido “th”, aplica a cada palabra con ese sonido, no solo a las tres que fueron marcadas.

Los grupos de patrones actuales para hispanohablantes incluyen: sonidos Th, distinción Sh vs Ch, pares de vocales cortas, grupos S iniciales, V vs B, J vs Y, sonido Z, terminaciones -ng y grupos consonánticos finales. Se planean conjuntos de patrones adicionales para otros primeros idiomas (criollo haitiano, vietnamita, árabe).

¿La IA habla directamente con los estudiantes?

Nunca. Este es el principio de diseño fundamental de MuggsOfDialect. La IA genera una evaluación preliminar — una lista sugerida de palabras marcadas, agrupaciones de patrones propuestas y puntuaciones de confianza. Ese borrador va a la cola de revisión del maestro y permanece ahí hasta que un maestro actúe.

El maestro puede confirmar, anular, eliminar o ajustar cualquier cosa en el borrador, y agregar su propio comentario personal. Solo después de que el maestro libera explícitamente la evaluación, el estudiante ve alguna retroalimentación. No hay liberación automática por defecto. No hay opción de “saltar revisión”. El maestro siempre tiene el control.

Esto importa porque la IA se equivoca. Una puntuación de confianza baja podría ser ruido del micrófono, no un problema de pronunciación. Una palabra marcada podría ser una que el maestro sabe que el estudiante en realidad dijo bien. Una evaluación incorrecta puede dañar la confianza de un estudiante. El maestro detecta esos errores antes de que lleguen al estudiante. Ese es todo el sentido de que el maestro tenga el control — la IA maneja el volumen, el maestro maneja el juicio.

¿Pueden los estudiantes usar esto en casa?

Sí. MuggsOfDialect funciona completamente en el navegador. Cualquier dispositivo con micrófono y acceso a internet funciona. Los estudiantes pueden practicar en casa, en el autobús, en la biblioteca — en cualquier lugar donde puedan hablar a su dispositivo. Las grabaciones se suben a la nube y entran en la cola de revisión del maestro sin importar dónde esté el estudiante.

¿Qué pasa con las grabaciones con el tiempo?

Todas las grabaciones se almacenan de forma segura y se vinculan a la cuenta del estudiante. Cuando un estudiante graba múltiples intentos del mismo texto, todos los intentos se conservan para que el maestro y el estudiante puedan seguir la mejora a lo largo del tiempo. Las grabaciones nunca se comparten públicamente y solo son accesibles para el estudiante y su maestro.