Claude Code Voice Mode

(support.claude.com)

24 points | by linolevan 2 days ago ago

12 comments

  • Brajeshwar 2 days ago ago

    If you are already using a universal (OS-wide) tool for Voice, this should just work in Claude or otherwise. Even though I’m a smooth and pretty quick typer, I sometimes speak using Handy[1] and Claude types it out. Handy is seriously accurate.

    It does look like more and more apps/tools are going to come built-in with - you can either type or talk - go wild.

    Ask: Someone commented some time back that they mapped their CAPSLOCK key to push-to-talk. I’ve looked around and could not figure out without a third party tool. I’d love to know how to do it in macOS - map the un-used CAPSLOCK Key to Push to Talk.

    1. https://handy.computer

    • nivertech a day ago ago

      Claude Code's /voice mode isn't very accurate comparing to other speech-to-text (e.g. Google Translate).

      Also it takes about 500ms to activate after you press SPACE, so the initial words are missed out. It took me some time to realize it.

      Does Handy has a better speech-to-text accuracy? Is there an activation delay?

    • jaen 2 days ago ago

      I've had success in the past in customizing macOS key bindings using Karabiner: https://karabiner-elements.pqrs.org/

      • Brajeshwar 8 hours ago ago

        Trying this one. It worked with a simple script mapping CAPSLOCK to the default Push-to-talk (OPTION + Space);

           ```
           {
            "description": "Caps Lock to Option+Space (Handy push-to-talk)",
            "manipulators": [
                {
                    "from": { "key_code": "caps_lock" },
                    "to": [
                        {
                            "key_code": "spacebar",
                            "modifiers": ["left_option"]
                        }
                    ],
                    "type": "basic"
                }
            ]
           }
           ```
  • ywatanabe1989 16 hours ago ago

    If voice mode does not mean only interactive conversations, I use my custom TTS MCP server. https://github.com/ywatanabe1989/scitex-audio

    It enables agents to deliver auditory feedback from speakers on my desk as a central notifier, relaying via ssh tunneling.

    If this is off topic here, I apologize for that.

  • jmathai 2 days ago ago

    Slightly related. Something I love to do is to tell Claude to generate a detailed explanation of a topic I want to learn more about (recently, it was Donor Advised Funds).

    If I’m going to pick my kids up then I tell it to make it 10 minutes when read aloud. Then I listen to that on my drive.

  • emschwartz 2 days ago ago

    This is great. It would be nice to be able to add a custom dictionary for words that are often misunderstood.

  • aeve890 2 days ago ago

    I'm not seeing anything related to Claude Code. It's the desktop and mobile Claude app with the same feature I've already seen in competing products. Voice isn't even available in the remote control mode of Claude Code.

    • _aavaa_ 2 days ago ago

      They’re doing a staged rollout. /voice will work in Claude code if they’ve rolled it out for your account.

      • furyofantares 2 days ago ago

        Well, it's not mentioned at all in the linked page.

        I tried /voice in Claude Code and it lets me hold space to get speech to text into my CLI. Not really what one expects from a "voice mode". And, again, not related to anything in the article.

        • nivertech a day ago ago

          yes here voice mode != hands-off mode, so you can't DWC (Driving-While-Coding)

          • furyofantares a day ago ago

            Yeah, doesn't really add much, I think if you want dictation your OS very likely has it built in. Certainly wouldn't be worth submitting to HN.