27 comments

  • augusteo 4 days ago ago

    Building on zemo's point about parasocial relationships: traditional parasocial interaction involves a performer who doesn't know you exist. Here the AI does respond to you specifically, which changes the dynamic.

    Is it still parasocial if the other party is responsive but not conscious? Or is this something new that we don't have good language for yet?

    • idiotsecant 4 days ago ago

      I think maybe there needs to be a new word. It's still an asymmetric relationship. It's kind of a mix of DMing an influencer and chatting with the barista because you think she actually likes you. You're talking to a mirage.

    • summerlee9611 3 days ago ago

      I think “parasocial” still captures part of it (one-to-many distribution, performer vibe), but there’s also a true interactive dyad here. It’s closer to “synthetic social interaction” or “responsive parasocial.” I don’t have a perfect word yet, but the asymmetry and the responsiveness both matter.

    • Charon77 4 days ago ago

      You need to first prove that AI is not conscious.

      I find it hard to even convince others that I am a conscious person.

      Maybe consciousness is just a matter of belief, if I see this AI and believe that it's a person, then I am talking to a conscious entity.

      • summerlee9611 3 days ago ago

        I’m not trying to make any claims about consciousness. For us, the practical question is: does the interaction feel supportive and useful, while staying transparent that it’s a model. The rest is philosophy, and I’m happy to read more perspectives.

      • idiotsecant 4 days ago ago

        Give it access to a terminal and see what it does, unprompted. Does it explore? Does it develop interests? Does it change when exposed to new information?

        • summerlee9611 3 days ago ago

          We’re not giving it unconstrained tool access. In-product, actions are either not available or gated behind explicit user intent and strict allowlists. The interesting part for us is the real-time conversational loop and memory personalization, not autonomous exploration.

        • krackers 4 days ago ago

          >Does it change when exposed to new information?

          By this metric most humans are not conscious.

      • neidnejnd 4 days ago ago

        [flagged]

        • DauntingPear7 4 days ago ago

          Yeah the “rocks are alive and conscious” crowd are certainly something

  • nitroedge 4 days ago ago

    For better lip sync you could try using rhubarb to extract from the mp3. What is your backend speech processor so you can get the real-time streaming response? Rhubarb would add a bit of latency for sure.

    • summerlee9611 3 days ago ago

      For real-time: we use WebRTC for streaming. Input is streaming STT, then a low-latency LLM, then TTS, then we drive Live2D parameters on the client. Lip sync: we currently do (simple phoneme / amplitude-based) and are testing viseme extraction. Rhubarb is on our list, but we’re cautious about added latency.

  • october8140 4 days ago ago

    This is disturbing.

    • xattt 4 days ago ago

      It will quickly distill down to clients using the service just for sex and sex-adjacent activities.

      No kink-shaming, but this sort of thing enables self-destructive hard-to-return-from anti-social behaviour.

    • summerlee9611 3 days ago ago

      Totally fair reaction. We’re building this with clear boundaries: we don’t position it as therapy replacement, we add safety rails, and gives user a choice what mode they want and guardrails differ based on this. Plus, age restriction is there as safety boundary

  • dummydummy1234 4 days ago ago

    What are you using for tts/stt/models?

    • summerlee9611 3 days ago ago

      realtime api + elevenlabs but llms will be diversified based on persona moving forward. Using chatgpt/gemini as baseline model, we feel prompting has limitation

  • sghimire2022 4 days ago ago

    This is cool.

    • summerlee9611 3 days ago ago

      Appreciate it. If you try it and anything feels off (latency, turn-taking, uncanny moments), I’d love concrete feedback. That’s what we’re grinding on right now.

  • singular_atomic 4 days ago ago

    Where’s the asteroid at

    • summerlee9611 3 days ago ago

      Same place as my latency budget: disappearing fast.

  • ryannampham 3 days ago ago

    wow we got personal vtubers now!

  • dfajgljsldkjag 4 days ago ago

    It creates a conflict to build a system that is both a private friend and a public performer. You cannot maximize intimacy and fame at the same time.

    • zemo 4 days ago ago

      You're describing Parasocial interaction: https://en.wikipedia.org/wiki/Parasocial_interaction

      far from being impossible, it's the entire influencer economy. This form of social media has been extremely widespread for a decade or so running; it's probably the dominant form of social media.

    • summerlee9611 3 days ago ago

      100% agree. Maximizing intimacy and scaling distribution pull in opposite directions. We’re experimenting with keeping the “character” consistent while letting personalization live in private memory and user-controlled settings. Still early, and this tension is real.

  • undefined 4 days ago ago
    [deleted]