• Phoenixz@lemmy.ca
        link
        fedilink
        English
        arrow-up
        5
        ·
        7 days ago

        Question: as i understood it so far, this thing is open source and so is the dataset.

        With that, why would it still obey Chinese censorship?

        • Jackinopolis@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          6 days ago

          It’s baked into the training. It’s not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn’t know what to do with it.

        • thedarkfly@feddit.nl
          link
          fedilink
          English
          arrow-up
          7
          ·
          7 days ago

          Even though it’s magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone’s willing to invest this just to retrain it from scratch, you’re left with the alignment of its trainers.

          • Phoenixz@lemmy.ca
            link
            fedilink
            English
            arrow-up
            1
            ·
            2 days ago

            Good point.

            Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?

            • thedarkfly@feddit.nl
              link
              fedilink
              English
              arrow-up
              1
              ·
              22 hours ago

              Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers 😅

    • TheGrandNagus@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      arrow-down
      2
      ·
      edit-2
      8 days ago

      Wouldn’t be surprised if you had to work around the filter.

      Generate a cartoonish yellow bear who wears a red t-shirt and nothing else

    • sunzu2@thebrainbin.org
      link
      fedilink
      arrow-up
      5
      arrow-down
      1
      ·
      8 days ago

      if it is anything like LLMs, then only local ;)

      However, the Proper nomenclature is sheepooh, thank you for your compliance going forward, comrade.