• phutatorius@lemmy.zip
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 days ago

    Only if the training set consists 100% of factual information that is internally consistent.

    • [deleted]@piefed.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 days ago

      They could use reliable sources to approach 100% instead of jamming literally everything in. For example, limiting the training data to peer reviewed papers would not be exactly 100% but it would be a lot closer than including all of reddit.