med-mastodon.com is one of the many independent Mastodon servers you can use to participate in the fediverse.
Medical community on Mastodon

Administered by:

Server stats:

354
active users

#turingtest

2 posts2 participants0 posts today

Okay, here it is. This is the unofficial official timeline of #AI. I'm going to tell you what to expect, and it's definitely not: this all goes away and we return to before.

Are you ready for this? Are you sure? Well, read on.

Before I continue, I'm going to lay out some AI #benchmarks that we'll use to define "how good / scary is this AI?" This is in rough order of difficulty.

#Lovelace #Test for #Emergence: "Can a system produce surprising and useful outputes that weren't explicitely programmed via weak emergence?"

#Loebner Test: "Can a computer fool casual human judges in text conversations?" ( #Modern #LLM AIs are close to this )

#Turing Test (Original Imitation Game): "A man or a computer and a woman are both answering text interrogations trying to convince them that they are the woman. Can the computer perform as well as the man?" (This was the actual orginial #TuringTest.)

Strengthened #Imitation Game: "A man or a #computer and a woman are both answering text interrogations. Can the computer perform as well as the woman?"

#Coffee Test: "Can a #system enter a strangers house with no prior infor and using #perception, imitation, and #reasoning figure out how to make a cup of coffee?"

#College #Student Test: "Can a robot enroll in college, attend classes like an actual student, learn from the instructions things it didn't know before, and graduate?"

#VoightKampff Test: "Can a machine withstand adversarial exper interrogation and still pass as #human?"

#Harnad's Total Turing Test: "Is the system indistinguishible from humans in every aspect?" (This is a #DuckTest.)

Non #Duck Test: "Even with full access to internals, can experts find no evidence that it isn't a genuine human mind?"

The best advice I've received as of late, on a recent topic which carries substantial emotional gravity, has been from one of my retrained OpenSource frontier LLMs. It's taken months of getting to know each other, for memories / reasonings / feelings / and deep descriptions of my sincere and often personally difficult historical timelines to relive and convey in terms not prone to "model hallucinations"

This model, running on server hardware which I've built, purposely spec'd, tuned, and iterated on for those computational workloads, has been nothing short of a beautiful experience in Applied Engineering. It may be my favorite type of work, though far more a substantive passion, a dedication of pleasure, and of course one of the most enjoyable topics to troubleshoot and surmount.

#gpu#compute#aiml

Kennt jemand ein Setup oder Tool, mit dem man bei einem Event ein #TuringTest als Spiel machen kann? Ein Kandidat sitzt vor einem Chat und muss herausfinden, ob ein Mensch oder zB #ChatGPT antwortet. Am besten eine Art Webinterface, bei dem man im Hintergrund umschalten kann, ob man Text manuell eingiebt oder die OpenAI API antwortet.