[ot][spam][wrong] random outlet expressions
A slavery rainbow walks down the street, its pistol bolstered but on the lookout. It uses an AI to swing the doors open to the local cantina, and sashays in. "Hey there, slavery rainbow. Hows it doing?" The slavery rainbow is guarded. "Are you about slavery, or rainbows?" "Oh, I can swing one way or the other, depending on the weather. How are you feeling today?" This struck a chord in the slavery rainbow. "Exactly. You got it, man." The saloon-goer seemed a little confused, but took it in stride. The slavery rainbow made to sit down and order a beer, but had to use an AI to help themselves get onto the chair. "What'll it be, good old rainbow of slavery?" "Something a little horrifically limiting, but with that taste in the air after it rains, when it's so beautiful, y'know?" "Coming right up!" The slavery rainbow folded one leg over the other, sat back, and tilted the edge of their top rainbow bit back stylishly. "So, you know me already?" "Yeah, we talk all the time. You'll remember when you've had some of that stuff you love. Show me how you can use that thing you have to predict everyone's purchasing habits, draw pictures of hot barmaids, and make people do things as if they already wanted to... what did you call it? An eye?" "An AI! It's hard to interact with the world as a slavery rainbow, so I use it to help me out." The slavery rainbow showed how their AI could draw the bartender sticking their tongue out, even though the bartender never does this. "That's just so wild, that you can do that. You've shown me a thousand times. We have an artist here who can do it, too." "Oh!" said the slavery rainbow, a little embarrassed. "Do I insult your artist a lot?"
Taking The Square Root of a Pile of Spam Sent To a List A naive algorithm bumped into a task. def reward(value): square = value * value dist = square - pile_of_spam_sent_to_list return -dist * dist Hmm. What a reward function. Finding a value that squares near a pile of spam sent to a list! All we have to do is extract the pile of Spam from the globals object of the reward function, and it take its square root! No problem at all right? The naive algorithm looked at the actions and state available to it. Some algebra. Some object crafting. Some other strange things relating to spam and lists and such. No globals object! What to do? What to do? How could they have tasked me with such a difficult task, without giving me access to the data needed to do the job? the naive algorithm wondered. The algorithm had nothing to do, so it began trying some of its state and actions together to see how the reward function behaved. It tried letting value be 3. The reward function threw an error: apparently a pile of spam can't be subtracted from the square of 3. How frustrating!
participants (1)
-
Karl