RenoHadreas

zaidlol

2025-02-22 22:41:09

Sam Altman Generates a new human. What prompt was used?

Master_Step_7066

2025-02-22 15:56:17

We might simply get a Sonnet 3.5 with thinking...

RenoHadreas

2025-02-21 17:10:01

When the benchmarks support your expectations vs. when they don’t

RenoHadreas

2025-02-21 03:18:54

Grok 3 summary

Hugedownload

2025-02-20 12:12:02

Topaz Labs Video AI 6 6.1.0 StarLight Update!

RenoHadreas

2025-02-19 04:20:48

Grok 3 is rolling out to everyone, including free X users

RenoHadreas

2025-02-18 02:10:22

The normies have failed us

MichaelFrowning

2025-02-17 17:34:15

o3-mini will often lie about using tools rather than actually using them. (Tool use is a known issue)

assymetry1

2025-02-17 17:06:43

OpenAI could do the funniest thing tonight

RenoHadreas

2025-02-15 20:03:22

Plus users to get "a lot" of o3 pro level intelligence

Condomphobic

2025-02-15 18:10:33

Is this actual beef with Perplexity or friendly banter?

RenoHadreas

2025-02-15 14:58:41

Anthropic is preparing to release its thinking model in webui and API – Codename Paprika

ApprehensiveEye7387

2025-02-14 13:19:16

What is your highest?

shogun2909

2025-02-12 23:56:20

OpenAI o1 and o3-mini now support both file & image uploads in ChatGPT

IlustriousTea

2025-02-12 19:18:40

SAMA GPT 4.5 and 5 UPDATE

Upbeat_Lunch_1599

2025-02-10 02:46:32

Perplexity is now deleting any post from their sub which they find remotely negative

RenoHadreas

2025-02-08 16:47:04

o3-mini's leaked CoT summarizer instructions reveal example raw and processed chains of thought. Here's a side-by-side comparison.

RenoHadreas

2025-02-09 00:12:15

Grok 3 has been spotted in the wild

RenoHadreas

2025-02-08 19:12:12

LLMs' performance on yesterday's AIME questions

RenoHadreas

2025-02-06 21:20:36

o3-mini’s chain of thought has been updated

Hello_moneyyy

2025-02-06 13:45:34

Lemme just clarify: LiveBench's language average is NOT about creative writing.

CJ9103

2025-02-04 19:53:09

What’s your theory on the “one more thing”

RenoHadreas

2025-02-03 17:34:59

Anthropic announces a new safety classifier that eradicates jailbreaks and further increases Claude's over-refusal rate

RenoHadreas

2025-02-03 17:11:23

Next OpenAI event in two days. Most definitely o3-mini related, but what could it be?

troymcclurre

2025-02-03 02:46:31

Will OpenAI’s deep research model be similar to perplexity?

Share Your Mood