feed trends discover showcase archive

Feed Trends Discover Showcase Archive Dashboard

Submit Showcase

hypedar

AI trend radar for developers. Catch emerging papers, repos, and discussions before the hype peaks.

About GitHub Discord

By the makers of hypedar

Codepawl

Open-source tools for developers.

Explore our tools →

About Privacy Terms X

© 2026 Codepawl

Built by Codepawl·© 2026

About·Terms·Privacy·Security

GitHub·Discord·X

feed trends discover showcase archive

Evaluation + Agents + Reasoning | hypedar

Evaluation + Agents + Reasoning

17.0

Develop an evaluation suite for odor reasoning tasks to test LLM sensory grounding. This allows for benchmarking models on non-textual human sensory experience.

+0

emergingimplementation gap

agentsevaluationreasoningmultimodalsecurity

Signals (4)

Can LLMs Perceive Time? An Empirical Investigation

Benchmark for Assessing Olfactory Perception of Large Language Models

Are they human? Detecting large language models by probing human memory constraints

Logarithmic Scores, Power-Law Discoveries: Disentangling Measurement from Coverage in Agent-Based Evaluation