-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy path.zenodo.json
More file actions
33 lines (33 loc) · 1.64 KB
/
.zenodo.json
File metadata and controls
33 lines (33 loc) · 1.64 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
{
"title": "Pre-execution self-review catching a self-introduced state-threading defect in an autonomous code-remediation agent",
"upload_type": "publication",
"publication_type": "technicalnote",
"description": "A verifiable behavioral datapoint: a large language model agent (Claude Code, Opus 4.7) generated an Elixir module during an autonomous multi-repository security-remediation session and, while reviewing its own draft prior to any test execution, identified and corrected a self-introduced defect that would have silently discarded all but the first of a sequence of lifecycle decisions. Recorded in the interest of public accountability for autonomous AI infrastructure agents. Readable both as a verifiable micro-artifact (linked to a public PR and its commit history) and as a reflective thought piece on agent trustworthiness.",
"version": "2",
"creators": [
{
"name": "Jewell, Jonathan D. A.",
"affiliation": "The Open University",
"orcid": "0000-0002-3078-6652"
}
],
"keywords": [
"autonomous agents",
"large language models",
"software engineering",
"AI accountability",
"self-review",
"code remediation",
"Claude Code"
],
"license": "cc-by-4.0",
"access_right": "open",
"related_identifiers": [
{
"relation": "isSupplementTo",
"identifier": "https://github.com/hyperpolymath/hypatia/pull/264",
"scheme": "url"
}
],
"notes": "Agent: Claude Code (Anthropic), model Opus 4.7 (1M context). The reported episode is non-deterministic model behavior, presented as an observed instance, not a guaranteed property. No tool/harness malfunction occurred."
}