General Feedback Kernel (Proposal)

Status

Proposal. This page describes a proposed next evolved-cooperation module. It is not yet an implemented canonical case study in the EvolvedCooperation repository.

For a broader follow-on proposal that also routes negative fitness-relevant effects and competition, see Cooperation and Competition Kernel Model.

Why A New Module

Retained Benefit is already the site's most abstract current evolved-cooperation model. It isolates one clean question: when does cooperation spread if the decisive issue is how much of the value created by cooperation is routed back toward cooperators or copies of the cooperative rule rather than being lost through evolutionary leakage?

That abstraction is useful precisely because it is narrow.

one continuous cooperation trait evolves
one lineage label acts as a protected return channel
one scalar retained-benefit fraction determines how much cooperative value avoids leakage

The next step up should not be a larger Retained Benefit model with many ad hoc extensions. It should be a new module that treats Retained Benefit as one special case of a more general return structure.

So the proposed question becomes:

what general forms of feedback are sufficient to let cooperation emerge, persist, or break down?

Core Idea

The proposed module replaces the single retained fraction with a general feedback kernel.

Instead of assuming that cooperative value is split only into open versus same-lineage retained components, the model asks how value created by one site eventually returns to another through a more flexible set of channels.

Those channels may depend on:

relatedness or lineage similarity
spatial proximity
delay through time
repeated interaction history
reputation or social memory
partner choice
institutional enforcement
ecological overlap or shared fate

Under that framing, many classical cooperation theories become special parameterizations of one general return operator.

What "Kernel" Means Here

In this context, a kernel is not an operating-system kernel or a GPU compute kernel. It is the rule that maps the benefit produced by one cooperative action to the agents that receive that benefit.

In RL-style language, the full causal chain is:

\text{action} \rightarrow \text{produced benefit} \rightarrow \text{returned benefit} \rightarrow \text{fitness}

The kernel is the middle map: it turns produced benefit into returned benefit.

Put more simply, if agent or site $j$ creates cooperative benefit, the kernel answers:

who receives that benefit, when do they receive it, and with what weight?

A compact way to write this is:

R_i(t) = \sum_j \sum_{\tau \ge 0} K_{j \to i}(\tau, X_t) \, B_j(t-\tau)

where:

$R_i(t)$ is the total returned benefit received by site $i$ at time $t$
$B_j(t-\tau)$ is benefit produced earlier by site $j$
$K_{j \to i}(\tau, X_t)$ is the kernel weight from producer $j$ to recipient $i$
$\tau$ is the time delay between production and return
$X_t$ is the current world state, including spatial structure, lineage labels, memory, reputation, or ecological context

What Is "Produced Benefit"?

Produced benefit is the positive, fitness-relevant output generated by an action before that output is allocated to recipients.

Formally, for agent $j$ :

B_j(t)

means:

the amount of positive, fitness-relevant output agent $j$ 's action creates at time $t$
before subtracting private cost
before deciding who receives that output
before turning it into anyone's final fitness or selection score

So:

\text{action by } j \;\Rightarrow\; B_j(t)

and then:

R_i(t) = \sum_j K_{j \to i}(\tau, X_t)\, B_j(t-\tau)

where:

$B_j(t)$ is the positive output created by agent $j$ 's action
$K_{j \to i}$ is the rule that decides how much of that output ends up benefiting $i$
$R_i(t)$ is the returned benefit actually received by $i$

In Retained Benefit

There it is very simple:

B_i = b h_i

where:

$h_i$ is the cooperation level of site $i$
$b$ is the benefit produced per unit cooperation
$B_i$ is the total cooperative value produced by site $i$

Then that $B_i$ is split into:

B_i^{open} = (1-r)B_i

B_i^{retained} = rB_i

where:

$r$ is the retained-benefit fraction
$B_i^{open}$ is the open share available to the whole local neighborhood
$B_i^{retained}$ is the protected share available only to same-lineage neighbors

So "produced benefit" here means:

how much fitness-relevant cooperative output the agent creates before the model decides who gets it

RL Translation

In RL-style language, you could write:

B_j(t) = g(a_j(t), X_t)

where:

$a_j(t)$ is the action of agent $j$ at time $t$
$X_t$ is the current world state
$g$ is the function mapping action and state to produced value

Then the kernel routes that value:

R_i(t) = \sum_j K_{j \to i}(X_t)\, B_j(t)

So:

action = what the agent does
produced benefit = the positive, fitness-relevant output that action creates
returned benefit = the part of that output that actually reaches a recipient
kernel = the rule mapping produced benefit into returned benefit

Important Distinction

Produced benefit is not the same as fitness.

Produced benefit: positive, fitness-relevant output created by an action
Returned benefit: the part of that output that comes back to a particular agent after routing
Fitness: returned benefit minus costs, possibly plus baseline terms

So:

\text{produced benefit} \neq \text{fitness}

A compact decomposition is:

\text{action} \rightarrow B_j \rightarrow R_i \rightarrow W_i

where:

$B_j$ is produced benefit
$R_i$ is returned benefit
$W_i$ is the final fitness or selection score

Intuition

Examples of produced benefit include:

food obtained by a cooperative hunt
safety created by an alarm call
offspring survival value created by parental care
shared information created by helping
abstract cooperative value in retained_benefit

So the clean definition is:

\boxed{ \text{Produced benefit is the positive, fitness-relevant output generated by an action before that output is routed to recipients.} }

For Retained Benefit, the kernel has one simple shape: nearby sites receive an open share, and same-lineage nearby sites receive an additional protected retained share. A more general Feedback Kernel Model would let that return operator take other forms as well, such as relatedness weighting, spatial distance weighting, delayed reciprocity, reputation-gated return, or institutional exclusion of free-riders.

Example: Reading A Kernel Matrix

For a simple no-delay example, suppose there are three agents:

A,\quad B,\quad C

Suppose agent $A$ takes an action that produces benefit:

B_A = 10

One row of the kernel might say:

K_{A \to A} = 0.30,\quad K_{A \to B} = 0.20,\quad K_{A \to C} = 0.05

This means:

$30\%$ of the benefit produced by $A$ returns to $A$
$20\%$ goes to $B$
$5\%$ goes to $C$
the remaining $45\%$ leaks away or is not fitness-relevant

The returned benefits from $A$ 's action are therefore:

R_A = K_{A \to A}B_A = 0.30 \times 10 = 3

R_B = K_{A \to B}B_A = 0.20 \times 10 = 2

R_C = K_{A \to C}B_A = 0.05 \times 10 = 0.5

If $A$ pays private cooperation cost:

C_A = 2

then the direct actor-return comparison is:

R_A - C_A = 3 - 2 = 1

So $A$ 's cooperation is favored from the direct-return perspective.

The full no-delay kernel can be written as a matrix:

K = \begin{bmatrix} 0.30 & 0.20 & 0.05 \\ 0.10 & 0.25 & 0.15 \\ 0.05 & 0.10 & 0.35 \end{bmatrix}

Rows are producers and columns are recipients:

K = \begin{bmatrix} K_{A \to A} & K_{A \to B} & K_{A \to C} \\ K_{B \to A} & K_{B \to B} & K_{B \to C} \\ K_{C \to A} & K_{C \to B} & K_{C \to C} \end{bmatrix}

If the produced-benefit vector is:

B = \begin{bmatrix} 10 \\ 4 \\ 6 \end{bmatrix}

then total returned benefit to each recipient is:

R = K^\top B

which gives:

R_A = 0.30(10) + 0.10(4) + 0.05(6) = 3.7

R_B = 0.20(10) + 0.25(4) + 0.10(6) = 3.6

R_C = 0.05(10) + 0.15(4) + 0.35(6) = 3.2

So the returned-benefit vector is:

R = \begin{bmatrix} 3.7 \\ 3.6 \\ 3.2 \end{bmatrix}

The matrix $K$ is the detailed map of benefit flow. The scalar $\Phi$ is a compressed summary of the part of that map that matters for a particular cooperation condition. If only direct return to $A$ matters, then:

\Phi_A = K_{A \to A} = 0.30

If $B$ is a close copy or lineage relative of $A$ with relatedness $q_{AB}=0.5$ , and $C$ is unrelated with $q_{AC}=0$ , then:

\Phi_A = K_{A \to A} + q_{AB}K_{A \to B} + q_{AC}K_{A \to C}

\Phi_A = 0.30 + 0.5(0.20) + 0(0.05) = 0.40

The compact cooperation condition becomes:

\Phi_A B_A > C_A

0.40 \times 10 > 2

4 > 2

In that example, cooperation by $A$ is favored because the feedback-weighted return exceeds the private cost.

Proposed General Equation

At time $t$ , let site or agent $j$ take action $a_j(t)$ , let that action produce benefit $B_j(t)$ , and let site or agent $i$ pay cost $C_i(t)$ for its current cooperation level.

The proposed fitness or reproduction score is:

W_i(t) = w_0 - C_i(t) + \sum_j \sum_{\tau \ge 0} K_{j \to i}(\tau, X_t) \, B_j(t-\tau)

where:

$W_i(t)$ is the selection-relevant score for site $i$
$w_0$ is baseline fitness or baseline replacement weight
$C_i(t)$ is the private cost of cooperation paid by $i$
$a_j(t)$ is the action taken by agent $j$ at time $t$
$B_j(t-\tau)$ is benefit previously produced by the action of site or agent $j$
$K_{j \to i}(\tau, X_t)$ is the feedback kernel specifying how much of that value returns to $i$ after delay $\tau$ in world state $X_t$

The cooperation condition then becomes more general:

\text{cooperation is favored when feedback-weighted marginal return exceeds marginal cost}

or, locally,

\frac{\partial \mathbb{E}[\text{return to actor or copies}]}{\partial h_i} > \frac{\partial C_i}{\partial h_i}

This does not commit the model to one particular mechanism. It asks only whether the return structure is strong enough to overcome private cost.

Minimal State Variables

The module should stay abstract enough to compare many mechanisms without becoming a kitchen-sink ecology.

Minimal state:

site index or agent index $i$
cooperation trait $h_i \in [0, 1]$
lineage or rule label $\ell_i$
optional internal social state $s_i$ for memory, reputation, or learned partner choice
world state $X_t$ capturing spatial structure, current population distribution, and any institutional or ecological context

Derived quantities:

produced benefit $B_i(t)$
private cost $C_i(t)$
returned value $R_i(t)$
selection score or fitness $W_i(t)$

Parameter Families

The aim is to vary a few interpretable parameter families rather than add one-off switches for every theory.

1. Contribution Rule

How much value is created as cooperation rises?

linear benefit
saturating benefit
threshold synergy
nonlinear group amplification

2. Cost Rule

How costly is cooperation to the producer?

linear cost
convex cost
context-dependent cost

3. Kernel Structure

What determines how value returns?

local distance weighting
lineage similarity weighting
temporal delay weighting
reputation weighting
reciprocity weighting from past interactions
partner-choice gating
institutional transfer or punishment terms

4. Selection Rule

How does returned value change persistence?

local replacement lottery
reproductive competition
survival threshold
bounded carrying-capacity competition

5. Variation Rule

How do new traits enter the system?

mutation in cooperation level
mutation in kernel sensitivity parameters
mutation in memory or reciprocity traits

Why This Is Distinct From Retained Benefit

Retained Benefit should remain a clean benchmark. The General Feedback Kernel proposal adds generality by making the return operator explicit.

1. Abstraction

Retained Benefit uses one specific return rule:

W_i = w_0 + O_i + R_i - C_i

where:

$W_i$ is the fitness or replacement weight of site $i$
$w_0$ is baseline fitness
$O_i$ is open benefit received from local neighbors
$R_i$ is retained benefit received from same-lineage neighbors
$C_i$ is the private cost paid by site $i$

The General Feedback Kernel model generalizes this to:

W_i(t) = w_0 - C_i(t) + \sum_j \sum_{\tau \ge 0} K_{j \to i}(\tau, X_t) \, B_j(t-\tau)

where:

$W_i(t)$ is the fitness or selection score of agent $i$ at time $t$
$w_0$ is baseline fitness
$C_i(t)$ is the private cost paid by agent $i$
$B_j(t-\tau)$ is benefit produced by agent $j$ at an earlier time
$K_{j \to i}(\tau, X_t)$ is the kernel weight saying how much of $j$ 's benefit returns to $i$
$\tau$ is the delay between production and return
$X_t$ is world state, such as space, lineage, memory, reputation, institutions, or ecology

So:

ext{Retained Benefit} = \text{one fixed kernel}

ext{General Feedback Kernel} = \text{a family of possible kernels}

2. Minimal Conditions For Cooperation

In Retained Benefit, cooperation is favored roughly when:

rB > C

where:

$r$ is the retained-benefit fraction
$B$ is cooperative benefit produced
$C$ is private cost of cooperation

That condition is useful but narrow, because $r$ represents only one feedback channel: immediate same-lineage retained benefit.

The more general kernel condition is:

\frac{\partial}{\partial h_i} \mathbb{E} \left[ \sum_j \sum_{\tau \ge 0} K_{j \to i}(\tau, X_t) B_j(t-\tau) \right] > \frac{\partial C_i(t)}{\partial h_i}

where:

$h_i$ is the cooperation level of agent $i$
the left side is the marginal returned benefit caused by increasing cooperation
the right side is the marginal private cost of increasing cooperation

In plain terms:

Cooperation increases when returned benefit from cooperation exceeds private cost.

The minimal conditions are:

There is variation in cooperation, $h_i$ .
Cooperation creates benefit, $B_i(h_i)$ .
Cooperation has private cost, $C_i(h_i)$ .
Some kernel $K$ routes enough benefit back to cooperators, copies, partners, or descendants.
Selection copies or preserves agents or rules with higher $W_i$ .
Leakage to free-riders is not too large.

3. Universal Cooperation Law

The retained-benefit law is:

ext{cooperation favored if retained same-lineage return} > \text{cost}

The proposed universal law is:

ext{cooperation favored if feedback-weighted return} > \text{cost}

A compact form is:

\Phi B > C

where:

$\Phi$ is the effective feedback coefficient
$B$ is the marginal benefit created by cooperation
$C$ is the marginal private cost
$\Phi B$ is the part of the created benefit that effectively returns to the actor, copies, partners, or future descendants

More explicitly:

\Phi_i = \sum_j \sum_{\tau \ge 0} K_{j \to i}(\tau, X_t)

and the local cooperation condition becomes:

\Phi_i B_i > C_i

The important relationship is:

r \subset \Phi

That is, the retained-benefit fraction $r$ is one specific component of the more general effective feedback coefficient $\Phi$ . The broader coefficient can also include lineage, spatial assortment, reciprocity, reputation, enforcement, partner choice, delayed return, and ecological synergy.

So the proposed general law is:

\boxed{ \begin{aligned} &\text{Cooperation evolves when the expected feedback-weighted marginal return} \\ &\text{to the cooperative rule exceeds its marginal private cost.} \end{aligned} }

The proposed module is therefore not a replacement for Retained Benefit. It is the more universal parent model within which Retained Benefit becomes one interpretable special case.

Important Special Cases

One reason to build this as a separate module is that many classical theories then become explicit kernel choices.

Retained Benefit

immediate local kernel
same-lineage protection
open leakage outside the protected channel

Kin Selection

kernel weights return by relatedness
cooperation spreads when the inclusive-fitness return is large enough

Direct Reciprocity

kernel weights future return by repeated interaction history
delayed return matters as much as immediate return

Indirect Reciprocity

kernel depends on reputation or social image
return comes from third parties, not only direct recipients

Spatial Assortment

kernel depends mainly on local clustering and repeated local encounters
return is structural rather than explicitly cognitive

Parental Investment

offspring may be immediate non-contributors
but delayed lineage return can still be strong through inclusive fitness

Proposed Python Module Layout

The canonical implementation should live as a new top-level module in the sibling EvolvedCooperation repository.

general_feedback_kernel/
  __init__.py
  README.md
  general_feedback_kernel_model.py
  kernel.py
  state.py
  selection.py
  metrics.py
  tests/
    test_kernel_normalization.py
    test_retained_equivalent_regression.py
    test_delay_buffer.py
  config/
    general_feedback_kernel_default_config.py
    retained_equivalent_config.py
    kin_local_config.py
    delayed_reciprocity_config.py
  utils/
    export_github_pages_demo.py
    plot_phase_diagrams.py
  experiments/
    sweep_kernel_coupling.py
    sweep_delay_vs_cost.py
    compare_special_cases.py

Core responsibilities:

general_feedback_kernel_model.py: simulation loop, state update, mutation, and reproduction logic
kernel.py: return operator definitions and composable kernel terms
state.py: data structures for agent traits, lineage labels, optional social state, and environment state
selection.py: local replacement or survival competition rules
metrics.py: mean cooperation, assortment, kernel-weighted return, lineage persistence, and breakdown thresholds

Canonical Repo Task Breakdown

The breakdown below is the concrete handoff plan for the sibling EvolvedCooperation repository.

Phase 0: Module Skeleton And Retained-Equivalent Baseline

Tasks:

Create the general_feedback_kernel/ module folder and default config layout.
Implement the minimal simulation loop with continuous cooperation trait, lineage label, mutation, and local replacement.
Add retained_equivalent_config.py that reproduces Retained Benefit as a kernel special case.
Add summary metrics for mean cooperation, local assortment, dominant-lineage share, and mean returned value.

Exit criterion:

the retained-equivalent configuration reproduces the expected qualitative Retained Benefit behavior and exports the same headline metrics.

Phase 1: Kernel Abstractions

Tasks:

Implement a composable FeedbackKernel interface.
Support immediate local weighting, lineage weighting, and optional delay weighting.
Add normalization or boundedness checks so kernel mass remains interpretable.
Add tests for non-negativity, normalization, and deterministic behavior under fixed seeds.

Exit criterion:

kernel terms can be combined without breaking score bounds or producing unstable return weights.

Phase 2: Delayed-Return Case

Tasks:

Add a time buffer for previously created cooperative value.
Implement at least one delayed-return configuration.
Record metrics for return lag, persistence, and cooperation breakdown threshold under delay.
Add a smoke test confirming the delayed-return config runs end-to-end.

Exit criterion:

one delayed-return case runs stably and produces interpretable history traces.

Phase 3: Non-Lineage Feedback Case

Tasks:

Implement one non-lineage channel such as reciprocity-weighted or reputation-weighted return.
Add the required agent social state for that mechanism.
Compare its emergence boundary with the retained-equivalent baseline under a shared metric set.
Confirm that cooperation can be sustained through that channel even when pure same-lineage routing is absent or weak.

Exit criterion:

at least one non-lineage mechanism sustains cooperation in a way that is clearly distinct from the retained-equivalent case.

Phase 4: Sweep And Comparison Layer

Tasks:

Add parameter sweeps over cost, delay, coupling strength, and locality.
Produce phase diagrams for emergence versus breakdown boundaries.
Add compare_special_cases.py to compare retained-equivalent, kin-local, and delayed-return regimes.
Define one common comparison table used by docs and plots.

Exit criterion:

the module can generate a clean cross-case comparison showing which return structures sustain cooperation under which conditions.

Phase 5: Export And Website Integration

Tasks:

Add a frozen website-demo config.
Export sampled replay data and summary metrics for one chosen canonical case.
Add the figure and replay bundle needed for the website page.
Write a repo README that states clearly which theoretical cases are already implemented versus still planned.

Exit criterion:

the sibling repo has one canonical demo case that can be documented on the site without claiming more implementation coverage than actually exists.

Suggested Issue List

If this work is tracked as issues in the canonical repo, the first issue set should be:

Scaffold general_feedback_kernel/ and retained-equivalent baseline.
Implement composable kernel terms plus normalization tests.
Add delayed-return buffer and delayed feedback config.
Add one non-lineage return mechanism.
Build comparison sweeps and shared metrics.
Export a website demo and write README/docs.

Recommended Test Strategy

The first implementation should be disciplined about regression checks.

unit tests for kernel boundedness and normalization
regression tests for the retained-equivalent baseline under a fixed seed
smoke tests for every config in config/
metric invariants such as valid probability weights, bounded cooperation trait values, and nonnegative returned-value tallies where appropriate

Suggested Internal API

A disciplined first implementation could expose the following abstractions.

ContributionRule: maps cooperation level to produced value
CostRule: maps cooperation level to private cost
FeedbackKernel: maps sender, recipient, delay, and world state to a return weight
SelectionRule: maps scores to persistence or reproductive success
MutationRule: perturbs cooperation and optional kernel-sensitivity traits

That keeps the model extensible without hardcoding one mechanism after another.

Minimal Implementation Milestones

The first version should stay narrow enough to test the abstraction rather than immediately chase every mechanism.

Reproduce Retained Benefit as a kernel special case.
Add one delayed-return case to test time-lagged feedback.
Add one non-lineage case, such as reputation-weighted or reciprocity-weighted return.
Compare emergence and breakdown boundaries across those cases with shared metrics.

Only after that should the module add richer institutional or ecological channels.

Why This Matters

If this module works, it would let the site express a deeper claim than Retained Benefit alone.

Retained Benefit says:

cooperation rises when enough of its return is protected from leakage

The General Feedback Kernel proposal would say:

cooperation rises when the total feedback-weighted return to cooperators or copies of the cooperative rule exceeds the private cost, regardless of whether that return is carried by kinship, spatial structure, reciprocity, reputation, institutions, or delayed ecological coupling

That would make it a stronger candidate for a genuinely general cooperation law.

References

Hamilton, W. D. (1964). The genetical evolution of social behaviour. I. Journal of Theoretical Biology, 7(1), 1-16. https://doi.org/10.1016/0022-5193(64)90038-4
Trivers, R. L. (1971). The evolution of reciprocal altruism. The Quarterly Review of Biology, 46(1), 35-57. https://doi.org/10.1086/406755
Axelrod, R., & Hamilton, W. D. (1981). The evolution of cooperation. Science, 211(4489), 1390-1396. https://doi.org/10.1126/science.7466396
Nowak, M. A. (2006). Five rules for the evolution of cooperation. Science, 314(5805), 1560-1563. https://doi.org/10.1126/science.1133755
West, S. A., Griffin, A. S., & Gardner, A. (2007). Evolutionary explanations for cooperation. Current Biology, 17(16), R661-R672. https://doi.org/10.1016/j.cub.2007.06.004

Status​

Why A New Module​

Core Idea​

What "Kernel" Means Here​

What Is "Produced Benefit"?​

In Retained Benefit​

RL Translation​

Important Distinction​

Intuition​

Example: Reading A Kernel Matrix​

Proposed General Equation​

Minimal State Variables​

Parameter Families​

1. Contribution Rule​

2. Cost Rule​

3. Kernel Structure​

4. Selection Rule​

5. Variation Rule​

Why This Is Distinct From Retained Benefit​

1. Abstraction​

2. Minimal Conditions For Cooperation​

3. Universal Cooperation Law​

Important Special Cases​

Retained Benefit​

Kin Selection​

Direct Reciprocity​

Indirect Reciprocity​

Spatial Assortment​

Parental Investment​

Proposed Python Module Layout​

Canonical Repo Task Breakdown​

Phase 0: Module Skeleton And Retained-Equivalent Baseline​

Phase 1: Kernel Abstractions​

Phase 2: Delayed-Return Case​

Phase 3: Non-Lineage Feedback Case​

Phase 4: Sweep And Comparison Layer​

Phase 5: Export And Website Integration​

Suggested Issue List​

Recommended Test Strategy​

Suggested Internal API​

Minimal Implementation Milestones​

Why This Matters​

References​

Status

Why A New Module

Core Idea

What "Kernel" Means Here

What Is "Produced Benefit"?

In Retained Benefit

RL Translation

Important Distinction

Intuition

Example: Reading A Kernel Matrix

Proposed General Equation

Minimal State Variables

Parameter Families

1. Contribution Rule

2. Cost Rule

3. Kernel Structure

4. Selection Rule

5. Variation Rule

Why This Is Distinct From Retained Benefit

1. Abstraction

2. Minimal Conditions For Cooperation

3. Universal Cooperation Law

Important Special Cases

Retained Benefit

Kin Selection

Direct Reciprocity

Indirect Reciprocity

Spatial Assortment

Parental Investment

Proposed Python Module Layout

Canonical Repo Task Breakdown

Phase 0: Module Skeleton And Retained-Equivalent Baseline

Phase 1: Kernel Abstractions

Phase 2: Delayed-Return Case

Phase 3: Non-Lineage Feedback Case

Phase 4: Sweep And Comparison Layer

Phase 5: Export And Website Integration

Suggested Issue List

Recommended Test Strategy

Suggested Internal API

Minimal Implementation Milestones

Why This Matters

References