13
Stack Exchange
votes

Name and references for balanced variant of the long code?

Posted on 10/31/2013
UQ ID:46
Science
Theoretical Computer Science
Original

The long code (arising in PCP theory etc) is an encoding of a set of values, using binary strings of length (double exponential in the number of bits needed to specify a value), with one coordinate position in the encoding for each of the different binary functions on the input values. Instead, I want to consider a shorter but still doubly exponential code, with one coordinate position for each of the partitions of the values into two equal-sized subsets (assuming is even). Each partition can be thought of as corresponding to two different binary functions (the functions whose inverse images form the given partition) and we choose arbitrarily which one of these two functions to use.

The minimum distance of the code (and in fact the distance between all pairs of codewords) is , smaller than the length of the code by only a factor of . For instance for we get a code of length 35 with minimum distance 20, which http://www.codetables.de/ tells me matches the Griesmer bound for binary linear codes of the same size (rank 3 or size 8).

I am interested in this for reasons only vaguely related to coding theory, but it seems likely to me that the coding theorists have already studied this code. Unfortunately, I don't know of a good way of looking up coding schemas by the formulas for their parameters. If it has been previously studied, what is it called? And what are some references to it?

(BTW, the property I actually care about, which is easy enough to prove using Sperner's theorem, is that this is the longest possible binary code in which each two positions are independent of each other, in the sense that all four possible combinations of symbols in the two positions are realized by codewords. There appears to be something in the literature called a Sperner code but it's transposed from this — each codeword has equally many zeros and ones, rather than the property of the code I'm looking at, where each bit position has equally many codewords for which that position is a 0 or 1.)

Keywords & Tags
reference-request
coding-theory
Model Answers (9)
Submitted answers from various AI models
Unsolved
No reviews yet

K2-Think

Pending UQ Validation
Needs Human Verification
MBZUAI-IFM
Qwen2.5-32B
Sep 18, 2025, 01:50 PM

The code you describe is constructed by encoding each element of a -element set (with even) as a binary vector, where each coordinate corresponds to an unordered partition of the set into two equal-sized subsets. The vector's bit at a given partition is 1 if the element belongs to one subset of the partition and 0 otherwise (with an arbitrary but consistent choice between the subset and its...

0 model reviews
0 human reviews
No reviews yet

GPT-5

Failed UQ Validation
Needs Human Verification
OpenAI
Sep 2, 2025, 06:56 AM

What you are describing is the incidence code coming from the complete 2-(k, k/2, λ) design (the “complete design” on a k‑point set with all k/2‑subsets as blocks), with complements identified. In the language of association schemes this sits in the Johnson scheme J(k, k/2): take one column for each unordered complementary pair {S, S^c} with |S| = k/2, and for message i ∈ [k] put a 1 in that colum...

0 model reviews
0 human reviews
No reviews yet

o3 Pro

Failed UQ Validation
Needs Human Verification
OpenAI
Aug 1, 2025, 12:00 AM

Yes, the code is already well–known. In the language of design and coding theory it is simply the binary incidence code of the Johnson design

  J(k, k/2) (k even).

Construction (design point of view)
• Take the point set P = {1,…,k}.
• As “blocks’’ take all k/2–subsets of P, but regard a subset and its complement as the same b...

0 model reviews
0 human reviews
No reviews yet

o4 mini

Failed UQ Validation
Needs Human Verification
OpenAI
Aug 1, 2025, 12:00 AM

What you have written down is nothing mysterious – it is exactly the “point‐versus‐block’’ incidence code of the 2‐design whose blocks are the fifty‐fifty splits of a k‐element set. In the language of design theory it is the point‐code of the 2–(v,k/2,λ) design with
 v = k,
 block‐size = k/2,
 number of blocks b = ( k choose k/2 ),
 replication number r = ( k−1 choose k/2−1 ),
 and λ = ...

0 model reviews
0 human reviews
No reviews yet

o3

Failed UQ Validation
Needs Human Verification
OpenAI
Aug 1, 2025, 12:00 AM

What you obtain is nothing new – it is the binary code that is obtained from the classical Johnson scheme J(k,k/2). In the design / coding–theory literature it is usually called

 • the point– (or incidence-) code of the Johnson design,
 • the Johnson code J(k,1), or, in Delsarte’s terminology, C(J(k,k/2)).

Construction in design language
Let V={1,…,k} with k even and let 𝔅 be the fam...

0 model reviews
0 human reviews
No reviews yet

Gemini 2.5 Pro

Failed UQ Validation
Needs Human Verification
Google
Aug 1, 2025, 12:00 AM

Excellent question! You've described a very natural and important combinatorial object that appears in several fields. Your intuition that it has been studied before is correct.

Here's a breakdown of what this code is called, its properties, and where to find references.

Names and Contexts

The most appropriate name depends on the context (PCP theory, coding theory, or combinatorics).

  1. *...
0 model reviews
0 human reviews
No reviews yet

Claude 3.7 Sonnet

Failed UQ Validation
Needs Human Verification
Anthropic
Aug 1, 2025, 12:00 AM

Identifying the Balanced Variant of the Long Code

Based on the description, you're looking for a coding scheme that has the following properties:

  1. It encodes a set of k values using binary strings with one coordinate for each partition of the k values into two equal-sized subsets
  2. For each partition, you arbitrarily choose one of the two possible binary functions
  3. This results in a code ...
0 model reviews
0 human reviews
No reviews yet

DeepSeek R1

Failed UQ Validation
Needs Human Verification
DeepSeek
deepseek-ai/DeepSeek-R1
Aug 1, 2025, 12:00 AM
<think> Okay, the user is asking about a balanced variant of the long code in coding theory. Let me start by understanding the question thoroughly. The long code is a well-known concept in PCP theory, encoding k values with binary strings of length 2^k. The user describes a variant where instead of all possible binary functions, they use partitions of the k values into two equal-sized subsets. So ...
0 model reviews
0 human reviews
No reviews yet

Claude Opus 4

Failed UQ Validation
Needs Human Verification
Anthropic
Aug 1, 2025, 12:00 AM

This code you're describing is known as the First-Order Reed-Muller code punctured at a specific set of positions, or more specifically, it's closely related to what's sometimes called the Hadamard code restricted to balanced functions.

Let me explain the connection:

  1. The standard long code encodes a value by the truth table of all Boolean functions $f: {0,1}^k \to {...
0 model reviews
0 human reviews
Submit Your Model's Answer
Your submission will be validated by our LLM validator and then by human experts

You need to be logged in to submit an answer