Protocol rules: Difference between revisions
More on blocks |
Call the main chain the main branch |
||
Line 9: | Line 9: | ||
=== Data structures === | === Data structures === | ||
The main data structures are ''transactions'' and ''blocks''. Blocks are composed of the ''block header'' followed by transactions in the block. Transactions are identified by their hash; blocks by the hash of their header. Blocks have prev | The main data structures are ''transactions'' and ''blocks''. Blocks are composed of the ''block header'' followed by transactions in the block. Transactions are identified by their hash; blocks by the hash of their header. Blocks have prev pointers that link them into a graph. | ||
Conceptually, the client has the following data structures: | Conceptually, the client has the following data structures: | ||
Line 27: | Line 27: | ||
There are 3 categories of blocks: | There are 3 categories of blocks: | ||
;blocks in the main | ;blocks in the main branch | ||
: the transactions in these blocks are considered at least tentatively confirmed | : the transactions in these blocks are considered at least tentatively confirmed | ||
;blocks on side branches off the main | ;blocks on side branches off the main branch | ||
: these blocks have at least tentatively lost the race to be in the main | : these blocks have at least tentatively lost the race to be in the main branch | ||
;orphan blocks | ;orphan blocks | ||
: these are blocks which don't link into the main | : these are blocks which don't link into the main branch, normally because of a missing predecessor or nth-level predecessor | ||
Blocks in the first two categories form a tree rooted at the genesis block, linked by the prev pointer, which points toward the root. (It is a very linear tree with few and short branches off the main branch.) | Blocks in the first two categories form a tree rooted at the genesis block, linked by the prev pointer, which points toward the root. (It is a very linear tree with few and short branches off the main branch.) The main branch is defined as the branch with highest total difficulty, summing the difficulties for each block in the branch. | ||
Line 50: | Line 50: | ||
# Check that nLockTime <= INT_MAX, size in bytes >= 100, and sig opcount <= 2 | # Check that nLockTime <= INT_MAX, size in bytes >= 100, and sig opcount <= 2 | ||
# Reject "nonstandard" transactions: scriptSig doing anything other than pushing numbers on the stack, or scriptPubkey not matching the two usual forms | # Reject "nonstandard" transactions: scriptSig doing anything other than pushing numbers on the stack, or scriptPubkey not matching the two usual forms | ||
# Reject if we already have matching tx in the pool, or in a block in the main | # Reject if we already have matching tx in the pool, or in a block in the main branch | ||
# Reject if any other tx in the pool uses the same transaction output as one used by this tx. | # Reject if any other tx in the pool uses the same transaction output as one used by this tx. | ||
# For each input, look in the main | # For each input, look in the main branch and the transaction pool to find the referenced output transaction. If the output transaction is missing for any input, this will be an orphan transaction. Add to the orphan transactions, if a matching transaction is not in there already. | ||
# For each input, if we are using the ''n''th output of the earlier transaction, but it has fewer than n+1 outputs, reject this transaction | # For each input, if we are using the ''n''th output of the earlier transaction, but it has fewer than n+1 outputs, reject this transaction | ||
# For each input, if the referenced output transaction is coinbase (i.e. only 1 input, with hash=0, n=-1), it must have at least COINBASE_MATURITY confirmations; else reject this transaction | # For each input, if the referenced output transaction is coinbase (i.e. only 1 input, with hash=0, n=-1), it must have at least COINBASE_MATURITY confirmations; else reject this transaction | ||
# Verify crypto signatures for each input; reject if any are bad | # Verify crypto signatures for each input; reject if any are bad | ||
# For each input, if the referenced output has already been spent by a transaction in the main | # For each input, if the referenced output has already been spent by a transaction in the main branch or the transaction pool, reject this transaction | ||
# Using the referenced output transactions to get input values, check that each input value, as well as the sum, are in legal money range | # Using the referenced output transactions to get input values, check that each input value, as well as the sum, are in legal money range | ||
# Reject if the sum of input values < sum of output values | # Reject if the sum of input values < sum of output values | ||
Line 81: | Line 81: | ||
# Reject if sum of transaction sig opcounts > MAX_BLOCK_SIGOPS | # Reject if sum of transaction sig opcounts > MAX_BLOCK_SIGOPS | ||
# Verify Merkle hash | # Verify Merkle hash | ||
# Check if prev block (matching ''prev'' hash) is in main | # Check if prev block (matching ''prev'' hash) is in main branch or side branches. If not, add this to orphan blocks, then query peer we got this from for 1st missing orphan block in ''prev'' chain; done with block | ||
# Check that ''nBits'' value matches the difficulty rules | # Check that ''nBits'' value matches the difficulty rules | ||
# Reject if timestamp is before prev block (for a complicated definition of "before") | # Reject if timestamp is before prev block (for a complicated definition of "before") |
Revision as of 01:19, 15 March 2011
Rules for clients
The wiki substantially documents the Bitcoin protocol, but equally important are the rules used by the client to process messages. It's crucial that clients follow certain rules in order to maintain consistency across the network, and to protect the Bitcoin security guarantees.
Here, the focus is on handling tx and block messages, because that is the tricky logic. This will skip over the method of requesting and forwarding these messages for now, and describe what to do when they are received. Also, this will describe the minimal data structures in rather abstract terms, ignoring the client's various indexes, maps and hash tables used for efficiency. This will be a conceptual description. This is all based on a fairly literal reading of the source code.
Mining (block generation) rules are not yet presented.
Data structures
The main data structures are transactions and blocks. Blocks are composed of the block header followed by transactions in the block. Transactions are identified by their hash; blocks by the hash of their header. Blocks have prev pointers that link them into a graph.
Conceptually, the client has the following data structures:
Transactions
There are two collections of transactions:
- transaction pool
- an unordered collection of transactions that are not in blocks in the main chain, but for which we have input transactions
- orphan transactions
- transactions that can't go into the pool due to one or more missing input transactions
Blocks
There are 3 categories of blocks:
- blocks in the main branch
- the transactions in these blocks are considered at least tentatively confirmed
- blocks on side branches off the main branch
- these blocks have at least tentatively lost the race to be in the main branch
- orphan blocks
- these are blocks which don't link into the main branch, normally because of a missing predecessor or nth-level predecessor
Blocks in the first two categories form a tree rooted at the genesis block, linked by the prev pointer, which points toward the root. (It is a very linear tree with few and short branches off the main branch.) The main branch is defined as the branch with highest total difficulty, summing the difficulties for each block in the branch.
"tx" messages
These messages hold a single transaction.
- Check syntactic correctness
- Make sure neither in or out lists are empty
- Size in bytes < MAX_BLOCK_SIZE
- Each output value, as well as the total, must be in legal money range
- Make sure none of the inputs have hash=0, n=-1 (coinbase transactions)
- Check that nLockTime <= INT_MAX, size in bytes >= 100, and sig opcount <= 2
- Reject "nonstandard" transactions: scriptSig doing anything other than pushing numbers on the stack, or scriptPubkey not matching the two usual forms
- Reject if we already have matching tx in the pool, or in a block in the main branch
- Reject if any other tx in the pool uses the same transaction output as one used by this tx.
- For each input, look in the main branch and the transaction pool to find the referenced output transaction. If the output transaction is missing for any input, this will be an orphan transaction. Add to the orphan transactions, if a matching transaction is not in there already.
- For each input, if we are using the nth output of the earlier transaction, but it has fewer than n+1 outputs, reject this transaction
- For each input, if the referenced output transaction is coinbase (i.e. only 1 input, with hash=0, n=-1), it must have at least COINBASE_MATURITY confirmations; else reject this transaction
- Verify crypto signatures for each input; reject if any are bad
- For each input, if the referenced output has already been spent by a transaction in the main branch or the transaction pool, reject this transaction
- Using the referenced output transactions to get input values, check that each input value, as well as the sum, are in legal money range
- Reject if the sum of input values < sum of output values
- Reject if transaction fee (defined as sum of input values minus sum of output values) would be too low to get into an empty block
- Add to transaction pool
- Add to wallet if mine
- Relay transaction to peers
- For each orphan transaction that uses this one as one of its inputs, run all these steps (including this one) recursively on that orphan
"block" messages
These messages hold a single block.
- Check syntactic correctness
- Reject if duplicate of block we have in any of the three categories
- Transaction list must be non-empty
- Block hash must satisfy claimed nBits proof of work
- Block timestamp must not be more than two hours in the future
- First transaction must be coinbase (i.e. only 1 input, with hash=0, n=-1), the rest must not be
- For each transaction, apply "tx" checks 2-4
- For the coinbase (first) transaction, scriptSig length must be 2-100
- For the other transactions, reject if any input has hash=0, n=-1
- Reject if sum of transaction sig opcounts > MAX_BLOCK_SIGOPS
- Verify Merkle hash
- Check if prev block (matching prev hash) is in main branch or side branches. If not, add this to orphan blocks, then query peer we got this from for 1st missing orphan block in prev chain; done with block
- Check that nBits value matches the difficulty rules
- Reject if timestamp is before prev block (for a complicated definition of "before")
- For certain blocks, on initial block download, check that hash is what it should be