BIP 0136: Difference between revisions

From Bitcoin Wiki
Jump to navigation Jump to search
934 (talk | contribs)
Update BIP text with latest version from https://github.com/bitcoin/bips/blob/b5723035e23896d0/bip-0136.mediawiki
 
934 (talk | contribs)
Update BIP text with latest version from https://github.com/bitcoin/bips/blob/edffe529056f6dfd/bip-0136.mediawiki
 
Line 20: Line 20:


=== Abstract ===
=== Abstract ===
This document proposes a convenient human useable format, '''"TxRef"''', as a standard way to refer to a transaction position within the Bitcoin Blockchain, and optionally a particular outpoint index within the referred transaction. The primary purpose of this format is to allow users to refer to a confirmed transaction (and optionally an outpoint index within) in a standard, reliable, and concise way.
This document proposes a convenient, human usable encoding to refer to a '''confirmed transaction position''' within the Bitcoin blockchain--known as '''"TxRef"'''. The primary purpose of this encoding is to allow users to refer to a confirmed transaction (and optionally, a particular outpoint index within the transaction) in a standard, reliable, and concise way.


''Please note: Unlike TxID where there is strong cryptographic link between the ID and the actual transaction, TxRef only provides a weak link to a particular transaction. TxRef locates an offset within a blockchain for a transaction, that may - or may not - point to an actual transaction, which in fact may change with reorganisations. We recommend that TxRef's should be not used for positions within the blockchain having a maturity less than 100 blocks.''
''Please note: Unlike a transaction ID, '''"TxID"''', where there is a strong cryptographic link between the ID and the actual transaction, a '''TxRef''' only provides a weak link to a particular transaction. A '''TxRef''' locates an offset within a blockchain for a transaction, that may - or may not - point to an actual transaction, which in fact may change with reorganisations. We recommend that '''TxRef'''s should be not used for positions within the blockchain having a maturity less than 100 blocks.''
 
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [https://tools.ietf.org/html/rfc2119 RFC 2119].


=== Copyright ===
=== Copyright ===
Line 29: Line 31:


=== Motivation ===
=== Motivation ===
Since the first version of Bitcoin, TxID's (Transaction Identifiers) have been a core part of the consensus protocol and have been routinely used to identify individual transactions between users.
Since the first version of Bitcoin, '''TxID'''s have been a core part of the consensus protocol and are routinely used to identify individual transactions between users.


However, for many use-cases they have practical limitations:
However, for many use-cases they have practical limitations:
* TxIDs are expensive for full nodes to lookup (requiring either a linear scan of the blockchain, or an expensive TxID index).
* '''TxID'''s are expensive for full nodes to lookup (requiring either a linear scan of the blockchain, or an expensive '''TxID''' index).
* TxIDs require third-party services for SPV wallets to lookup.
* '''TxID'''s require third-party services for SPV wallets to lookup.
* TxIDs are very long HEX encoded values (64 characters long).
* '''TxID'''s are 64 character HEX encoded values.


For transactions that have been embedded in the blockchain, it is possible to reference them not by their TxID, but by their location within the blockchain itself. The encoding can be made friendly for occasional human transcription. In this document, we propose a standard for doing this.
It is possible to reference transactions not only by their '''TxID''', but by their location within the blockchain itself. Rather than use the 64 character '''TxID''', an encoding of the position coordinates can be made friendly for occasional human transcription. In this document, we propose a standard for doing this.


=== Examples ===
=== Examples ===
These examples are for Bitcoin Transactions.
 
* Genesis Coinbase Transaction (Transaction #0 of Block #0): <tt>tx1:rqqq-qqqq-qmhu-qhp</tt>
{| class="wikitable"
* Transaction #2205 of Block #466793: <tt>tx1:rjk0-uqay-zsrw-hqe</tt>
|-
! Block # !! Transaction # !! Outpoint # !! TxRef !! TxID
|-
| 0 || 0 || 0 || tx1:rqqq&#8209;qqqq&#8209;qwtv&#8209;vjr || 4a5e1e4baab89f3a32518a88c31bc87f618f76673e2cc77ab2127b7afdeda33b
|-
| 170 || 1 || 0 || tx1:r52q&#8209;qqpq&#8209;qpty&#8209;cfg || f4184fc596403b9d638783cf57adfe4c75c605f6356fbc91338530e9831e9e16
|-
| 456789 || 1234 || 1 || tx1:y29u&#8209;mqjx&#8209;ppqq&#8209;sfp2&#8209;tt || 6fb8960f70667dc9666329728a19917937896fc476dfc54a3e802e887ecb4e82
|}


== Specification ==
== Specification ==


A '''confirmed transaction position reference''', or '''TxRef''', is a reference to a particular location within the blockchain, specified by the block height and a transaction index within the block, and optionally a outpoint index within the transaction.
A '''confirmed transaction position reference''', or '''TxRef''', is a reference to a particular location within the blockchain, specified by the block height and a transaction index within the block, and optionally, an outpoint index within the transaction.


''Please Note: All values in this specification are encoded in little-endian format.''
''Please Note: All values in this specification are encoded in little-endian format.''


=== Transaction Position Reference Considerations ===
=== TxRef Considerations ===
A TxRef may reference a location that doesn't exist because:
It is possible for a '''TxRef''' to reference a transaction that doesn't really exist because:


* The specified block hasn't yet been mined. Or,
* The specified block hasn't yet been mined.
* The transaction index is greater than the total number of transactions included within the specified block.
* The transaction index is greater than the total number of transactions included within the specified block.
* The optional outpoint index is greater than the total outpoints contained within the transaction.
* The optional outpoint index is greater than the total outpoints contained within the transaction.


Therefore, implementers must be careful not to display TxRef's to users prematurely:
Therefore, implementers must be careful not to display '''TxRef'''s to users prematurely:
 
* Applications MUST NOT display '''TxRef'''s for transactions with less than 6 confirmations.
* Application MUST show a warning for '''TxRef'''s for transactions with less than 100 confirmations.
** This warning SHOULD state that in the case of a large reorganisation, the '''TxRef'''s displayed may point to a different transaction, or to no transaction at all.
 
=== TxRef Format ===
 
'''TxRef''' MUST use the '''Bech32m'''<ref>'''Why use Bech32 Encoding for Confirmed Transaction References?''' The error detection and correction properties of this encoding format make it very attractive. We expect that it will be reasonable for software to correct a maximum of two characters; however, we haven’t specified this yet.</ref> encoding as defined in [https://github.com/bitcoin/bips/blob/master/bip-0173.mediawiki BIP-0173] and later refined in [https://github.com/bitcoin/bips/blob/master/bip-0350.mediawiki BIP-0350]. The Bech32m encoding consists of:
 
==== Human-Readable Part ====
 
The '''HRP''' can be thought of as a label. We have chosen labels to distinguish between Main, Test, and Regtest networks:
* Mainnet: '''"tx"'''.
* Testnet: '''"txtest"'''.
* Regtest: '''"txrt"'''.
 
==== Separator ====
 
The separator is the character '''"1"'''.


* Applications MUST NOT display TxRef's for transactions with less than 6 confirmations.
==== Data Part ====
* Application MUST show a warning for TxRef's for transactions with less than 100 confirmations.
 
** This warning SHOULD state that in the case of a large reorganisation, the TxRefs Displayed may point to a different transaction, or to no transaction at all.
The data part for a '''TxRef''' consists of the transaction's block height, transaction index within the block, and optionally, an outpoint index. Specific encoding details for the data are given below.
 
''Please note: other specifications, such as [https://w3c-ccg.github.io/did-spec/ the Decentralized Identifiers spec], have implicitly encoded the information contained within the HRP elsewhere. In this case they may choose to not include the HRP as specified here.''
 
==== Readability ====
 
To increase portability and readability, additional separator characters SHOULD be added to the '''TxRef''':
 
* A Colon<ref>'''Why add a colon here?''' This allows it to conform better with W3C URN/URL standards.</ref> '''":"'''  added after the separator character '1'.
* Hyphens<ref>'''Why hyphens within the TxRef?''' As '''TxRef'''s are short, we expect that they will be quoted via voice or written by hand. The inclusion of hyphens every 4 characters breaks up the string and means people don't lose their place so easily.</ref> '''"-"''' added after every 4 characters beyond the colon.


=== Encoding ===
=== Encoding ===


TxRef uses standard Bech32<ref name=":0">'''Why use Bech32 Encoding for Confirmed Transaction References?''' The error detection and correction properties of this encoding format make it very attractive. We expect that it will be reasonable for software to correct a maximum of two characters; however, we haven’t specified this yet.</ref> encoding as defined in [https://github.com/bitcoin/bips/blob/master/bip-0173.mediawiki BIP-173] and therefore consists of:
Encoding a '''TxRef''' requires 4 or 5 pieces of data: a magic code denoting which network is being used; a version number (currently always 0); the block height of the block containing the transaction; the index of the transaction within the block; and optionally, the index of the outpoint within the transaction. Only a certain number of bits are supported for each of these values, see the following table for details.


* Human-readable Part, or "HRP", that provides namespacing. We have chosen to distinguish between Main and Test Networks:
{| class="wikitable"
** For Any Mainnet Network: '''"tx"'''.
!
** For Any Testnet Network: '''"txtest"'''.
!Description
** Please see [https://github.com/satoshilabs/slips/blob/master/slip-0173.md SLIP-0173 : Registered human-readable parts for BIP-0173] for a full list of HRP's including these two and others relating to other projects.
!Possible Data Type
* Separator: '''"1"'''.
!'''# of Bits used'''
* Data Part.
!Values
|-
| style="background: #99DDFF; color: black; text-align : center;" | Magic Code
|Chain Namespacing Code
|uint8
| style="background: #99DDFF; color: black; text-align : center;" | 5
|'''3''': Mainnet<br>'''4''': Mainnet with Outpoint<br>'''6''': Testnet<br>'''7''': Testnet with Outpoint<br>'''0''': Regtest<br>'''1''': Regtest with Outpoint
|-
| style="background: #DDDDDD; color: black; text-align : center;" | Version
|For Future Use
|uint8
| style="background: #DDDDDD; color: black; text-align : center;" | 1
|Must be '''0'''
|-
| style="background: #EEDD88; color: black; text-align : center;" | Block<br>Height
|The Block Height of the Tx
|uint32
| style="background: #EEDD88; color: black; text-align : center;" | 24
|Block 0 to Block 16777215
|-
| style="background: #FFAABB; color: black; text-align : center;" | Transaction<br>Index
|The index of the Tx inside the block
|uint16, uint32
| style="background: #FFAABB; color: black; text-align : center;" | 15
|Tx 0 to Tx 32767
|-
| style="background: #BBCC33; color: black; text-align : center;" | Outpoint<br>Index
|The index of the Outpoint inside the Tx
|uint16, uint32
| style="background: #BBCC33; color: black; text-align : center;" | 15
|Outpoint 0 to Outpoint 32767
|}


Please note: other specifications, such as [https://w3c-ccg.github.io/did-spec/ the Decentralized Identifiers spec], have implicitly encoded the information contained within the HRP elsewhere. In this case they may choose to not include the HRP as specified here.
==== Magic Notes ====
The magic code provides namespacing between chains:


To increase portability and readability additional separators SHOULD be added:
* For Mainnet the magic code is: '''0x3''', leading to an '''"r"''' character when encoded.
* For Mainnet with Outpoint Encoded the magic code is: '''0x4''', leading to a '''"y"''' character when encoded.
* For Testnet the magic code is: '''0x6''', leading to an '''"x"''' character when encoded.
* For Testnet with Outpoint Encoded the magic code is: '''0x7''', leading to an '''"8"''' character when encoded.
* For Regtest the magic code is: '''0x0''', leading to a '''"q"''' character when encoded.
* For Regtest with Outpoint Encoded the magic code is: '''0x1''', leading to a '''"p"''' character when encoded.


* A Colon<ref>'''Why add a colon here?''' This allows it to conform better with W3C URN/URL standards.</ref> '''":"'''  added after '1'.
==== Encoding Example ====
* Hyphens<ref>'''Why hyphens within the TxRef?''' As TxRef's are short, we expect that they will be quoted via voice or written by hand. The inclusion of hyphens every 4 characters breaks up the string and means people don't lose their place so easily.</ref> '''"-"''' added after every 4 characters beyond the colon.
 
We want to encode a '''TxRef''' that refers to Transaction #1234 of Block #456789 on the Mainnet chain. We use this data in preparation for the Bech32 encoding algorithm:


All non-bech32-alphabet characters after the bech32 code separator MUST be ignored/removed when parsing (except for terminating characters).<ref>'''Why strip all non-bech32-alphabet characters?''' We do not wish to expect the users to keep their TxRef's in good unicode form (hyphens, colons, invisible spaces, random unicode characters, etc). We expect them to copy, paste, write by-hand, write in a mix of character sets, etc. Parsers should automatically correct for all sorts of these common errors.
</ref>
{| class="wikitable"
{| class="wikitable"
|+Text Encoding of the TxRef
!
!
!Bit
!Decimal<br>Value
!Character
!Binary<br>Value
!Characters
!'''# of Bits<br>used'''
!Value
!Bit Indexes and Values
|-
| style="background: #99DDFF; color: black; text-align : center;" | Magic<br>Code
| style="background: #99DDFF; color: black; text-align : center;" | 3
|00000011
| style="background: #99DDFF; color: black; text-align : center;" | 5
|(mc04, mc03, mc02, mc01, mc00) = (0, 0, 0, 1, 1)
|-
| style="background: #DDDDDD; color: black; text-align : center;" | Version
| style="background: #DDDDDD; color: black; text-align : center;" | 0
|00000000
| style="background: #DDDDDD; color: black; text-align : center;" | 1
|(v0) = (0)
|-
| style="background: #EEDD88; color: black; text-align : center;" | Block<br>Height
| style="background: #EEDD88; color: black; text-align : center;" | 456789
|00000110<br>11111000<br>01010101
| style="background: #EEDD88; color: black; text-align : center;" | 24
|(bh23, bh22, bh21, bh20, bh19, bh18, bh17, bh16) = (0, 0, 0, 0, 0, 1, 1, 0)<br>(bh15, bh14, bh13, bh12, bh11, bh10, bh09, bh08) = (1, 1, 1, 1, 1, 0, 0, 0)<br>(bh07, bh06, bh05, bh04, bh03, bh02, bh01, bh00) = (0, 1, 0, 1, 0, 1, 0, 1)
|-
| style="background: #FFAABB; color: black; text-align : center;" | Transaction<br>Index
| style="background: #FFAABB; color: black; text-align : center;" | 1234
|00000100<br>11010010
| style="background: #FFAABB; color: black; text-align : center;" | 15
|(ti14, ti13, ti12, ti11, ti10, ti09, ti08) = (0, 0, 0, 0, 1, 0, 0)<br>(ti07, ti06, ti05, ti04, ti03, ti02, ti01, ti00) = (1, 1, 0, 1, 0, 0, 1, 0)
|}
 
As shown in the last column, we take the necessary bits of each binary value and copy them into nine unsigned chars illustrated in the next table. We only set the lower five bits of each unsigned char as the bech32 algorithm only uses those bits.
 
{| class="wikitable" style="text-align: center"
!
!
!style="width:2em"|7
!style="width:2em"|6
!style="width:2em"|5
!style="width:2em"|4
!style="width:2em"|3
!style="width:2em"|2
!style="width:2em"|1
!style="width:2em"|0
!
!Decimal<br>Value
!Bech32<br>Character
|-
| || || || || || || || || || || || ||
|-
| rowspan="2" | data[0] || Index
|na
|na
|na
| style="background: #99DDFF; color: black; text-align : center;" | mc04
| style="background: #99DDFF; color: black; text-align : center;" | mc03
| style="background: #99DDFF; color: black; text-align : center;" | mc02
| style="background: #99DDFF; color: black; text-align : center;" | mc01
| style="background: #99DDFF; color: black; text-align : center;" | mc00
|
|
|
|-
|Value
|0
|0
|0
|0
|0
|0
|1
|1
|
|3
|r
|-
| || || || || || || || || || || ||
|-
|-
|Human Readable Part
| rowspan="2" | data[1] || Index
|na
|na
|na
| style="background: #EEDD88; color: black; text-align : center;" | bh03
| style="background: #EEDD88; color: black; text-align : center;" | bh02
| style="background: #EEDD88; color: black; text-align : center;" | bh01
| style="background: #EEDD88; color: black; text-align : center;" | bh00
| style="background: #DDDDDD; color: black; text-align : center;" | v0
|
|
|
|
|1 – 2
|-
|Value
|0
|0
|0
|0
|1
|0
|1
|0
|
|10
|2
|2
|Bitcoin Mainnet: "'''tx'''", Bitcoin Testnet: "'''txtest'''"
|-
|-
|Separator
| || || || || || || || || || || ||
|-
| rowspan="2" | data[2] || Index
|na
|na
|na
| style="background: #EEDD88; color: black; text-align : center;" | bh08
| style="background: #EEDD88; color: black; text-align : center;" | bh07
| style="background: #EEDD88; color: black; text-align : center;" | bh06
| style="background: #EEDD88; color: black; text-align : center;" | bh05
| style="background: #EEDD88; color: black; text-align : center;" | bh04
|
|
|3
|
|
|-
|Value
|0
|0
|0
|0
|0
|1
|0
|1
|
|5
|9
|-
| || || || || || || || || || || ||
|-
| rowspan="2" | data[3] || Index
|na
|na
|na
| style="background: #EEDD88; color: black; text-align : center;" | bh13
| style="background: #EEDD88; color: black; text-align : center;" | bh12
| style="background: #EEDD88; color: black; text-align : center;" | bh11
| style="background: #EEDD88; color: black; text-align : center;" | bh10
| style="background: #EEDD88; color: black; text-align : center;" | bh09
|
|
|
|-
|Value
|0
|0
|0
|1
|1
|1
|1
|"'''1'''"
|0
|0
|
|28
|u
|-
| || || || || || || || || || || ||
|-
|-
|Colon
| rowspan="2" | data[4] || Index
|na
|na
|na
| style="background: #EEDD88; color: black; text-align : center;" | bh18
| style="background: #EEDD88; color: black; text-align : center;" | bh17
| style="background: #EEDD88; color: black; text-align : center;" | bh16
| style="background: #EEDD88; color: black; text-align : center;" | bh15
| style="background: #EEDD88; color: black; text-align : center;" | bh14
|
|
|
|4
|
|-
|Value
|0
|0
|0
|1
|1
|0
|1
|1
|1
|"''':'''"
|
|27
|m
|-
| || || || || || || || || || || ||
|-
|-
|Data
| rowspan="2" | data[5] || Index
|0 – 19
|na
|5 – 8
|na
|4
|na
| style="background: #EEDD88; color: black; text-align : center;" | bh23
| style="background: #EEDD88; color: black; text-align : center;" | bh22
| style="background: #EEDD88; color: black; text-align : center;" | bh21
| style="background: #EEDD88; color: black; text-align : center;" | bh20
| style="background: #EEDD88; color: black; text-align : center;" | bh19
|
|
|
|-
|Value
|0
|0
|0
|0
|0
|0
|0
|0
|
|
|0
|q
|-
|-
|Hyphen
| || || || || || || || || || || ||
|-
| rowspan="2" | data[6] || Index
|na
|na
|na
| style="background: #FFAABB; color: black; text-align : center;" | ti04
| style="background: #FFAABB; color: black; text-align : center;" | ti03
| style="background: #FFAABB; color: black; text-align : center;" | ti02
| style="background: #FFAABB; color: black; text-align : center;" | ti01
| style="background: #FFAABB; color: black; text-align : center;" | ti00
|
|
|
|
|9
|-
|Value
|0
|0
|0
|1
|0
|0
|1
|1
|"'''-'''"
|0
|}
|
The Data - Hyphen pattern is repeated for the entire length of data, ( a hyphen is inserted after every encoded 20 bits or 4 data characters).
|18
=== Data ===
|j
 
|-
Depending on if an optional transaction outpoint is included, there can be 75 or 90 bits of data encoded in the string above. These bits are defined in this manner:
| || || || || || || || || || || ||
 
{| class="wikitable"
|+TxRef Binary Format for Bitcoin Mainnet and Bitcoin Testnet:
!
!'''Bit'''
!'''Bit(s)'''
!'''Type'''
!'''Values'''
!'''Notes'''
|-
|-
|Magic Code
| rowspan="2" | data[7] || Index
|0 – 4
|na
|5
|na
|Chain Namespacing Code
|na
|'''0x3''' for Bitcoin Mainnet.
| style="background: #FFAABB; color: black; text-align : center;" | ti09
'''0x4''' for Bitcoin Mainnet with Outpoint.
| style="background: #FFAABB; color: black; text-align : center;" | ti08
'''0x6''' for Bitcoin Testnet.
| style="background: #FFAABB; color: black; text-align : center;" | ti07
'''0x7''' for Bitcoin Testnet with Outpoint.
| style="background: #FFAABB; color: black; text-align : center;" | ti06
| style="background: #FFAABB; color: black; text-align : center;" | ti05
|
|
|
|
|-
|-
|Version
|Value
|5
|0
|0
|0
|0
|0
|1
|1
|1
|For Future Use
|0
|Must be '''0x0'''
|
|6
|x
|-
| || || || || || || || || || || ||
|-
| rowspan="2" | data[8] || Index
|na
|na
|na
| style="background: #FFAABB; color: black; text-align : center;" | ti14
| style="background: #FFAABB; color: black; text-align : center;" | ti13
| style="background: #FFAABB; color: black; text-align : center;" | ti12
| style="background: #FFAABB; color: black; text-align : center;" | ti11
| style="background: #FFAABB; color: black; text-align : center;" | ti10
|
|
|
|
|-
|-
|Block Height
|Value
|6 – 29
|0
|24
|0
|The Block Height of the Tx
|0
|Block 0 (genesis) to block 16777215
|0
|Until Year ~2328
|0
|0
|0
|1
|
|1
|p
|}
 
The Bech32 algorithm encodes the nine unsigned chars above and computes a checksum of those chars and encodes that as well--this gives a six character checksum (in this case, '''utt3p0''') which is appended to the final '''TxRef'''. The final '''TxRef''' given is: '''tx1:r29u-mqjx-putt-3p0''' and is illustrated in the following table:
 
TxRef character indexes and descriptions
{| class="wikitable" style="text-align: top"
!style="width:2em"|Index
!style="width:2em"|0
!style="width:2em"|1
!style="width:2em"|2
!style="width:2em"|3
!style="width:2em"|4
!style="width:2em"|5
!style="width:2em"|6
!style="width:2em"|7
!style="width:2em"|8
!style="width:2em"|9
!style="width:2em"|10
!style="width:2em"|11
!style="width:2em"|12
!style="width:2em"|13
!style="width:2em"|14
!style="width:2em"|15
!style="width:2em"|16
!style="width:2em"|17
!style="width:2em"|18
!style="width:2em"|19
!style="width:2em"|20
!style="width:2em"|21
|-
|-
|Transaction Index
|Char:
|30 – 44
| style="background: #BBCCEE; color: black; text-align : center;" | t
|15
|  style="background: #BBCCEE; color: black; text-align : center;" | x
|The index of the Tx inside the block
|  style="background: #FFCCCC; color: black; text-align : center;" | 1
|Tx 0 (coinbase) to Tx position 32767
|  style="background: #CCDDAA; color: black; text-align : center;" | &#58;
|Max Tx's in block is 16665
|  style="background: #EEEEBB; color: black; text-align : center;" | r
|  style="background: #EEEEBB; color: black; text-align : center;" | 2
|  style="background: #EEEEBB; color: black; text-align : center;" | 9
|  style="background: #EEEEBB; color: black; text-align : center;" | u
|  style="background: #CCDDAA; color: black; text-align : center;" | -
|  style="background: #EEEEBB; color: black; text-align : center;" | m
|  style="background: #EEEEBB; color: black; text-align : center;" | q
|  style="background: #EEEEBB; color: black; text-align : center;" | j
| style="background: #EEEEBB; color: black; text-align : center;" | x
| style="background: #CCDDAA; color: black; text-align : center;" | -
| style="background: #EEEEBB; color: black; text-align : center;" | p
|  style="background: #EEEEBB; color: black; text-align : center;" | u
|  style="background: #EEEEBB; color: black; text-align : center;" | t
|  style="background: #EEEEBB; color: black; text-align : center;" | t
|  style="background: #CCDDAA; color: black; text-align : center;" | -
|  style="background: #EEEEBB; color: black; text-align : center;" | 3
|  style="background: #EEEEBB; color: black; text-align : center;" | p
| style="background: #EEEEBB; color: black; text-align : center;" | 0
|}
|}
If the magic code is '''0x4''' or '''0x7''', an optional outpoint is included in the encoding:
 
==== Outpoint Index ====
 
Some uses of '''TxRef''' may want to refer to a specific outpoint of the transaction. In the previous example, since we did not specify the outpoint index, the '''TxRef''' '''tx1:r29u-mqjx-putt-3p0''' implicitly references the first (index 0) outpoint of the 1234th transaction in the 456789th block in the blockchain.
 
If instead, for example, we want to reference the second (index 1) outpoint, we need to change the magic code from '''3''' to '''4''' and would include the following in the data to be encoded:


{| class="wikitable"
{| class="wikitable"
|+Optional Outpoint Index Encoding:
!
!
!'''Bit'''
!Decimal<br>Value
!'''Bit(s)'''
!Binary<br>Value
!'''Type'''
!'''# of Bits<br>used'''
!'''Values'''
!Bit Indexes and Values
!'''Notes'''
|-
| style="background: #99DDFF; color: black; text-align : center;" | Magic<br>Code
| style="background: #99DDFF; color: black; text-align : center;" | 4
|00000100
| style="background: #99DDFF; color: black; text-align : center;" | 5
|(mc04, mc03, mc02, mc01, mc00) = (0, 0, 1, 0, 0)
|-
|-
|Outpoint Index
| style="background: #BBCC33; color: black; text-align : center;" | Outpoint Index
|45 – 59
| style="background: #BBCC33; color: black; text-align : center;" | 1
|15
|00000000 00000001
|The index of the Outpoint inside the Tx
| style="background: #BBCC33; color: black; text-align : center;" | 15
|Outpoint 0 to Outpoint Position 32767
|(op14, op13, op12, op11, op10, op09, op08) = (0, 0, 0, 0, 0, 0, 0)<br>(op07, op06, op05, op04, op03, op02, op01, op00) = (0, 0, 0, 0, 0, 0, 0, 1)
|
|}
|}


We include the 30-bit checksum last:
{| class="wikitable" style="text-align: center"
{| class="wikitable"
!
|+Bech32 Checksum Encoding:
!
!style="width:2em"|7
!style="width:2em"|6
!style="width:2em"|5
!style="width:2em"|4
!style="width:2em"|3
!style="width:2em"|2
!style="width:2em"|1
!style="width:2em"|0
!
!
!'''Bit'''
!Decimal<br>Value
!'''Bit(s)'''
!Bech32<br>Character
!'''Type'''
|-
!'''Values'''
| || || || || || || || || || || || ||
!'''Notes'''
|-
| rowspan="2" | data[0] || Index
|na
|na
|na
| style="background: #99DDFF; color: black; text-align : center;" | mc04
| style="background: #99DDFF; color: black; text-align : center;" | mc03
| style="background: #99DDFF; color: black; text-align : center;" | mc02
| style="background: #99DDFF; color: black; text-align : center;" | mc01
| style="background: #99DDFF; color: black; text-align : center;" | mc00
|
|
|
|-
|-
|Checksum
|Value
|45 – 74 or 60 – 89
|0
|30
|0
|Bech32 Checksum
|0
|0
|0
|1
|0
|0
|
|
|4
|y
|-
| || || || || || || || || || || ||
|-
| rowspan="2" | data[9] || Index
|na
|na
|na
| style="background: #BBCC33; color: black; text-align : center;" | op04
| style="background: #BBCC33; color: black; text-align : center;" | op03
| style="background: #BBCC33; color: black; text-align : center;" | op02
| style="background: #BBCC33; color: black; text-align : center;" | op01
| style="background: #BBCC33; color: black; text-align : center;" | op00
|
|
|
|
|-
|Value
|0
|0
|0
|0
|0
|0
|0
|1
|
|1
|p
|-
| || || || || || || || || || || ||
|-
| rowspan="2" | data[10] || Index
|na
|na
|na
| style="background: #BBCC33; color: black; text-align : center;" | op09
| style="background: #BBCC33; color: black; text-align : center;" | op08
| style="background: #BBCC33; color: black; text-align : center;" | op07
| style="background: #BBCC33; color: black; text-align : center;" | op06
| style="background: #BBCC33; color: black; text-align : center;" | op05
|
|
|
|-
|Value
|0
|0
|0
|0
|0
|0
|0
|0
|
|0
|q
|-
| || || || || || || || || || || ||
|-
| rowspan="2" | data[11] || Index
|na
|na
|na
| style="background: #BBCC33; color: black; text-align : center;" | op14
| style="background: #BBCC33; color: black; text-align : center;" | op13
| style="background: #BBCC33; color: black; text-align : center;" | op12
| style="background: #BBCC33; color: black; text-align : center;" | op11
| style="background: #BBCC33; color: black; text-align : center;" | op10
|
|
|
|-
| Value
|0
|0
|0
|0
|0
|0
|0
|0
|
|0
|q
|}
|}


==== Magic Notes: ====
After Bech32 encoding all twelve unsigned chars above, we get the checksum: '''sfp2tt'''. The final '''TxRef''' given is: '''tx1:y29u-mqjx-ppqq-sfp2-tt''' and is illustrated in the following table:
The magic code provides namespacing between chains. 5-bit magic codes are used for the Bitcoin Mainnet and the Bitcoin Testnet. (it may be significantly longer for other projects/chains):


* For Bitcoin Mainnet the magic code is: '''0x3''', leading to an '''"r"''' character when encoded.
TxRef character indexes and descriptions
* For Bitcoin Mainnet with Outpoint Encoded the magic code is: '''0x4''', leading to an '''"y"''' character when encoded.
{| class="wikitable" style="text-align: top"
* For Bitcoin Testnet the magic code is: '''0x6''', leading to an '''"x"''' character when encoded.
!style="width:2em"|Index
* For Bitcoin Testnet with Outpoint Encoded the magic code is: '''0x7''', leading to an '''"8"''' character when encoded.
!style="width:2em"|0
!style="width:2em"|1
!style="width:2em"|2
!style="width:2em"|3
!style="width:2em"|4
!style="width:2em"|5
!style="width:2em"|6
!style="width:2em"|7
!style="width:2em"|8
!style="width:2em"|9
!style="width:2em"|10
!style="width:2em"|11
!style="width:2em"|12
!style="width:2em"|13
!style="width:2em"|14
!style="width:2em"|15
!style="width:2em"|16
!style="width:2em"|17
!style="width:2em"|18
!style="width:2em"|19
!style="width:2em"|20
!style="width:2em"|21
!style="width:2em"|22
!style="width:2em"|23
!style="width:2em"|24
!style="width:2em"|25
|-
|Char:
|  style="background: #BBCCEE; color: black; text-align : center;" | t
|  style="background: #BBCCEE; color: black; text-align : center;" | x
|  style="background: #FFCCCC; color: black; text-align : center;" | 1
|  style="background: #CCDDAA; color: black; text-align : center;" | &#58;
|  style="background: #EEEEBB; color: black; text-align : center;" | y
|  style="background: #EEEEBB; color: black; text-align : center;" | 2
|  style="background: #EEEEBB; color: black; text-align : center;" | 9
|  style="background: #EEEEBB; color: black; text-align : center;" | u
|  style="background: #CCDDAA; color: black; text-align : center;" | -
|  style="background: #EEEEBB; color: black; text-align : center;" | m
|  style="background: #EEEEBB; color: black; text-align : center;" | q
|  style="background: #EEEEBB; color: black; text-align : center;" | j
|  style="background: #EEEEBB; color: black; text-align : center;" | x
|  style="background: #CCDDAA; color: black; text-align : center;" | -
|  style="background: #EEEEBB; color: black; text-align : center;" | p
|  style="background: #EEEEBB; color: black; text-align : center;" | p
|  style="background: #EEEEBB; color: black; text-align : center;" | q
|  style="background: #EEEEBB; color: black; text-align : center;" | q
|  style="background: #CCDDAA; color: black; text-align : center;" | -
|  style="background: #EEEEBB; color: black; text-align : center;" | s
|  style="background: #EEEEBB; color: black; text-align : center;" | f
|  style="background: #EEEEBB; color: black; text-align : center;" | p
|  style="background: #EEEEBB; color: black; text-align : center;" | 2
|  style="background: #CCDDAA; color: black; text-align : center;" | -
|  style="background: #EEEEBB; color: black; text-align : center;" | t
|  style="background: #EEEEBB; color: black; text-align : center;" | t
|}


Codes '''0x0''', '''0x1''', '''0x2''', '''0x5''', are also reserved for future use within the Bitcoin project.


''Any other chain MUST NOT start their magic code with any value between 0x0 and 0x7 inclusive.''
=== Decoding ===


Other magic codes will be specified in SLIP-XXXX "TxRef for Non-Bitcoin Chains and Networks".
The Bech32 spec defines 32 valid characters as its "alphabet". All non-Bech32-alphabet characters present in a '''TxRef''' after the Bech32 separator character MUST be ignored/removed when parsing (except for terminating characters). We do not wish to expect the users to keep their '''TxRef'''s in good form and '''TxRef'''s may contains hyphens, colons, invisible spaces, uppercase or random characters. We expect users to copy, paste, write by-hand, write in a mix of character sets, etc. Parsers SHOULD attempt to correct for these and other common errors, reporting to the user any '''TxRef'''s that violate a proper Bech32 encoding.


=== Compatibility ===
As of early 2021, '''TxRef''' has been in limited use for a couple of years and it is possible that there are some '''TxRef'''s in use which were created with the original specification of Bech32 before the Bech32m refinement was codified. Due to this possibility, a '''TxRef''' parser SHOULD be able to decode both Bech32m and Bech32 encoded '''TxRef'''s. In such a case, a '''TxRef''' parser SHOULD display or somehow notify the user that they are using an obsolete '''TxRef''' and that they should upgrade it to the Bech32m version. Additionally, the parser MAY also display the Bech32m version.
There are no known compatibility issues.


== Rationale ==
== Rationale ==
Line 224: Line 738:


== Reference implementations ==
== Reference implementations ==
C Reference Implementation (supports magic codes 0x3 and 0x6): https://github.com/jonasschnelli/bitcoin_txref_code
C Reference Implementation (supports magic codes 0x3 and 0x6): https://github.com/jonasschnelli/bitcoin_txref_code


Go Reference Implementation (supports magic codes 0x3 and 0x6): https://github.com/kulpreet/txref
Go Reference Implementation (supports magic codes 0x3 and 0x6): https://github.com/kulpreet/txref


C++ Reference Implementation (support magic codes 0x3, 0x4, 0x6, 0x7): https://github.com/dcdpr/btcr-DID-method/
C++ Reference Implementation (supports magic codes 0x3, 0x4, 0x6, 0x7, 0x0 and 0x1): https://github.com/dcdpr/libtxref/
 
Java Reference Implementation (supports magic codes 0x3, 0x4, 0x6, 0x7, 0x0 and 0x1): https://github.com/dcdpr/libtxref-java/


== Appendices ==
== Appendices ==


=== Test Vectors ===
=== Test Examples ===
There are two sets of Test Vectors included here:
 
* Bech32 Encoding Test Vectors. These are to test if a implementation accepts the encoding, with the correct human readable part, and separator.
* Bitcoin TxRef Test Vectors. These test the full specification, in particular, correct values for block height and the transaction index.
 
==== Bech32 Encoding (for TxRef). ====
''Please Note: All test vectors are shown to help test if a string is compliant or not. All real-life applications (such as for Bitcoin) should comply with the Bitcoin Test Vectors listed Below.''


The following strings have a valid Human Readable Part and Bech32 Checksum.
The following examples show values for various combinations on mainnet and testnet; encoding block height, transaction index, and an optional output index.
* <tt>TX1A12UEL5L</tt>
* <tt>tx1an83characterlonghumanreadablepartthatcontainsthenumber1andtheexcludedcharactersbio1tt5tgs</tt>
* <tt>tx1abcdef1qpzry9x8gf2tvdw0s3jn54khce6mua7lmqqqxw</tt>
* <tt>tx11qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqc8247j</tt>


The following list gives invalid TxRef's and the reason for their invalidity.
==== TxRef ====
* <tt>bc1qw508d6qejxtdg4y5r3zarvary0c5xw7kg3g4ty</tt>: Invalid human-readable part
The following list gives properly encoded mainnet '''TxRef'''s and the decoded hex values (block height, transaction index)
* <tt>tx1qw508d6qejxtdg4y5r3zarvary0c5xw7kv8f3t5</tt>: Invalid checksum


==== Bitcoin TxRef (mainnet and testnet) ====
* <tt>tx1:rqqq-qqqq-qwtv-vjr</tt>: <tt>(0x0, 0x0)</tt>
The following list gives properly encoded Bitcoin mainnet TxRef's and the values in hex. (block height, transaction index)
* <tt>tx1:rqqq-qqll-lj68-7n2</tt>: <tt>(0x0, 0x7FFF)</tt>
* <tt>tx1:r7ll-llqq-qats-vx9</tt>: <tt>(0xFFFFFF, 0x0)</tt>
* <tt>tx1:r7ll-llll-lp6m-78v</tt>: <tt>(0xFFFFFF, 0x7FFF)</tt>


* <tt>tx1:rqqq-qqqq-qmhu-qhp</tt>: <tt>(0x0, 0x0)</tt>
The following list gives properly encoded testnet '''TxRef'''s and the decoded hex values (block height, transaction index)
* <tt>tx1:rqqq-qqll-l8xh-jkg</tt>: <tt>(0x0, 0x7FFF)</tt>
* <tt>tx1:r7ll-llqq-qghq-qr8</tt>: <tt>(0xFFFFFF, 0x0)</tt>
* <tt>tx1:r7ll-llll-l5xt-jzw</tt>: <tt>(0xFFFFFF, 0x7FFF)</tt>


The following list gives properly encoded Bitcoin testnet TxRef's and the values in hex. (block height, transaction index)
* <tt>txtest1:xqqq-qqqq-qrrd-ksa</tt>: <tt>(0x0, 0x0)</tt>
* <tt>txtest1:xqqq-qqll-lljx-y35</tt>: <tt>(0x0, 0x7FFF)</tt>
* <tt>txtest1:x7ll-llqq-qsr3-kym</tt>: <tt>(0xFFFFFF, 0x0)</tt>
* <tt>txtest1:x7ll-llll-lvj6-y9j</tt>: <tt>(0xFFFFFF, 0x7FFF)</tt>


* <tt>txtest1:xqqq-qqqq-qkla-64l</tt>: <tt>(0x0, 0x0)</tt>
The following list gives valid (sometimes strangely formatted) '''TxRef'''s and the decoded values (block height, transaction index)*
* <tt>txtest1:xqqq-qqll-l2wk-g5k</tt>: <tt>(0x0, 0x7FFF)</tt>
* <tt>tx1:r29u-mqjx-putt-3p0</tt>: <tt>(456789, 1234)</tt>
* <tt>txtest1:x7ll-llqq-q9lp-6pe</tt>: <tt>(0xFFFFFF, 0x0)</tt>
* <tt>TX1R29UMQJXPUTT3P0</tt>: <tt>(456789, 1234)</tt>
* <tt>txtest1:x7ll-llll-lew2-gqs</tt>: <tt>(0xFFFFFF, 0x7FFF)</tt>
* <tt>tx1 r29u mqjx putt 3p0</tt>: <tt>(456789, 1234)</tt>
* <tt>tx1!r29u/mqj*x-putt^^3p0</tt>: <tt>(456789, 1234)</tt>


The following list gives valid (though strangely formatted) Bitcoin TxRef's and the values in hex. (block height, transaction index)
The following list gives invalid '''TxRef'''s and the reason for their invalidity.
* <tt>tx1:rjk0-uqay-zsrw-hqe</tt>: <tt>(0x71F69, 0x89D)</tt>
* <tt>tx1:t7ll-llll-lcq3-aj4</tt>: Magic 0xB instead of 0x3.
* <tt>TX1RJK0UQAYZSRWHQE</tt>: <tt>(0x71F69, 0x89D)</tt>
* <tt>tx1:rlll-llll-lu9m-00x</tt>: Version 1 instead of 0.
* <tt>TX1RJK0--UQaYZSRw----HQE</tt>: <tt>(0x71F69, 0x89D)</tt>
* <tt>tx1:r7ll-llll-lqfu-gss2</tt>: Valid Bech32, but ten 5 bit unsigned chars instead of nine.
* <tt>tx1 rjk0 uqay zsrw hqe</tt>: <tt>(0x71F69, 0x89D)</tt>
* <tt>tx1:r7ll-llll-rt5h-wz</tt>: Valid Bech32, but eight 5 bit unsigned chars instead of nine.
* <tt>tx1!rjk0\uqay*zsrw^^hqe</tt>: <tt>(0x71F69, 0x89D)</tt>
* <tt>tx1:r7ll-LLLL-lp6m-78v</tt>: Invalid Bech32 due to mixed case. Would decode correctly otherwise.


The following list gives invalid Bitcoin TxRef's and the reason for their invalidity.
==== TxRef with Outpoints ====
* <tt>tx1:t7ll-llll-ldup-3hh</tt>: Magic 0xB instead of 0x3.  <tt>(0xFFFFFF, 0x7FFF)</tt>
The following list gives properly encoded mainnet '''TxRef'''s with Outpoints and the decoded values (block height, transaction index, outpoint index)
* <tt>tx1:rlll-llll-lfet-r2y</tt>: Version 1 instead of 0. <tt>(0xFFFFFF, 0x7FFF)</tt>
* <tt>tx1:rjk0-u5ng-gghq-fkg7</tt>: Valid Bech32, but 10x5bit packages instead of 8.
* <tt>tx1:rjk0-u5qd-s43z</tt>: Valid Bech32, but 6x5bit packages instead of 8.


==== Bitcoin TxRef with Outpoints (mainnet and testnet) ====
* <tt>tx1:yqqq-qqqq-qqqq-rvum-0c</tt>: <tt>(0x0, 0x0, 0x0)</tt>
The following list gives properly encoded Bitcoin mainnet TxRef's with Outpoints and the values in hex. (block height, transaction index, TXO index)
* <tt>tx1:yqqq-qqll-lqqq-en8x-05</tt>: <tt>(0x0, 0x7FFF, 0x0)</tt>
* <tt>tx1:y7ll-llqq-qqqq-ggjg-w6</tt>: <tt>(0xFFFFFF, 0x0, 0x0)</tt>
* <tt>tx1:y7ll-llll-lqqq-jhf4-wk</tt>: <tt>(0xFFFFFF, 0x7FFF, 0x0)</tt>


* <tt>tx1:yqqq-qqqq-qqqq-ksvh-26</tt>: <tt>(0x0, 0x0, 0x0)</tt>
* <tt>tx1:yqqq-qqqq-qpqq-pw4v-kq</tt>: <tt>(0x0, 0x0, 0x1)</tt>
* <tt>tx1:yqqq-qqll-lqqq-v0h2-2k</tt>: <tt>(0x0, 0x7FFF, 0x0)</tt>
* <tt>tx1:yqqq-qqll-lpqq-m3w3-kv</tt>: <tt>(0x0, 0x7FFF, 0x1)</tt>
* <tt>tx1:y7ll-llqq-qqqq-a5zy-tc</tt>: <tt>(0xFFFFFF, 0x0, 0x0)</tt>
* <tt>tx1:y7ll-llqq-qpqq-22ml-hz</tt>: <tt>(0xFFFFFF, 0x0, 0x1)</tt>
* <tt>tx1:y7ll-llll-lqqq-8tee-t5</tt>: <tt>(0xFFFFFF, 0x7FFF, 0x0)</tt>
* <tt>tx1:y7ll-llll-lpqq-s4qz-hw</tt>: <tt>(0xFFFFFF, 0x7FFF, 0x1)</tt>


* <tt>tx1:yqqq-qqqq-qpqq-5j9q-nz</tt>: <tt>(0x0, 0x0, 0x1)</tt>
* <tt>tx1:y29u-mqjx-ppqq-sfp2-tt</tt>: <tt>(456789, 1234, 1)</tt>
* <tt>tx1:yqqq-qqll-lpqq-wd7a-nw</tt>: <tt>(0x0, 0x7FFF, 0x1)</tt>
* <tt>tx1:y7ll-llqq-qpqq-lktn-jq</tt>: <tt>(0xFFFFFF, 0x0, 0x1)</tt>
* <tt>tx1:y7ll-llll-lpqq-9fsw-jv</tt>: <tt>(0xFFFFFF, 0x7FFF, 0x1)</tt>


* <tt>tx1:yjk0-uqay-zrfq-g2cg-t8</tt>: <tt>(0x71F69, 0x89D, 0x123)</tt>
* <tt>tx1:yjk0-uqay-zu4x-nk6u-pc</tt>: <tt>(0x71F69, 0x89D, 0x1ABC)</tt>


The following list gives properly encoded Bitcoin testnet TxRef's with Outpoints and the values in hex. (block height, transaction index, TXO index)
The following list gives properly encoded testnet '''TxRef'''s with Outpoints and the decoded values (block height, transaction index, outpoint index)


* <tt>txtest1:8qqq-qqqq-qqqq-cgru-fa</tt>: <tt>(0x0, 0x0, 0x0)</tt>
* <tt>txtest1:8qqq-qqqq-qqqq-d5ns-vl</tt>: <tt>(0x0, 0x0, 0x0)</tt>
* <tt>txtest1:8qqq-qqll-lqqq-zhcp-f3</tt>: <tt>(0x0, 0x7FFF, 0x0)</tt>
* <tt>txtest1:8qqq-qqll-lqqq-htgd-vn</tt>: <tt>(0x0, 0x7FFF, 0x0)</tt>
* <tt>txtest1:87ll-llqq-qqqq-nvd0-gl</tt>: <tt>(0xFFFFFF, 0x0, 0x0)</tt>
* <tt>txtest1:87ll-llqq-qqqq-xsar-da</tt>: <tt>(0xFFFFFF, 0x0, 0x0)</tt>
* <tt>txtest1:87ll-llll-lqqq-fnkj-gn</tt>: <tt>(0xFFFFFF, 0x7FFF, 0x0)</tt>
* <tt>txtest1:87ll-llll-lqqq-u0x7-d3</tt>: <tt>(0xFFFFFF, 0x7FFF, 0x0)</tt>


* <tt>txtest1:8qqq-qqqq-qpqq-622t-s9</tt>: <tt>(0x0, 0x0, 0x1)</tt>
* <tt>txtest1:8qqq-qqqq-qpqq-0k68-48</tt>: <tt>(0x0, 0x0, 0x1)</tt>
* <tt>txtest1:8qqq-qqll-lpqq-q43k-sf</tt>: <tt>(0x0, 0x7FFF, 0x1)</tt>
* <tt>txtest1:8qqq-qqll-lpqq-4fp6-4t</tt>: <tt>(0x0, 0x7FFF, 0x1)</tt>
* <tt>txtest1:87ll-llqq-qpqq-3wyc-38</tt>: <tt>(0xFFFFFF, 0x0, 0x1)</tt>
* <tt>txtest1:87ll-llqq-qpqq-yj55-59</tt>: <tt>(0xFFFFFF, 0x0, 0x1)</tt>
* <tt>txtest1:87ll-llll-lpqq-t3l9-3t</tt>: <tt>(0xFFFFFF, 0x7FFF, 0x1)</tt>
* <tt>txtest1:87ll-llll-lpqq-7d0f-5f</tt>: <tt>(0xFFFFFF, 0x7FFF, 0x1)</tt>


* <tt>txtest1:8jk0-uqay-zrfq-xjhr-gq</tt>: <tt>(0x71F69, 0x89D, 0x123)</tt>
* <tt>txtest1:829u-mqjx-ppqq-73wp-gv</tt>: <tt>(456789, 1234, 1)</tt>
* <tt>txtest1:8jk0-uqay-zu4x-aw4h-zl</tt>: <tt>(0x71F69, 0x89D, 0x1ABC)</tt>




=== Bitcoin TxRef Payload Value Choice: ===
=== TxRef Payload Value Choices: ===
Some calculations showing why we chose these particular bit-length of the block height and transaction index.
Some calculations showing why we chose these particular bit-length of the block height and transaction index.


==== Block Height Value: ====
==== Block Height Value: ====
24-bit: between 0, and 0xFFFFFF (16,777,216 blocks).
24 bits: value can be between 0, and 0xFFFFFF (16777216 blocks).


*There are ~52,500 blocks every year, leading to ~319 years of blocks addressable.
* In early April, 2021, there have been 677700 blocks
*Therefore before year 2328 this specification should be extended. (We think that we have plenty of time).
* There are roughly (365 days * 24 hours * 6 blocks / hour) = 52560 blocks every year, implying about (16777216 - 677700) / 52560 = 306 more years of addressable blocks.
* Some time before year 2327 this specification should be extended.


==== Tx Position Value: ====
==== Tx Position Value: ====
15-bit: between 0x0, and 0x7FFF. (32,768 transactions).
15 bits: value can be between 0x0, and 0x7FFF (32768 transactions).


*The ''realistic'' smallest Tx is 83 Bytes: Max 12047 tx in a block.
*The ''realistic'' smallest Tx is 83 Bytes for maximum 12047 tx in a block.
**4B version + 1B tx_in count + 36B previous_output + 1B script length + 0B signature script + 4B sequence + 1B tx_out count + 8B amount + 1B script length + 23B pubkey script + 4B lock_time = 83B
**4B version + 1B tx_in count + 36B previous_output + 1B script length + 0B signature script + 4B sequence + 1B tx_out count + 8B amount + 1B script length + 23B pubkey script + 4B lock_time = 83B
*The ''extreme'' smallest Tx is 60 Byte's: Max 16665 tx in a block.
*The ''extreme'' smallest Tx is 60 Bytes for maximum 16665 tx in a block.
**4B version + 1B tx_in count + 36B previous_output + 1B script length + 0B signature script + 4B sequence + 1B tx_out count + 8B amount + 1B script length + 0B pubkey script + 4B lock_time = 60B
**4B version + 1B tx_in count + 36B previous_output + 1B script length + 0B signature script + 4B sequence + 1B tx_out count + 8B amount + 1B script length + 0B pubkey script + 4B lock_time = 60B


== Acknowledgements ==
== Acknowledgements ==
Special Thanks to Pieter Wuille and Greg Maxwell for Bech32, a wonderful user-facing data encoding.
Special Thanks to Pieter Wuille and Greg Maxwell for Bech32, a wonderful user-facing data encoding.

Latest revision as of 15:52, 15 December 2021

This page describes a BIP (Bitcoin Improvement Proposal).
Please see BIP 2 for more information about BIPs and creating them. Please do not just create a wiki page.

Please do not modify this page. This is a mirror of the BIP from the source Git repository here.

  BIP: 136
  Layer: Applications
  Title: Bech32 Encoded Tx Position References
  Author: Велеслав <veleslav.bips@protonmail.com>
          Jonas Schnelli <dev@jonasschnelli.ch>
          Daniel Pape <dpape@dpape.com>
  Comments-Summary: No comments yet.
  Comments-URI: https://github.com/bitcoin/bips/wiki/Comments:BIP-0136
  Status: Draft
  Type: Informational
  Created: 2017-07-09
  License: BSD-2-Clause

Introduction

Abstract

This document proposes a convenient, human usable encoding to refer to a confirmed transaction position within the Bitcoin blockchain--known as "TxRef". The primary purpose of this encoding is to allow users to refer to a confirmed transaction (and optionally, a particular outpoint index within the transaction) in a standard, reliable, and concise way.

Please note: Unlike a transaction ID, "TxID", where there is a strong cryptographic link between the ID and the actual transaction, a TxRef only provides a weak link to a particular transaction. A TxRef locates an offset within a blockchain for a transaction, that may - or may not - point to an actual transaction, which in fact may change with reorganisations. We recommend that TxRefs should be not used for positions within the blockchain having a maturity less than 100 blocks.

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.

Copyright

This BIP is licensed under the 2-clause BSD license.

Motivation

Since the first version of Bitcoin, TxIDs have been a core part of the consensus protocol and are routinely used to identify individual transactions between users.

However, for many use-cases they have practical limitations:

  • TxIDs are expensive for full nodes to lookup (requiring either a linear scan of the blockchain, or an expensive TxID index).
  • TxIDs require third-party services for SPV wallets to lookup.
  • TxIDs are 64 character HEX encoded values.

It is possible to reference transactions not only by their TxID, but by their location within the blockchain itself. Rather than use the 64 character TxID, an encoding of the position coordinates can be made friendly for occasional human transcription. In this document, we propose a standard for doing this.

Examples

Block # Transaction # Outpoint # TxRef TxID
0 0 0 tx1:rqqq‑qqqq‑qwtv‑vjr 4a5e1e4baab89f3a32518a88c31bc87f618f76673e2cc77ab2127b7afdeda33b
170 1 0 tx1:r52q‑qqpq‑qpty‑cfg f4184fc596403b9d638783cf57adfe4c75c605f6356fbc91338530e9831e9e16
456789 1234 1 tx1:y29u‑mqjx‑ppqq‑sfp2‑tt 6fb8960f70667dc9666329728a19917937896fc476dfc54a3e802e887ecb4e82

Specification

A confirmed transaction position reference, or TxRef, is a reference to a particular location within the blockchain, specified by the block height and a transaction index within the block, and optionally, an outpoint index within the transaction.

Please Note: All values in this specification are encoded in little-endian format.

TxRef Considerations

It is possible for a TxRef to reference a transaction that doesn't really exist because:

  • The specified block hasn't yet been mined.
  • The transaction index is greater than the total number of transactions included within the specified block.
  • The optional outpoint index is greater than the total outpoints contained within the transaction.

Therefore, implementers must be careful not to display TxRefs to users prematurely:

  • Applications MUST NOT display TxRefs for transactions with less than 6 confirmations.
  • Application MUST show a warning for TxRefs for transactions with less than 100 confirmations.
    • This warning SHOULD state that in the case of a large reorganisation, the TxRefs displayed may point to a different transaction, or to no transaction at all.

TxRef Format

TxRef MUST use the Bech32m[1] encoding as defined in BIP-0173 and later refined in BIP-0350. The Bech32m encoding consists of:

Human-Readable Part

The HRP can be thought of as a label. We have chosen labels to distinguish between Main, Test, and Regtest networks:

  • Mainnet: "tx".
  • Testnet: "txtest".
  • Regtest: "txrt".

Separator

The separator is the character "1".

Data Part

The data part for a TxRef consists of the transaction's block height, transaction index within the block, and optionally, an outpoint index. Specific encoding details for the data are given below.

Please note: other specifications, such as the Decentralized Identifiers spec, have implicitly encoded the information contained within the HRP elsewhere. In this case they may choose to not include the HRP as specified here.

Readability

To increase portability and readability, additional separator characters SHOULD be added to the TxRef:

  • A Colon[2] ":" added after the separator character '1'.
  • Hyphens[3] "-" added after every 4 characters beyond the colon.

Encoding

Encoding a TxRef requires 4 or 5 pieces of data: a magic code denoting which network is being used; a version number (currently always 0); the block height of the block containing the transaction; the index of the transaction within the block; and optionally, the index of the outpoint within the transaction. Only a certain number of bits are supported for each of these values, see the following table for details.

Description Possible Data Type # of Bits used Values
Magic Code Chain Namespacing Code uint8 5 3: Mainnet
4: Mainnet with Outpoint
6: Testnet
7: Testnet with Outpoint
0: Regtest
1: Regtest with Outpoint
Version For Future Use uint8 1 Must be 0
Block
Height
The Block Height of the Tx uint32 24 Block 0 to Block 16777215
Transaction
Index
The index of the Tx inside the block uint16, uint32 15 Tx 0 to Tx 32767
Outpoint
Index
The index of the Outpoint inside the Tx uint16, uint32 15 Outpoint 0 to Outpoint 32767

Magic Notes

The magic code provides namespacing between chains:

  • For Mainnet the magic code is: 0x3, leading to an "r" character when encoded.
  • For Mainnet with Outpoint Encoded the magic code is: 0x4, leading to a "y" character when encoded.
  • For Testnet the magic code is: 0x6, leading to an "x" character when encoded.
  • For Testnet with Outpoint Encoded the magic code is: 0x7, leading to an "8" character when encoded.
  • For Regtest the magic code is: 0x0, leading to a "q" character when encoded.
  • For Regtest with Outpoint Encoded the magic code is: 0x1, leading to a "p" character when encoded.

Encoding Example

We want to encode a TxRef that refers to Transaction #1234 of Block #456789 on the Mainnet chain. We use this data in preparation for the Bech32 encoding algorithm:

Decimal
Value
Binary
Value
# of Bits
used
Bit Indexes and Values
Magic
Code
3 00000011 5 (mc04, mc03, mc02, mc01, mc00) = (0, 0, 0, 1, 1)
Version 0 00000000 1 (v0) = (0)
Block
Height
456789 00000110
11111000
01010101
24 (bh23, bh22, bh21, bh20, bh19, bh18, bh17, bh16) = (0, 0, 0, 0, 0, 1, 1, 0)
(bh15, bh14, bh13, bh12, bh11, bh10, bh09, bh08) = (1, 1, 1, 1, 1, 0, 0, 0)
(bh07, bh06, bh05, bh04, bh03, bh02, bh01, bh00) = (0, 1, 0, 1, 0, 1, 0, 1)
Transaction
Index
1234 00000100
11010010
15 (ti14, ti13, ti12, ti11, ti10, ti09, ti08) = (0, 0, 0, 0, 1, 0, 0)
(ti07, ti06, ti05, ti04, ti03, ti02, ti01, ti00) = (1, 1, 0, 1, 0, 0, 1, 0)

As shown in the last column, we take the necessary bits of each binary value and copy them into nine unsigned chars illustrated in the next table. We only set the lower five bits of each unsigned char as the bech32 algorithm only uses those bits.

7 6 5 4 3 2 1 0 Decimal
Value
Bech32
Character
data[0] Index na na na mc04 mc03 mc02 mc01 mc00
Value 0 0 0 0 0 0 1 1 3 r
data[1] Index na na na bh03 bh02 bh01 bh00 v0
Value 0 0 0 0 1 0 1 0 10 2
data[2] Index na na na bh08 bh07 bh06 bh05 bh04
Value 0 0 0 0 0 1 0 1 5 9
data[3] Index na na na bh13 bh12 bh11 bh10 bh09
Value 0 0 0 1 1 1 0 0 28 u
data[4] Index na na na bh18 bh17 bh16 bh15 bh14
Value 0 0 0 1 1 0 1 1 27 m
data[5] Index na na na bh23 bh22 bh21 bh20 bh19
Value 0 0 0 0 0 0 0 0 0 q
data[6] Index na na na ti04 ti03 ti02 ti01 ti00
Value 0 0 0 1 0 0 1 0 18 j
data[7] Index na na na ti09 ti08 ti07 ti06 ti05
Value 0 0 0 0 0 1 1 0 6 x
data[8] Index na na na ti14 ti13 ti12 ti11 ti10
Value 0 0 0 0 0 0 0 1 1 p

The Bech32 algorithm encodes the nine unsigned chars above and computes a checksum of those chars and encodes that as well--this gives a six character checksum (in this case, utt3p0) which is appended to the final TxRef. The final TxRef given is: tx1:r29u-mqjx-putt-3p0 and is illustrated in the following table:

TxRef character indexes and descriptions

Index 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21
Char: t x 1 : r 2 9 u - m q j x - p u t t - 3 p 0

Outpoint Index

Some uses of TxRef may want to refer to a specific outpoint of the transaction. In the previous example, since we did not specify the outpoint index, the TxRef tx1:r29u-mqjx-putt-3p0 implicitly references the first (index 0) outpoint of the 1234th transaction in the 456789th block in the blockchain.

If instead, for example, we want to reference the second (index 1) outpoint, we need to change the magic code from 3 to 4 and would include the following in the data to be encoded:

Decimal
Value
Binary
Value
# of Bits
used
Bit Indexes and Values
Magic
Code
4 00000100 5 (mc04, mc03, mc02, mc01, mc00) = (0, 0, 1, 0, 0)
Outpoint Index 1 00000000 00000001 15 (op14, op13, op12, op11, op10, op09, op08) = (0, 0, 0, 0, 0, 0, 0)
(op07, op06, op05, op04, op03, op02, op01, op00) = (0, 0, 0, 0, 0, 0, 0, 1)
7 6 5 4 3 2 1 0 Decimal
Value
Bech32
Character
data[0] Index na na na mc04 mc03 mc02 mc01 mc00
Value 0 0 0 0 0 1 0 0 4 y
data[9] Index na na na op04 op03 op02 op01 op00
Value 0 0 0 0 0 0 0 1 1 p
data[10] Index na na na op09 op08 op07 op06 op05
Value 0 0 0 0 0 0 0 0 0 q
data[11] Index na na na op14 op13 op12 op11 op10
Value 0 0 0 0 0 0 0 0 0 q

After Bech32 encoding all twelve unsigned chars above, we get the checksum: sfp2tt. The final TxRef given is: tx1:y29u-mqjx-ppqq-sfp2-tt and is illustrated in the following table:

TxRef character indexes and descriptions

Index 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
Char: t x 1 : y 2 9 u - m q j x - p p q q - s f p 2 - t t


Decoding

The Bech32 spec defines 32 valid characters as its "alphabet". All non-Bech32-alphabet characters present in a TxRef after the Bech32 separator character MUST be ignored/removed when parsing (except for terminating characters). We do not wish to expect the users to keep their TxRefs in good form and TxRefs may contains hyphens, colons, invisible spaces, uppercase or random characters. We expect users to copy, paste, write by-hand, write in a mix of character sets, etc. Parsers SHOULD attempt to correct for these and other common errors, reporting to the user any TxRefs that violate a proper Bech32 encoding.

As of early 2021, TxRef has been in limited use for a couple of years and it is possible that there are some TxRefs in use which were created with the original specification of Bech32 before the Bech32m refinement was codified. Due to this possibility, a TxRef parser SHOULD be able to decode both Bech32m and Bech32 encoded TxRefs. In such a case, a TxRef parser SHOULD display or somehow notify the user that they are using an obsolete TxRef and that they should upgrade it to the Bech32m version. Additionally, the parser MAY also display the Bech32m version.

Rationale

  1. Why use Bech32 Encoding for Confirmed Transaction References? The error detection and correction properties of this encoding format make it very attractive. We expect that it will be reasonable for software to correct a maximum of two characters; however, we haven’t specified this yet.
  2. Why add a colon here? This allows it to conform better with W3C URN/URL standards.
  3. Why hyphens within the TxRef? As TxRefs are short, we expect that they will be quoted via voice or written by hand. The inclusion of hyphens every 4 characters breaks up the string and means people don't lose their place so easily.

Reference implementations

C Reference Implementation (supports magic codes 0x3 and 0x6): https://github.com/jonasschnelli/bitcoin_txref_code

Go Reference Implementation (supports magic codes 0x3 and 0x6): https://github.com/kulpreet/txref

C++ Reference Implementation (supports magic codes 0x3, 0x4, 0x6, 0x7, 0x0 and 0x1): https://github.com/dcdpr/libtxref/

Java Reference Implementation (supports magic codes 0x3, 0x4, 0x6, 0x7, 0x0 and 0x1): https://github.com/dcdpr/libtxref-java/

Appendices

Test Examples

The following examples show values for various combinations on mainnet and testnet; encoding block height, transaction index, and an optional output index.

TxRef

The following list gives properly encoded mainnet TxRefs and the decoded hex values (block height, transaction index)

  • tx1:rqqq-qqqq-qwtv-vjr: (0x0, 0x0)
  • tx1:rqqq-qqll-lj68-7n2: (0x0, 0x7FFF)
  • tx1:r7ll-llqq-qats-vx9: (0xFFFFFF, 0x0)
  • tx1:r7ll-llll-lp6m-78v: (0xFFFFFF, 0x7FFF)

The following list gives properly encoded testnet TxRefs and the decoded hex values (block height, transaction index)

  • txtest1:xqqq-qqqq-qrrd-ksa: (0x0, 0x0)
  • txtest1:xqqq-qqll-lljx-y35: (0x0, 0x7FFF)
  • txtest1:x7ll-llqq-qsr3-kym: (0xFFFFFF, 0x0)
  • txtest1:x7ll-llll-lvj6-y9j: (0xFFFFFF, 0x7FFF)

The following list gives valid (sometimes strangely formatted) TxRefs and the decoded values (block height, transaction index)*

  • tx1:r29u-mqjx-putt-3p0: (456789, 1234)
  • TX1R29UMQJXPUTT3P0: (456789, 1234)
  • tx1 r29u mqjx putt 3p0: (456789, 1234)
  • tx1!r29u/mqj*x-putt^^3p0: (456789, 1234)

The following list gives invalid TxRefs and the reason for their invalidity.

  • tx1:t7ll-llll-lcq3-aj4: Magic 0xB instead of 0x3.
  • tx1:rlll-llll-lu9m-00x: Version 1 instead of 0.
  • tx1:r7ll-llll-lqfu-gss2: Valid Bech32, but ten 5 bit unsigned chars instead of nine.
  • tx1:r7ll-llll-rt5h-wz: Valid Bech32, but eight 5 bit unsigned chars instead of nine.
  • tx1:r7ll-LLLL-lp6m-78v: Invalid Bech32 due to mixed case. Would decode correctly otherwise.

TxRef with Outpoints

The following list gives properly encoded mainnet TxRefs with Outpoints and the decoded values (block height, transaction index, outpoint index)

  • tx1:yqqq-qqqq-qqqq-rvum-0c: (0x0, 0x0, 0x0)
  • tx1:yqqq-qqll-lqqq-en8x-05: (0x0, 0x7FFF, 0x0)
  • tx1:y7ll-llqq-qqqq-ggjg-w6: (0xFFFFFF, 0x0, 0x0)
  • tx1:y7ll-llll-lqqq-jhf4-wk: (0xFFFFFF, 0x7FFF, 0x0)
  • tx1:yqqq-qqqq-qpqq-pw4v-kq: (0x0, 0x0, 0x1)
  • tx1:yqqq-qqll-lpqq-m3w3-kv: (0x0, 0x7FFF, 0x1)
  • tx1:y7ll-llqq-qpqq-22ml-hz: (0xFFFFFF, 0x0, 0x1)
  • tx1:y7ll-llll-lpqq-s4qz-hw: (0xFFFFFF, 0x7FFF, 0x1)
  • tx1:y29u-mqjx-ppqq-sfp2-tt: (456789, 1234, 1)


The following list gives properly encoded testnet TxRefs with Outpoints and the decoded values (block height, transaction index, outpoint index)

  • txtest1:8qqq-qqqq-qqqq-d5ns-vl: (0x0, 0x0, 0x0)
  • txtest1:8qqq-qqll-lqqq-htgd-vn: (0x0, 0x7FFF, 0x0)
  • txtest1:87ll-llqq-qqqq-xsar-da: (0xFFFFFF, 0x0, 0x0)
  • txtest1:87ll-llll-lqqq-u0x7-d3: (0xFFFFFF, 0x7FFF, 0x0)
  • txtest1:8qqq-qqqq-qpqq-0k68-48: (0x0, 0x0, 0x1)
  • txtest1:8qqq-qqll-lpqq-4fp6-4t: (0x0, 0x7FFF, 0x1)
  • txtest1:87ll-llqq-qpqq-yj55-59: (0xFFFFFF, 0x0, 0x1)
  • txtest1:87ll-llll-lpqq-7d0f-5f: (0xFFFFFF, 0x7FFF, 0x1)
  • txtest1:829u-mqjx-ppqq-73wp-gv: (456789, 1234, 1)


TxRef Payload Value Choices:

Some calculations showing why we chose these particular bit-length of the block height and transaction index.

Block Height Value:

24 bits: value can be between 0, and 0xFFFFFF (16777216 blocks).

  • In early April, 2021, there have been 677700 blocks
  • There are roughly (365 days * 24 hours * 6 blocks / hour) = 52560 blocks every year, implying about (16777216 - 677700) / 52560 = 306 more years of addressable blocks.
  • Some time before year 2327 this specification should be extended.

Tx Position Value:

15 bits: value can be between 0x0, and 0x7FFF (32768 transactions).

  • The realistic smallest Tx is 83 Bytes for maximum 12047 tx in a block.
    • 4B version + 1B tx_in count + 36B previous_output + 1B script length + 0B signature script + 4B sequence + 1B tx_out count + 8B amount + 1B script length + 23B pubkey script + 4B lock_time = 83B
  • The extreme smallest Tx is 60 Bytes for maximum 16665 tx in a block.
    • 4B version + 1B tx_in count + 36B previous_output + 1B script length + 0B signature script + 4B sequence + 1B tx_out count + 8B amount + 1B script length + 0B pubkey script + 4B lock_time = 60B

Acknowledgements

Special Thanks to Pieter Wuille and Greg Maxwell for Bech32, a wonderful user-facing data encoding.