More Jabber about the Collatz Conjecture and a Closed Form for Detecting Cycles on Special Subsequences [Assertion: Collatz cycles]

More Jabber about the Collatz Conjecture and a Closed form for Detecting Cycles on Special Subsequences. Thomas W . L ynch 201 1 06 02 Copyright 201 1 © b y Thomas W . L y nch, All Rights Reserved Introduction In [ref 1] apparently published in 2006 professor Cadogan presents an interesting function that maps natural numbers into a two dimensional array based on how many ones the number trails off with. T he length of trailing one's is said to be the row index. The rest of the number , divided by 2, is the column index. He then shows that applying the Collatz function reduces the ending string of ones by one causing it to drop down a row . At the bottom of the table are odd numbers ending in 'xxx..01' - a tail length of one. He reports that a restart function and continued iteration then causes a to monotonic reduction in column indexes, and the existence of this algorithm proves the conjecture. This paper provides a bit more formal framework as a basis for further discussion. Herein I closely examine the functions in involved, provide some nomenclature, and examine the general concept of column reduction. In contrast to prior results, this paper purports that no algorith m which makes use of only row index reduction termination points can display the trait of monotonic column reduction. However , the concept of a row reduction does more than just lead to a colu m n termination point, it also identifies strands in the more general Collatz sequences that have a special property , that of the existence of a unique inverse function. This makes analysis on these strands simpler than for the sequence in general. By using this property I derive a closed form formula which w hen evaluated would identify cycles in such strands. However , it involves factorization and mixed integer solutions of exponential functions. C y cles on the strands, if the y exi st, appear to only exist for very large numbers. 1 Collatz's Conjecture Unless otherwise stated all variables used in this paper are non-negative integers i.e. Natural numbers. I use N for the set of natural numbers, O for the set of odd numbers, E for events, and Q for rationals. A superscript on a function name represents repeated applications of the function. I don't use powers of functions in this paper . Let us start with the Collatz function: def 1.1; Collatz Function : γ ( n ) = ( 3 n + 1 ) 2 − k ∧ pick k ∣ γ ( n ) ∈ O 1/24 T o calculate the result of this function from the given input n we multiply n by 3, add 1, and then divide the result repeatedly b y 2 until the result is an odd nu m b er . The number of times we divide by two is in fact another calculated result, denoted as k . def 1.1.1 W e sa y k = ∥γ ( n ) ∥ It follows from the definition that γ ( n ) can not produce an even number . In addition it can not produce a multiple of 3. This can be shown by the fact there are exceptions to possible inputs to the inverse function, see lemma 1.1 below . W e may now simply state the Collatz conjecture: def 1.2; Collatz Conjectur e : ∃ m : γ m ( n ) = 1 The Collatz conjecture supposes that there alwa y s exists a finite number of applications of the Collatz function that will reduce an y given positive integer to 1. As an example, the Collatz conjecture holds for the value 0 with m=1. As m is one, the Collatz function is applied only once, and the parameter k for that application will be zero. The inverse of the Collatz function maps odd numbers back to natural numbers. This function has one explicit input, say n ∈ O , and an implicit input k . Because the re sul t is a natural number , k must make n 2 k − 1 divisible by 3. Hence, lemma 1.1; Collatz inverse : γ − 1 ( k , n )=( n 2 k − 1 ) 3 − 1 But suppose we had an odd number , say x , for which no k existed that would produce a result γ − 1 ∈ N , that would imply ¬ ∃ k x 2 k − 1 = 0 mod 3 W e can break the proof down into three test cases as x mod 3 is either 0,1, or 2: case x = 0 mod 3 x 2 k − 1 = 2 mod 3 as x mod 3 is zero, we are left with -1 case x = 1 mod 3 x 2 k − 1 = 2 k − 1 mod 3 holds for k even, as 2 2n = 1 mod 3 case x = 2 mod 3 x 2 k − 1 = 2 k + 1 − 1 m od 3 holds for k odd, as 2 2n = 1 mod 3 W e discover that for the case of x = 0 mod 3 k drops out and we are left with a constant of 2. No value of k can cause this to be a multiple of 3, so : lemma 1.2; γ ( n ) can not produce numbers that are multiples of 3. 2 The Exception Set Lets consider a set of non-negative integers that are exceptions to the Collatz conjecture. Set of exceptions to the Collatz Conjecture: E 0 = { n ∣ n ∈ N ∧ ¬ ∃ m : γ m ( n ) = 1 } This says that for each exception, n, no matter how man y times we apply the Collatz function to n , we will never converge to the number 1. This set will be ∅ if and only if the Collatz conjecture is true. 2/24 W e know we don't need to test the number 0 for inclusion in this set because the Conjecture holds in that case. Also we don't need to include the number 1, as that is the definition of convergence. So we can make our definition more expressive without danger of having removed an exception from the set : def 2.1; E 0 = { n ∣ n ∈ N ∧ n > 1 ∧ ¬ ∃ m : γ m ( n ) = 1 } lemma 2.1; No Need to T est Even Numbers as Exceptions By definition the result of applying the Collatz function is odd. Thus if there is an even number , n ,that is an exception, then the odd number γ ( n ) must also be an exception. Consequently the set will not become empty if we take out the even nu m bers. E 1 ={ n ∣ n ∈ O ∧ n > 1 ∧ ¬∃ m : γ m ( n )= 1 } W e may use an analogous argum e nt to show we do not need to test multiples of three. E 2 ={ n ∣ n ∈ O ∧ n mod 3 ≠ 0 ∧ n > 1 ∧ ¬ ∃ m : γ m ( n ) = 1 } 3. Professor Cadogan's Function [ref 1] defines a number as a function of two parameters: def 3.1; ∀ i ≥ 1, j : c i , j − 1 = j 2 i + 1 + 2 i − 1 in [ref 1]: t i , j = j 2 i + 1 + 2 i − 1 lemma 3.1 c ( n ) = ( i , j ) c(n) maps the set of odd numbers, O, to N × O : O c → N × O . W e only consider odd valued c i , j − 1 this implies that i will always be 1 or greater . The next section shows that c ( n ) exists and can be uniquely constructed for an y odd natural number . Constructing c(n) from Any Odd Natural Number The parameter i will be determined while evaluating the tail function. The parameter j will be determined while evaluating the head function as described in this section. 3/24 Construction of the tail(n) and determination of i Given an odd integer , n, consider that number to be represented in binary notation. Now look at the least significant (right most) bit of the number . It is a one. T his has to be the case as we are looking at an odd number . Consider this 1 to be the right most bit of the tail . Then scan left bit by bit until a zero is discovered. All bits to the right of the zero are part of the tail . The parameter i is the number of ones placed in the tail. As we have an odd number the smallest value for i will be one. It follows that the tail will have a numerical value of 2 i − 1 . def 3.3: for any odd n pick the largest i such that n − ( 2 i − 1 )= 2 i ⌊ n 2 − i ⌋ then 2 i − 1 is the tail ( n ) and i is said to be the length of the tail, also i = ∥ tail ( n ) ∥ Again the subscript on the function name is used to indicate an implicit result, here it is cal led i . Often the subscript will be dropped. lemma 3.3; tail ( n ) will always be an odd natural number less than or equal to n . The operations used to create n are closed on the integer field for non-negat ive i . n itself is a positive odd natural number . As such, ⌊ n 2 − i ⌋ will be positive or zero. As n is an odd natural number , at least 1 can alwa y s be subtracted fro m it wit hout the result becoming negative, so i is an integer greater than or equal to one. As powers of two are even, 2 i − 1 is odd. It is a consequence of the definition that subtracting tail fro m n will leave a zero or positive value, therefore tail is less than or equal to n . Construction of the head(n) and determi nat i on of j In the definition of the tail we referred to the floor ⌊ n 2 − i ⌋ this number will be even. This is a consequence of the fact that the tail function stripped of f all the bottom ones. If the left over after this operation were odd, there would be more to ones to strip off. Hence we define the head as: def 3.4 head ( n ) = ⌊ n 2 − ( i + 1 ) ⌋ This value could be any natural number , including zero. It follows that: lemma 3.4 n = head ( n ) 2 i + 1 + ta i l ( n )= c − 1 ( c ( n ) ) substitute def 3.3 and 3.4 into def 3.1 def 3.5 when c ( n ) → ( i , j ) we say that j = head ( n ) lemma 3.5; This section has demonstrated a construc tive procedure for converting any odd natural number into (i,j) , so c(n) is defined for all odd natural nu m bers. A T able of n Pla c ed at Coordinates c(n) For each number , n , in this table, i is the row index and j is the column index (from [ref 1]) . Note that the value 1 appears at location (1,0). j=0 1 2 3 4 5 i=1 1 5 9 13 17 21 2 3 1 1 19 27 35 43 4/24 3 7 23 39 55 71 87 4 15 47 79 1 1 1 143 175 5 31 95 159 223 287 351 table 3.1 The value 1 uniquely appears at location (1,0) c − 1 ( i , j ) = j 2 i + 1 + 2 i − 1 def 3.1 As already noted, the minimum i for an odd n is 1. The parameter i onl y occurs in positive terms of the equation shown in def 3.1 , indeed onl y as a power of 2. Henc e, c − 1 ( i , j ) increases strictly monotonically with increasing i. Likewise j only appears in a positive term as a coef fi cient of a non-zero value. Hence c − 1 ( i , j ) increases strictly monotonically with j . When i is 2, and j is 0, c − 1 ( i , j ) is already larger than 1. When j is 1 and i is at its minimum value of 1, c − 1 ( i , j ) is already larger than 1. It follows that c − 1 ( i , j ) = 1 can only occur when j=0 and i=1 . lemma 3.6; The number 1 only occurs at location (1,0) in the table. Relationships between tail and k Multiplying a string of ones by three is the same as left shifting one number b y 1, and then adding it back into itself. However , we can take the number to be added and u se it to fill in the zero caused by the left shift. Hence, we will end up with a string of ones added to a string of ones. For the bottom of the number this is the same as multiplying b y 2. It follows that k will have to be 1 in order to normalize the result. Though this section appears towards the front it was added last, so I hope you will forgive the ASCII graphics, here are some examples: (2 + 1) tail + 1 = (2 tail + 1) + tail 01 1 1 10 h01 1 1 1 --------- b01 101 + 1 --------- b01 1 10 --> k=1 h01 1 01 1 ------ b01 1 + 1 ------ b1 10 --> k=1 5/24 hh01 h01 ------ 0h1 1 + 1 ?b00 ---> woops, for a tail length of 1, k is related to head lemma 3.7; For numbers in rows greater than 1, k = 1 . ∀ i > 1 : ∥γ ( c − 1 ( i , j ) ) ∥ = 1 Corollary 3.7.1 n = c − 1 ( i , j ) ∧ ∥ tail ( n ) ∥ > 1 → ∥γ ( n ) ∥ = 1 Corollary 3.7.2 c ( n ) i > 1 →∥γ ( n ) ∥ = 1 Here the subscript i on c(n) means we want the i component of the vector result, (i,j). lemma 3.7.3; Due to lemma 3.7 we can define a one operand inverse Colla tz function The value for k that would take one back to a prior value, say n 0 , from a current value, say n 1 , is identical to the value k that calculate by γ when going from from n 0 to n 1 in the first place. This is because the inverse was derived algebraically for lemma 1.1, so the k in both functions is in fact the same number . k = ∥γ ( n0 ) ∥ ; by def 1.1.1 n 0 = γ − 1 ( ∥γ ( n 0 ) ∥ , n 1 ) Hence, when the inverse is to go into the table, other than to row 1: γ − 1 ( n )=( 2 n − 1 ) 3 − 1 Inverse paths are unique for c(n) with row index greater than 1. There are the other inverse paths, indeed there are k of them and k is unbounded. These other paths must be in row 1. This inverse is still only defined over natural numbers when n mod 3 = 2 It is defined for row 1 numbers, but will take them back up to higher row numbers rather than following an y path back into row 1. 4 Row Index Reduction in c ( n ) against γ lemma 4.1; Applying the Collatz function, γ , to a number on row i 0 for all i 0 > 1 , results in a number on row i 0 − 1 [ref 1]. W e can derive this result algebraically in a similar manner as done in [ref 1] while using our head and tail definitions as a device: 6/24 γ ( head ( n ) 2 i0 + 1 + tail ( n ) ) inserting eq 3.5 term 4.1 ( 3 ( head ( n ) 2 i0 + 1 ) + 3 tail ( n )+ 1 ) 2 − k ; chose k so result is an odd integer definition γ Now consider what happens when the tail is multiplied b y 3 and 1 is added: 3 tail ( n )+ 1 taken from term 4.1 2 tail ( n )+ tail ( n ) + 1 3 = (2 + 1) 2 ( 2 i0 − 1 )+( 2 i0 − 1 )+ 1 apply def tail 2 i0 + 1 − 2 + 2 i0 collect terms 2 i0 + 1 + 2 ( 2 i0 − 1 − 1 ) separate out form of a tail term 4.2 2 i0 + 1 + 2 ( 2 i1 − 1 ) say that i 1 = i 0 − 1 Substituting this back into term 1 : ( 3 head ( n ) 2 i0 + 1 + 3 ( 2 i1 − 1 ) + 1 ) 2 − k term 4.1 again ( 3 head ( n ) 2 i0 + 1 + [ 2 i0 + 1 + 2 ( 2 i1 − 1 ) ]) 2 − k substitute in term 4.2 ( [ 3 head ( n ) 2 i0 + 1 + 2 i0 + 1 ] + 2 ( 2 i1 − 1 )) 2 − k regrouping ( [ 3 head ( n ) 2 i0 + 2 i0 ] + ( 2 i1 − 1 )) 2 − k + 1 adjust the scale b y one ( [ 3 head ( n ) 2 i1 + 1 + 2 i1 + 1 ] + ( 2 i1 − 1 )) 2 − k − 1 apply def i 1 = i 0 − 1 ( [ 3 head ( n )+ 1 ] 2 i1 + 1 + ( 2 i1 − 1 ) ) 2 − k − 1 pull 2 i1 + 1 out of [] term 4.3 head ( γ ( n )) 2 i1 + 1 + tail ( γ ( n ) ) 2 − k − 1 i 1 = i 0 − 1 head ( γ ( n )) = 3 head ( n )+ 1 tail ( γ ( n ) ) = 2 i1 − 1 lemma 4.2; Recall that j is identical to head(n) . So head ( γ ( n ))= 3 head ( n )+ 1 means the same thing as j 1 = 3 j 0 + 1 . Which leads to: c ( n ) = ( i , j ) → c ( γ ( n )) = ( i − 1, 3 j + 1 ) lemma 4.3; ( i , j ) γ → ( i − 1, 3 j + 1 ) restatement of prior line, from [ref 1] The nifty thing about this result is that it shows we can calculate changes in i and j against applications of γ without having to explicitly invoke Cadogan's function. 7/24 Behavior of Row Index i under Repeated A pplications of γ lemma 4.4; It follows from lemma 4.1 that: ∀ ( i 0 ≥ 1 )∧( m ≤ i 0 − 1 ) : i 0 γ m → i 0 − m Hence there always exists a Collatz Sequence leading to c ( n )=( 1, j ) . [ref 1]. As for the first term in the quantification, by definition all row indexes, i, are greater than or equal to one. Since a row index can not be less than one, m cannot be greater than i 0 − 1 ; Behavior of Column Index j under Re pe a ted Applications of γ Lemma 4.1 establishes an increase in parameter j against the application of the Collatz function. W e will call this the function g . def 4.1 g ( j ) = 3 j + 1 term from lemma 4.2 I find it interesting that the column expansion function g has nearly the same form as the Collatz function γ . The only dif ference is that γ normalizes with a right shift, 2 − k , where as g does not. lemma 4.5 g ( j ) ≥ γ ( j ) Note here in lemma 4.5, we are doing an unusual thing and appl y ing ga mm a to a column in index rather than a number from the table. If the number had been from the table, other than in row 1. we would know k to be 1 due to lemma 3.7 In which case we would have: corollary 4.5.1 i > 1 ∧ n = c − 1 ( i , j ) → g ( n )= 2 γ ( n ) This time we have done the unusual thing of appl y ing g to a number from the table, as usually it is applied to a column index, but a function doesn't care where its operands came fro m . By algebraically manipulating the definition of g ( j ) to form an inverse we get: lemma 4.6 g − 1 ( j )=( j − 1 ) 3 − 1 derived algebraically from def 4.1 Over the domain of natural numbers g onl y has an inve rse for certain values of j which are 1 larger than a multiple of 3. def 4.2; we extend the definitions of g − 1 and g to operate over the rational field. g is closed over natural numbers, but it may or may not produce a natural number given a rational input. W e can sa y this: lemma 4.7; ∀ j , m , n ∈ N ∧ m ≥ n : g n ( g − m ( j ) ) ∈ N When n is equal to m we get back j . There may have been some rational intermediate values, but those get undone again. When we continue after getting back to the starting point by making n larger then m we are working with j , a natural number , and g is closed over natural numbers, so the results from all further applications of g will be in N . 8/24 The Behavior of k relative to g − 1 In lemma 4.3 the result of applying γ to a number n 0 , located at the point c ( n 0 ) , caused the j 0 component of c ( n 0 ) to go to 3 j 0 + 1 = j 1 . W e called this column index transformation function g. In other words: g ( c ( n 0 ) j ) = c ( γ ( n 0 ) ) j Note the subscript j on c ( n ) j means we want the j component from the vector result (i,j) . Then we derived an inverse function that would take us back again. g − 1 ( j 1 ) = j 0 The interesting part about this inverse is that it accepts onl y one para m eter while γ − 1 requires two parameters. g − 1 ( c ( n 1 ) j ) = c ( γ − 1 ( k , c ( n 1 ) j ) ) j This implies that built into the definition of g − 1 is an assumption about the value k that would be put into γ − 1 to cause it to follow one specific backwards path. This value of k must be the same k that was used for normalization in the forward direction, namel y ∥γ ( n 0 ) ∥ , and indeed, due to lemma 3.7, we know k to be 1 ∀ i > 1 ∥γ ( c − 1 ( i , j ) ) ∥ = 1 Iteration Now lets consider what repeated applications of g does to a j 0 If we write the sequence of j m resulting from repeated applications, in base 3 number representation we we get the pat tern: in base 3 in base 10 as g application application count j 0 1 3 3 j 0 + 1 g ( j 0 ) m=1 j 0 11 3 9 j 0 + 4 g 2 ( j 0 ) m=2 j 0 111 3 27 j 0 + 13 g 3 ( j 0 ) m=3 W e get a head followed by a tail of 1s. In this case the tail is of ones in base 3. In an y base subtracting 1 from the b n , will produce a string of digits of value b − 1 . In this case it will be a sequence of 2s. Of course a sequence of 2s divided b y two is a s tring of ones. Hence we get: Summation form: j m = j 0 3 m + ∑ n = 0 m − 1 ( 3 n ) creating the string of 1s j m = j 0 3 m + 1 2 ( 3 m − 1 ) lemma 4.8; g m ( j ) = 3 m j + ∑ n = 0 m − 1 ( 3 n ) lemma 4.9; g ( j ) m = j3 m + 1 2 ( 3 m − 1 ) 9/24 I find it interesting that g m has a head and a tail and is of similar form as c − 1 . Perhaps this implies that c − 1 could also be derived as the repeatedly application of a simpler function. lemma 4.1; combined with either lemma 4.8 or lemma 4.9 produces: lemma 4.10; ( i , j ) γ m → ( i − m , 3 m j + ∑ n = 0 m − 1 ( 3 n ) ) lemma 4.1 1; ( i , j ) γ m → ( i − m , 3 m j + 1 2 ( 3 m − 1 ) ) About Row 1 Numbers This result is presented in [ref 1]. By definition in row 1, i = 1 . Hence the tail is one bit long, scanning to the left this bit is followed b y a zero, and then there is the head of the number , j. lemma 4.12; c − 1 ( 1, j ) = 4 j + 1 def 3.1 evaluated with i = 1 Hence, all numbers in the first row have values that can be calculated s imply from their column index - which could in fact be said of numbers on any row . Rather we di scuss row 1 because of this surprising property: lemma 4.14; γ ( 4 j + 1 ) = γ ( j ) This says that applying the Collatz function to a row 1 number yields the same result as appl y ing the Collatz function to the column index. The derivation goes like this: γ ( 4n + 1 ) = ( 3 ( 4 n + 1 ) + 1 ) 2 − k = ( 12 n + 3 + 1 ) 2 − k = ( 3 n + 1 ) 2 − k + 2 γ ( n ) = ( 3 n + 1 ) 2 − k same form as above Here are a couple of examples: 10/24 5 γ → ( 5 ⋅ 3 + 1 ) 2 − 4 = 1 21 γ → ( 21 ⋅ 3 + 1 ) 2 − 6 = 1 j=5 4 j+ 1 = 21 2 γ → ( 2 ⋅ 3 + 1 ) 2 0 = 7 9 γ → ( 9 ⋅ 3 + 1 ) 2 − 2 = 7 j=2; 4 j + 1 = 9 This is a strange result, as it says we can exchange a number in row one of the table for its colu m n index and continue iterating. While numbers in the table are of special form, the column index can be anything, including an even number . If the column index has a tail ( j )> 1 , then it has a corresponding entry in the table at a row index greater than one, and thus has the form c − 1 ( ∥ tail ( j ) ∥ , head ( j ) ) . In such a case head ( j ) , the new column number , will be smaller than j by at least a factor of 2 − i , due to removing the tail to get to the head. However , recall that there was growth of g m to get to the termination column in the first place, which is of order 3 m . Whic h term dominates is a question to be answered further below in this paper . If the column index for the termination point, j, has a tail of length 1, then once again i t is a number of the form 4 j + 1 , so we again land back on row 1. Though this time with a smaller j by a factor of 1/4. If the column index for the termination point j is even, then we move to this even numbers odd partner after one application, which again could be an y nu mber . The zeros on the bottom of such a number multiplied by 3 during the Collatz function, remain zero. Adding 1 then makes the number odd. So the number increased in size b y a factor of 3. Iteration form for Row Index Reduction and Summary T o recap, we start with a number , sa y n 0 , we apply the Cadogan function c ( n 0 ) to get i 0 and j 0 . W e apply the Collatz function repeatedl y , γ m until i m = 1 . W e are guaranteed a monotonic incremental arrival in row index reduction due to lemm a 4.1 . Lets give row reduction a name: lemma 4.8 ρ (( i 0, j 0 ) ) = g i 0 − 1 ( j 0 ) which can also be written ρ ( c ( n 0 ) ) = g i 0 − 1 ( j 0 ) ρ accepts a start point and produces the termination column. B y definition the termination row is the constant 1, so it need not be returned. 1 1/24 Due to lemma 4.3 we know we can calculate the c ( n m ) coordinates at each step m ≥ 1 directly from the coordinates from the prior step at c ( n m − 1 ) . There is no need to evaluate c ( n m ) to find the coordinates. Indeed, due to lemma 4.10 we can skip iterat ion altogether and just calculate g i 0 − 1 ( j 0 ) directly . W e can also s ee that the column number j will be increasing as a power of 3 agatinst the number of steps from lemma 4.9. At ea ch application of γ the row index goes down by 1, so we know that the number of steps, m, needed to reach the ter m ination point will be i 0 − 1 . Upon arriving at the termination point after row reduction we know from lemma 4.12 that we are at the number , n i 0 − 1 = 4 j i 0 − 1 + 1 Furthermore, we know from lemma 4.14 that continuing from n i 0 − 1 by applying γ again, is the same as continuing from the column index itself j i 0 , i.e. that γ ( n i 0 − 1 ) = γ ( j i 0 ) . Another thing we get from row reduction is an equivalence class of numbers, namel y those that belonged to the 'row index reduction trajectory'. These numbers, apart from the last one in the trajectory , have long tails and related head values. Perhaps this causes other relational properties between them. 5 Repeated Row Reductions Unless ρ ( c ( n )) happens to equal 0, the column for the number 1, the row reduction has left us with another number to apply the Collatz function to. Now suppose we iterate row reductions, after each row reduction step r , we arrive at column, j m , r . W e will call this quantit y mor e simply J r . def 5.1 ; J r = j m , r where m will be i 0, r − 1 . Successive row reductions will produce a sequence of the form J 0, J 1, J 2 .. J r . Row reduction transforms the question of the Collatz conjecture to the question of whether there exists a J r that is 0. (recall the number 1 is in row 1, column 0 by lemma 3.6 ) Hence, lemma 5.1; The Collatz Conjecture holds if, and onl y i f, ∃ r : J r ∈ Sequence ( J r ) ∧ J r = 0 Due to lemma 4.7 we have the option of getting the new start point b y appl y ing γ to the number J rather to the number c − 1 ( 1, J ) . Thus to start the next iteration we will find the length of tail ( γ ( J ) ) to set i 0 and the head ( γ ( J )) to set j 0 . lemma 5.2 ; q ( J ) = ( i 0, j 0 ) = ( ∥ tail ( γ ( J ) ) ∥ , head ( γ ( J ))) ; calculating a start point for row reduction corollary 5.1.1 Collatz holds if and only if: ρ ( J ) = 0 ∨ [ n 0 = ρ ( J ) ∧ ∃ r ( ρ ∘ q ) r ( n 0 ) = 0 ] The Convergence Boundary The numbers placed in the cells in table 1 are not necessar y as they a re implied by the coordinates, so we can use a dot diagram to describe c(n). This is good because g is exponential against step count, m , so we are going to need a lot of dots to create a useful visualization. 12/24 table 5.1 T able 5.1 is a dot diagram view of table 3.1 . It has been flipped vertically so that moving to lesser row indexes moves down the diagram. It has been extended so more cases can be shown. Here i stems from 1 to 9, and j stems from 0 to 42. Now the table looks more like a graph. Be careful to notice that coordinates are of the form (row , column) which are ( y , x ) relative to the graph. In table 2 we show a row index reduction trajectory starting from the number n 0 = 1 11 , i.e. one hundred and eleven. c ( 111 ) = ( i , j )= ( 3,4 ) . Thi s trajectory is then shown going to the right, first arriving at c ( γ ( 111 ) )= ( 2,13 ) and then at the termination point c ( γ 2 ( 111 ) ) = ( 1,40 ) . W e then go back to the start point (3,4) and use g − 1 to complete the trajectory b y calcula ting trajectory crossing points on each successive row with increasingl y hi gher indexes than our start point. This continues until we arrive at column 0. W e first move up one row and find point (4,1), and then at the next step arrive at column 0 with (5,0). For this trajector y we s a y the start point for row reduction was (3,4) but the first point of the completed trajector y is (5,0). First points alway s have a column coordinate of zero. The termination point for both row reduction and for the trajectory is (1,40). T ermination points always have a row coordinate of 1. W e happened to pick a case where g − 1 remained in the integer domain. However , for some start points when working backwards to complete the trajectory the column value for a row ma y be a rational value. In this case we treat our dot diagram as though it were laid over a Q × Q plane. The dots just mark special points on the plane, namel y those w ith integral coordinates. Each row is interpreted as a line going horizontally . Similarl y c olumns are lines going vertically . The trajectory has an intersection point with each row that occurs below or at the first point, though this intersection might not coincide with a dot. Similarl y , the intersection where the trajectory crosses the first column (y-axis) might not have an integral row coordinate. W e will make use the term 'dot' to refer to a point that has integral coordinates. lemma 5.3; definition of lesser than , on , and greate r than the boundary Completed row reduction trajectories will alwa y s extend fro m colum n 0 at the y-axis and then cross the plane while monotonically decreasing in value until crossing the x-axis at row = 1. The form of the curve is due to lemma 4.1 1 . Hence, a completed trajector y bi-partitions the plane. For this reason we call it a r ow index reduction boundary , or just boundary . Dots on a given row that occur to the left of the boundar y crossing are said to be lesser than the boundary , those to the right are gr eater than . Dots coincident to the boundar y intersection are on the boundar y . It may be the case for a given row that no dot lies on the boundary . All dots on rows above the first point of the boundary are said to be greater than the boundar y . Due to shape of the boundary curve the first point will always be the highest point on a boundar y . 13/24 0 1 2 3 4 5 6 7 8 9 10 1 1 12 13 14 15 16 17 18 1 9 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 9 8 7 6 5 4 3 2 1 lemma 5.4 Suppose we are given a boundary defined b y completing a row reduction trajector y . Suppose we are also given an integral coordinate start 'dot' for another trajector y , say (i, j). a) If (i,j) is greater than the boundary all dots on the trajector y s temm i ng from it remain greater than the boundary . b) If (i,j) is on the boundary , then all dots on the trajector y steam ing from (i,j) are on the boundar y . c) If (i,j) is lesser than the boundary then all points on the trajector y will be lesser than the boundary . In other words, all trajectories remain on the same part of the plane as their start points. The proof of this is rather simple. Recall lemma 4.1 1, ( i , j ) γ m → ( i − m , 3 m j + 1 2 ( 3 m − 1 ) ) lemma 4.1 1 The column component calculation is independent of the row calculation as it does not include i . The row calculation just shifts vertically b y a constant for different start points. Hence all completed row reduction trajectories are parallel. Of course, parallel trajectories do not cross. lemma 5.5: T rajectories ste mm ing fro m lesser than dots have lesser than termination points. This also follows from the fact that the trajectories are parallel and that the function is monotonically decreasing while starting from above the x-axis. When a trajectory is moved down, it intersects the x-axis sooner lemma 5.6 monotonic conver gence; It follows from lemma 5.5, that a sufficient criteria to show the Collatz Conjecture holds is that the new start point for each next row reduction iteration always lies on the lesser side of the boundar y set by the prior row reduction trajectory . ∀ J > 0 : ( q ( J ) = ( i 0, j 0 ) ) < ( i 0, g − ( i 0 − 1 ) ( J ) ) Note that the term g − ( i 0 − 1 ) ( J ) is the boundary intersection with row i 0 , it is a column coordinate. At each application of g − 1 we walk up to the intersection on the next row , with one application taking us to the intersection with row 2, two applications taking us to row 3, etc. After i 0 − 1 applications we have the column intersection for row i 0 . Due to the behavior of g − 1 (see the discussion of lemma 4.6 ) the column coordinate ma y be a non-integer rational number . lemma 5.7 within r ow index r eduction trajectory cycle; There will be a cycle within a row , i.e. the Collatz Con jecture will be false, if any possible new start point lands on the boundary set by the prior row reduction trajector y . ∃ J > 0 : ( q ( J )=( i 0, j 0 ) )=( i 0, g − ( i 0 − 1 ) ( J ) ) lemma 5.8: monotonic diver gence; Furthermore, it is sufficient to demonstrate diver gence by showing t h at the start point will always be greater than the boundary set b y the prior row reduction trajector y . ∀ J > 0 : ( q ( J ) = ( i 0, j 0 ) ) > ( i 0, g − ( i 0 − 1 ) ( J ) ) 14/24 ---- These criteria can all be stated in terms of existentia l quanti f ication: lemma 5.9; ¬ ∃ J > 0 : ( q ( J ) = ( i 0, j 0 ) ) > ( i 0, g −( i 0 − 1 ) ( J ) ) monotonic convergence lemma 5.10; ∃ J > 0 : ( q ( J )=( i 0, j 0 ) )=( i 0, g − ( i 0 − 1 ) ( J ) ) within boundary cycle lemma 5.1 1; ¬ ∃ J > 0 : ( q ( J ) = ( i 0, j 0 ) ) < ( i 0, g −( i 0 − 1 ) ( J ) ) monotonic divergence 6 Start Point Behavior In lemma 5.3 for the definition of less than, on, and greater than relative to a boundar y requ ired comparison of the column coordinates along the row i 0 . As the row is given, we only need to calculate the column components of the comparison to understand the convergence behavior of the start point. Of course, it is possible that none of our three criteria are true meaning that start points ju m p around the boundary . Such a result would tell us little about the tru th of the Collatz Conjecture. g − 1 ( j )=( j − 1 ) 3 − 1 lemma 4.6 g − 2 ( j )=( ( j − 1 ) 3 − 1 − 1 ) 3 − 1 g − 3 ( j )=( ( ( j − 1 ) 3 − 1 − 1 ) 3 − 1 − 1 ) 3 − 1 lemma 6.1; g − m ( j )=( j − 1 ) 3 − m − ∑ n = 1 m − 1 ( 3 − n ) Consider an example. note that g 2 ( 5 ) = 49 . Now lets apply 49 to g − 2 : j + 1 = 49 . j − 1 = 48 . The ( j − 1 ) 3 − m term is then 48 ⋅ 3 − 2 = 5 1 3 , a rational value. m − 1 is just 1 so the summation term is 1 3 , so we get the correct answer back of 5. Note, 49 = 1210 3 when this is shifted over two places we get 12.10 3 Then when we subtract 0.1 3 (a string of m − 1 ones) we are left with 12 3 , which is 5. 15/24 g − m ( j )=( j − 1 ) 3 − m − ∑ n = 1 m − 1 ( 3 − n ) lemma 6.1 g − m ( j )= 3 − m [ j − ∑ n = 1 m ( 3 m − n ) ] pull out the 3 − m lemma 6.2; g − m ( j )= 3 − m [ j − ∑ n = 0 m − 1 3 n ] fix the summation counter lemma 6.3; g − m ( j )= 3 − m [ j − 1 2 ( 3 m − 1 ) ] replace summation Recall the restart function from lemma 5.2. I drop the s ub s cripts here as there are no other i, j variables in this context, also these are just intermediate values used s o the criteria equations will not become too long. def 6.1; i = ∥ t ail ( γ ( J )) ∥ def 6.2; j = head ( γ ( J )) and placing these into the convergence criteria of lemma 5.6, lemma 5.7, and lemma 5.8: lemma 6.3; ∀ J : j < D monotonic convergence lemma 6.4; ∃ J : j = D cycles within boundary lemma 6.5; ∀ J : j > D monotonic divergence lemma 6.6; D = 3 − ( i − 1 ) [ J − 1 2 ( 3 i − 1 − 1 ) ] All the criteria have the same right hand side, so we will call that the deter m i nate, D . Lets test a point to see if the column index reduces after a row index reduction, sa y J = 40 termination point for an iteration → γ ( J )= 121 ∥ tail ( γ ( J ) ) ∥ head ( γ ( J ) ) = 30 → 30 < 3 0 [ 40 − ∑ n = 0 − 1 3 n ] lemma 6.3 → 30 < 120 there is a column index reduction 16/24 It is true in general that when tail ( γ ( J ) ) is 1, lemma 6.3 reduces to head ( γ ( J )) < J , which will be true for all γ ( J ) , because γ ( J ) is always odd, and head ( γ ( J )) is gotten by subtracting a positive non-zero value from J , namel y by subtracting tail ( γ ( J )) . 7 Change of Quantification V ariables W e desire to switch the universal quantification from J in our convergence/cycles/diver ges cri teria of lemmas 6.3, 6.4, and 6.5 to i and j, where i is the tail length of γ ( J ) , and j is the head of γ ( J ) . By doing this we would remove i = ∥ tail ( γ ( J ) ) ∥ and j = head ( γ ( J )) from the criteria. Those are complicated non-linear functions so eliminating them would simplify analysis of the convergence criteria. However we know from lemma 1.2 that multiples of three are missing fro m the set { ∀ J γ ( J ) } Multiples of three are most easily discussed in modulo 3. Head and tail are defined in modulo 2. As 2 and 3 are relatively prime 'digits' in base 2 and base 3 are uncorrelated. This is a sort of cryptographic problem. Hence, we would not expect to find an eas y wa y to express how multiples of three missing in { ∀ J γ ( J ) } would affect head and tail . Perhaps this is the crux of the difficulty in making conclusions about the Collatz conjecture. I propose that we union { ∀ J γ ( J ) } with the multiples of three so we can quantify on i and j . Lets call the set of odd multiples of three, T , and then create { ∀ J γ ( J ) } ∪ T will cause spurious results that are apparently randomly dispersed, but we have a simple wa y to test if a value is spurious, namely by assembling the head and the tail, and then appl y ing the invers e γ function to see if we get an integer . For existential proofs we will have to test members from the set of results and show at least one of them can be generated from { ∀ J γ ( J ) } rather than from T . If a universal criteria holds, then it holds for both the values that derived from { ∀ J γ ( J ) } and those from T . In which case it will not matter that we added the spurious values. def 7.1; any odd natural number can be found in the set { ∀ J γ ( J ) } ∪ T lemma 7.2; Any n ∈ ( { ∀ J γ ( J ) } ∪ T ) can be synthesized from two natural numbers j and i ≥ 1 W e just use the construction technique given with the equation from def 1, c − 1 ( i , j )= j 2 i + 1 + 2 i − 1 lemma 7.3; from def 7.1 and lemma 7.2 we have derived that w e m a y replace γ ( J ) with a formulation in terms of i and j, though at the cost of introducing spurious values into our criteria. γ ( J ) → j 2 i + 1 + 2 i − 1 ___ Now if we find a function to calculate J given j and i , we can substitute this function in and remove J from the equation and universally quantify our criteria onl y on j and i . Here is a derivation of such a function: 17/24 lemma 7.4; J = γ − 1 ( γ ( J ) ) identity J = γ − 1 ( j 2 i + 1 + 2 i − 1 ) lemma 7.3, γ ( J ) → j 2 i + 1 + 2 i − 1 ∃ k : J =(( j 2 i + 1 + 2 i − 1 ) 2 k − 1 ) 3 − 1 lemma 1.1, γ k − 1 ( n ) = ( n 2 k − 1 ) 3 − 1 lemma 7.5; ∃ k : J =( 2 k ( 2 i ( 2 j + 1 )− 1 )− 1 ) 3 − 1 For the gamma inverse we could not use the one operand form of lemma 3.7.3 as the inverse is taking us from the new start point back to row 1. That inverse explicitly doesn't apply to going to row 1. Now is it really true that any value J can be calculated in b y the expression on the right in lemma 7.5? Suppose we had a value, say , x-1, which could not be created with this expression, no matter what value we chose for i and j : ∀ j , i , k : x ≠ ( 2 k ( 2 i ( 2 j + 1 )− 1 )− 1 ) 3 − 1 3 x + 1 ≠ 2 k ( 2 i ( 2 j + 1 )− 1 ) 2 − k ( 3 x + 1 ) ≠ 2 i ( 2 j + 1 )− 1 γ ( x )≠ j 2 i + 1 + 2 i − 1 term on LHS γ , expand RHS The applying the Collatz function is always odd, and we already did the proof that an y odd nu m ber can be constructed with the form on the right hand side in chapter 3. Therefore there can be no such x. lemma 7.5 : all possible J can be represented b y picking j, i, and k in the formula, J =( 2 k ( 2 i ( 2 h + 1 ) − 1 ) − 1 ) 3 − 1 Furthermore, due to the manner we derived J, J will have the correct mathematical relationship to our choice of i , j, and k . W e picked up k because the Collatz inverse is not unique without it. J is an integer , so we cannot use any possible k , rather k must be chosen so t hat 2 k ( 2 i ( 2 j + 1 ) − 1 )− 1 a multiple of 3. I.e. 2 k ( 2 i ( 2 j + 1 ) − 1 ) − 1 is a multiple of 3: 2 k ( 2 i ( 2 j + 1 ) − 1 )− 1 = 0 mod 3 → 2 k ( 2 i ( 2 j + 1 ) − 1 )= 1 mod 3 Consider the following fact: k ∈ E → 2 k mod 3 = 1 k ∈ O → 2 k mod 3 = 2 18/24 Consider the case k is even: 2 i ( 2 j + 1 ) − 1 = 1 m od 3 k ∈ E 2 i ( 2 j + 1 )= 2 mod 3 k ∈ E This two can be broken into two cases due to 2 i 2 j + 1 = 2 mod 3 k ∈ E ∧ i ∈ E → 2 j = 1 mod 3 k ∈ E ∧ i ∈ E This says that h must be a multiple of 3, but we said we were going to make j universally quantified. The contradiction here is that we started this derivation with lemma 7.3 , J = γ − 1 ( γ ( J ) ) which makes use of {γ ( J ) } not {γ ( J ) } ∪ T . As we continue to explore the other cases, we will find constraints that eliminate all members of T in t his fashion, after this point we will not have to be concerned about having spurious results. W e will now derive a parametrization for each i,j,k that will faci litate universal quantification on the parameters. W e will make the corresponding parameters t,u,v . So for all cases we have lemma 7.6; parametrization k i 2 k ( 2 i ( 2 j + 1 ) − 1 )= 1 mod 3 parametrization k ∈ E i ∈ E 2 j = 1 mod 3 j = 3 t + 2 ; i = 2 u ; k = 2 v k ∈ E i ∈ O 2 ( 2 j + 1 ) = 2 mod 3 1 ⋅ j + 2 = 2 mod 3 j = 0 mod 3 j = 3 t ; i = 2 u + 1 ; k = 2 v k ∈ O i ∈ E 2 ( ( 2 j + 1 ) − 1 )= 1 m od 3 2 ( 2 j ) = 1 mod 3 j = 1 mod 3 j = 3 t + 1 ; i = 2 u ; k = 2 v + 1 k ∈ O i ∈ O 2 ( 2 ( 2 j + 1 ) − 1 )= 1 mod 3 2 j + 2 = 1 mod 3 2 j = 2 mod 3 j = 3 t + 1 ; i = 2 u + 1 ; k = 2 v + 1 So we can universally quantify on t, u and v , and not worry about spurious results, though we have to consider 4 cases. 19/24 lemma 7.7; V alues of J in terms of t , u , and v . k i 3 J = ( 2 k ( 2 i ( 2 j + 1 ) − 1 )− 1 ) lemma 7.5 k ∈ E i ∈ E 3 J = 2 2 v ( 2 2 u ( 6 t + 5 )− 1 ) − 1 j = 3 t + 2 ; i = 2 u ; k = 2 v k ∈ E i ∈ O 3 J = 2 2 v ( 2 2 u + 1 ( 6 t + 1 )− 1 ) − 1 j = 3 t ; i = 2 u + 1 ; k = 2 v k ∈ O i ∈ E 3 J = 2 2 v + 1 ( 2 2 u ( 6 t + 3 )− 1 ) − 1 j = 3 t + 1 ; i = 2 u ; k = 2 v + 1 k ∈ O i ∈ O 3 J = 2 2 v + 1 ( 2 2 u + 1 ( 6 t + 3 )− 1 )− 1 j = 3 t + 1 ; i = 2 u + 1 ; k = 2 v + 1 Now the criteria from lemmas 6.3, 6.4, and 6.5 become: lemma 7.8; ∀ t , u , v : j < D k ∈ E ∧ i ∈ E → ∀ t , u , v : 3 t + 2 < D k ∈ E ∧ i ∈ O → ∀ t , u , v : 3 t < D k ∈ O ∧ i ∈ E → ∀ t , u , v : 3 t + 1 < D k ∈ O ∧ i ∈ O → ∀ t , u , v : 3 t + 1 < D lemma 7.9; ∃ t , u , v : j = D k ∈ E ∧ i ∈ E → ∃ t , u , v : 3 t + 2 = D k ∈ E ∧ i ∈ O → ∃ t , u , v : 3 t = D k ∈ O ∧ i ∈ E → ∃ t , u , v : 3 t + 1 = D k ∈ O ∧ i ∈ O → ∃ t , u , v : 3 t + 1 = D lemma 7.10; ∀ t , u , v : j > D k ∈ E ∧ i ∈ E → ∀ t , u , v : 3 t + 2 > D k ∈ E ∧ i ∈ O → ∀ t , u , v : 3 t > D k ∈ O ∧ i ∈ E → ∀ t , u , v : 3 t + 1 > D k ∈ O ∧ i ∈ O → ∀ t , u , v : 3 t + 1 > D In every case the left hand side is of linear in t and contains no other variabl es. 20/24 lemma 7.1 1; Now A pply ing le mma 7.7 to lemma 6.6, D, we have: k ∈ E , i ∈ E i = 2 u ; k = 2 v 3 J = 2 2 v ( 2 2 u ( 6 t + 5 )− 1 )− 1 → D = 3 − ( 2 u − 1 ) [ ( 2 2 v ( 2 2 u ( 6 t + 5 )− 1 ) − 1 ) 3 − 1 − 1 2 ( 3 2 u − 1 − 1 ) ] → 3 2 u − 1 D =( 2 2 v ( 2 2 u ( 6 t + 5 )− 1 )− 1 ) 3 − 1 − 1 2 ( 3 2 u − 1 − 1 ) ; mul by 3 2u − 1 → 2 ⋅ 3 2 u D = 2 2 v + 1 ( 2 2 u ( 6 t + 5 ) − 1 ) − 3 2 u + 1 ; mul by 6 → 2 ⋅ 3 2 u D = ( 6 t + 5 ) 2 2 ( u + v )+ 1 − 2 2 v + 1 − 3 2 u + 1 ; expand → ( 2 D + 1 ) 3 2 u = ( 6 t + 5 ) 2 2 ( u + v )+ 1 − 2 2 v + 1 + 1 ; add 3 2 u both sides → ( 2 D + 1 ) 3 2 u = ( 6 t + 5 ) 2 2 v + 1 ( 2 2 u − 1 ) + 1 ; collect → ( 2 D + 1 ) 3 2 u = ( 6 t + 5 ) 2 2 u 2 2 v + 1 − 2 2 v + 1 + 1 k ∈ E , i ∈ O i = 2 u + 1 ; k = 2 v 3 J = 2 2 v ( 2 2 u + 1 ( 6 t + 1 )− 1 )− 1 → D = 3 − ( 2 u ) [ ( 2 2 v ( 2 2 u + 1 ( 6 t + 1 ) − 1 ) − 1 ) 3 − 1 − 1 2 ( 3 2 u − 1 ) ] → ( 2 D + 1 ) 3 2 u + 1 =( 6 t + 5 ) 2 2 v + 1 ( 2 2 u + 1 − 1 )+ 1 k ∈ O , i ∈ E ; i = 2 u ; k = 2 v + 1 3 J = 2 2 v + 1 ( 2 2 u ( 6 t + 3 )− 1 )− 1 → D = 3 − ( 2u − 1 ) [ ( 2 2 v + 1 ( 2 2 u ( 6 t + 3 )− 1 )− 1 ) 3 − 1 − 1 2 ( 3 2u − 1 − 1 ) ] → ( 2 D + 1 ) 3 2 u = ( 6 t + 5 ) 2 2 v + 2 ( 2 2 u − 1 )+ 1 k ∈ O , i ∈ O ; i = 2 u + 1 ; k = 2 v + 1 3 J = 2 2 v + 1 ( 2 2 u + 1 ( 6 t + 3 )− 1 )− 1 → D = 3 − ( 2u − 1 ) [ ( 2 2 v + 1 ( 2 2 u + 1 ( 6 t + 3 ) − 1 ) − 1 ) 3 − 1 − 1 2 ( 3 2u − 1 − 1 )] → ( 2 D + 1 ) 3 2 u + 1 =( 6 t + 5 ) 2 2 v + 2 ( 2 2 u + 1 − 1 )+ 1 No multiplying by negative numbers was involved. Hence, when we unwrap each D we can then do the inverse operations on the left hand side of the criteria, in otherwords, the Ds above can be replaced with the left hand side directly . 21/24 lemma 7.12; ∀ t , u , v : j < D k ∈ E ∧ i ∈ E → ∀ t , u , v : D ( 6 t + 5 ) 3 2 u < ( 6 t + 5 ) 2 2 v + 1 ( 2 2 u − 1 ) + 1 k ∈ E ∧ i ∈ O → ∀ t , u , v : ( 6 t + 1 ) 3 2 u + 1 <( 6 t + 5 ) 2 2 v + 1 ( 2 2 u + 1 − 1 )+ 1 k ∈ O ∧ i ∈ E → ∀ t , u , v : ( 6 t + 3 ) 3 2 u < ( 6 t + 5 ) 2 2 v + 2 ( 2 2 u − 1 ) + 1 k ∈ O ∧ i ∈ O → ∀ t , u , v : ( 6 t + 3 ) 3 2 u + 1 < ( 6 t + 5 ) 2 2 v + 2 ( 2 2 u + 1 − 1 ) + 1 lemma 7.13; ∃ t , u , v : j = D k ∈ E ∧ i ∈ E → ∃ t , u , v : ( 6 t + 5 ) 3 2 u = ( 6 t + 5 ) 2 2 v + 1 ( 2 2 u − 1 ) + 1 k ∈ E ∧ i ∈ O → ∃ t , u , v : ( 6 t + 1 ) 3 2 u + 1 =( 6 t + 5 ) 2 2 v + 1 ( 2 2 u + 1 − 1 )+ 1 k ∈ O ∧ i ∈ E → ∃ t , u , v : ( 6 t + 3 ) 3 2 u = ( 6 t + 5 ) 2 2 v + 2 ( 2 2 u − 1 )+ 1 k ∈ O ∧ i ∈ O → ∃ t , u , v : ( 6 t + 3 ) 3 2 u + 1 = ( 6 t + 5 ) 2 2 v + 2 ( 2 2 u + 1 − 1 )+ 1 lemma 7.14; ∀ t , u , v : j > D k ∈ E ∧ i ∈ E → ∀ t , u , v : ( 6 t + 5 ) 3 2 u > ( 6 t + 5 ) 2 2 v + 1 ( 2 2 u − 1 ) + 1 k ∈ E ∧ i ∈ O → ∀ t , u , v : ( 6 t + 1 ) 3 2 u + 1 >( 6 t + 5 ) 2 2 v + 1 ( 2 2 u + 1 − 1 )+ 1 k ∈ O ∧ i ∈ E → ∀ t , u , v : ( 6 t + 3 ) 3 2 u > ( 6 t + 5 ) 2 2 v + 1 ( 2 2 u − 1 ) + 1 k ∈ O ∧ i ∈ O → ∀ t , u , v : ( 6 t + 3 ) 3 2 u + 1 > ( 6 t + 5 ) 2 2 v + 2 ( 2 2 u + 1 − 1 ) + 1 Lets look at monotonic column convergence first. As v is a universally quantified variable, it appears only on the right side of the greater than comparison, and the right hand side becomes monotonically larger when v becomes lar ger , so we set the strongest right hand side constraint by setting v to zero: lemma 7.15; ∀ t , u , v : j < D k ∈ E ∧ i ∈ E → ∀ t , u , v : 2 ln ( 3 ) ln ( 2 ) 2 u < 2 2 u + 1 − 1 k ∈ E ∧ i ∈ O → ∀ t , u , v : ( 6 t + 1 ) 2 ln ( 3 ) ln ( 2 ) ( 2 u + 1 ) < ( 6 t + 5 ) ( 2 2 u + 2 − 2 ) + 1 k ∈ O ∧ i ∈ E → ∀ t , u , v : ( 6 t + 3 ) 2 ln ( 3 ) ln ( 2 ) 2 u < ( 6 t + 5 ) ( 2 2 u + 2 − 4 )+ 1 k ∈ O ∧ i ∈ O → ∀ t , u , v : ( 6 t + 3 ) 2 ln ( 3 ) ln ( 2 ) ( 2u + 1 ) < ( 6 t + 5 ) ( 2 2 u + 3 − 4 )+ 1 It is interesting that t has little or no role. In all cases the power of u is larger on the left hand side than on the right hand side, so the constraint can not hold universally . It follows that: Theor em 7.1; No Monotonic Column convergence algorithm based on the Cadogan function exists. 22/24 Lets take a look at divergence. lemma 7.14; ∀ t , u , v : j > D k ∈ E ∧ i ∈ E → ∀ t , u , v : ( 6 t + 5 ) 3 2 u > ( 6 t + 5 ) 2 2 v + 1 ( 2 2 u − 1 ) + 1 k ∈ E ∧ i ∈ O → ∀ t , u , v : ( 6 t + 1 ) 3 2 u + 1 >( 6 t + 5 ) 2 2 v + 1 ( 2 2 u + 1 − 1 )+ 1 k ∈ O ∧ i ∈ E → ∀ t , u , v : ( 6 t + 1 ) 3 2 u + 1 >( 6 t + 5 ) 2 2 v + 1 ( 2 2 u + 1 − 1 ) + 1 k ∈ O ∧ i ∈ O → ∀ t , u , v : ( 6 t + 3 ) 3 2 u + 1 > ( 6 t + 5 ) 2 2 v + 2 ( 2 2 u + 1 − 1 ) + 1 As v is a free variable on the RHS it can be set to an unbounded number of values to create exceptions to the constraints, no matter how t and u are set. Now lets take a look at within trajector y c y c les. This constraint is fundamentall y dif ferent because it only relates numbers found within a single row index reduction trajectory . The other two constraints spoke of relationships between trajectories. lemma 7.13; ∃ t , u , v : j = D k ∈ E ∧ i ∈ E → ∃ t , u , v : ( 6 t + 5 ) 3 2 u = ( 6 t + 5 ) 2 2 v + 1 ( 2 2 u − 1 ) + 1 k ∈ E ∧ i ∈ O → ∃ t , u , v : ( 6 t + 1 ) 3 2 u + 1 =( 6 t + 5 ) 2 2 v + 1 ( 2 2 u + 1 − 1 )+ 1 k ∈ O ∧ i ∈ E → ∃ t , u , v : ( 6 t + 1 ) 3 2 u + 1 =( 6 t + 5 ) 2 2 v + 1 ( 2 2 u + 1 − 1 ) + 1 k ∈ O ∧ i ∈ O → ∃ t , u , v : ( 6 t + 3 ) 3 2 u + 1 = ( 6 t + 5 ) 2 2 v + 2 ( 2 2 u + 1 − 1 )+ 1 ( 6 t + 5 ) 2 ln3 ln2 2 u = ( 6 t + 5 ) 2 2 v + 1 ( 2 2 u − 1 ) + 1 case 0 3 2 u − 2 2 ( u + v )+ 1 − 2 2 v + 1 = 1 / ( 6 t + 5 ) As all the terms on the left are integers, and the term on the right is a fracti on, there are no solutions to this equation. Lets tr y case 1: ( 6 t + 1 ) 3 2 u + 1 = ( 6 t + 5 ) 2 2 v + 1 ( 2 2u + 1 − 1 )+ 1 case EO ( 6 t + 1 ) 3 2 u + 1 =( 6 t + 5 ) 2 2 ( u + v + 1 ) − ( 6 t + 5 ) 2 2 v + 1 + 1 3 2 u + 1 + 2 2 v + 1 = 2 2 ( u + v + 1 ) + 4 ⋅ 3 2 u + 1 + 1 6 t + 5 3 2 u + 1 + 2 2 v + 1 − 2 2 ( u + v + 1 ) = 4 ⋅ 3 2 u + 1 + 1 6 t + 5 n ( 6 t + 5 ) = 4 ⋅ 3 2 u + 1 + 1 looking at fraction, it must be an integer 3 2 u + 1 + 2 2 v + 1 > 2 2 ( u + v + 1 ) term on left must remain positive What a nice equation, a mixed integer exponential problem in two bas es along with factoring. 23/24 The other cases have a similar form based on s m all cons tant transformations of u, v , and t. Conclusion The head of an arbitrary number is another arbitrary number; hence it is difficult for me to see how stripping off the tail helps us gain understanding of the conver gence of the Collatz function. However , I do find interesting that strands of numbers can be easil y ident ified in the sequence where k=1, and algebraic functions may be used. Analysis of these strands lead to a closed form description for possible cycles in them. However , evaluating the form requires large integer factorization and mixed integer calculation. The Reference 2006, C. Cadogan, Caribbean Journal of Mathematics and Computer Science, 13, 2006, 1-11 Appendix Here is a Mathematica program for evaluating the following form, 3 a − 2 b ( 2 a − 1 )= 4 ⋅ 3 a + 1 c There is a solution at {a = 1, b=1, c=13} but these are not the correct form for the parameters, e.g. no 6 t + 5 = 13. 24/24

More Jabber about the Collatz Conjecture and a Closed Form for Detecting Cycles on Special Subsequences [Assertion: Collatz cycles]

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment