What’s the shortest known Normal Number?

Well, the answer is that it has to be infinitely long, but the question is what is the most compact form of a Normal Number possible.

I was motivated to look into this from a lovely Numberphile video about all the real numbers.

 

Normal numbers in base 10 are those for which, in the base 10 decimal expansion, you can find every natural number.

Champernowne’s number is a very simple example of this where it is simply written as:

0.12345678910111213…etc.

I thought that it might be interesting to see if one could write a more compact Normal Number, but using a similar procedure to Champernowne. I haven’t seen this done anywhere else. For example, in the above expression, you don’t need to include the 12 explicitly as it’s already there at the beginning. You could write

0.12345678910113

So you skip the 12, and also 11 and 13 becomes 113. We will do all of this just with the list of digits, rather than the number in base 10.…

By | September 6th, 2019|Uncategorized|1 Comment

p-values (part 3): meta distribution of p-values

Introduction

So far we have discussed what p-values are and how they are calculated, as well as how bad experiments can lead to artificially small p-values. The next thing that we will look at comes from a paper by N.N. Taleb (1), in which he derives the meta-distribution of p-values i.e. what ranges of p-values we might expect if we repeatedly did an experiment where we sampled from the same underlying distribution.

The derivations are pretty in depth and this content and the implications of the results are pretty new to me, so any discrepancies/misinterpretations found should be pointed out and/or discussed.

Thankfully, in this video (2) there is an explanation that covers some of what the paper says as well as some Monte-Carlo simulations. My discussion will focus on some simulations of my own that are based on those that are done in the video.

What we are talking about

We have already discussed what p-values mean and how they can go wrong.…

By | September 5th, 2019|English, Level: intermediate|1 Comment

p-values (part 2) : p-Hacking Why drinking red wine is not the same as exercising

What is p-hacking?

You might have heard about a reproducibility problem with scientific studies. Or you might have heard that drinking a glass of red wine every evening is equivalent to an hour’s worth of exercise.

Part of the reason that you might have heard about these things is p-hacking: ‘torturing the data until it confesses’. The reason for doing this is mostly pressure on researchers to find positive results (as these are more likely to be published) but it may also arise from misapplication of Statistical procedures or bad experimental design.

Some of the content here is based on a more serious video from Veritasium: https://www.youtube.com/watch?v=42QuXLucH3Q. John Oliver has also spoken about this on Last Week Tonight, for those who are interested in some more examples of science that makes its way onto morning talk shows.

p-hacking can be done in a number of ways- basically anything that is done either consciously or unconsciously to produce statistically significant results where there aren’t any.…

By | September 2nd, 2019|English, Undergraduate|1 Comment

A quick argument for why we don’t accept the null hypothesis

Introduction

When doing hypothesis testing, an often-repeated rule is ‘never accept the null hypothesis’. The reason for this is that we aren’t making probability statements about true underlying quantities, rather we are making statements about the observed data, given a hypothesis.

We reject the null hypothesis if the observed data is unlikely to be observed given the null hypothesis. In a sense we are trying to disprove the null hypothesis and the strongest thing we can say about it is that we fail to reject the null hypothesis.

That is because observing data that is not unlikely given that a hypothesis is true does not make that hypothesis true. That is a bit of a mouthful, but basically what we are saying is that if we make some claim about the world and then we see some data that does not disprove this claim, we cannot conclude that the claim is true.…

By | August 28th, 2019|English, Level: Simple, Uncategorized, Undergraduate|0 Comments

p-values: an introduction (Part 1)

The starting point

This is the first of (at least) 3 posts on p-values. p-values are everywhere in statistics- especially in fields that require experimental design.

They are also pretty tricky to get your head around at first. This is because of the nature of classical (frequentist) statistics. So to motivate this I am going to talk about a non-statistical situation that will hopefully give some intuition about how to think when interpreting p-values and doing hypothesis testing.

My New Car

I want to buy a car. So I go down to the second hand car dealership to get one. I walk around a bit until I find one that I like.

I think to myself: ‘this is a good car’. 

Now because I am at a second-hand car dealership I find it appropriate to gather some data. So I chat to the lady there (looks like a bit of a scammer, but I am here for a deal) about the car.…

By | August 21st, 2019|English, Level: Simple, Undergraduate|0 Comments

R-squared values for linear regression

What we are talking about

Linear regression is a common and useful statistical tool. You will have almost certainly come across it if your studies have presented you with any sort of statistical problems.

The pros of regression are that it is relatively easy to implement and that the relationship between inputs and outputs is linear (it’s in the name, but this simplifies the interpretation of the relationship significantly). On the downside, it relies fairly heavily on frequentist interpretation of probability (which is a little counterintuitive) and it’s very easy to draw erroneous conclusions from different models.

This post will deal with a measure of how good a model is: R^2. First, I will go through what this value means and what it measures. Then, I will discuss an example of how reliance on  R^2  is a dangerous game when it comes to linear models.

What you should know

Firstly, let’s establish a bit of context.…

By | August 18th, 2019|English, Undergraduate|1 Comment

Cantor–Schröder–Bernstein Theorem

Knowledge this posts assumes: What is a set, set cardinality, a function, an image of a function and an injective (one-to-one) function.

David Hilbert imagines a hotel with an infinite number of rooms. In this hotel, each room can only be occupied by one guest, and each room is indeed occupied by exactly one guest. What happens if more guests show up? Can they be accommodated for?

PAUSE: WHAT DO YOU THINK AND WHY?

Suppose we propose they cannot be accommodated for, since all the rooms are occupied. Hilbert then claims that he can define the functions f:A \mapsto B, and g:B \mapsto C, where A is a set containing all current guests, and f simply maps each guest to a room in the set B, and g maps each room in B to a new one in C. Notice that these functions must be injective, since if a room contains two different guests, those two different guests must be the same guest; recall f(a) = f(b) \rightarrow a = b.…

By | August 16th, 2019|Uncategorized|0 Comments

1.6 Partitions

Recall the  relation \equiv \text{ mod} (4) on the set \mathbb{ N}.

One of the equivalence classes is [0] = \{ ..., -8, -4, 0, 4, 8, ...\} which is equivalent to writing [0] = [4] = [-4] = [8] = [-8] ...

We could do this because the equivalence class collects all the natural numbers that are related to zero under the relation \equiv \text{ mod} (4)

 

The following theorem generalises this idea for any relation \equiv \text{ mod} (n) on the set \mathbb{ N}: for the integer n.

Let R be an equivalence relation on set A. If a, b \in A,  then [a] = [b] \iff  aRb.

Essentially, equivalence classes  [a] = [b] are equal if the elements  a, b \in A, are related under the relation R. And simultaneously, knowing that elements a, b \in A, are related under R means their equivalence classes  [a] = [b] are equal.

An equivalence class  \equiv \text{ mod} (n) divides set a A into n equivalence classes. We call this situation a partition of set A.

A partition of a set A is defined as a set of non-empty subsets of A, such that both these conditions are simultaneously satisfied:

 (i) the union of all these subsets equals A.

(ii) the intersection of any two different subsets is

 

Let’s return to our example: \equiv \text{ mod} (4) on the set \mathbb{ N}. We could represent this set as:

Modulus 4, General

  • NOTE: Each equivalence class above represents an infinite set and despite the drawing suggesting [0] is larger than [3] for instance, this is not true.
By | August 9th, 2019|Uncategorized|0 Comments

Review: Calculus Reordered

Book title: Calculus Reordered: A History of the Big Ideas
Author : David M. Bressoud

9780691181318

Princeton University Press
Link to the book: Calculus Reordered: A History of the Big Ideas

Discussions on the history of different fields are usually dry, wordy and generally, when you are studying the field, hard to read. This is because they are usually geared towards the general audience, and in doing so most authors tend to strip away the very exciting technical details. I expected the same treatment from the author, but I was pleasantly surprised.

The book contains 5 chapters, which are the following:

1) Accumulations
2) Ratios of Change
3) Sequences of Partial Sums
4) The Algebra of Inequalities
5) Analysis

Each of these chapters has a central theme that is being covered, but they are not at all disjoint. For instance, the last three contain the history of concepts that would normally be found in a first course for Real Analysis, while the first two are essentially the more applied spectrum to serve as some form of motivation for going through all this trouble, although they can certainly stand on their own.…

By | August 4th, 2019|Uncategorized|1 Comment

1.5 Equivalence classes (Infinite sets)

 

Let’s find the equivalence classes of the following finite set S:

Given S = \{ -1, 1, 2, 3, 4 \}, we can form the following relation R = \{ (-1, -1), (1,1), (2,2), (3,3), (4,4), (1,3), (3,1), (2,4), (4,2) \}.

Note: writing the relation R on set S in the following ways is equivalent:

-1R-1, 1R1, 2R2, 3R3, 4R4, 1R3, 3R1, 2R4, 4R2

or

-1\le -1, 1 \le1, 2 \le2, 3  \le3, 4 \le4, 1 \le 3, 3 \le 1, 2 \le 4, 4 \le 2

This relation, R has been given the symbol \le but it means “the same sign and parity” in this case. For instance, (1,3) or 1 \le 3 tells us that one and three are both odd and both have the same sign in set A (both positive).

The equivalence classes for this relation are the following sets:

\{ -1 \}, \{ 1, 3\} \text{ and } \{2, 4 \}

We obtained the above equivalence classes by asking ourselves:

  • How is the element -1 related to any other element in the set S under the definition of R?

Since R is defined as “the same sign and same parity,” then we’re really asking ourselves whether -1 has the same sign as any other element in S. Since all the other elements are positive, then -1 has the equivalence class containing only itself. Another question we would’ve asked ourselves is whether -1 is even or odd. …

By | July 25th, 2019|Uncategorized|1 Comment