Feed john-d-cook John D. Cook

John D. Cook

Link	https://www.johndcook.com/blog
Feed	http://feeds.feedburner.com/TheEndeavour?format=xml
Updated	2026-02-11 02:16

Comparing Truncation to Differential Privacy

by

John

on 2019-05-15 21:09 (#4F76C)

Traditional methods of data de-identification obscure data values. For example, you might truncate a date to just the year. Differential privacy obscures query values by injecting enough noise to keep from revealing information on an individual. Letâ€™s compare two approaches for de-identifying a personâ€™s age: truncation and differential privacy. Truncation First consider truncating birth date [â€¦]

Golden ratio primes

by

John

on 2019-05-13 03:31 (#4F0BW)

The golden ratio is the larger root of the equation Ï†Â² â€“ Ï† â€“ 1 = 0. By analogy, golden ratio primes are prime numbers of the form p = Ï†Â² â€“ Ï† â€“ 1 where Ï† is an integer. To put it another way, instead of solving the equation Ï†Â² â€“ Ï† â€“ 1 [â€¦]

Goldilocks and the three multiplications

by

John

on 2019-05-13 01:29 (#4F09K)

Mike Hamburg designed an elliptic curve for use in cryptography he calls Ed448-Goldilocks. The prefix Ed refers to the fact that itâ€™s an Edwards curve. The number 448 refers to the fact that the curve is over a prime field where the prime p has size 448 bits. But why Goldilocks? Golden primes and Goldilocks [â€¦]

Tricks for arithmetic modulo NIST primes

by

John

on 2019-05-12 19:43 (#4EZWS)

The US National Institute of Standards and Technology (NIST) originally recommended 15 elliptic curves for use in elliptic curve cryptography [1]. Ten of these are over a field of size 2n. The other five are over prime fields. The sizes of these fields are known as the NIST primes. The NIST curves over prime fields [â€¦]

Elliptic curve P-384

by

John

on 2019-05-11 18:21 (#4EYCT)

The various elliptic curves used in ellitpic curve cryptography (ECC) have different properties, and weâ€™ve looked at several of them before. For example, Curve25519 is implemented very efficiently, and the parameters were transparently chosen. Curve1174 is interesting because itâ€™s an Edwards curve and has a special addition formula. This post looks at curve P-384. Whatâ€™s [â€¦]

Bessel function crossings

by

John

on 2019-05-09 01:48 (#4ERM5)

The previous looked at the angles that graphs make when they cross. For example, sin(x) and cos(x) always cross with the same angle. The same holds for sin(kx) and cos(kx) since the k simply rescales the x-axis. The post ended with wondering about functions analogous to sine and cosine, such as Bessel functions. This post [â€¦]

Orthogonal graphs

by

John

on 2019-05-08 20:11 (#4ER24)

Colin Wright posted a tweet yesterday that said that the plots of cosine and tangent are orthogonal. Hereâ€™s a plot so you can see for yourself. Jim Simons replied with a proof so short it fits in a tweet: The product of the derivatives is -sin(x)secÂ²(x) = -tan(x)/cos(x), which is -1 if cos(x)=tan(x). This made [â€¦]

Fascination burnout

by

John

on 2019-05-08 02:22 (#4EP92)

Here a little dialog from Anathem by Neal Stephenson that I can relate to: â€œâ€¦ I donâ€™t care â€¦â€ Asribalt was horrified. â€œBut how can you not be fascinated byâ€”â€ â€œI am fascinated,â€ I insisted. â€œThatâ€™s the problem. Iâ€™m suffering from fascination burnout. Of all the things that are fascinating, I have to choose just [â€¦]

Area and volume of Menger sponge

by

John

on 2019-05-06 01:03 (#4EHJZ)

The Menger sponge is the fractal you get by starting with a cube, dividing each face into a 3 by 3 grid (like a Rubikâ€™s cube) and removing the middle square of each face and everything behind it. Thatâ€™s M1, the Menger sponge at the 1st stage of its construction. The next stage repeats this [â€¦]

Regular expression for ICD-9 and ICD-10 codes

by

John

on 2019-05-05 21:52 (#4EHCQ)

Suppose youâ€™re searching for medical diagnosis codes in the middle of free text. One way to go about this would be to search for each of the roughly 14,000 ICD-9 codes and each of the roughly 70,000 ICD-10 codes. A simpler approach would be to use regular expressions, though that may not be as precise. [â€¦]

A misunderstanding of complexity

by

John

on 2019-04-30 12:29 (#4E65T)

Iterating simple rules can lead to complex behavior. Many examples of this are photogenic, and so theyâ€™re good for popular articles. Itâ€™s fun to look at fractals and such. Iâ€™ve written several articles like that here, such as the post that included the image below. But thereâ€™s something in popular articles on complexity that bothers [â€¦]

Improving on the sieve of Eratosthenes

by

John

on 2019-04-30 01:32 (#4E5AQ)

Ancient algorithm Eratosthenes had a good idea for finding all primes less than an upper bound N over 22 centuries ago. Make a list of the numbers 2 to N. Circle 2, then scratch out all the larger multiples of 2 up to N. Then move on to 3. Circle it, and scratch out all [â€¦]

How category theory is applied

by

John

on 2019-04-29 19:11 (#4E4MV)

Instead of asking whether an area of mathematics can be applied, itâ€™s more useful to as how it can be applied. Differential equations are directly and commonly applied. Ask yourself what laws govern the motion of some system, write down these laws as differential equations, then solve them. Statistical models are similarly direct: propose a [â€¦]

Rare and strange ICD-10 codes

by

John

on 2019-04-27 22:07 (#4E1CH)

ICD-10 is a set of around 70,000 diagnosis codes. ICD stands for International Statistical Classification of Diseases and Related Health Problems. The verbosity of the name is foreshadowing. Some of the ICD-10 codes are awfully specific, and bizarre. For example, V95.4: Unspecified spacecraft accident injuring occupant V97.33XA: Sucked into jet engine, initial encounter V97.33XD: Sucked [â€¦]

State privacy laws to watch

by

John

on 2019-04-25 11:48 (#4DVY5)

A Massachusetts court ruled this week that obtaining real-time cell phone location data requires a warrant. Utah has passed a law that goes into effect next month that goes further. Police in Utah will need a warrant to obtain location data or to search someoneâ€™s electronic files. (Surely electronic files are the contemporary equivalent of [â€¦]

by

John

on 2019-04-24 14:58 (#4DSV8)

A literal quantum leap is a discrete change, typically extremely small [1]. A metaphorical quantum leap is a sudden, large change. I canâ€™t think of a good metaphor for a small but discrete change. I was reaching for such a metaphor recently and my first thought was â€œquantum leap,â€ though that would imply something much [â€¦]

Professional, amateur, and something else

by

John

on 2019-04-23 16:11 (#4DQPJ)

I opened a blog posts a while back by saying One of the differences between amateur and professional software development is whether youâ€™re writing software for yourself or for someone else. Itâ€™s like the difference between keeping a journal and being a journalist. This morning I saw where someone pulled that quote and I thought [â€¦]

Easter and exponential sums

by

John

on 2019-04-21 22:50 (#4DM5Z)

For the last couple years, the exponential sum of the day for Easter Sunday has been a cross. This was not planned, since the image each day is determined by the numbers that make up the date, as explained here. This was the exponential sum for last Easter last year, April 1, 2018: and this [â€¦]

Groups in categories

by

John

on 2019-04-21 22:27 (#4DM60)

The first time I saw a reference to a â€œgroup in a categoryâ€ I misread it as something in the category of groups. But thatâ€™s not what it means. Due to an unfortunately choice of terminology, â€œinâ€ is more subtle than just membership in a class. This is related to another potentially misleading term, algebraic [â€¦]

What is an isogeny?

by

John

on 2019-04-21 21:31 (#4DM48)

The previous post said that isogenies between elliptic curves are the basis for a quantum-resistant encryption method, but we didnâ€™t say what an isogeny is. Itâ€™s difficult to look up what an isogeny is. Youâ€™ll find several definitions, and they seem incomplete or incompatible. If you go to Wikipedia, youâ€™ll read â€œan isogeny is a [â€¦]

Isogeny-based encryption

by

John

on 2019-04-20 15:17 (#4DJ85)

If and when large quantum computers become practical, all currently widely deployed method for public key cryptography will break. Even the most optimistic proponents of quantum computing believe such computers are years away, maybe decades. But it also takes years, maybe decades, to develop, test, and deploy new encryption methods, and so researchers are working [â€¦]

Calling Python from Mathematica

by

John

on 2019-04-19 00:42 (#4DF7E)

The Mathematica function ExternalEvalute lets you call Python from Mathematica. However, there are a few wrinkles. I first pasted in an example from the Mathematica documentation and it failed. ExternalEvaluate[ "Python", {"def f(x): return x**2", "f(3)"} ] It turns out you (may) have to tell Mathematica where to find Python. I ran the following, tried [â€¦]

Random projection

by

John

on 2019-04-16 14:00 (#4D8YG)

Last night after dinner, the conversation turned to high-dimensional geometry. (I realize how odd that sentence sounds; I was with some unusual company.) Someone brought up the fact that two randomly chosen vectors in a high-dimensional space are very likely to be nearly orthogonal. This is a surprising but well known fact. Next the conversation [â€¦]

A truly horrible random number generator

by

John

on 2019-04-14 22:27 (#4D5CM)

I needed a bad random number generator for an illustration, and chose RANDU, possibly the worst random number generator that was ever widely deployed. Donald Knuth comments on RANDU in the second volume of his magnum opus. When this chapter was first written in the late 1960â€™s, a truly horrible random number generator called RANDU [â€¦]

Maybe you should’t script it after all

by

John

on 2019-04-11 14:56 (#4CYPK)

Programmers have an easier time scaling up than scaling down. You could call this foresight or over-engineering, depending on how things work out. Scaling is a matter of placing bets. Experienced programmers are rightfully suspicious of claims that something only needs to be done once, or that quick-and-dirty will be OK [*]. Theyâ€™ve been burned [â€¦]

Squircle perimeter and the isoparametric problem

by

John

on 2019-04-11 11:25 (#4CY8P)

If you have a fixed length of rope and you want to enclose the most area inside the rope, make it into a circle. This is the solution to the so-called isoparametric problem. Didoâ€™s problem is similar. If one side of your bounded area is given by a straight line, make your rope into a [â€¦]

Taking the derivative of a muscle car

by

John

on 2019-04-08 22:02 (#4CR3X)

Iâ€™ve been getting a lot of spam lately saying my web site does not rank well on â€œcertain keywords.â€ This is of course true: no web site ranks well for every keyword. I was joking about this on Twitter, saying that my site does not rank well for womenâ€™s shoes, muscle cars, or snails because [â€¦]

Safe Harbor and the calendar rollover problem

by

John

on 2019-04-08 17:35 (#4CQKQ)

Data privacy is subtle and difficult to regulate. The lawmakers who wrote the HIPAA privacy regulations took a stab at what would protect privacy when they crafted the â€œSafe Harborâ€ list. The list is neither necessary or sufficient, depending on context, but itâ€™s a start. Extreme values of any measurement are more likely to lead [â€¦]

Data privacy Twitter account

by

John

on 2019-04-08 01:02 (#4CP3A)

My newest Twitter account is Data Privacy (@data_tip). There I post tweets about ways to protect your privacy, statistical disclosure limitation, etc. I had a clever idea for the icon, or so I thought. I started with the default Twitter icon, a sort of stylized anonymous person, and colored it with the same blue and [â€¦]

Ratio of Lebesgue norm ball volumes

by

John

on 2019-04-07 20:28 (#4CNTZ)

As dimension increases, the ratio of volume between a unit ball and a unit cube goes to zero. Said another way, if you have a high-dimensional ball inside a high-dimensional box, nearly all the volume is in the corners. This is a surprising result when you first see it, but itâ€™s well known among people [â€¦]

Higher dimensional squircles

by

John

on 2019-04-04 14:00 (#4CFDK)

The previous post looked at what exponent makes the area of a squircle midway between the area of a square and circle of the same radius. We could ask the analogous question in three dimensions, or in any dimension. (What do you call a shape between a cube and a sphere? A cuere? A sphube?) [â€¦]

History of the “Squircle”

by

John

on 2019-04-03 01:04 (#4CBYM)

Architect Peter Panholzer coined the term â€œsquircleâ€ in the summer of 1966 while working for Gerald Robinson. Robinson had seen a Scientific American article on the superellipse shape popularized by Piet Hein and suggested Panholzer use the shape in a project. Piet Hein used the term superellipse for a compromise between an ellipse and a [â€¦]

Covered entities: TMPRA extends HIPAA

by

John

on 2019-04-02 15:58 (#4CAW9)

The US HIPAA law only protects the privacy of health data held by â€œcovered entities,â€ which essentially means health care providers and insurance companies. If you give your heart monitoring data or DNA to your doctor, it comes under HIPAA. If you give it to Fitbit or 23andMe, it does not. Government entities are not [â€¦]

Inferring religion from fitness data

by

John

on 2019-04-01 03:04 (#4C8XA)

Fitness monitors reveal more information than most people realize. For example, it may be possible to infer someoneâ€™s religious beliefs from their heart rate data. If you have location data, itâ€™s trivial to tell whether someone is attending religious services. But you could make a reasonable guess from cardio monitoring data alone. Muslim prayers occur [â€¦]

Putting topological data analysis in context

by

John

on 2019-03-29 13:04 (#4C2RD)

I got a review copy of The Mathematics of Data recently. Five of the six chapters are relatively conventional, a mixture of topics in numerical linear algebra, optimization, and probability. The final chapter, written by Robert Ghrist, is entitled Homological Algebra and Data. Those who grew up with Sesame Street may recall the song â€œWhich [â€¦]

Assumed technologies

by

John

on 2019-03-27 13:30 (#4BY4Q)

I just had a client ship me a laptop. We never discussed what OS the computer would run. I havenâ€™t opened the box yet, but I imagine itâ€™s running Windows 10. Iâ€™ve had clients assume I run Windows, but also others who assume I run Linux or Mac. I donâ€™t recall anyone asking me whether [â€¦]

Elementary solutions to differential equations

by

John

on 2019-03-26 12:00 (#4BVHV)

Differential equations rarely have closed-form solutions. Some do, and these are emphasized in textbooks. For this post we want to look specifically at homogeneous second order linear equations: y â€ + a(x) yâ€˜ + b(x) y = 0. If the coefficient functions a and b are constant, then the solution can be written down in terms [â€¦]

by

John

on 2019-03-25 18:56 (#4BT37)

It occurred to me recently that I rarely hear about finite rings. I did a Google Ngram search to make sure this isnâ€™t just my experience. Source Why are finite groups and finite fields common while finite rings are not? Finite groups have relatively weak algebraic structure, and demonstrate a lot of variety. Finite fields [â€¦]

Monads and generalized elements

by

John

on 2019-03-24 20:52 (#4BR71)

Paolo Perrone gives a nice, succinct motivation for monads in the introduction to his article on probability and monads. â€¦ a monad is like a consistent way of extending spaces to include generalized elements of a specific kind. He develops this idea briefly, and links to his dissertation where he gives a longer exposition (pages [â€¦]

Mixing error-correcting codes and cryptography

by

John

on 2019-03-23 16:58 (#4BP8Y)

Secret codes and error-correcting codes have nothing to do with each other. Except when they do! Error-correcting codes Error correcting code make digital communication possible. Without some way to detect and correct errors, the corruption of a single bit could wreak havoc. A simple example of an error-detection code is check sums. A more sophisticated [â€¦]

US Army applying new areas of math

by

John

on 2019-03-21 14:27 (#4BHP4)

Many times on this blog Iâ€™ve argued that the difference between pure and applied math is motivation. As my graduate advisor used to say, â€œApplied mathematics is not a subject classification. Itâ€™s an attitude.â€ Traditionally there was general agreement regarding what is pure math and what is applied. Number theory and topology, for example, are [â€¦]

Riffing on mistakes

by

John

on 2019-03-19 16:32 (#4BD1R)

I mentioned on Twitter yesterday that one way to relieve the boredom of grading math papers is to explore mistakes. If a statement is wrong, what would it take to make it right? Is it approximately correct? Is there some different context where it is correct? Several people said theyâ€™d like to see examples, so [â€¦]

A genius can admit finding things difficult

by

John

on 2019-03-19 13:58 (#4BCMF)

Karen Uhlenbeck has just received the Abel Prize. Many say that the Fields Medal is the analog of the Nobel Prize for mathematics, but others say that the Abel Prize is a better analog. The Abel prize is a recognition of achievement over a career whereas the Fields Medal is only awarded for work done [â€¦]

Thermocouple polynomials and other sundries

by

John

on 2019-03-19 01:00 (#4BBKJ)

I was looking up something on the NIST (National Institute of Standards and Technology) web site the other day and ran across thermocouple polynomials. I wondered what that could be, assuming â€œthermocoupleâ€ was a metaphor for some algebraic property. No, it refers to physical thermocouples. The polynomials are functions for computing voltage as a function [â€¦]

Digital signatures with oil and vinegar

by

John

on 2019-03-18 11:43 (#4BA03)

â€œUnbalanced oil and vinegarâ€ is a colorful name for a cryptographic signature method. This post will give a high-level description of the method and explain where the name comes from. The RSA encryption algorithm depends on the fact that computers can easily multiply enormous numbers, but they cannot efficiently factor the product of two enormous [â€¦]

Counting irreducible polynomials over finite fields

by

John

on 2019-03-14 17:40 (#4B2VC)

You can construct a finite field of order pn for any prime p and positive integer n. The elements are polynomials modulo an irreducible polynomial of degree n, with coefficients in the integers mod p. The choice of irreducible polynomial matters, though the fields you get from any two choices will be isomorphic. For example, [â€¦]

Scaling up differential privacy: lessons from the US Census

by

John

on 2019-03-14 15:52 (#4B2H7)

The paper Issues Encountered Deploying Differential Privacy describes some of the difficulties the US Census Bureau has run into while deploying differential privacy for the 2020 census. Itâ€™s not surprising that they would have difficulties. Itâ€™s surprising that they would even consider applying differential privacy on such an enormous scale. If your data project is [â€¦]

Average distance between planets

by

John

on 2019-03-13 03:17 (#4AYQ9)

What is the closest planet to Earth? The planet whose orbit is closest to the orbit of Earth is clearly Venus. But what planet is closest? That changes over time. If Venus is between the Earth and the sun, Venus is the closest planet to Earth. But if Mercury is between the Earth and the [â€¦]

All elliptic curves over fields of order 2 and 3

by

John

on 2019-03-11 15:52 (#4ATT1)

Introductions to elliptic curves often start by saying that elliptic curves have the form yÂ² = xÂ³ + ax + b. where 4aÂ³ + 27bÂ² â‰ 0. Then later they say â€œexcept over fields of characteristic 2 or 3.â€ What does characteristic 2 or 3 mean? The order of a finite field is the number of [â€¦]

US Census Bureau embraces differential privacy

by

John

on 2019-03-10 14:11 (#4ARRT)

The US Census Bureau is convinced that traditional methods of statistical disclosure limitation have not done enough to protect privacy. These methods may have been adequate in the past, but it no longer makes sense to implicitly assume that those who would like to violate privacy have limited resources or limited motivation. The Bureau has [â€¦]

...41 42 43 444546 47 48 49 50...