In 1905 and 1908, Einstein developed two formulations for the diffusion of a small particle in a liquid. As a side-benefit of the first derivation, he demonstrated the visible existence of molecules, a remarkable piece of work. In the second formulation, he derived the same result using non-equilibrium thermodynamics, something he seems to have developed on the spot. I’ll give a brief version of the second derivation, and will then I’ll show off my own extension. It’s one of my proudest intellectual achievements.
But first a little background to the problem. In 1827, a plant biologist, Robert Brown examined pollen under a microscope and noticed that it moved in a jerky manner. He gave this “Brownian motion” the obvious explanation: that the pollen was alive and swimming. Later, it was observed that the pollen moved faster in acetone. The obvious explanation: pollen doesn’t like acetone, and thus swims faster. But the pollen never stopped, and it was noticed that cigar smoke also swam. Was cigar smoke alive too?
Einstein’s first version of an answer, 1905, was to consider that the liquid was composed of atoms whose energy was a Boltzmann distribution with an average of E= kT in every direction where k is the Boltzmann constant, and k = R/N. That is Boltsman’s constant equals the gas constant, R, divided by Avogadro’s number, N. He was able to show that the many interactions with the molecules should cause the pollen to take a random, jerky walk as seen, and that the velocity should be faster the less viscous the solvent, or the smaller the length-scale of observation. Einstein applied the Stokes drag equation to the solute, the drag force per particle was f = -6πrvη where r is the radius of the solute particle, v is the velocity, and η is the solution viscosity. Using some math, he was able to show that the diffusivity of the solute should be D = kT/6πrη. This is called the Stokes-Einstein equation.
In 1908 a French physicist, Jean Baptiste Perrin confirmed Einstein’s predictions, winning the Nobel prize for his work. I will now show the 1908 Einstein derivation and will hope to get to my extension by the end of this post.
Consider the molar Gibbs free energy of a solvent, water say. The molar concentration of water is x and that of a very dilute solute is y. y<<1. For this nearly pure water, you can show that µ = µ° +RT ln x= µ° +RT ln (1-y) = µ° -RTy.
Now, take a derivative with respect to some linear direction, z. Normally this is considered illegal, since thermodynamic is normally understood to apply to equilibrium systems only. Still Einstein took the derivative, and claimed it was legitimate at nearly equilibrium, pseudo-equilibrium. You can calculate the force on the solvent, the force on the water generated by a concentration gradient, Fw = dµ/dz = -RT dy/dz.
Now the force on each atom of water equals -RT/N dy/dz = -kT dy/dz.
Now, let’s call f the force on each atom of solute. For dilute solutions, this force is far higher than the above, f = -kT/y dy/dz. That is, for a given concentration gradient, dy/dz, the force on each solute atom is higher than on each solvent atom in inverse proportion to the molar concentration.
Now calculate the speed of each solute atom. It is proportional to the force on the atom by the same relationship as appeared above: f = 6πrvη or v = f/6πrη. Inserting our equation for f= -kT/y dy/dz, we find that the velocity of the average solute molecule,
v = -kT/6πrηy dy/dz.
Let’s say that the molar concentration of solvent is C, so that, for water, C will equal about 1/18 mols/cc. The atomic concentration of dilute solvent will then equal Cy. We find that the molar flux of material, the diffusive flux equals Cyv, or that
Molar flux (mols/cm2/s) = Cy (-kT/6πrηy dy/dz) = -kTC/6πrη dy/dz -kT/6πrη dCy/dz.
where Cy is the molar concentration of solvent per volume.
Classical engineering comes to a similar equation with a property called diffusivity. Sp that
Molar flux of y (mols y/cm2/s) = -D dCy/dz, and D is an experimentally determined constant. We thus now have a prediction for D:
D = kT/6πrη.
This again is the Stokes Einstein Equation, the same as above but derived with far less math. I was fascinated, but felt sure there was something wrong here. Macroscopic viscosity was not the same as microscopic. I just could not think of a great case where there was much difference until I realized that, in polymer solutions there was a big difference.
Polymer solutions, I reasoned had large viscosities, but a diffusing solute probably didn’t feel the liquid as anywhere near as viscous. The viscometer measured at a larger distance, more similar to that of the polymer coil entanglement length, while a small solute might dart between the polymer chains like a rabbit among trees. I applied an equation for heat transfer in a dispersion that JK Maxwell had derived,

where κeff is the modified effective thermal conductivity (or diffusivity in my case), κl and κp are the thermal conductivity of the liquid and the particles respectively, and φ is the volume fraction of particles.
To convert this to diffusion, I replaced κl by Dl, and κp by Dp where
Dl = kT/6πrηl
and Dp = kT/6πrη.
In the above ηl is the viscosity of the pure, liquid solvent.
The chair of the department, Don Anderson didn’t believe my equation, but agreed to help test it. A student named Kit Yam ran experiments on a variety of polymer solutions, and it turned out that the equation worked really well down to high polymer concentrations, and high viscosity.
As a simple, first approximation to the above, you can take Dp = 0, since it’s much smaller than Dl and you can take Dl to equal Dl = kT/6πrηl as above. The new, first order approximation is:
D = kT/6πrηl (1 – 3φ/2).
We published in Science. That is I published along with the two colleagues who tested the idea and proved the theory right, or at least useful. The reference is Yam, K., Anderson, D., Buxbaum, R. E., Science 240 (1988) p. 330 ff. “Diffusion of Small Solutes in Polymer-Containing Solutions”. This result is one of my proudest achievements.
R.E. Buxbaum, March 20, 2024















