One second per second is harder than it sounds

ChuckMcM · on July 22, 2014

For a long time synchronized clocks was the networking code's equivalent joke to Regular Expressions ("Now you've got two problems ...") That said, having implemented a 100ns accurate clock using the PPS on a $30 GPS module (Adafruit) and a Beaglebone Black (multiple) these days there are some interesting alternatives. It would be interesting if the Open Compute folks would specify a 1PPS GPIO input as part of the spec, you could easily deliver rack level 1PPS accurate signals throughout the data center pretty cost effectively (a few wires, a transmitter, a TDR tool. Not that it will ever be a 'solved' problem but it certainly makes some transactional protocols safer.

rdtsc · on July 22, 2014

I've used PTP with special timestamping hardware (network cards) and could synchronize machines on a LAN pretty well. Now that doesn't guarantee global sync just sync between machines.

GPS NTP server can be obtained for as low as $600 or so. So those are becoming not as exotic now.

ChuckMcM · on July 22, 2014

So FWIW my beagle bone is a "GPS NTP server" and it was built out of the following:

1) BeagleBone $55 (http://www.adafruit.com/product/1876)

2) Proto cape $10 (http://www.adafruit.com/products/572)

3) Power supply $8 (http://www.adafruit.com/products/276)

4) Enclosure $7 (http://www.makershed.com/products/beaglebone-black-enclosure)

5) GPS board $40 w/ PPS (http://www.adafruit.com/products/746)

6) Ext Antenna ($13) (http://www.adafruit.com/products/960)

7) SMA to Antenna ($4) (http://www.adafruit.com/products/851)

So what $137 + tax and shipping, and the Robert Nelson Ubuntu pack with Openntpd running on it. The trick though in data centers is always getting visibility to GPS satellites. I wonder when the data center guys will figure the can add a 'cross connect' to the GPS on the roof for $75/month or something :-)

rdtsc · on July 22, 2014

That is pretty neat. I got my Beaglebone but haven't quite figured out what I am going to do with it. A GPS time server is a neat idea.

The price sounds about right. But, you also used your expertise, time and labor. Then add an enclosure box, documentation, support line, CE+FCC certifications and it can quickly get up there in price.

ChuckMcM · on July 22, 2014

Oh absolutely, I completely get why folks sell them for $600 each. (They were $25,000 each at one time which I found to be rather extortionate). I used to run ntpd on my home server to keep the machines in my house synced until the reflection attack using it came out and folks were dragging down my cheezy home net using it to attack others. Now I only run ntp inside the firewall :-)

nick_riviera · on July 22, 2014

We had radio time receivers (Rugby MSF, now Anthorn MSF) for a long time in the UK. Just plug them into an RS232 hole and have a site-wide master NTP server on that box, fire up the daemon and you're done.

I think the last one we paid for was in the region of $250 as the hardware is very simple.

http://www.npl.co.uk/science-technology/time-frequency/produ...

GPS is an interesting prospect but they are pretty HF which is a PITA if you have a unit inside a rack inside a machine room inside a building. MSF is 60KHz so is ground wave propagation so it had great coverage.

easytiger · on July 22, 2014

But worth noting it is only ±1 ms accuracy which is not useful in many applications.

nick_riviera · on July 22, 2014

Very true, but system-wide consistency across all participants is a different problem to external reference accuracy but it starts as a reasonable basis that you can reason about.

sp332 · on July 22, 2014

"A man with one watch always knows what time it is. A man with two watches is never quite sure..."

kiram9 · on July 22, 2014

There is a standard to keep time on LANs now: IEEE-1588 which is built into a lot of new network cards. It enables the NIC to time stamp incoming packets which can be used to achieve sub microsecond timing.

Normally it can be implemented with a single GPS receiver on the network that then feeds all servers/devices. It is starting to become very common in broadcast applications as a reference. Furthermore most manufactures already offer some application notes/libraries to make use of it. And linux offers PTPd which implements the standard.

http://ptpd.sourceforge.net/

http://en.wikipedia.org/wiki/Precision_Time_Protocol

http://www.nist.gov/el/isd/ieee/ieee1588.cfm

bradknowles · on July 23, 2014

PTP is a good choice for synchronizing clocks on a per-segment basis, but it doesn't work across a WAN and there are lots and lots of devices out there that don't support it.

NTP is good for doing time sync across the WAN, but to get the most out of it for certain applications you need to hook it up to reference clocks.

So, NTP can use PTP as a reference clock, and Bob's your uncle.

dm2 · on July 22, 2014

28 people killed because a clock was off by .34 seconds (system hadn't been restarted in 100 hours)

http://fas.org/spp/starwars/gao/im92026.htm

manojlds · on July 22, 2014

Isn't it actually "28 lives couldn't be saved" because of the clock issue. Also, the article states that the defense was never tried against SCUD and could have failed nevertheless.

spingsprong · on July 22, 2014

It probably would have failed.

The Gulf War Patriot was designed to shoot down Mach 2 jets, not Mach 5 Scuds.

Detonating slightly late against an aircraft is actually a good thing, since it makes the Patriot explode near the aircraft's middle instead of its nose.

A Scud is a short range ballistic missile. Once it's burnt out its fuel, it just carries on under the influence of gravity. From that point on, the only important part is the warhead in the very tip of the nose.

Detonating by the mid section of a Scud means you're just shredding empty fuel tanks. And the huge speeds involved might actually make Gulf War Patriot missiles detonate against the rear of the Scud and just shred the non-functioning engine, or even detonate behind the Scud entirely.

umeshunni · on July 22, 2014

"...the delay in distributing the software from the United States to all Patriot locations was due to the time it took to arrange for air and ground transportation in a wartime environment."

Fascinating to think that just 23 years ago, the only way to transmit the updated software was by physically transporting it from one location to another.

thaumaturgy · on July 22, 2014

Just for fun: this predates ssh. Everyone was still using telnet to log in to remote boxes.

23 years ago, Slashdot didn't exist.

23 years ago, Linux had just gotten started. For all practical purposes, it wouldn't exist for another month yet.

23 years ago, Mosaic (the first "modern" web browser) didn't yet exist. That was OK, because CERN was the only website; the www didn't exist yet either.

pjc50 · on July 22, 2014

I suspect you still can't do OTA updates of this kind of mission-critical system. If you can, I'd be very concerned about security.

YokoZar · on July 22, 2014

I don't see any reason the software update process needs to be more secure than the firing process, and presumably these batteries can be fired remotely.

umeshunni · on July 22, 2014

You don't need OTA updates per se - they could transfer it using sftp etc to the local base.

dm2 · on July 22, 2014

I think it means that this was the first conflict that it had in against Scuds. There was a software update made a few days before this incident that was in response to another Patriot/Scud engagement. The big issue with this time was that the missile didn't even fire.

http://embeddedgurus.com/barr-code/2014/03/lethal-software-d...

It's interesting that there were 6 software updates to the Patriot missile batteries during Desert Storm, several of them were specifically to fix the clock errors.

It would be interesting to see the data for the misses and the hits compared to how long each system had been online for and what version of the software it was using.

Today's patriots are better (PAC-3, PAC-4 in testing) but still have several issues. I remember reading somewhere where a pilot got so mad that he was constantly being locked on by friendly Patriot batteries that he shot a missile at a friendly battery. http://books.google.com/books?id=7cU8Kpzu9osC&pg=PA339&lpg=P...

These seem to be mostly RADAR issues, I can't imagine how difficult it would be to program a missile to intercept another missile, there's hundreds of variables involved. I'd be willing to bet that these days they can run them through a computer simulator and detect most of these issues.

Unrelated to the missile but relating to the original topic: http://www.wired.com/2012/11/google-spanner-time/

oakwhiz · on July 22, 2014

I'm guessing that the people wouldn't have been standing within range of missile attacks if they didn't expect that the missile defense system would protect them.

x1798DE · on July 22, 2014

Almost everywhere on earth is currently "within range of missile attacks" without any reliable missile defense shield protecting us. From the report, this was during Operation Desert Storm and they were in an Army barracks. Given that these systems had never been used to defend against SCUD missiles before, it seems likely that they were taking a calculated risk by being there, not simply assuming the missile shield would protect them.

oakwhiz · on July 22, 2014

I wasn't aware of the speed issues with intercepting SCUDs. Makes a lot more sense when you factor that in - these systems were being used as a last line of defense, not a shield.

kranner · on July 22, 2014

Having followed the link, I'd like to mention that a Scud missile was also involved.

lttlrck · on July 22, 2014

And a war.

chubot · on July 22, 2014

I think the question is: when does it matter? What applications rely on an accurate clock? I guess it would matter if you are running some algorithm on logs that uses time as an input. Interested to hear any other examples.

DJB has a clockspeed package that addresses this problem. Anyone used it?

http://thedjbway.b0llix.net/clockspeed.html

It seems interesting but I haven't (yet) run into the need for super accurate clocks (or perhaps I am not running enough machines).

keeperofdakeys · on July 22, 2014

As soon as you have any kind of highly-distributed system, accurate clocks are usually very important. When events can start on one machine, and end on another, it's important to have some kind of consistency (knowing that one event occurred before another). Probably the most famous paper on this is Lamport's "Time, clocks, and the ordering of events in a distributed system" - http://dl.acm.org/citation.cfm?id=359563.

But to give a practical example. On something like facebook, if you receive two comments on a post within a millisecond, how do you determine the order that they appear in the list? Each request may hit a different datacentre in the world, yet somehow the machines determine the ordering, and every request sees that ordering from then on. You can't just have a single server, because that couldn't handle the load for the entire world. These kinds of things require very strict temporal ordering, which is why time is such an important thing. This leads to the CAP theorem, and distributed systems in general.

chubot · on July 22, 2014

So that's exactly why you don't depend on hardware clocks -- you use software techniques when your algorithm depends on some notion of time.

The point of Lamport's paper is that time in distributed systems is a partial ordering, not a total ordering. If you are taking values from hardware clocks, you are using a total ordering.

tlarkworthy · on July 22, 2014

Google spanner is a DB that uses atomic clocks distributed world wide. They seem to think the pain involved dealing with partial ordering is not worth the hassle given there genuinely is a total ordering on operations.

http://en.wikipedia.org/wiki/Spanner_(database)

dllthomas · on July 23, 2014

"given there genuinely is a total ordering on operations."

I thought relativity says there genuinely isn't (but it's close enough we can fudge it...)

tlarkworthy · on July 23, 2014

ha! I think its ok as long as all parties involved are travelling on the earth and not around it near the speed of light

http://en.wikipedia.org/wiki/Relativity_of_simultaneity

dllthomas · on July 23, 2014

Right. Like I said, we can (presently) fudge it. When we get moving faster that ceases to be the case.

keeperofdakeys · on July 22, 2014

Yeah, but you can't have your clocks drifting a lot. It's fine if they are consistently wrong, but when a second isn't a second, things can start to screw up.

judk · on July 22, 2014

Does Lamport's design use seconds at all, or use only "happens before" statements?

chetanahuja · on July 23, 2014

"As soon as you have any kind of highly-distributed system, accurate clocks are usually..."

Let me stop you right there. If you have a highly distributed system, it's design CAN NOT rely on perfectly synchronized clocks among it's components. That's actually the point of the Lamport paper you cited as well as the OP. I actually started writing a detailed rehash of the Lamport's paper but I realize I'd do a poor job of it at best. Instead, I think it's best to just point to excellent and highly readable entire paper here: http://research.microsoft.com/en-us/um/people/lamport/pubs/t...

Dylan16807 · on July 22, 2014

Using a highly accurate clock to solve ordering like that is punting on the actual problem. It's possible for requests to different servers to happen at exactly the same time. You need a mechanism to handle that. And any mechanism that can handle that can also handle clocks being a couple hundred milliseconds out of sync.

At a certain level of precision you have to accept that data only travels at the speed of light and there is no way to have a global ordering that shows data consistently and immediately. So you can use an easy non-immediate method like a vector clock.

sophacles · on July 22, 2014

Power grids require highly accurate, distributed clocks.

Well they don't strictly require them, but having that capability increases efficiency, robustness and safety.

They need to be accurate because electricity is fast - so a lot of protection schemes need to have a decision made across very long distances (which breaker, at which point on a 100 mile long transmission line should be fliped to isolate faults or stop "bad things" propagation) with minimal impact to the system as a whole.

They also need to be accurate because a good measure of frequency and phasor at geographically disparate points allows system tuning. A tiny change - e.g. speed up one of the 100s of generators could result in a much smoother, overall more-efficient transmission system. Similarly, adjustments to capacitor banks and transformers can provide slight phase adjustments for optimal power flow. But these resources are hundreds of miles apart - e.g the western half of the US and Canada (and parts of Mexico) (except for texas) is one big system.

Being a large, physical system, with feedback, and also being electricity, you get some very interesting effects to try and reason about. There can be very slow (relatively) oscillations in the system, but at the same time you have many, many high frequency noise signals mixed in to the elecrical signal. To find trends and patterns for the slow oscillations you need a very high sample rate to determine what noise to ignore. And these samples need to be time aligned, because even a perfect wave will have different phases at different points along it's propagation.

Anyway - that's an example I'm familiar with. I hope it made sense, seeing as I may not have had quite enough coffee yet.

michaelt · on July 22, 2014

Clock years out of sync: All SSL certificates appear invalid.

Clock minutes out of sync: "Here's a temporary security token, I don't think it's expired" "That credential has expired" "This never came up in testing, I think I'll crash"

Clock seconds out of sync: Log files on different servers can't be merged reliably. Different requests went to different load-balanced servers and now you can't tell what order the logged events happened in.

spydum · on July 22, 2014

And that last one has had many people chasing ghosts for years. I can also attest to time sync killing a number of SAML assertions and their window of validation.

MichaelApproved · on July 22, 2014

Communicating with other servers is a good example.

I need to have an accurate clock to within a few seconds when I sign Amazon S3 files. I have code that gives access to restricted files on S3. To give access, the customer makes a request from my server, I authenticate, sign a new URL and redirect them to the signed S3 URL.

For security reasons, the URL gets a signature that expires after a few seconds. The expiration date is based on X seconds from my servers clock. If my servers time differs from S3 server time then, according to S3, the link could be expired before it's even created.

In theory, a redirect takes a second or less to complete so you'd think an expiration that's 5 seconds in the future would be enough. The reality is, over time, the server clock can differ as much as 1 minute or so from S3's clock. So, I just set the expiration to be 120 seconds into the future and call it a day.

throw_away · on July 22, 2014

If ultrashort expiry time really mattered to you and ntpd not reliably working, you could always make a request of S3 and use the response's version of the date (which presumably is the version of time that really matters, as far as S3 is concerned).

ajanuary · on July 22, 2014

For a bunch of distributed systems nodes need to be roughly matching real time, but also within some tollerance of each other; it's simplest to make everything as accurate to real time as possible.

I work in broadcast automation. It's a distributed system and the clocks need to be within a certain tolerance of each other. Historically that's done by having a dedicated piece of timekeeping hardware that dictates the station clock, but recently we've been experimenting with using NTP.

We have one node (A) where you author time-sensitive events, and another node (B) that polls for pages of those events, loops through them and takes some action when they should occur. Without both nodes agreeing somewhat on the time you'll end up missing events.

superpatosainz · on July 22, 2014

IRC: everything is timestamped to the subsecond. Any drifting causes channel takeovers and "random" nick kills, even links fail. Also, since a message from a server is broadcasted across the entire network, one misconfigured server can cause chaos in your hubs or completely unrelated leafs.

Or at least that's how I remember my experience managing an unstable (as in ddosed, every irc service in active development, a few in-house bots, etc) IRC network and having read the IRC and IRCv3 specs and unreal's and TS6 server message specs. If I'm wrong I'm sure someone here could correct me.

fanf2 · on July 23, 2014

Your comment is very surprising. I don't see how clock drift can cause the problems you describe. There is no timestamping in the IRC protocol.

You can ask for a server's local time, but that is just for information.

There is a recently-added timestamp extension, but that seems to be to correct for delays so clients can record logs correctly. It is not for keeping the IRC network together.

superpatosainz · on July 23, 2014

Yes, there is. Just not in the client side, it's server-to-server, and due to the centralised nature of IRC, it's critical.

For example, here's a paragraph from the UnrealIRCd server protocol (http://www.unrealircd.com/files/docs/technical/serverprotoco...):

> Unreal is very time-dependant. Users and channels, for example, are timestamped, and if server clocks are not synchronized properly, things can go very wrong very fast. See http://vulnscan.org/UnrealIrcd/faq/#67 for more information on this. Note that there is a slight difference between server time and what is actually reported by the UNIX date command or by the C time() function.

The other major side of IRC server protocol is the TS6 protocol (You can see it by yourself at http://irc-wiki.org/IRCd_Comparison). TS stands for TimeStamp, and you can see how critical timestamps are at https://github.com/avenj/irc-server-pluggable/blob/master/do... and https://github.com/atheme/charybdis/blob/master/doc/technica.... You can see TS are what control almost everything related to nicks, channels and their properties.

Before the implementation of TimeStamps and Services, it was really easy to abuse netsplits to take over channels and people's nicks.

There's a complete explanation at http://codeidol.com/internet/irc/The-IRC-Protocol/Timestamp-... (and of course, a Wikipedia link: http://en.wikipedia.org/wiki/Internet_Relay_Chat#Timestampin...).

dsymonds · on July 22, 2014

Several security protocols (e.g. Kerberos) depend on systems having a somewhat reliable clock. They'll issue some sort of authentication token on system A that a user is meant to present to system B; the token has some expiry time, and the security of the system depends on systems A and B having comparable clocks.

tmarthal · on July 22, 2014

I know that offhand gaming and financial services rely on very accurate clocks at the application level. And you can't necessarily rely on the clients to tell you what their time is, as it can be spoofed.

Arelius · on July 22, 2014

Gaming is actually a really bad example since as you say, you can't really on the clients telling you a time as it may be spoofed. Thus the clients times are always ignored, and thus time sync is irrelevant.

I'll say that in about 10 years of working on multi-player games, I've never once put a wall, or other absolute time on the wire for any purpose than just printing server time to the player for informational purposes.

jacquesm · on July 22, 2014

Time sync is very important because such games are typically played across clusters of machines, not just a single machine.

Arelius · on July 23, 2014

If by such games you mean very specifically MMO's, then sure, I'll concede that time sync may be more valuable, but even then event serialization is normally done via an authority hand-off model that does not actually require time sync.

However, if you mean that all multiplayer games are typically played across "clusters of machines" and thus require time-sync then my original point addressed that.

jrockway · on July 22, 2014

I've used clockspeed. tlsdate + clockspeed turns out to be pretty reliable, and a lot less code than ntpd.

rayiner · on July 22, 2014

Wireless networking protocols may very precise time synchronization at the PHY/MAC layer. Generally, wireless technologies like WiMAX or LTE divide time into frames (e.g. LTE is 10 ms). Then depending on whether you're frequency duplexing (FDD) or TDD (time duplexing), the frames are divided into subframes with defined meaning. If different radios on the network don't agree as to frame start/end times, interference occurs.

Here's an interesting paper describing the software implementation of certain features that are usually handled by the hardware (well, by software running on the baseband processor): http://people.freebsd.org/~sam/FreeBSD_TDMA-20090921.pdf.

jfoutz · on July 22, 2014

spanner? https://en.wikipedia.org/wiki/Spanner_(database)

spydum · on July 22, 2014

Since the advent of virtualization (most notoriously VMware ESX) time keeping has gotten to be much more difficult. I know back in my unix admin days, I spent more than a few days on problematic Linux kernels on VMs under ESX (made far worse when hosts were over provisioned and VMs were starved for CPU readiness). I know later enhancements like tick-less kernels may have helped, but I do not envy anyone who has to wrestle these beasts.

ghshephard · on July 22, 2014

I spent about 2+ weeks in early 2003/2004 trying to get VMware Server to properly keep the time in Sync. I had memorized the VMware whitepaper on the topic, and had followed every possible bit of advice they offered (including not using NTPD) - nothing worked. Eventually just cronned ntpdate to run every minute and that resolved the issue on all our systems.

This, by the way, horrified all our NTPD theoreticians, who made it clear that running ntpdate was going to cause catastrophic things to occur in our Operations environment - but, given that our logs were all getting progressively more and more useless as we were unable to correlate times for events between servers - the worst case scenario (in my mind) had already occurred.

I don't recall any particularly negative side effects as a result of our ntpdate sledgehammer.

I presume things have gotten better in the last 10 years with VMware.

bradknowles · on July 23, 2014

Somewhat.

Years ago, the answer was to run NTP on the host and use VMWare Tools to help ensure that good quality time was served to the clients. That's when the earlier versions of the page at <http://support.ntp.org/bin/view/Support/KnownOsIssues#Sectio... were written.

Since then, that advice has apparently been recanted.

The ultimate problem with any virtualization system is the frequency and accuracy of clock updates to the client, and the loss of clock interrupts.

To the degree you can solve that problem, protocols like NTP can work reasonably well to monitor and correct for various types of known clock issues.

MichaelGG · on July 22, 2014

Windows time sync only cares about being accurate to 5 !minutes, for Kerberos. HyperV and makes Windows not even able to be that accurate. Oddly enough, Linux guests have no trouble with subsecond accuracy.

I'm told the w32time codebase is not something MS employees like to look at.

72deluxe · on July 22, 2014

I don't think that w32time is something that users like to look at! I recall a situation where you run w32time on a server and it runs fine and doesn't report an error, but still silently ignore the option you set as it is using another time source from another machine that it has designated a master.

I could be wrong, but I was disappointed that the utility didn't return any errors when it knew that it was going to ignore you.

isomorphic · on July 22, 2014

I run various production systems in both VMware ESX and VirtualBox, and I have time issues in all VMs, regardless of hypervisor vendor. The VM tools are installed and operating "properly." The host machine clocks are synced correctly with ntp. Yet the VMs will sometimes get tens of minutes out of alignment in short order.

At this point I'm thinking I need to run ntp on the physical hardware with the least jitter, then just run ntpdate in a cron job on all the VMs. This would work better than ntp clients in the VMs or the VM tools.

joevandyk · on July 22, 2014

Every so often, my linux vm running on VMware fusion gets really out of date. I tried the "sync time" feature in VMware, didn't fix it.

Eventually I had ntpdate running every minute via cron. Pretty annoying.

lamontcg · on July 22, 2014

Run a cronjob every hour which:

a. ensures ntpd is running and starts it if its not.

b. checks `nptq -rv` and looks for the self-reported state of ntp and bounces if it is not sync'd.

c. 'manually' checks ntpdate for the drift against a known stratum 1 and bounces the daemon if the drift is out of bounds (something like 50ms is a pretty good cutoff since ntpd normally does much better than that).

You can run this out of chef/puppet/whatever but it needs to run with a frequency of about an hour (faster than that and ntpd often won't settle well, and slower than that and you can be needlessly waiting too long to fix problems). Running it as a cronjob uncouples it from how often your config management runs.

This will catch all kinds of issues -- crashing ntpds, ntdps that randomly lose sync, kernel problems that keep them from keeping sync, ntpds that lie about their sync status, etc.

I set it up to e-mail me as a form of monitoring (with heavy-duty procmail filters).

Had 30,000 servers with this script running against it. I think 6 of them had shitty clocks that were getting reset every single hour, but were still keeping time correctly to under a second. Every couple days there would be a server that'd flip out and send a few e-mails until it quieted down and synch'd. At one point we had a shipment of several hundred new servers that all had issues with kernel drivers and the cronjob was bouncing ntpd constantly on them until a kernel upgrade fixed the problem.

I also used to bounce ntpd once a night, but had to take that out because it would cause non-monotonic slew in the clock which timers around service calls would turn up as negative seconds that due to unsigned int conversion would turn into 4 billion second p100 times.

You do want to be careful about things like network outages, if you can't ping your upstream stratum1/2 its probably better to leave things as it is. And again, you really want to limit the frequency that you bounce it at, and you want to be aware of servers where it gets into a state where you're bouncing ntpd constantly.

easytiger · on July 22, 2014

This is a terrible idea when dealing with an application other than "users checking the time on their system clock".

Performing hard resets of the clock continually is a catastrophic infrastructure failure. A high performance 24/7 application that relies on sub millisecond let alone sub microsecond accuracy across multiple physical systems will be utterly destroyed by this approach.

jacquesm · on July 22, 2014

Not just that, monotonicity could be impaired.

wlesieutre · on July 22, 2014

To elaborate, monotonicity means that a value is either always increasing or always decreasing, and never changes direction. If a program is built on the assumption that time always goes forward, it may not like you setting the clock back 0.000000001sec every hour.

shadowfox · on July 22, 2014

Many license management software are especially bothered by time changes of this sort.

wlesieutre · on July 22, 2014

Very true, a lot of software with time limited trials or features like "check out a network license for 30 days of offline use" used to be easy to bypass by turning your clock back so that it never expired.

This is pretty widely fixed now, but often overzealously. Clock changed unexpectedly? User is a dirty pirate and the license should deactivate.

lamontcg · on July 23, 2014

Reading comprehension...

I had 30,000 servers, all but 6 of them were not getting hard resets. 6 of them were getting hard resets every hour because they were sick.

Occasionally a ntpd on a server would go a little crazy and it would get a few hard resets. This script got it back into spec so that it didn't need hard resets.

I mentioned specifically that you can't send hard resets to ntpd more than about once an hour or it won't have time to settle and you'll wind up in a situation where you are constantly sending hard resets to it (which is bad).

I took out the nightly forced hard resets of ntpd specifically because it affected monotonic time.

If you look at the algorithm I wound up with ntpd will only be reset if:

a. it is already crashed and not running

b. it is self-reporting that it is out of synch

c. is is more than 50ms out of synch with an upstream time server

In all of those cases, you have already lost the battle to get sub-microsecond accuracy.

pjc50 · on July 22, 2014

If the clock is drifting substantially then your high-precision application will already be in serious trouble.

easytiger · on July 22, 2014

If your clock solution, GPS + Sync, is drifting then your infrastructure is broken and your platform is not fit for service.

kijin · on July 22, 2014

In the age of mobile phones that can get the time directly from GPS satellites with sub-100ns accuracy, the isolated hardware clock of a typical server strikes me as an arcane method of keeping time.

Is there an expansion card or USB device that can get the time from GPS and feed it to something like ntpd (and adjust for leap seconds from data downloded from other networks)? Or would that be impractical because of all the walls and metal cages that surround the typical server? What about an antenna on the roof of the datacenter that supplies GPS time to every server in the building?

rachelbythebay · on July 22, 2014

Datacenter GPS-backed time sources exist. You can get little boxes which sit in a rack and hook to an antenna and feed nice time to the rest of the place. Those other machines pick it up with ntpd.

The problem is when your system clock is so far out of whack (and/or unpredictable) that even ntpd won't help you. Get enough supposedly-identical machines running and this will probably bite you too.

kastnerkyle · on July 22, 2014

This is 100% true. If you have ever watched ntpd ignore a stratum 0 clock (and every other time source) because some stratum 6 device reports the wrong time, you know the pain. Yes, this isn't supposed to happen, and yes it has happened to me many times.

feld · on July 22, 2014

well, you can override ntpd to always sync no matter how far out of whack but it's not recommended

tinker panic 0

cnvogel · on July 22, 2014

While most smartphones, theoretically, could get their time from GPS... do any use this method? As far as I know, my Android handsets get their time via NTP.

Ironically, often a misconfigured NTP server screws up GPS fixes (it somehow seems to be necessary for assisted GPS, which relies on the cooperation of the mobile network or Internet to faster acquire a GPS fix).

http://stackoverflow.com/questions/8308412/gps-how-ntp-time-...

sp332 · on July 22, 2014

I think cell phones get their time from the tower, since they have to sync clocks with the tower to talk to it anyway, and the towers get time from GPS.

x1798DE · on July 22, 2014

>In the age of mobile phones that can get the time directly from GPS satellites with sub-100ns accuracy, the isolated hardware clock of a typical server strikes me as an arcane method of keeping time.

Genuinely curious, is it actually true that you can get sub-100ns time accuracy on a mobile phone? Is this because you have a direct connection with the satellite, which is calculating where you are? I would have assumed that the uncertainty on the latency of any such connection would be higher than 100 ns.

kijin · on July 22, 2014

GPS satellites don't do any calculating for you, and there is no "direct connection" between the satellite and your phone. The GPS receiver in your phone is purely passive. It uses latency (usually a few dozen milliseconds) to calculate its distance from each satellite.

As long as you have access to at least 4 satellites, triangulation (quadri-angulation?) allows you to determine your precise location in all 4 dimensions, three of space and one of time.

adrianb · on July 22, 2014

From https://en.wikipedia.org/wiki/GPS#Timekeeping :

GPS time is theoretically accurate to about 14 nanoseconds. However, most receivers lose accuracy in the interpretation of the signals and are only accurate to 100 nanoseconds.

easytiger · on July 22, 2014

For phones to achieve that the GPS would have to be turned on & synced near constantly. Also the accuracy goes down with mobile devices and new strategies have to be applied given the change in altitude and greater temperature flux. It also depends on the phone's internall oscillator & the gps recv unit used.

Mobile phones use CDMA stuff to sync their clocks, not GPS

pjc50 · on July 22, 2014

GSM phones are supposed to be able to pick up "NITZ" (network information and time zone") messages, but these aren't reliably sent by most networks. In practice, smartphones get it over NTP like everyone else and dumbphones have user-set time that may, if you're lucky, be synced by counting the base station's time slot intervals.

easytiger · on July 22, 2014

Yes modern smartphones often hardcode ntp servers and prob do ineffective hard resetting. Would be good to know more how they do it.

enf · on July 22, 2014

Things have changed a bit since the last time I had anything to with it, but you can read the Android source code: https://github.com/android/platform_frameworks_base/blob/mas...

NITZ is terrible. You only get one-second accuracy, a GMT offset, and a DST flag, and they can all be wrong at the whim of someone at the phone company.

pjc50 · on July 22, 2014

Yeah, GPS doesn't really work indoors. You can certainly get such devices, or even do it with consumer GPS, but then you have to account for the latency of USB.

rdtsc · on July 22, 2014

You can buy GPS receivers and install antennas. Those are pretty good but expensive.

For S2, it can be configured to never jump. Instead you can just the time manually (run the equivalent of 'ntpdate <server>') at boot perhaps. Then don't start your software until ntp has been synchronized. Then maintain a watchdog that will alert you if you drop to a too low of a stratum.

Don't have an answer for S4 and S5. But I have heard that chrony is a good replacement for ntp (same network protocol but different discipline, faster conversion).

http://chrony.tuxfamily.org/

For a whole new other thing there is PTP (https://en.wikipedia.org/wiki/Precision_Time_Protocol). This is good if you want to synchronize machines to each other to withing a couple of hundred micros (maybe more with special network cards and switches).

kastnerkyle · on July 22, 2014

The question is - how often do you reboot? For long running high uptime applications "ntpdate at boot" doesn't cut it... and the other solutions are nasty. In my experience, ntpd will ignore clocks almost at a whim even if they are stratum 0! Some stratum 6 clock decides to report a different time, and suddenly all your other sources are ignored until an ntpd restart. If your clock drifted too much... even ntpd restart with options to "force the time" can fail if the clock is too far out of whack. The you are back to forcing ntpdate - hope none of your processes are time dependent when system time jumps by 3 or 4 seconds.

Half of the GPS PCI devices require you to hack ntpd directly to get anything working, and the Ethernet sources have a strange tendency to go belly up after a year or 2 of running on non-120V sources. Not to mention that if you run the antenna cable too long, you might not notice that your GPS amplitudes are a little low until the next cloudy day. Or if someone adds a new hunk of metal on your roof, or blocks line of sight to some portion of the sky...

Long story short, time is a nightmare and I am glad other people think so too.

onli · on July 22, 2014

That is the nice thing about chrony. It tries to synchronize clocks even when offline, it works not only on a reboot. If it works as advertized (and I tried it some years ago at home and it worked there, for what it's worth), it should fix all of the issues described in the article.

kastnerkyle · on July 22, 2014

Very nice - I will have to check it out! Thanks for the tip.

rdtsc · on July 22, 2014

That is why I mentioned to put a watchdog on it. At some point if you ntp server can't be reached, and you have clouds, and don't want to invest in an expensive atomic or maser based clock, you just don't get nice time sync.

bradknowles · on July 23, 2014

With appropriate configuration, you can get ntpd to settle down and consider itself to be fully synchronized in a matter of seconds after startup.

You just have to configure an adequate number of upstream time servers, add judicious use of iburst, and you should be good to go.

On my laptop, I usually get good timesync within ten to fifteen seconds of startup.

socialist_coder · on July 22, 2014

I have a problem in my AWS Windows cluster with the clocks as well. We have a mobile game with a tournaments feature, and tournaments can be as short as 5 minutes. So, it's important that all the clients get the same "end time".

Normally, at least 10% of the VMs are between 5 and 20 seconds off from the average.

By default, the VMs are syncing their time to an NTP server, but it just doesn't seem to be enough. I haven't changed any settings as of yet.

What are some of the things I could do on Windows box to sync up the clocks in my cluster? Scheduled shell scripts to force more clock syncs? Or maybe something on the application level?

bradknowles · on July 23, 2014

First off, Windows isn't the greatest client for trying to get good timesync. The way the OS usually handles interrupts isn't sufficiently precise and accurate. That said, it's not uncommon for real-hardware Windows clients to get decent timesync down into the double-digit millisecond range -- usually about ~40-50ms or so.

Second, you're trying to run them in a virtualization environment, and that plays holy hell with whatever good timesync you might otherwise have been able to get. And AWS is probably about as bad as you could reasonably get in this regard -- VMWare has had a long time to get to where they are with regards to timesync for client OSes, and that's not particularly good.

So, your best bet may be to run NTP clients on the Windows machines, but to set them up to run at an artificially high rate of updates. Or maybe even run ntpdate periodically against good real-hardware time servers that are available to you.

You're starting off with both hands tied behind your back, and that's a hard place to come back from.

easytiger · on July 22, 2014

Err no mention of PTP at all?

There are hardly any oscillators which can be plugged into a host which have any decent holdover. One should never expect them to run in reasonable time. Standard TCXO prob losses 100 ms a day. Rubidium loses 0.01 ms a day. Time providers include the perceived accuracy of the clock in the protocol so that the client can make a decision about which time source to use.

Relying on the oscillator is never done in time critical applications except in emergency states where GPS is out of service.

Of course a lot of people run relative clock domains, which is just a pretty bad idea for most applications.

ars · on July 22, 2014

I had an S5 motherboard (ASRock P55 Delux), after pulling out my hair I finally gave up on it and bought a new motherboard.

And I made a resolution to never ever buy the V1 of anything. (It was a brand new Intel chipset, and a brand new motherboard design to go with it.)

If everyone did like me and never bought V1 then there would never be a V2, I don't have a solution for that, but I refuse to be the guinea pig.

makomk · on July 22, 2014

I had a desktop PC a few years back that exhibited S5. Can't remember the exact details, but I think it turned out that Linux was using a time source which wasn't stable on that particular hardware and switching to a different time source fixed it. There's been a lot of issues like that in the past, most of which are thankfully no more.

feld · on July 22, 2014

FreeBSD timecounter in detail:

http://phk.freebsd.dk/pubs/timecounter.pdf

jchonphoenix · on July 22, 2014

Um... In large scale systems... vector clocks? Or am I not reading this correctly (I skimmed)

MrBuddyCasino · on July 22, 2014

Not sure why downvoted, vector clocks are an interesting alternative. Its just that they won't fix your log timestamps or kerberos clock skew errors.