Story 2015-05-30 A388 Software glitch disables LightSail spacecraft

Software glitch disables LightSail spacecraft

by
in space on (#A388)
story imageAfter two days sending data back to Earth, a suspected software glitch in the LightSail spacecraft's Linux-based flight software has silenced the craft. Every 15 seconds, LightSail transmits a telemetry beacon packet. The software writes corresponding information to a file called beacon.csv. When it reaches 32 megabytes it can crash the flight system. The manufacturer corrected this glitch in later software revisions, but LightSail's software version doesn't include the update. A fix was scheduled to be uploaded, but before that happened, LightSail fell silent.

A reboot should clear the contents of the problematic beacon.csv file, giving the team a couple days to implement a fix. But the outcome of the freeze is "non-deterministic." Sometimes the processor will still accept a reboot command; other times, it won't. Multiple reboot commands were sent to the spacecraft. Nothing has happened yet. Therefore, we have to assume that LightSail is only going to respond to someone pushing the power button. Spacecraft, however, are susceptible to charged particles zipping through deep space. If one of these particles strikes an electronics component in just the right way, it can cause a reboot. This is not an uncommon occurrence. Cal Poly's experience with CubeSats suggest most experience a reboot in the first three weeks. LightSail is capable of remaining in orbit about six months.
Reply 6 comments

Sigh (Score: 0)

by Anonymous Coward on 2015-05-31 10:38 (#A3YR)

Stop hiring the people I work with... or at least promote them to somewhere to a position to limit how much damage they can do

Re: Sigh (Score: 1)

by axsdenied@pipedot.org on 2015-06-01 04:48 (#A59Y)

Quite often the only way to get rid of those people is to promote them to a higher level. That's why we have so many incompetent managers and, unfortunately, they do even more damage then.

The three finger salute (Score: 2, Insightful)

by zafiro17@pipedot.org on 2015-06-01 12:20 (#A62N)

Sad story, and a neat project. I can only assume the engineers are frantically trying to figure how they can remotely do a Ctl-Alt-Delete.

I pity them, but I also fault them. Easy to armchair code, but it seems to me if you're building this kind of system, your operating policy must be that any error leads to a reboot under known good software conditions. I know, easier said than done, but ...

Good news (Score: 4, Informative)

by axsdenied@pipedot.org on 2015-06-01 12:57 (#A65B)

It rebooted and contact has been established.

However:
- Its exact position remains fuzzy, complicating two-way communication.
- The communication is not stable enough to apply the patch. So, they have to keep rebooting it regularly.

i know... (Score: 0)

by Anonymous Coward on 2015-06-14 23:23 (#B9B1)

strlcat() ?

Tha D? (Score: 0)

by Anonymous Coward on 2015-06-15 05:58 (#B9ZS)

SystemD?