110 likes | 289 Views
SSED Application Example. Lessons Learned: 100 Questions That Should Be Asked during Technical Reviews Seminar on Aerospace Mishaps and Lessons Learned 2004 MAPLD Conference 7 September 2004 Paul Cheng (310) 336-8222 Paul.g.cheng@aero.org.
E N D
SSED Application Example Lessons Learned: 100 Questions That Should Be Asked during Technical Reviews Seminar on Aerospace Mishaps and Lessons Learned 2004 MAPLD Conference 7 September 2004 Paul Cheng (310) 336-8222 Paul.g.cheng@aero.org
Unclassified U.S. Government Satellite Failures, 1990–Present Engineering Technology Date Program Problem/Outcome Mistake Surprise X 04/90 H ubble A defect in the tool used both in manufacturing and in QA misshaped the mirror X 07/92 TSS - 1 Deployment mechanism jammed by a bolt added after I&T X 09/92 Mars Oxidizer reacted with braze, jamming regulator and bursting tank during Observer pre ssurization X 08/93 NOAA 13 The battery charger had low dimensional tolerance —shorted out by a screw X 10/93 Landsat F Pyrovalve ignited fuel nearby X X 01/94 Clementine CPU froze due to overload, allowing the thruster to deple te fuel X 0 5/94 MSTI 2 Contact lost, probably due to micro meteor oid/debris impact or charging — X 12/95 Skipper S olar arrays miswir ed on drawing I& T did not ascertain current direction X 02/96 TSS - 1R Contamin ation within the tether caused arcing X X saved due to inadequate monitoring 08/97 Lewis Flawed GN&C design caused tumbling — not X 10/97 STEP - 4 Damage by launch vibration. Ground test strategy improper X 10/98 STEX Solar array too hot, fatiguing solder joint s . A nalysis used wrong configuration X 12/98 MCO U nit mix - up in ground soft ware, coupled with vulnerable navigation scheme, caused trajectory error X 01/99 Mars Polar Requirement error prevented touchdown sensors from being protected against de - Lander ployment shock . Engine shut down premature ly X X 03/99 WIRE A start - up transient in the pyro electronics controller prematurely ejected the telescope cover X 08/01 Simplesat Transmitte r arcing X 07/02 Contour Plume analysis, based on similarity, misled by typo in an AIAA paper Count 14 6 Since 1995 9 3 Why Do Satellites Fail?
Remember Past Mistakes to Avoid Repetition Fools say that they learn by experience. I prefer to profit by others' experience. Otto Bismarck Like Susan Lee did for NEAR! • 100 Questions: “Driver’s Ed Movie” for Engineers • Based on lessons extracted from SSED data: • 79 catastrophic failures • 32 major events (e.g., loss of an instrument) • 21 ground problems (e.g., unit damaged during vibe) • 3 recoveries of “dead” missions • Examine: • How did the mistake occur? • What prevented its detection? • Why did a flaw bring down the system?
Hyperlinks explain the context The Thrust of Questions Questions are grouped in: • Requirements • Heritage and Qualification-by-Similarity • Analysis • Fault Management • Embedded Software and Database • Interface • Parts, Materials, and Manufacturing Process • Testing and Evaluation For example: Q 3-1 (Analysis): Have all critical analyses been placed under configuration control? See Lessons: 26 (STEX Failure) and 83 (AC 70/71 Failures)
Mars Polar Lander Failure Legs deployed; Unprotected sensors registered shock Software read stored sensor status; shut down engine This requirement did not flow down to software requirements Systems Requirement stated: The touchdown sensors shall be sampled at 100 Hz. The sample process shall initiate to keep processor demand constant. However, sensor data shall not begin until 12 m above the surface. One requirement, one statement Q 1-3 (Requirements): Are there lumped/nested requirements?
Forward Payload BW = Bridge Wire SFC = Squib Firing Circuit I/F = Interface Connection • A dual-payload launcher was used for a single payload. BW BW SFC SFC Aft P/L • Hardware engineers redlined spec drafted by software engineers to facilitate wiring, and designed harness based on redlines BW BW SFC SFC Mission Unique Generic Core • Redlines fell through mission spec’s cracks – S/W and H/W incompatible I/F I/F Generic Configuration P/L P/L BW BW BW BW SFC SFC SFC SFC I/F I/F I/F I/F Failed Mission Hard Wired Software Commanded Launch Vehicle X Failure • Systems engineer failed to verify - viewed mission spec as software document and not subject to configuration control • Generic test masked problem Q 8-15 (Testing): Does the system being tested represent the flight configuration?
Representative Questions for Electrical Engineers • Are units and tolerances specified? • See Mars Climate Orbiter failure* and Huygens launch pad damage • Do testing independently confirm development results? • See Hubble mirror aberration* • Are handover procedures between two sources of control well defined? • See START launch failure • Does the harness design preclude mismating? • See BP-TD launch failure *: Report available on klabs.org
Some Questions Specifically for Digital Engineers • Can a momentary glitch cause a crash (will logic devices improperly reset following a brief undervoltage, for example)? • See Delta 178 and Titan A-20 failures • How are databases verified? • See Centaur TC-14 failure • Will unexpected inputs cause the computer to freeze, without a way to autonomously reboot? • See Clementine failure and SPIRIT anomaly* • Can the fault protection logic be set off too easily (e.g., can phantom sensor readings spoof the fault management system into taking precipitous actions)? • See Ariane 501* and Atlas/Mariner 1 failure
More Items EEs Rarely Think of, but Should: • Ambiguous drawing instructions • Opposite engineering convention (right- or left- hand coordinates? Positive- or negative- ground?) • Wiring crossover between two drawings • Commandability after OBC faults disabled receivers • Revivability of solar array regulator after battery drain • Fratricide by pyro devices • In-rush current welding relays shut • FOD-caused shorting and arcing • ...
Using “100 Questions” in Practice • A satellite uses many low-shock deployment devices • Consisting of spools of tightly wound wires • Actuated by electrically severing restraining wires Logic Control Power Supply Releases Solar array, etc. Firing Relay Arming Relay Four problems found: • Constant-voltage firing circuit may fail (SAFER lesson) • Routing both arming and firing relays to one PLD (WIRE) • If deployed wires touch firing circuits, battery can drain; power distribution board may overheat (Deep Space 1) • Test circuits are constant-current (not flight-like)
In Closing Petroski’s Law of Design: To engineer is human Akin's Laws of Spacecraft Design: Space is a completely unforgiving environment. If you screw up the engineering, somebody dies! For additional interesting quotes, see klabs.org