Risk Factor iconRisk Factor

What Ever Happened to STEM Job Security?

Figuring out how to get more students drawn into the “STEM education pipeline” has been a major concern of those arguing that there exists an acute shortage of STEM workers, be it in the U.S., the U.K., Brazil, Australia, or almost any country you choose. Typically, the arguments made to encourage students to enter the STEM pipeline center around how interesting STEM careers are and especially how much money you can earn over pursuing non-STEM careers.

However, others point out that many students aren’t interested in STEM careers because they see that the academic work needed at both the high school and university-level to pursue a STEM degree is just too hard in comparison to non-STEM degrees. Until this changes (for example, by increasing the readiness of a prospective STEM student by “redshirting” them), the argument goes, don’t expect a full STEM pipeline anytime soon.

Another factor little talked about that I personally witnessed has been the changing social compact between STEM workers and employers over the past several decades and the impact it has had on convincing students today to pursue a STEM career. When my father, an electro-optical engineer was laid off from his company late in the recession of 1957-1958, he assumed the company would be rehiring him a few months later when the economy got better. His wasn’t an unreasonable assumption, since that was the general practice in the 1950s. When he wasn’t soon rehired, and with a new house mortgage to pay and three children under age 5 to feed, my father left his temporary job of selling Electrolux vacuums door-to-door and found another electro-optical engineering job. He stayed with that company for another 25 years when he retired with the usual gold desk pen-set, which now sits on my desk.

Read More

An Engineering Career: Only a Young Person’s Game?

If you are an engineer (or a computer professional, for that matter), the danger of becoming technologically obsolete is an ever-growing risk. To be an engineer is to accept the fact that at some future time—always sooner than one expects—most of the technical knowledge you once worked hard to master will be obsolete.

An engineer’s “half-life of knowledge,” an expression coined in 1962 by economist Fritz Machlup to describe the time it takes for half the knowledge in a particular domain to be superseded, everyone seems to agree, has been steadily dropping. For instance, a 1966 story in IEEE Spectrum titled, “Technical Obsolescence,” postulated that the half-life of an engineering degree in the late 1920’s was about 35 years; for a degree from 1960, it was thought to be about a decade.

Thomas Jones, then an IEEE Fellow and President of the University of South Carolina wrote a paper in 1966 for the IEEE Transactions on Aerospace and Electronic Systems titled, “The Dollars and Cents of Continuing Education,” in which he agreed with the 10 year half-life estimate. Jones went on to roughly calculate what effort it would take for a working engineer to remain current in his or her field.

Read More

IT Hiccups of the Week: Sutter Health’s $1 Billion EHR System Crashes

After a torrid couple of months, last week saw a slowdown in the number of reported IT errors, miscalculations, and problems. We start off this week’s edition of IT Hiccups with the crash of a healthcare provider’s electronic health record system.

Sutter Health’s Billion Dollar EHR System Goes Dark

Last Monday, at about 0800 PDT, the nearly US $1 billion EPIC electronic health record (EHR) system used by Sutter Health of Northern California crashed. As a result, the Sacramento Business Journal reported, healthcare providers at seven major medical facilities, including Alta Bates Summit Medical Center facilities in Berkeley and Oakland, Eden Medical Center in Castro Valley, Mills Peninsula Health Services in Burlingame and San Mateo, Sutter Delta in Antioch, Sutter Tracy, Sutter Modesto and affiliated doctor’s offices and clinics, were unable to access patient medications or histories.

A software patch was applied Monday night, and EHR access was restored. Doctors and nurses no doubt spent most of the day Tuesday entering in all the handwritten patient notes they scribbled on Monday.

It still is unclear whether the crash was related to a planned system upgrade that was done the Friday evening before the crash, but if I were betting, I would lay some coin on that likelihood.

Nurses working at Sutter Alta Bates Summit Hospital have been complaining for months about problems with the EHR system, which was rolled out at the facility in April. Nurses at Sutter Delta Medical Center have also complained that hospital management there has threatened to discipline nurses for not using the EHR system; its system went live about the same time as Alta Bates Summit's, but for billing for chargeable items. Sutter management said that it was unaware of any of the issues the nurses were complaining about, and that any complaints they might have lodged were the result of an ongoing management-labor dispute.

Sutter is now about midway through its EHR system roll-out, an effort it first started in 2004 at a planned cost of $1.2 billion and completion date of 2013. It later backed off that aggressive schedule, and then “jump started” its EHR efforts once more in 2007. Sutter plans to complete the roll-out across all 15 of its hospitals by 2015 at a cost now approaching $1.5 billion.

Hospital management said in the aftermath of the incident, “We regret any inconvenience this may have caused patients.” It did not express regret to its nurses, however.

Computer Issue Scraps Japanese Rocket Launch

Last Tuesday, the launch of Japan’s new Epsilon rocket was scrubbed with 19 seconds to go because a computer aboard the rocket “detected a faulty sensor reading.” The Japan Aerospace Exploration Agency (JAXA) had spent US $200 million developing the rocket, which is supposed to be controllable from conventional desktop computers instead of massive control centers. This added convenience has resulted from the extensive use of AI to self-perform status-checks.

The Japan Times reported on Thursday that the problem was traced to a “computer glitch at the ground control center in which an error was mistakenly identified in the rocket’s positioning.”

The Times stated that, “According to JAXA project manager Yasuhiro Morita, the fourth-stage engine in the upper part of the Epsilon that is used to put a satellite in orbit, is equipped with a sensor that detects positioning errors. The rocket’s computer system starts calculating the rocket’s position based on data collected by the sensor 20 seconds before a launch. The results are then sent to a computer system at the ground control center, which judges whether the rocket is positioned correctly. On Tuesday, the calculation started 20 seconds before the launch, as scheduled, but the ground control computer determined the rocket was incorrectly positioned one second later based on data sent from the rocket’s computer.”

The root cause(s) of the problem are still unknown, although it is speculated that it was a transmission issue. JAXA says that it will be examining “the relevant computer hardware and software in detail.” The Times reported on Wednesday that speculation centered on a “computer programming error and lax preliminary checks.”

JAXA President Naoki Okumura apologized for the launch failure, which he said brought “disappointment to the nation and organizations involved.” A new launch date has yet to be announced.

Nasdaq Blames Software Bug For Outage

Two weeks ago, Nasdaq suffered what it called at the time a “mysterious trading glitch.” The problem shut down trading for three hours. After pointing fingers at rival exchange NYSE Arca, it admitted last week that perhaps it wasn’t all Arca’s fault after all.

A Reuters News story quoted Bob Greifeld, Nasdaq's chief executive, as saying Nasdaq’s backup system didn’t work because, “There was a bug in the system, it didn't fail over properly, and we need to work hard to make sure it doesn't happen again.”

However, Greifeld didn’t fully let Arca off the hook. A story at the Financial Times said that in testing, Nasdaq’s Securities Information Processor (SIP), the system that receives all traffic on quotes and orders for stocks on the exchange, “was capable of handling around 500,000 messages per second containing trades and quotes. However, in practice, Nasdaq said repeated attempts to connect to the SIP by NYSE Arca, a rival electronic trading platform, and streams of erroneous quotes from its rival eroded the system’s capacity in a manner similar to a distributed denial of service attack. Whereas the SIP had a capacity of 10,000 messages per data port, per second, it was overwhelmed by up to more than 26,000 messages per port, per second.”

Nasdaq said that it was now looking at design changes to make the SIP more resilient.

A detailed report looking into the cause of the failure will be released in about two weeks or so.

Of Other Interest…

Computer Error Causes False Weather Alert and Cancelled Classes at Slippery Rock University

UK’s HSBC Bank Suffers IT Glitch

NC Fast Computer System Can’t Shake Processing Problems

North Carolina DMV Computer System Now Back to Normal

Australia’s Telstra Faces Large Compensation Bill for Internet Problems

Data Glitch Hits CBOE Futures Exchange

China Fines Everbright Security US $85 million Over Trading Error

Photo: iStockphoto

Is There a U.S. IT Worker Shortage?

Someone who is a data scientist today is said by Harvard Business Review to have the sexiest job alive. And if sexy isn’t enough, how about being a savior of the economy?  According to a 2011 report by consulting company McKinsey & Company, “Big Data” is “the next frontier for innovation, competition and productivity.” That is, of course, if enough of those sexy data scientists can be found.

For also according to McKinsey’s report, “the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills as well as 1.5 million managers and analysts with the know-how to use the analysis of big data to make effective decisions,” by 2018.

However, Peter Sondergaard, senior vice president at Gartner and global head of research asserts that the shortage situation is even more frightening than what McKinsey implies. Sondergaard stated in October 2012 that, “By 2015, 4.4 million IT jobs globally will be created to support Big Data, generating 1.9 million IT jobs in the United States. In addition, every big data‐related role in the U.S. will create employment for three people outside of IT, so over the next four years a total of 6 million jobs in the U.S. will be generated by the information economy.”

Wow. Not only will Big Data make a significant dent in the U.S. unemployment rate, but the U.S. IT technical workforce of 3.9 million or so needs to increase by almost 50 percent within the next two years.

But wait, there’s more.

Read More

Chinese Internet Rocked by Cyberattack

China’s Internet infrastructure was temporarily rocked by a distributed denial of service attack that began at about 2 a.m. local time on Sunday and lasted for roughly four hours. The incident, which was initially reported by the China Internet Network Information Center (CNNIC), a government-linked agency, is being called the “largest ever” cyberattack targeting websites using the country’s .cn URL extension. Though details about the number of affected users have been hard to come by, CNNIC apologized to users for the outage, saying that “the resolution of some websites was affected, leading visits to become slow or interrupted.” The best explanation offered so far is that the attacks crippled a database that converts a website’s URL into the series of numbers (its IP address) that servers and other computers read. The entire .cn network wasn’t felled because some Internet service providers store their own copies of these databases.

A Wall Street Journal report notes that the attack made a serious dent in Chinese Web traffic. Matthew Prince, CEO of Internet security firm CloudFlare told the WSJ that his company observed a 32 percent drop in traffic on Chinese domains. But Prince was quick to note that although the attack affected a large swath of the country, the entity behind it was probably not another country. “I don’t know how big the ‘pipes’ of .cn are,” Prince told the Wall Street Journal, “but it is not necessarily correct to infer that the attacker in this case had a significant amount of technical sophistication or resources. It may have well have been a single individual.”

That reasoning stands in stark contrast to the standard China-blaming reaction to attacks on U.S. and Western European Internet resources or the theft of information stored on computers in those regions. In the immediate aftermath of the incident, there was an air of schadenfreude from some observers. Bill Brenner of cloud-service provider Akami told the Wall Street Journal that “the event was particularly ironic considering that China is responsible for the majority of the world’s online ‘attack traffic.’” Brenner pointed to Akami’s 2013 ‘State of the Internet’ report, which noted that 34 percent of global attacks originated from China, with the U.S. coming third with 8.3 percent.

For its part, the CNNIC, rather than pointing fingers, said it will be working with the Chinese Ministry of Industry and Information Technology to shore up the nation’s Internet “service capabilities.”

Photo: Ng Han Guan/AP Photo

IT Hiccups of the Week: Stock Exchange “Gremlins” Attack

We were blessed with another impressive week of IT-related burps, belches and eructs. This time, stock market officials are reaching for the antacid.

Nasdaq Suffers Three-Hour Trading “Glitch”

Well, opinions vary about whether it was or wasn’t a big deal. Last Thursday, the Nasdaq suffered what the AP called a “mysterious trading glitch” that suspended trading on the exchange from 12:15 p.m. to 3:25 p.m. EDT. After trading resumed, the market closed up 39 points.

The trading suspension was the longest in Nasdaq history and was a major embarrassment for the exchange, which is still trying to recover from its Facebook IPO screw-up. The exchange blamed the problem on a “connectivity issue” involving its Securities Information Processor, which Reuters describes as being “the system that receives all traffic on quotes and orders for stocks on the exchange.” When the SIP doesn’t work, stock quotations cannot be disseminated.

Nasdaq Chief Executive Robert Greifeld has refused to discuss the cause of the problem in public. Greifeld did darkly hint, however, that the problems were someone else’s fault. The Guardian quoted him as saying, “I think where we have to get better is what I call defensive driving. Defensive driving means what do you do when another part of the ecosystem, another player, has some bad event that triggers something in your system?”

He then went on to say, “We spend a lot of time and effort where other things happen outside our control and how we respond to it.”

Greifeld’s statement immediately triggered further speculation that the other “player” was rival exchange NYSE Arca, which was known to have had connectivity issues with Nasdaq.  Greifield refused, however, to elaborate further on his statements.

Today’s Wall Street Journal published a lengthy story that has shed more light on what happened—although why it happened is still being investigated. According to the WSJ, between 10:53 a.m. and 10:55 a.m. EDT Thursday, “NYSE Arca officials tried and failed to establish a connection with Nasdaq about 30 times, according to people familiar with the events of that day. Nasdaq, for its part, was having its own problems regarding its connectivity to Arca, the people said.”

The WSJ goes on to say that: “What remained unclear Sunday was how that connectivity problem—which Nasdaq officials privately have called ‘unprecedented’—could have had so catastrophic an effect on Nasdaq's systems that the exchange decided to stop trading in all Nasdaq-listed shares, causing ripples of shutdowns across the market and spreading confusion.”

The Journal said NYSE Arca went to a backup system, and after several failed attempts, finally re-established connection with Nasdaq at 11:17 a.m. However, once the two exchanges were reconnected, “Nasdaq's computers began to suffer from a capacity overload created by the multiple efforts to connect the two exchanges.”

As a result, other markets also started to report problems in receiving and sending quotes from Nasdaq. Officials at Nasdaq decided they had better pull the plug in order to figure out how to get back to a normal operating state, which they did at 12:15 p.m.

Many traders viewed the episode as a non-event, while other interested observers, like U.S. Securities and Exchange Commission Chairman Mary Jo White, were more concerned. Given the complexity of the systems involved, no one should be surprised to see more hiccups in the future. In Charles Perrow’s terminology, they are now just “normal accidents.

The folks at Goldman Sachs and Everbright were probably very happy about the distraction created by Nasdaq’s difficulties. Last Tuesday, Goldman “accidentally sent thousands of orders for options contracts to exchanges operated by NYSE Euronext, Nasdaq OMX and the CBOE, after a systems upgrade that went awry. The faulty orders roiled options markets in the opening 17 minutes of the day’s trading and sparked reviews of the transactions,” the Financial Times reported.

Bloomberg News reported that Goldman had placed four senior technology specialists on administrative leave because of the programming error, but Goldman declined to discuss why. Probably a smarter move than when Knight Capital Group CEO Thomas Joyce blamed “knuckleheads” in IT when a similar problem a year ago this month resulted in loss of US $440 million in about 45 minutes. Goldman's losses were expected to be less than US $100 million.

Knight Capital was sold last December to Getco Holdings Co.

Everbright Securities, the state-controlled Chinese brokerage, is also likely happy at the timing of Nasdaq’s and Goldman’s problems. On 16 August, a trading error traced to the brokerage significantly disrupted the Shanghai market. The error, which cost the brokerage US $31.7 million and the brokerage’s president his job, was blamed originally on a “fat finger” trade. However, a “computer system malfunction” was the real cause, the Financial Times reported. Needless to say, the China Securities Regulatory Commission is investigating Everbright and says “severe punishments” might be in order.

Finally, a real fat finger incident hit the Tel Aviv Stock Exchange (TASE) yesterday. The Jerusalem Post reported that, “a trader from a TASE member intending to carry out a major transaction for a different company's stock accidentally typed in Israel Corporation, the third-largest company traded on the exchange. The disparity in prices cause the company's stock value did a nose-dive, from an opening value of NIS 1690 down to NIS 2.10,” or a 99.9 percent loss. The trader quickly realized his typo, and requested the transaction be canceled. However, by then, the error had already triggered a halt in the exchange’s trading.

Helsinki’s Automated Metro Trains Rough First Half Day

Last week, Helsinki's Metro tried out its three driverless Seimens-built trains for the first time. However, in a bit of irony, after a few hours, problems developed with the ventilation system in the trains' drivers cabins, and the trains had to be taken out of service. Drivers were aboard the automated trains for safety reasons. The Metro didn’t indicate whether the trains would have been pulled out of service (or the problem even detected) if they had been running in full automatic mode without drivers.

Indian Overseas Bank Back to Normal

The Hindu Times reported on Friday that the Indian Overseas Bank announced that the problem with the bank’s central server had been finally fixed. For three days, hundreds of thousands of bank customers were unable to deposit checks or use the bank’s ATM network. A story at Business Standard said that the problem was related to annual system maintenance of the core banking system, which instead ended up  creating what the bank said was a “complex technological malfunction.”

Of Other Interest….

Chrysler Delaying Launch of 2014 Cherokee Jeep Due to Transmission Software Issues

Network Issues Stop Marines from Using Unclassified Network

Tesco Pricing Glitch Lowers Price of Ice Cream by 88 Percent

Xerox Releases Scanner “Error” Software Patch

Photo: Seth Wenig/AP Photo

This Week in Cybercrime: Facebook Feels Backlash After Balking on Bug Bounty

This hasn’t been a week for major headline-making hacks, but a few interesting stories bubbled to the surface.

A Palestinian security researcher recently notified Facebook of a security vulnerability on its site by posting a message on the page of Facebook founder Mark Zuckerberg. The struggling researcher, Khalil Shreateh, was looking forward to receiving a reward under the social media site’s bug bounty program for reporting the problem, which would have allowed anyone to post messages to another user’s page, regardless of whether he or she is on the user’s Friends list.

But Facebook denied him, giving itself a PR black eye in the process. It seems Facebook fixed the bug but wouldn’t shell out any money to Shreateh. Why? The site’s security team reasoned that his method of notifying the company—first posting an Enrique Iglesias video to the page belonging to one of Zuckerberg’s college friends, then posting to Zuckerberg's page itself after the security team still insisted that the issue wasn't a bug—violated its terms of service.

Only after Marc Maiffret, CTO of network security firm BeyondTrust, heard about the snub and launched a page whose aim is to raise $10,000 for Shreateh did Facebook try to explain itself.

According to a Wired article, “Matt Jones, a member of Facebook’s security team, posted a note on the Hacker News web site saying a language barrier with Shreateh had been part of the problem for the company’s initial rejection of his submission…He also said that Shreateh had failed to provide any details about the bug that would help Facebook reproduce the problem and fix it.” But the bottom line, despite Jones’ attempts to rationalize a response that came off as miserly, is: 1) Facebook fixed the problem. 2) It was Shreateh who alerted them to it.

“Mistakes were made on both sides,” Jesse Kornblum, a network security engineer for Facebook later told Wired. “We should have asked for more details rather than saying, ‘this is not a bug.’ But Khalil should have demonstrated the vulnerability on a test account, not a real person. We’ve made an interface for [researchers] to create multiple test accounts [for that purpose].”

But Maiffret, who has met his goal (with $3000 coming from his own pocket), says nixing the bounty for the Palestinian researcher sent the wrong message. “It was a good thing that he did,” Maiffret, who got his start as a teenage hacker, told Wired. “He might have done it slightly wrong, but ultimately it was a bug he got killed off before anyone did a bad thing [with it].” Maiffret pointed to his own beginnings, noting that he went from being a rudderless high school dropout to having a successful career after someone agreed to take a chance on him. “Ultimately, [Shreateh] was well-intentioned and hopefully he stays on the same track of doing research,” Maiffret says.

Google App Engine an Unwitting Conduit for Adware

The adware that floods a computer user’s browser with come-ons is nothing new. But purveyors of this pestilence have come up with a new way to spread it. Jason Ding, a research scientist at Barracuda Labs, posted a note on the company’s research blog this week alerting the world that two sites are lacing users’ machines with malware posing as legitimate application software on Google’s App Engine.

According to Ding, the sites, which appeared about a week ago, prey on the inexperienced or inattentive user. The first one (java-update[dot]appspot[dot]com), which passes itself off as a free Java download site, looks a lot like Oracle’s official Java site. But clicking links on this sinister page causes the download of “setup.exe,” which in turn tries to install the Solimba adware program. The endgame for the other site (updateplayer[dot]appspot[dot]com) also involves plaguing the user with Solimba. But instead of baiting users with a Java imposter, it tells them that their media player is outmoded and needs an update. And guess who’s generous enough to offer a just the right fix? Clicking on any of the site’s links pulls down the same executable file that installs Solimba.

The people who set up these sites are using Google’s App Engine as an intermediary because it gives their pitches the air of credibility and hides URLs that would instantly put users on alert that something is fishy.

More People Affected by Outages from Cyberattacks than from Hardware Failures

A report released on Tuesday by the European Union Agency for Network and Information Security (ENISA) reveals some startling information about the reach and effectiveness of cybercrime. Last year, hardware failures accounted for about 38 percent of incidents that resulted in “severe outages of both mobile and fixed telephony and Internet services” in the E.U. These attacks affected 1.4 million people, on average. Though cyberattacks made up 6 percent of European outages last year, each incident affected an average of 1.8 million users.

“League of Legends” Maker Hacked

Marc Merrill and Brandon Beck, founders of video game maker Riot Games, said in a blog post this week that cybercrooks had hacked into its network and gained access to usernames, email addresses, salted password hashes, some first and last names, and encrypted credit card numbers. The company, developer of the online multiplayer game “League of Legends,” says it is looking into just what details were gleaned in the unauthorized access of 120 000 transaction records dating back to 2011.

In Other Cybercrime News…

Reuters: Ex-Soviet hackers play outsized role in cyber crime world

ZDNet Reviews the New Book “Cyber Crime & Warfare”

IT Hiccups of the Week: You May Want That Burger Well-Cooked

The summertime streak of interesting IT snafus, tangles and general “oops” incidents continues unabated. We start off with a story that appeared in the New York Times over the weekend that may make you reconsider your meat-cooking preference for your next outdoor barbeque.

U.S. Agriculture Meat Inspection Computer Outage Means Meat and Poultry Left Uninspected

The U.S. Department of Agriculture (USDA) considers it to be a non-issue, since there have yet to be any documented instances of people having gotten sick. However, one wonders how long it will be before the continuing problems with a new $20 million computer system upon which some 3000 meat and poultry inspectors working at 6300 packing and processing plants across the U.S. depend on contribute to a major food-borne illness outbreak.

According to a Saturday New York Times story, the computer system the USDA Food Safety and Inspection Service (FSIS) meat and poultry inspectors use has experienced several recent break downs. Earlier this month, for instance, it shut down for two days, putting “at risk [consumers of] millions of pounds of beef, poultry, pork and lamb that had left the plants before workers could collect samples to check for E. coli bacteria and other contaminants.”

What's the risk? Well, a USDA report (pdf) states that, “the Centers for Disease Control and Prevention estimate that E. coli O157:H7 causes about 73,000 cases of illness and 61 deaths annually in the U.S. The USDA's Economic Research Service estimates that the total costs associated with consuming E. coli-contaminated meat are about $488 million annually.”

The new computer system was installed in 2011 as a way to help hasten the meat and poultry inspection process. Previously, it could take days before inspected food flagged as being contaminated could be traced to the offending plant. The new system speeds up completion of the paperwork used to trace inspected meat and poultry, dramatically reducing the time needed to identify the source of any compromised food. That's under normal circumstances. The downside is that when the computer system isn’t working, which inspectors tell the Times happens frequently, meat and poultry sometimes go without being inspected at all.

Last year, computer system issues led to problems at 18 meat processing and packing plants. The Times stated that, “At one of the plants, auditors found that inspectors had not properly sampled some 50 million pounds of ground beef for E. coli over a period of five months. At another plant, which the report identified as among the 10 largest slaughterhouses in the United States, auditors found that computer failures had caused inspectors to miss sampling another 50 million pounds of beef products.”

But not to worry, the USDA says. Many of the highlighted problems have been corrected. Additionally, USDA officials claim, the problem wasn’t really with the computer system itself, but with balky wireless networks the computer system has to connect to in the rural areas where many meat processing and packing plants operate.

Does that mean that the USDA doesn’t include the communication system as part of an overall system test before it fields such a system? Regardless of the answer, questions abound; mainly because USDA field inspectors told the Times that even where wireless connections are first-rate, the computer system still keeps crashing.  

Until the reliability of the USDA’s computer system improves, you may want to make sure your meat and poultry are thoroughly cooked.

U.K. Post Computer System Leads to False Theft Accusations

The Daily Mail published a story last week about the four-year fight between Tom Brown, of South Stanley, County Durham and the U.K. Post Office over charges that Brown, a sub-postmaster, fiddled £85,426 from its accounts. Last week, the Mail reported, the Post Office decided to drop its two civil court charges of false accounting against Brown (the police decided two years ago not to pursue the case), and a judge has recorded not-guilty verdicts.

Brown is one of more than 100 people across the U.K. that the Post Office has accused of theft since the introduction of its £1 billion Horizon computer system, used to record transactions across its 14 000-branch network, over a decade ago.  However, Brown and the others claimed that the disappearance of the money they were accused of stealing was caused by computer problems with the Horizon system that created false shortfalls in the sub-postmasters’ financial accounts.

The U.K. paper Computer Weekly has been diligently following this story for years, noting that sub-postmasters were complaining about computer problems as far back as 2003. However, the U.K. Post Office steadfastly refused to believe that there was anything wrong with the Horizon system. It was convinced that those whom it had accused of stealing were merely using “computer glitches” as an excuse to hide their theft.

In fact, the Post Office was even able to convince some U.K. judges that the Horizon system was extremely reliable and didn’t make mistakes. As a result, some sub-postmasters were sent to jail and many lost their homes or went bankrupt in order to pay back the alleged shortfalls in their accounts.

However, as more and more sub-postmasters were accused, the Post Office finally succumbed to pressure to conduct an investigation into the Horizon system just this past year. In July of this year, as word filtered out that the system’s reliability wasn't as phenomenal as claimed, the Post Office finally admitted that the investigation did find defects in the system that caused accounting shortfalls at 76 branches. In light of those shortfalls, the Post Office stated that more investigation would be required into the system's operations, BBC News reported.

The Post Office also stated it would be looking into how to “take better account” of sub-postmaster complaints “going forward,” but did not directly address those lodged by the dozens who say they were falsely accused. Maybe it is because they are looking to bring legal action against the Post Office.

Like all good bureaucracies, the Post Office also proposed to set up a working group to investigate further the problems so far uncovered.

Wall Street Journal Doesn’t Let Rival’s Crisis Go to Waste

Finally, last Wednesday morning, the New York Times website and mobile app suffered a two-hour outage, although new articles didn’t appear until about four hours after the site and the app first became unavailable. Speculation ran rampant that the Times was the victim of a cyberattack, but the Times said the incident was likely the result of a “scheduled maintenance update being pushed out.”  

However, soon after the outage began at 11:10 a.m., which is the start of the peak traffic time for the paper, the Wall Street Journal decided to try to capitalize on its rival's misfortune by lowering its own pay wall for two hours. The Journal later said it lowered its pay wall not because of the New York Times outage, but because of the violent protests then happening in the Egypt.


Google also suffered an outage last week, but only for a few minutes.  Reports were that Internet traffic dropped by some 40 percent because of it.

Microsoft, which has been taking very public potshots at Google, remained mum on the Google outage. Was it, perhaps, because Microsoft seems to be having significant problems of its own with Outlook.com over the past week?

Of Other Interest…

Moose Detector System Out For Extended Periods

Software Issue Delays Work for Months on Guernsey Airport’s New £3.5 million Radar

Everbright Securities Fat Finger Trading Error Roils Shanghai Stock Market

Computer Problem Forces Arizona Lottery to Issue New Pick 3 Tickets

BT Sport Issue Angers Premier League Fans


Photo: Remy Gabalda/AFP/Getty Images

This Week in Cybercrime: Computer Glitch Opens Prison Doors?

Florida prison officials are trying to figure out whether a computer glitch may be behind two recent, as yet unexplained incidents where all of the doors at a facility’s maximum-security wing opened simultaneously. In the latest occurrence, on 13 June, guards at the Turner Guilford Knight Correctional Center in Miami, Florida, had to rush to corral prisoners back into their cells after a “group release” button in the computerized system was triggered. The entire facility, including locks on cell doors, surveillance cameras, water and electricity, and other systems, has been automated. Anyone who gains full access to the network—whether from a touch-screen monitor in the guard tower, or from outside, via a security hole—can control any of these functions. Guards say they don’t know how it happened, and recently released surveillance footage does not pinpoint the source of the errant command.

Read More

IT Hiccups of the Week: Sabre Outage Hit Flights Worldwide

It’s been another eventful week in the land of IT snags, snarls and complications. We start with another major airline reservation system hiccup, but this time one with world-wide effects.

Sabre Reservation System Outage Affects 400 Airlines

The Sabre airline reservation system experienced a still unexplained two-hour-plus “system issue” beginning at 8:40 p.m. PDT on Monday evening. The outage affected airlines around the world. (Sabre’s website states that about 380 airlines use its reservation system.) As a result, ticket agents needed to hand write boarding passes, and the affected airlines’ websites couldn’t book or change reservations.  Where in the world you happened to be at the time impacted how severely you felt the system outage.

In the U.S., a small number of late-night West Coast domestic and international flights experienced delays. In Europe and the Middle East, the effects were felt a bit more, as it was early to mid morning. The greatest problems were felt in Australia, where it was early afternoon Sidney time. Virgin Australia got the worst of it.

Virgin Australia reportedly had to cancel 35 domestic and international flights and delay many others.  For whatever reason, Virgin seems snake-bit when it comes to reservation system IT issues. This complication came on the heels of a router outage that occurred just a few weeks ago. You may remember its nearly two week reservation meltdown back in 2010, as well. Virgin moved from Navitaire’s New Skies platform, which was at the heart of the 2010 meltdown, to the Sabre system earlier this year. The costs incurred in the switch helped send the company into the red.

Although the exact number of passengers affected is unknown, it was undoubtedly thousands worldwide given that Sabre says some 300 million passenger reservations are processed by its system every year. On Tuesday, in the wake of the outage, Sabre sent out the standard, “We apologize and regret the inconvenience caused.”

Denver International Airport officials were also apologizing on Tuesday to passengers as all 1200 airport flight boards were out for most of the day.Maintenance had been performed on the system that runs the flight boards on Monday night. The boards were not restored to operation until late Tuesday evening.

False Emergency Warnings Sent in Japan, Virginia, California

The Japan Meteorological Agency sent an emergency message last Thursday warning most of Japan to expect “violent shaking” after detecting a magnitude-2.3 earthquake in Wakayama prefecture in western Japan at 4:56 pm local time, Bloomberg News reported. According to the story, the Wakayama earthquake prompted JMA’s warning system to predict that a magnitude 7.8 earthquake was possible.

As a result of the warning, Central Japan Railway suspended some bullet train operations, and a number of mobile phone networks became jammed as a multitude of people called friends and family.

However, it soon became clear that the prediction was in error. The JMA blamed the false warning on “electrical noise” on the ocean floor, and offered a televised apology. The JMA admitted Thursday's incorrect warning to be the “biggest misreading” since the early warning system was begun in 2007.

On Wednesday morning, human error was blamed for a tornado alert mistakenly being sent to 500 people in the Charlottesville-Albemarle County area of Virginia, the AP reported. Apparently the notification was sent during a training session on how to use the local emergency alert system.

Also on Wednesday, a real emergency alert of a reported gas leak was sent to more people than intended. The automated wireless emergency alert message, which urged residents and businesses to evacuate immediately and only take essential belongings with them, was sent out across all of Contra Costa County, California, instead of the homes and businesses within a 1000 foot radius of a damaged gas pipe. County officials said that they would be working with the vendor the county uses to send alerts to ensure that the messages are better targeted, the San Jose Mercury News reported.

Xerox to Patch Scanner Feature

I doubt many people would think to look at a document they scanned to check whether in fact what was scanned actually match that as on the original. It might be a good idea to do so in the future, however.

Last week, a story Tuesday at BBC News reported that German computer scientist David Kessel  “discovered” that the compression software used by several Xerox scanner models had the nasty habit of changing the characters in the scanned document from those on the original document. The Daily Mail published an article showing some of the changes that could result.  The legal implications, a London lawyer told the BBC, were “Interesting.”

Xerox played down the error, however, saying that the vast majority of scanner users would never experience the problem since it only happened when the scanner's default resolution setting was reset to low resolution in order to save smaller-sized computer files.  The character changing/substitution issue was long known to be a possibility, and a warning about it was in all its user manuals, Xerox said. However, in light of the uproar the BBC News story generated—which was also fueled by Xerox’s nonchalant response to the issue—Xerox said it would be sending out a patch in the next few weeks to disable the highest compression mode which it claimed would eliminate the problem.

Even so, it might be a good idea to routinely check over your scanned documents just in case, and also maybe read  your scanner’s user manual. There may actually be something useful in it.

Also of Interest…

Navy Explains Why the USS Guardian Got Stuck on “Misplaced” Philippine Reef

BATS Exchange Experiences Another Outage

New Zealand Vodafone's Data Networks Problems Affect Mobile EFTPOS

X-Ray Computer System Problems Continue Unabated in Kent, England

Wisconsin State Government Recovering From Computer Crash

Los Angeles Fire Department 911 System Repeatedly Breaks Down

Japanese Luxury Toilet Has Computer Hardware Flaw

Photo: iStockphoto


Risk Factor

IEEE Spectrum's risk analysis blog, featuring daily news, updates and analysis on computing and IT projects, software and systems failures, successes and innovations, security threats, and more.

Willie D. Jones
Load More