Clinical Research Data Scientist, Belgium - Beerse   
Salary: Negotiable (GBP)
Benefits:
Location: Belgium, Beerse
Company: CK Clinical
Posted: 30 June 2017 15:31:35
Expiry: 28 July 2017 00:00:00
          Data Scientist - Cisco - San Jose, CA   
We Are Cisco. Excellent written and oral communication skills, able to communicate with all levels of SSO internal technology teams and business....
From Cisco Systems - Tue, 18 Apr 2017 00:04:32 GMT - View all San Jose, CA jobs
          Sr. Data Scientist - Cisco - San Jose, CA   
We Are Cisco. Excellent written and oral communication skills, able to communicate with all levels of SSO internal technology teams and business *....
From Cisco Systems - Thu, 23 Mar 2017 23:32:26 GMT - View all San Jose, CA jobs
          Comment on creating a TrueType font from your handwriting with your scanner, your printer, and FontForge by Playing with Randall Munroe’s XKCD handwriting | Open Data Science   
[…] already out there on this topic, particularly one from 2010 regarding the creation of fonts from a hand-written sample. Notably though, there isn’t much that is automated – this is a rub for me, as without […]
          Greenfield Advisors Adds Principal Data Scientist to Its Staff   

Dr. Andy Krause will be the lead for several new products aimed at data collection and analysis.

(PRWeb June 29, 2017)

Read the full story at http://www.prweb.com/releases/2017/07/prweb14474891.htm


          UPS Sr. Data Scientist   
NJ-Mahwah, Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes and how!” We are UPS. We are the United Problem Solvers. About Information Management at UPS Technology: Our Information Management teams are responsible for designing and supporting data solutions to meet UPS’s rapidly changing business needs
          UPS Lead Data Scientist   
NJ-Mahwah, Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes and how!” We are UPS. We are the United Problem Solvers. About Information Management at UPS Technology: Our Information Management teams are responsible for designing and supporting data solutions to meet UPS’s rapidly changing business needs.
          2 Wissenschaftliche Mitarbeiter/innen (PostDoc) Forschung und Lehre im Bereich Social Data Science   
2 Wissenschaftliche Mitarbeiter/innen (PostDoc) Forschung und Lehre im Bereich Social Data Science 2 Wissenschaftliche Mitarbeiter/innen (PostDoc) - RWTH AACHEN UNIVERSITY - Deutsch h1 h2 h3 h4 h5 h6 Zum Inhaltsbereich Zur Hauptnavigation Zur Suche Suche Suche nach RWTH English Fakultäte...
          What Is Automatic Data Capture? How Hedge Funds Can Trade On Heaven-Sent Data   
Delegates at Newsweek and International Business Times' data science in capital markets event were mesmerised by a video of shoe box-sized satellites, known as "cube sats" being released into the earth's atmosphere. They were then shown the speeded up footage that was being captured: giant oil tanks with floating lids, rising like tides; ships being built from scratch in dry docks; burning flares from steel mills; a picture of the agricultural yields of the entire planet.
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Learn Data Science in Python with theDevMasters   
none
          Global Knowledge organise un webinar gratuit sur le métier de Data Scientist, le jeudi 6 juillet 2017   
Global Knowledge organise un webinar gratuit sur le métier de Data Scientist
le jeudi 6 juillet 2017

Si le métier de Data scientist a été élu par le Harvard Businees Review comme le « métier le plus sexy du XXIe siècle », sa définition et la manière dont il est perçu ne font pas encore l'unanimité. Certains critiques considèrent la data science (ou science de données) comme un label superflu ou un simple mot à buzz qui n'existe que pour saler son CV afin d'attirer l'attention des recruteurs....
          UPS Sr. Data Scientist   
NJ-Mahwah, Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes and how!” We are UPS. We are the United Problem Solvers. About Information Management at UPS Technology: Our Information Management teams are responsible for designing and supporting data solutions to meet UPS’s rapidly changing business needs
          UPS Lead Data Scientist   
NJ-Mahwah, Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes and how!” We are UPS. We are the United Problem Solvers. About Information Management at UPS Technology: Our Information Management teams are responsible for designing and supporting data solutions to meet UPS’s rapidly changing business needs.
          (USA-TN-Chattanooga) Data Science Analyst-Enterprise Modeling & Governance Support   
**Description:** This position will focus on the use of information and analytics to improve health and optimize customer and business processes\. It also requires the ability to pair technical and analytical skills to perform day to day job duties\. May also assist in the designing and implementation of systems and use of programming skills to analyze and report insights that drive action\. Assist in the research, validation and development of Predictive models and Identification algorithms\. **Responsibilities:** • Identify, extract, manipulate, analyze and summarize data to deliver insights to business stakeholders\. • Source data can consist of medical and pharmacy claims, program activity and participation data, as well as demographic, census, biometric, marketing and health risk assessment data\. • Perform Model Governance duties such as maintaining a library of Predictive Models and monitoring the model accuracy and performance of these models\. And other required model governance activities\. **Qualifications:** • **Bachelor’s degree in Math, Statistics or Public Health or professional analytical experience\. Qualifying backgrounds include: epidemiologists, quantitative MBAs, quantitative sociologists, data miners, behavioral economists, qualitative researchers, economists, statisticians, or biostatisticians\.** • Strong analytical, communication and technical skills • Problem solving and critical thinking skills • Experience extracting and manipulating large data \(ie: a minimum of 1 million records\) across multiple data platforms\. • Familiarity with healthcare claims data • At least 1 years coding in SAS and/or SQL experience • Familiarity with Hadoop and Teradata coding highly desired\. **US** **Candidates Only** : Qualified applicants will be considered for employment without regard to race, color, religion, national origin, sex, sexual orientation, gender identity, disability, veteran status\. If you require a special accommodation, please visit our Careers website or contact us atSeeYourself@cigna\.com\. **Primary Location:** Bloomfield\-Connecticut **Other Locations:** United States\-North Carolina\-Raleigh, United States\-Colorado\-Greenwood Village, United States\-Tennessee\-Chattanooga, United States\-Pennsylvania\-Philadelphia **Work Locations:** 900 Cottage Grove Road Wilde Bloomfield 06152 **Job:** Bus Ops\-\-Operations Mgmt \(Bus\) **Schedule:** Regular **Shift:** Standard **Employee Status:** Individual Contributor **Job Type:** Full\-time **Job Level:** Day Job **Travel:** Yes, 25 % of the Time **Job Posting:** Jun 29, 2017, 10:35:24 AM
          Data Scientist - Unilever - Singapore   
Over half (57%) of the company’s footprint is in developing and emerging markets. Unilever is one of the world’s leading suppliers of Food, Home and Personal...
From Unilever - Tue, 27 Jun 2017 11:40:22 GMT - View all Singapore jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Machine Learning - Expert. Our Information Management teams are responsible for designing and supporting data solutions to meet UPS’s rapidly changing business...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Cyber Computer Scientist II/ Data Scientist - Battelle - Columbus, OH   
Experience with high level software languages (Python preferred, or demonstrated Java, C#, C++, Go, Haskell, Rust). Battelle is guided by a founding mission....
From Battelle - Wed, 14 Jun 2017 10:30:07 GMT - View all Columbus, OH jobs
          TECHNOLOGY: DATA SCIENCE DEVELOPER   
Diversity Recruitment - Johannesburg, Gauteng - There is a new and exciting opportunity to join a collaborative, engaged and passionate technology team. We are looking for colleagues that share our conviction that fact-based decision makes a difference in the lives of others. If you are inspired by solutions that empower posit...
          Manager, Quantitative Analytics/Data Science - Scotiabank - Toronto, ON   
Assess the impact of GL reconciliation results on capital and critical risk reports. Join the Global Community of Scotiabankers to help customers become better...
From Scotiabank - Thu, 29 Jun 2017 18:49:25 GMT - View all Toronto, ON jobs
          Scientific Data Scientist: VECTOR RECRUITMENT LIMITED   
£Excellent + Benefits package + Relocation package + Bonus: VECTOR RECRUITMENT LIMITED
For more latest jobs and jobs in East of England visit brightrecruits.com
          Senior Data Scientist - USA-CA-Menlo Park   
Who You Are Making sense out of complex data is core to our business at ******* and we’re looking for talented Data Scientists to join the team. At ******* , we look to integrate large amounts ...
          Agile Data Science 2.0 Building Full-Stack Data Analytics Applications with Spark   
Agile Data Science 2.0: Building Full-Stack Data Analytics Applications with Spark by Russell Jurney English | 7 Jun. 2017 | ASIN: B072MKL34K | 352 Pages | AZW3 | 5.91 MB DOWNLOAD (Buy premium account for maximum speed and resuming ability) الكود:  http://nitroflare.com/view/69068CED840D845/mdmx5.A.D.S.2.0.B.FS.D.A.A.w.S.rar http://rapidgator.net/file/aa77b9da3f52466c939b7d7c89a21cda/mdmx5.A.D.S.2.0.B.FS.D.A.A.w.S.rar http://uploaded.net/file/7svvn0lx/mdmx5.A.D.S.2.0.B.FS.D.A.A.w.S.rar
          Director, Data Scientist - KPMG - Atlanta, GA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Tue, 16 May 2017 08:29:26 GMT - View all Atlanta, GA jobs
          Director, Data Scientist - KPMG - Santa Clara, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Santa Clara, CA jobs
          Director, Data Scientist - KPMG - Irvine, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:26 GMT - View all Irvine, CA jobs
          Data Scientist - IBM - Austin, TX   
Opportunities to implement machine learning into support business processes. Business process optimization....
From IBM - Tue, 06 Jun 2017 21:03:26 GMT - View all Austin, TX jobs
          Director, Data Scientist - KPMG - Seattle, WA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Seattle, WA jobs
          How to replace yourself with a very small shell script   

Data scientist Hillary Mason (previously) talks through her astoundingly useful collection of small shell scripts that automate all the choresome parts of her daily communications: processes that remind people when they owe her an email; that remind her when she accidentally drops her end of an exchange; that alert her when a likely important email arrives (freeing her up from having to check and check her email to make sure that nothing urgent is going on). It's a hilarious and enlightening talk that offers a glimpse into the kinds of functionality that users can provide for themselves when they run their own infrastructure and aren't at the mercy of giant webmail companies. (via Clive Thompson)

          Why I Am Not Enthusiastic About Ed Tech   

Writing at EdSurge, Jason Palmer, a General Partner at an outfit called New Markets Venture Partners, has declared himself “optimistic about the next wave of education technology.” 

Palmer believes the time is finally fully ripe for the education technology industry as the right combination of regulatory change, advances in technology, and philanthropic support have put “all the key building blocks in place.”

Palmer's optimism is the reason for my pessimism. The mix of companies in the New Markets Venture Portfolio reveals an emphasis on credentialing, and perhaps my least favorite ed tech trend, “personalized learning” software. Palmer believes the relative financial health of these ed tech start-ups is a good sign for education technology.

Maybe so, but I believe it’s a bad sign for education.

As Audrey Watters shows in her talk “Pigeons and Personalization: The Histories of ‘Personalized Learning,’ “personalized learning” dates to the earliest teaching machines, and underneath those promises of individualization is an undermining of institutions and the people who work within them. When education can be delivered “personally” by machines, the only relationship needs to be with the corporation serving up the software. 

This is, of course, big business. In my home state of South Carolina, the Post & Courier newspaper recently reported over a 10-year period, $350 million of funding had been funneled to “online charter schools,”  to “dismal results.” 

Since a 2007 bill paving the way for taxpayer money to be funneled their way, the five virtual charter schools have seen enrollment increase from 2100 to 10,000 students. Our current Secretary of Education, Betsy DeVos is a strong supporter of increasing and expanding the use of vouchers, including to online schools.

All but one of these schools are for-profit entities. But even the schools themselves often merely serve as fronts for larger corporations. Of the $24 million S.C. Connections Academy received from the state in 2015-16, $22 million went to Connections Education LLC, a subsidiary of Pearson.

An online personalized virtual charter school in South Carolina comes courtesy of a massive educational publishing and assessment conglomerate headquartered in the United Kingdom.

While $350 million of taxpayer money was going to these corporations, South Carolina is “desperate” for teachers. 6500 teachers did not return for the 2016-17 year, an increase of 21% over the previous year. South Carolina ranks 38th in the nation for teacher salaries and is now turning to an alternative certification program through Teachers of Tomorrow, (a corporation) which offers a set of virtual online courses that qualify participants for a provisional license to teach. 

In 2014 the National Council on Teacher Quality gave A+ Texas Teachers (the entity from which Teachers of Tomorrow was born) an “F” rating for their credentialing program. Teachers of Tomorrow claims they’ve changed a lot since then.

In sum:

  • A state that ranks poorly in education embraced virtual charters as a way to serve students who have fallen through the cracks of their under resourced schools to the tune of $350 million, money primarily funneled to out-of-state (out-of-country) corporations.
  • The poorly paid, under resourced teachers are fleeing their jobs in record numbers, necessitating the use of alternative credentialing which happens through virtual education provided by corporations.

There’s a certain cruel beauty to the formula. The state of South Carolina is now at least partially captive to outside corporations to educate their students and train their teachers. As more students enter the virtual space, even more resources are drained from public schools, which drives even greater need for these virtual credentialing services.

They’ve got us coming and going. South Carolina is not unique in this pattern. Meanwhile, I wonder what kind of difference an extra $350 million might’ve made to teacher salaries and teacher retention.

As a teacher, I am a fan of integrating technology into the classroom and have relied on dedicated ed tech professionals to help me explore these possibilities. I know of many of these people who want nothing more than to figure out the best ways to use technology to help students learn.

But these good motives mean very little up against the might of industry, particularly where industry is encouraged to colonize previously public spaces in order to lap up our taxpayer dollars. Even if you are not a believer in education as a public good as I am, surely the waste and inefficiency should bother you. As reported by the Post & Courier, the S.C. Virtual Charter school with an enrollment of 3275, has received over $150 million in taxpayer funding since 2008 for an on-time graduation rate of 37% and a dropout rate near 25%. 

IHE resident ed tech blogger Josh Kim recently wrote about Courtney Maum’s satirical novel, Touch, and asked if we could see a future where there is “an inverse relationship between the exclusivity (and cost) of an education and the amount of technology incorporated into the experience?” 

In fact, I can, because we’re living it now. While I have no doubt you will find technology in elite private schools such as Sidwell Friends (alma mater of Malia Obama), or Lakeside Prep (where Bill Gates’ children go to school), I feel confident there is no personalized learning software substituting for a teacher in these educational spaces.

The students in these schools get to use technology as tools for learning, as it should be. These young people get to imagine futures as “data scientists,” while Silicon Valley pushes students in public education towards “coding,” and demand we be thankful for it. 

As long as more and more children who are born into less fortunate circumstances are subjected to “Let them eat pixels” futures, I will be hard-pressed to feel enthusiastic about education technology.

Show on Jobs site: 
Is this diversity newsletter?: 
Is this Career Advice newsletter?: 
Advice Newsletter publication dates: 
Thursday, June 29, 2017
Diversity Newsletter publication date: 
Thursday, June 29, 2017

          Data Scientist Data Analyst   
Düsseldorf Feste Anstellung Vollzeit Introduction Want to help change the world of recruitment …
          Data Scientist – Data Engineer   
You are a data scientist with a strong background in machine learning and software technologies? …
          Data Scientist Data Lab   
Responsibilities You work at the leading edge of the energy revolution for our business customers …
          DATA SCIENTIST DATA ENGINEER FÜR BIG DATA   
Wachse an technischen Herausforderungen Gehe Deinen nächsten Karriereschritt als DATA SCIENTIST / …
          Data scientist - gestión de fuentes de datos   
Madrid Energía sin fronteras
          Senior Data Scientist - Microsoft - Redmond, WA   
Office 365 is the locomotive engine that uplifts enterprise & consumer monetization with millions of active users in our datacenters around the globe....
From Microsoft - Tue, 06 Jun 2017 13:26:20 GMT - View all Redmond, WA jobs
          Developing and Evaluating Digital Interventions to Promote Behavior Change in Health and Health Care: Recommendations Resulting From an International Workshop   
Devices and programs using digital technology to foster or support behavior change (digital interventions) are increasingly ubiquitous, being adopted for use in patient diagnosis and treatment, self-management of chronic diseases, and in primary prevention. They have been heralded as potentially revolutionizing the ways in which individuals can monitor and improve their health behaviors and health care by improving outcomes, reducing costs, and improving the patient experience. However, we are still mainly in the age of promise rather than delivery. Developing and evaluating these digital interventions presents new challenges and new versions of old challenges that require use of improved and perhaps entirely new methods for research and evaluation. This article discusses these challenges and provides recommendations aimed at accelerating the rate of progress in digital behavior intervention research and practice. Areas addressed include intervention development in a rapidly changing technological landscape, promoting user engagement, advancing the underpinning science and theory, evaluating effectiveness and cost-effectiveness, and addressing issues of regulatory, ethical, and information governance. This article is the result of a two-day international workshop on how to create, evaluate, and implement effective digital interventions in relation to health behaviors. It was held in London in September 2015 and was supported by the United Kingdom’s Medical Research Council (MRC), the National Institute for Health Research (NIHR), the Methodology Research Programme (PI Susan Michie), and the Robert Wood Johnson Foundation of the United States (PI Kevin Patrick). Important recommendations to manage the rapid pace of change include considering using emerging techniques from data science, machine learning, and Bayesian approaches and learning from other disciplines including computer science and engineering. With regard to assessing and promoting engagement, a key conclusion was that sustained engagement is not always required and that for each intervention it is useful to establish what constitutes “effective engagement,” that is, sufficient engagement to achieve the intended outcomes. The potential of digital interventions for testing and advancing theories of behavior change by generating ecologically valid, real-time objective data was recognized. Evaluations should include all phases of the development cycle, designed for generalizability, and consider new experimental designs to make the best use of rich data streams. Future health economics analyses need to recognize and model the complex and potentially far-reaching costs and benefits of digital interventions. In terms of governance, developers of digital behavior interventions should comply with existing regulatory frameworks, but with consideration for emerging standards around information governance, ethics, and interoperability.

Purchases: 0
          Hacker News Suggestions for Engineering Team Blogs   
See this comment page on Hacker News suggestions for engineering team blogs. Some of the suggested blog: Google Research Blog Giant Robots Smashing into Other Giant Robots Airbnb Engineering and Data Science Riot Games Engineering Google Testing Blog Netflix Technology … Continue reading
          Cyber Computer Scientist II/ Data Scientist - Battelle - Columbus, OH   
Experience with high level software languages (Python preferred, or demonstrated Java, C#, C++, Go, Haskell, Rust). Battelle is guided by a founding mission....
From Battelle - Wed, 14 Jun 2017 10:30:07 GMT - View all Columbus, OH jobs
          Bonsai Expands TensorFlow Support with Gears, Extending Functionality of AI Platform for Enterprises Building Industrial Applications   
Bonsai, provider of an AI platform that empowers enterprises to build and deploy intelligent systems, released Gears, a top feature requested by customers in the Bonsai Early Access Program. Gears further extends the value of Bonsai to data scientists, providing them with a tool to manage, deploy and scale previously developed machine learning models, including those built with TensorFlow, within the Bonsai Platform.
          Software Developer - Data Science/Machine Learning - Leidos - Hanover, MD   
Java/JEE, JavaScript, Java Expression Language (JEXL), J1BX, Flex, EXT - JS, JSP, .NET, AJAX, SEAM, C, C++, PHP, Ruby / Ruby-on-Rails, SQL, MS SQL Server, MySQL...
From Leidos - Thu, 22 Jun 2017 10:40:48 GMT - View all Hanover, MD jobs
          Hiring for SBA/ Assistant Manager - SAS Analytics in Delhi/NCR(National Capital Region), Gurgaon for (Delhi Job)   
Job Description:Job Location : Gurgaon - SAS, SQL, Unix Responsibilities : - Conceptualize, design and execute analytical models to embed within the Analytics portfolio, working with a team of data scientists - Test model performance against competitive a...
          The Ultimate Data Infrastructure Architect Bundle for $36   
From MongoDB to Apache Flume, This Comprehensive Bundle Will Have You Managing Data Like a Pro In No Time
Expires June 01, 2022 23:59 PST
Buy now and get 94% off

Learning ElasticSearch 5.0


KEY FEATURES

Learn how to use ElasticSearch in combination with the rest of the Elastic Stack to ship, parse, store, and analyze logs! You'll start by getting an understanding of what ElasticSearch is, what it's used for, and why it's important before being introduced to the new features of Elastic Search 5.0.

  • Access 35 lectures & 3 hours of content 24/7
  • Go through each of the fundamental concepts of ElasticSearch such as queries, indices, & aggregation
  • Add more power to your searches using filters, ranges, & more
  • See how ElasticSearch can be used w/ other components like LogStash, Kibana, & Beats
  • Build, test, & run your first LogStash pipeline to analyze Apache web logs

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Ethan Anthony is a San Francisco based Data Scientist who specializes in distributed data centric technologies. He is also the Founder of XResults, where the vision is to harness the power of data to innovate and deliver intuitive customer facing solutions, largely to non-technical professionals. Ethan has over 10 combined years of experience in cloud based technologies such as Amazon webservices and OpenStack, as well as the data centric technologies of Hadoop, Mahout, Spark and ElasticSearch. He began using ElasticSearch in 2011 and has since delivered solutions based on the Elastic Stack to a broad range of clientele. Ethan has also consulted worldwide, speaks fluent Mandarin Chinese and is insanely curious about human cognition, as related to cognitive dissonance.

Apache Spark 2 for Beginners


KEY FEATURES

Apache Spark is one of the most widely-used large-scale data processing engines and runs at extremely high speeds. It's a framework that has tools that are equally useful for app developers and data scientists. This book starts with the fundamentals of Spark 2 and covers the core data processing framework and API, installation, and application development setup.

  • Access 45 lectures & 5.5 hours of content 24/7
  • Learn the Spark programming model through real-world examples
  • Explore Spark SQL programming w/ DataFrames
  • Cover the charting & plotting features of Python in conjunction w/ Spark data processing
  • Discuss Spark's stream processing, machine learning, & graph processing libraries
  • Develop a real-world Spark application

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Rajanarayanan Thottuvaikkatumana, Raj, is a seasoned technologist with more than 23 years of software development experience at various multinational companies. He has lived and worked in India, Singapore, and the USA, and is presently based out of the UK. His experience includes architecting, designing, and developing software applications. He has worked on various technologies including major databases, application development platforms, web technologies, and big data technologies. Since 2000, he has been working mainly in Java related technologies, and does heavy-duty server-side programming in Java and Scala. He has worked on very highly concurrent, highly distributed, and high transaction volume systems. Currently he is building a next generation Hadoop YARN-based data processing platform and an application suite built with Spark using Scala.

Raj holds one master's degree in Mathematics, one master's degree in Computer Information Systems and has many certifications in ITIL and cloud computing to his credit. Raj is the author of Cassandra Design Patterns - Second Edition, published by Packt.

When not working on the assignments his day job demands, Raj is an avid listener to classical music and watches a lot of tennis.

Designing AWS Environments


KEY FEATURES

Amazon Web Services (AWS) provides trusted, cloud-based solutions to help businesses meet all of their needs. Running solutions in the AWS Cloud can help you (or your company) get applications up and running faster while providing the security needed to meet your compliance requirements. This course leaves no stone unturned in getting you up to speed with administering AWS.

  • Access 19 lectures & 2 hours of content 24/7
  • Familiarize yourself w/ the key capabilities to architect & host apps, websites, & services on AWS
  • Explore the available options for virtual instances & demonstrate launching & connecting to them
  • Design & deploy networking & hosting solutions for large deployments
  • Focus on security & important elements of scalability & high availability

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Wayde Gilchrist started moving customers of his IT consulting business into the cloud and away from traditional hosting environments in 2010. In addition to consulting, he delivers AWS training for Fortune 500 companies, government agencies, and international consulting firms. When he is not out visiting customers, he is delivering training virtually from his home in Florida.

Learning MongoDB


KEY FEATURES

Businesses today have access to more data than ever before, and a key challenge is ensuring that data can be easily accessed and used efficiently. MongoDB makes it possible to store and process large sets of data in a ways that drive up business value. Learning MongoDB will give you the flexibility of unstructured storage, combined with robust querying and post processing functionality, making you an asset to enterprise Big Data needs.

  • Access 64 lectures & 40 hours of content 24/7
  • Master data management, queries, post processing, & essential enterprise redundancy requirements
  • Explore advanced data analysis using both MapReduce & the MongoDB aggregation framework
  • Delve into SSL security & programmatic access using various languages
  • Learn about MongoDB's built-in redundancy & scale features, replica sets, & sharding

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Daniel Watrous is a 15-year veteran of designing web-enabled software. His focus on data store technologies spans relational databases, caching systems, and contemporary NoSQL stores. For the last six years, he has designed and deployed enterprise-scale MongoDB solutions in semiconductor manufacturing and information technology companies. He holds a degree in electrical engineering from the University of Utah, focusing on semiconductor physics and optoelectronics. He also completed an MBA from the Northwest Nazarene University. In his current position as senior cloud architect with Hewlett Packard, he focuses on highly scalable cloud-native software systems.

Learning Hadoop 2


KEY FEATURES

Hadoop emerged in response to the proliferation of masses and masses of data collected by organizations, offering a strong solution to store, process, and analyze what has commonly become known as Big Data. It comprises a comprehensive stack of components designed to enable these tasks on a distributed scale, across multiple servers and thousand of machines. In this course, you'll learn Hadoop 2, introducing yourself to the powerful system synonymous with Big Data.

  • Access 19 lectures & 1.5 hours of content 24/7
  • Get an overview of the Hadoop component ecosystem, including HDFS, Sqoop, Flume, YARN, MapReduce, Pig, & Hive
  • Install & configure a Hadoop environment
  • Explore Hue, the graphical user interface of Hadoop
  • Discover HDFS to import & export data, both manually & automatically
  • Run computations using MapReduce & get to grips working w/ Hadoop's scripting language, Pig
  • Siphon data from HDFS into Hive & demonstrate how it can be used to structure & query data sets

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Randal Scott King is the Managing Partner of Brilliant Data, a consulting firm specialized in data analytics. In his 16 years of consulting, Scott has amassed an impressive list of clientele from mid-market leaders to Fortune 500 household names. Scott lives just outside Atlanta, GA, with his children.

ElasticSearch 5.x Cookbook eBook


KEY FEATURES

ElasticSearch is a Lucene-based distributed search server that allows users to index and search unstructured content with petabytes of data. Through this ebook, you'll be guided through comprehensive recipes covering what's new in ElasticSearch 5.x as you create complex queries and analytics. By the end, you'll have an in-depth knowledge of how to implement the ElasticSearch architecture and be able to manage data efficiently and effectively.

  • Access 696 pages of content 24/7
  • Perform index mapping, aggregation, & scripting
  • Explore the modules of Cluster & Node monitoring
  • Understand how to install Kibana to monitor a cluster & extend Kibana for plugins
  • Integrate your Java, Scala, Python, & Big Data apps w/ ElasticSearch

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Alberto Paro is an engineer, project manager, and software developer. He currently works as freelance trainer/consultant on big data technologies and NoSQL solutions. He loves to study emerging solutions and applications mainly related to big data processing, NoSQL, natural language processing, and neural networks. He began programming in BASIC on a Sinclair Spectrum when he was eight years old, and to date, has collected a lot of experience using different operating systems, applications, and programming languages.

In 2000, he graduated in computer science engineering from Politecnico di Milano with a thesis on designing multiuser and multidevice web applications. He assisted professors at the university for about a year. He then came in contact with The Net Planet Company and loved their innovative ideas; he started working on knowledge management solutions and advanced data mining products. In summer 2014, his company was acquired by a big data technologies company, where he worked until the end of 2015 mainly using Scala and Python on state-of-the-art big data software (Spark, Akka, Cassandra, and YARN). In 2013, he started freelancing as a consultant for big data, machine learning, Elasticsearch and other NoSQL products. He has created or helped to develop big data solutions for business intelligence, financial, and banking companies all over the world. A lot of his time is spent teaching how to efficiently use big data solutions (mainly Apache Spark), NoSql datastores (Elasticsearch, HBase, and Accumulo) and related technologies (Scala, Akka, and Playframework). He is often called to present at big data or Scala events. He is an evangelist on Scala and Scala.js (the transcompiler from Scala to JavaScript).

In his spare time, when he is not playing with his children, he likes to work on open source projects. When he was in high school, he started contributing to projects related to the GNOME environment (gtkmm). One of his preferred programming languages is Python, and he wrote one of the first NoSQL backends on Django for MongoDB (Django-MongoDBengine). In 2010, he began using Elasticsearch to provide search capabilities to some Django e-commerce sites and developed PyES (a Pythonic client for Elasticsearch), as well as the initial part of the Elasticsearch MongoDB river. He is the author of Elasticsearch Cookbook as well as a technical reviewer of Elasticsearch Server-Second Edition, Learning Scala Web Development, and the video course, Building a Search Server with Elasticsearch, all of which are published by Packt Publishing.

Fast Data Processing with Spark 2 eBook


KEY FEATURES

Compared to Hadoop, Spark is a significantly more simple way to process Big Data at speed. It is increasing in popularity with data analysts and engineers everywhere, and in this course you'll learn how to use Spark with minimum fuss. Starting with the fundamentals, this ebook will help you take your Big Data analytical skills to the next level.

  • Access 274 pages of content 24/7
  • Get to grips w/ some simple APIs before investigating machine learning & graph processing
  • Learn how to use the Spark shell
  • Load data & build & run your own Spark applications
  • Discover how to manipulate RDD
  • Understand useful machine learning algorithms w/ the help of Spark MLlib & R

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Krishna Sankar is a Senior Specialist—AI Data Scientist with Volvo Cars focusing on Autonomous Vehicles. His earlier stints include Chief Data Scientist at http://cadenttech.tv/, Principal Architect/Data Scientist at Tata America Intl. Corp., Director of Data Science at a bioinformatics startup, and as a Distinguished Engineer at Cisco. He has been speaking at various conferences including ML tutorials at Strata SJC and London 2016, Spark Summit, Strata-Spark Camp, OSCON, PyCon, and PyData, writes about Robots Rules of Order, Big Data Analytics—Best of the Worst, predicting NFL, Spark, Data Science, Machine Learning, Social Media Analysis as well as has been a guest lecturer at the Naval Postgraduate School. His occasional blogs can be found at https://doubleclix.wordpress.com/. His other passion is flying drones (working towards Drone Pilot License (FAA UAS Pilot) and Lego Robotics—you will find him at the St.Louis FLL World Competition as Robots Design Judge.

MongoDB Cookbook: Second Edition eBook


KEY FEATURES

MongoDB is a high-performance, feature-rich, NoSQL database that forms the backbone of the systems that power many organizations. Packed with easy-to-use features that have become essential for a variety of software professionals, MongoDB is a vital technology to learn for any aspiring data scientist or systems engineer. This cookbook contains many solutions to the everyday challenges of MongoDB, as well as guidance on effective techniques to extend your skills and capabilities.

  • Access 274 pages of content 24/7
  • Initialize the server in three different modes w/ various configurations
  • Get introduced to programming language drivers in Java & Python
  • Learn advanced query operations, monitoring, & backup using MMS
  • Find recipes on cloud deployment, including how to work w/ Docker containers along MongoDB

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Amol Nayak is a MongoDB certified developer and has been working as a developer for over 8 years. He is currently employed with a leading financial data provider, working on cutting-edge technologies. He has used MongoDB as a database for various systems at his current and previous workplaces to support enormous data volumes. He is an open source enthusiast and supports it by contributing to open source frameworks and promoting them. He has made contributions to the Spring Integration project, and his contributions are the adapters for JPA, XQuery, MongoDB, Push notifications to mobile devices, and Amazon Web Services (AWS). He has also made some contributions to the Spring Data MongoDB project. Apart from technology, he is passionate about motor sports and is a race official at Buddh International Circuit, India, for various motor sports events. Earlier, he was the author of Instant MongoDB, Packt Publishing.

Cyrus Dasadia always liked tinkering with open source projects since 1996. He has been working as a Linux system administrator and part-time programmer for over a decade. He works at InMobi, where he loves designing tools and platforms. His love for MongoDB started in 2013, when he was amazed by its ease of use and stability. Since then, almost all of his projects are written with MongoDB as the primary backend. Cyrus is also the creator of an open source alert management system called CitoEngine. He likes spending his spare time trying to reverse engineer software, playing computer games, or increasing his silliness quotient by watching reruns of Monty Python.

Learning Apache Kafka: Second Edition eBook


KEY FEATURES

Apache Kafka is simple describe at a high level bust has an immense amount of technical detail when you dig deeper. This step-by-step, practical guide will help you take advantage of the power of Kafka to handle hundreds of megabytes of messages per second from multiple clients.

  • Access 120 pages of content 24/7
  • Set up Kafka clusters
  • Understand basic blocks like producer, broker, & consumer blocks
  • Explore additional settings & configuration changes to achieve more complex goals
  • Learn how Kafka is designed internally & what configurations make it most effective
  • Discover how Kafka works w/ other tools like Hadoop, Storm, & more

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Nishant Garg has over 14 years of software architecture and development experience in various technologies, such as Java Enterprise Edition, SOA, Spring, Hadoop, Hive, Flume, Sqoop, Oozie, Spark, Shark, YARN, Impala, Kafka, Storm, Solr/Lucene, NoSQL databases (such as HBase, Cassandra, and MongoDB), and MPP databases (such as GreenPlum).

He received his MS in software systems from the Birla Institute of Technology and Science, Pilani, India, and is currently working as a technical architect for the Big Data R&D Group with Impetus Infotech Pvt. Ltd. Previously, Nishant has enjoyed working with some of the most recognizable names in IT services and financial industries, employing full software life cycle methodologies such as Agile and SCRUM.

Nishant has also undertaken many speaking engagements on big data technologies and is also the author of HBase Essestials, Packt Publishing.

Apache Flume: Distributed Log Collection for Hadoop: Second Edition eBook


KEY FEATURES

Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It's used to stream logs from application servers to HDFS for ad hoc analysis. This ebook start with an architectural overview of Flume and its logical components, and pulls everything together into a real-world, end-to-end use case encompassing simple and advanced features.

  • Access 178 pages of content 24/7
  • Explore channels, sinks, & sink processors
  • Learn about sources & channels
  • Construct a series of Flume agents to dynamically transport your stream data & logs from your systems into Hadoop

PRODUCT SPECS

Details & Requirements

  • Length of time users can access this course: lifetime
  • Access options: web streaming, mobile streaming
  • Certification of completion not included
  • Redemption deadline: redeem your code within 30 days of purchase
  • Experience level required: all levels

Compatibility

  • Internet required

THE EXPERT

Steve Hoffman has 32 years of experience in software development, ranging from embedded software development to the design and implementation of large-scale, service-oriented, object-oriented systems. For the last 5 years, he has focused on infrastructure as code, including automated Hadoop and HBase implementations and data ingestion using Apache Flume. Steve holds a BS in computer engineering from the University of Illinois at Urbana-Champaign and an MS in computer science from DePaul University. He is currently a senior principal engineer at Orbitz Worldwide (http://orbitz.com/).

          Data Scientist   

          Data Scientist / Data Science / Data Analyst / Big Data   
Data Scientist / Data Science / Data Analyst / Big Data My Financial Client is looking to take on board a few Data Scientists for an upcoming project. The contract length is 6 months with possible extrnsion. They are looking for ...
          Data Science Platform Developer - Iodine Software - Austin, TX   
Team player DNA with a desire to solve for the interests of our internal clients and of our business. Passion for exploring, applying and following the...
From Iodine Software - Fri, 30 Jun 2017 00:28:17 GMT - View all Austin, TX jobs
          High paying entry level positions for recent IT graduates   

IT compensation and new hiring are up according to the latest IT Salary Survey by Janco

Hisest paying entry level positions for IT graduatesThe top 7 highest paying IT tech jobs for recent graduates, along with their median starting salary and the current number of open, entry-level positions across the nation.

  1. Data scientist - Median starting salary: $93,500
  2. Hardware engineer - Median starting salary: $90,000
  3. Software engineer - Median starting salary: $80,000
  4. Technology analyst - Median starting salary: $76,000
  5. Security engineer - Median starting salary: $74,200
  6. Process development engineer - Median starting salary: $73,000
  7. User experience design - Median starting salary: $72,000

Order Salary SurveyDownload Sample salary survey


          Top Paying IT Jobs   

2017 IT Salary Survey

Top Paying IT jobsThere isn’t a lot of pay disparity due to location. Geographic location no longer dictates a significant increase or decrease in salary range. Candidates with in-demand tech skills can net top salaries regardless of region.

JOB TITLE: SALARY RANGE

  • CIO/CTO: $170,000 - $285,000
  • Demandware developer: $150,000 - $250,000
  • Chief information security officer: $145,000 - $250,000
  • DevOps lead/engineer: $115,000 - $250,000
  • Chief data officer: $162,000 - $228,000
  • Director PMO: $125,000 - $225,000
  • Data scientist: $130,000 - $210,000
  • Data architect: $130,000 - $210,000
  • Application security engineer: $125,000 - $210,000
  • Solutions architect: $140,000 - $200,000
  • Project manager: $90,000 - $200,000
  • Android developer: $90,000 - $200,000
  • iOS developer: $90,000 - $200,000

Order Salary SurveyDownload Selected Pages


          Data Scientist, Advanced Analytics - LogRhythm - Boulder, CO   
Working hands-on/embedded with engineering teams to develop and rapidly deliver scalable, reliable, performant product that provides security value to end-users...
From LogRhythm - Fri, 24 Mar 2017 21:05:24 GMT - View all Boulder, CO jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Coterminal CS/Data Science Student Spicknall Selected as Microsoft Civic Tech Fellow   
Soren Spicknall, a coterminal student who is earning a B.S. in computer science and M.A.S. in data science, has been selected as a Microsoft Civic Tech fellow.
          Machine Learning/Data Scientist   

          Senior Fullstack JavaScript Developer - DataCamp - Leuven   
Job description Join our team Datacamp is building the future of data science education. Our students get real hands-on experience by completing self-paced, interactive data science courses from the best instructors in the world, right in the browser. In fact, over 1 million students around the world have completed nearly 50 million Datacamp exercises to date! The role We are looking for a talented Full-Stack Software Engineer (JavaScript) to help us craft web apps...
          Machine Learning/Data Scientist   

          Free Online Research Tools: Casetext   
 Casetext is one of a growing number of free online sources for legal research. Developed by attorneys, data scientists, and engineers, Casetext offers free access to over 10 million cases, statutes, and regulations, plus articles and commentary from leading litigators. Coverage includes all United States Supreme Court decisions, Circuit Court and District Court decisions from […]
          Manager, Quantitative Analytics/Data Science - Scotiabank - Toronto, ON   
Assess the impact of GL reconciliation results on capital and critical risk reports. Join the Global Community of Scotiabankers to help customers become better...
From Scotiabank - Thu, 29 Jun 2017 18:49:25 GMT - View all Toronto, ON jobs
          Data Scientist - SEI - Oaks, PA   
This team is responsible for extracting value from the vast IMS data store to improve efficiency and decision-making across a broad client base....
From SEI - Fri, 30 Jun 2017 17:36:16 GMT - View all Oaks, PA jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Senior Data Architect - ASRC Federal - Moorestown, NJ   
Senior Data Architect. ASRC Federal Mission Solutions (AFMS) is seeking a Senior Data Scientist/ Architect to join its professional engineering team developing...
From ASRC Federal - Sat, 03 Jun 2017 01:36:31 GMT - View all Moorestown, NJ jobs
          Data scientist munkakörbe keresünk munkatársat. | Feladatok: Interact with customers to underst...   
Data scientist munkakörbe keresünk munkatársat. | Feladatok: Interact with customers to understand their requirements and identify emerging opportunities. • Take part in high and detailed level solution design to propose solutions and translating them into functional and technical specifications. • Convert large volumes of structured and unstructured data using advanced analytical solutions into actionable insights and business value. • Work independently and provide guidance to less experienced colleagues/employees. • Participate in projects, closely work and collaborate effectively with onsite and offsite teams at different worldwide locations in Hungary/China/US while delivering and implementing solutions. • Continuously follow data scientist trends and related technology evolutions in order to develop knowledge base within team.. | Mit ajánlunk: To be a member of dynamically growing site and enthusiastic team. • Professional challenges and opportunities to work with prestigious multinational companies. • Competitive salary and further career opportunities. | Elvárások: Bachelor?s/Master?s Degree in Computer Science, Math, Applied Statistics or a related field. • At least 3 years of experience in modeling, segmentation, statistical analysis. • Demonstrated experience in Data Mining, Machine Learning, additionally Deep Learning Tensorflow or Natural Language Processing is an advantage. • Strong programming skills using Python, R, SQL and experience in algorithms. • Experience working on big data and related tools Hadoop, Spark • Open to improve his/her skills, competencies and learn new techniques and methodologies. • Strong analytical and problem solving skills to identify and resolve issues proactively • Ability to work and cooperate onsite and offsite teams located in different countries Hungary, China, US and time zones. • Strong verbal and written English communication skills • Ability to handle strict deadlines and multiple tasks. | További infó és jelentkezés itt: www.profession.hu/allas/1033284
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Science Platform Developer - Iodine Software - Austin, TX   
Team player DNA with a desire to solve for the interests of our internal clients and of our business. Passion for exploring, applying and following the...
From Iodine Software - Fri, 30 Jun 2017 00:28:17 GMT - View all Austin, TX jobs
          Developing and Evaluating Digital Interventions to Promote Behavior Change in Health and Health Care: Recommendations Resulting From an International Workshop   
Devices and programs using digital technology to foster or support behavior change (digital interventions) are increasingly ubiquitous, being adopted for use in patient diagnosis and treatment, self-management of chronic diseases, and in primary prevention. They have been heralded as potentially revolutionizing the ways in which individuals can monitor and improve their health behaviors and health care by improving outcomes, reducing costs, and improving the patient experience. However, we are still mainly in the age of promise rather than delivery. Developing and evaluating these digital interventions presents new challenges and new versions of old challenges that require use of improved and perhaps entirely new methods for research and evaluation. This article discusses these challenges and provides recommendations aimed at accelerating the rate of progress in digital behavior intervention research and practice. Areas addressed include intervention development in a rapidly changing technological landscape, promoting user engagement, advancing the underpinning science and theory, evaluating effectiveness and cost-effectiveness, and addressing issues of regulatory, ethical, and information governance. This article is the result of a two-day international workshop on how to create, evaluate, and implement effective digital interventions in relation to health behaviors. It was held in London in September 2015 and was supported by the United Kingdom’s Medical Research Council (MRC), the National Institute for Health Research (NIHR), the Methodology Research Programme (PI Susan Michie), and the Robert Wood Johnson Foundation of the United States (PI Kevin Patrick). Important recommendations to manage the rapid pace of change include considering using emerging techniques from data science, machine learning, and Bayesian approaches and learning from other disciplines including computer science and engineering. With regard to assessing and promoting engagement, a key conclusion was that sustained engagement is not always required and that for each intervention it is useful to establish what constitutes “effective engagement,” that is, sufficient engagement to achieve the intended outcomes. The potential of digital interventions for testing and advancing theories of behavior change by generating ecologically valid, real-time objective data was recognized. Evaluations should include all phases of the development cycle, designed for generalizability, and consider new experimental designs to make the best use of rich data streams. Future health economics analyses need to recognize and model the complex and potentially far-reaching costs and benefits of digital interventions. In terms of governance, developers of digital behavior interventions should comply with existing regulatory frameworks, but with consideration for emerging standards around information governance, ethics, and interoperability.
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Director, Data Scientist - KPMG - Atlanta, GA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Tue, 16 May 2017 08:29:26 GMT - View all Atlanta, GA jobs
          Director, Data Scientist - KPMG - Santa Clara, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Santa Clara, CA jobs
          Director, Data Scientist - KPMG - Irvine, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:26 GMT - View all Irvine, CA jobs
          Data Scientist - IBM - Austin, TX   
Opportunities to implement machine learning into support business processes. Business process optimization....
From IBM - Tue, 06 Jun 2017 21:03:26 GMT - View all Austin, TX jobs
          Director, Data Scientist - KPMG - Seattle, WA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Seattle, WA jobs
          Data Scientist - Data Insights & Analytics - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 17 Jun 2017 08:59:04 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - Growth - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:08 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - RA - CSI - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:08 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - Mkt Place Routing - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:05 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Science con Python: la seconda parte   
Il libro più completo sulla disciplina più promettente del momento e il linguaggio di programmazione più mainstream.
          Vision Zero Labs: Using Data Science to Improve Traffic Safety   
The central idea behind the global Vision Zero movement is that traffic crashes are preventable. At Microsoft, we believe that data science and complex machine learning can aid cities in their life-saving Vision Zero commitment. That’s why we partnered with Datakind in 2015, and since then, we’ve worked with them to use city-specific data to […]
          Financial Data Science Event In London Includes Hedge Fund Quantitative Analysts, Software Engineers   
International Business Times and Newsweek host their first Data Science in Capital Markets event this week (1st and 2nd March) at the Barbican in the City of London. A global audience of data scientists, quantitative analysts, and software engineers from hedge funds and investment banks are attending. Speakers include Wes McKinney, the open source Pandas libraries guru; Professor David Hand, chief scientific advisor at Winton; and Professor Steve Roberts director of the Oxford-Man Institute.
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
The Senior Data Scientist provides leadership in implementation of advanced analytics models and solutions to yield predictive and prescriptive insights from...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Data Scientist   
Specializes in data science, analytics and architecture. Strong experience/knowledge on framing and conducting complex analyses and experiments using large volumes of complex (not always well-structured, highly variable) data. Ability to source, scrub, and join varied data sets from public, commercial, and proprietary sources and review relevant academic and industry research to identify useful algorithms, techniques, libraries, etc. Assists in efforts to centralize data collection and develop an analytics platform that drives data science and analytics capabilities. Deep domain experience in Apache Hadoop, data analysis, machine learning and scientific programming. Understands how to integrate multiple systems and data sets. Able to link and mash up distinctive data sets to discover new insights. Designing and developing statistical procedures and algorithms around data sources, recommending and building models for various data studies, data discovery and predictive analytics tasks, implementing any software required for accessing and handling data appropriately, working with developers to integrate and preprocess data for inputs into models and recommending tools and libraries for data science that are appropriate for the project Required Qualifications: 5-10 years of platform software development experience ? 3-5 years of experience with, understanding and knowledge of the Hadoop ecosystem and building analytic jobs in MapReduce, Pig, Hive, etc. ? 5 years of experience in SAS, R, Perl, Python, Java, or other languages appropriate for large scale analysis of numerical and textual data ? Experience developing static and interactive data visualizations. ? Strong knowledge of technical design and architecture principles. ? Creating large scale data processing systems. ? Driving design and code review process. ? Ability to develop and program databases, query databases and perform statistical analysis. ? Working with large scale warehouse and databases, sound knowledge of tuning and query processing. ? Excellent understanding of entire development process, including specification, documentation, quality assurance, debugging practices and source control systems. ? Ability to understand business issues as they impact the software development project. ? Solves complex, critical problems related to significant and unique issues. ? Ability to delve into large data sets to identify useful trends in business and develop methods to leverage that knowledge. ? Strong skills in predictive analytics, conceptual modeling, planning, statistics, visualization capabilities, identification of best data sources, hypothesis testing and data analysis. ? Familiar with disciplines such as natural language processing (the interactions between computers and humans) and machine learning (using computers to improve as well as develop algorithms). ? Writing data extraction, transformation, munging etc. algorithms. ? Developing end-to-end data flow from data consumption, organization and making it available via dashboard and/or APIs. ? Bachelor's degree in software engineering, computer science, information systems or equivalent. ? 5 years of related experience; 10 years of overall experience. ? Ability to perform activities, tasks and responsibilities described in the Position Description above. ? Demonstrated track record of architecting and delivering solutions with enterprise customers. ? Excellent people and communication skills. ? Processing complex, large scale data sets used for modeling, data mining, and research ? Designing and implementing statistical data quality procedures for new data sources ? Understanding the principles of experimental testing and design, including population selection and sampling ? Performing statistical analyses in tools such as SAS, SPSS, R or Weka ? Visualizing and reporting data findings creatively to provide insights to the organization ? Agile Methodology Experience ? Masters Degree or PhD We are an equal employment opportunity employer and will consider all qualified candidates without regard to disability or protected veteran status.
          Data Visualization Analyst (Tableau) job, San Francisco   
<span>Modis is currently recruiting Data Visualization Analysts with Tableau for a very exciting opportunity in San Francisco, CA. &nbsp;This is a contract to hire opportunity with a stellar company! <br>&nbsp;<br>&bull; You should have at least 2 years of analytics experience <br>&bull; 1 year of Tableau experience. <br>&nbsp;<br>My client is seeking a Data Visualization (Tableau) analyst/developer to join their dynamic team. &nbsp;You&rsquo;ll work directly with the business and technology teams to develop compelling visual analytics that tell my client&rsquo;s story and drive business objectives. &nbsp;&nbsp;<br>&bull; This position will give you invaluable direct experience with some of the most exciting new technologies and practices in the fast paced and competitive world of data science and analytics. <br>&bull; Qualified candidates will be creative, methodical in their approach to problem solving, and detail oriented. Applicants should be excellent at assessing business partner&rsquo;s needs and finding ways to meet them, and can handle multiple projects simultaneously. <br>&nbsp;<br>** If this opportunity is for you please apply directly to this posting** <br>&nbsp;<br>Required Qualifications:<br>&bull; 1+ years experience in an analytical role<br>&bull; Bachelor&rsquo;s degree in a quantitative field or a social science field with a &nbsp;quantitative emphasis<br>&bull; Experience in data analysis. &nbsp;An emphasis in retail, marketing, e-commerce, internet advertising, SEO or SEM.<br>&bull; Basic familiarity with SQL<br>Preferred Qualifications:<br>&bull; 2+ years in an analytical role<br>&bull; 1+ years experience with Tableau<br>&bull; Working knowledge of SQL and database architecture<br>Essential Functions:<br>&bull; Build compelling, interactive dashboards in Tableau that answer key business questions.<br>&bull; Query Hadoop/Hive, Vertica, DB2, Teradata and other data sources directly to get the data you need to build visualizations.<br>&bull; Communicate requests to development teams to support appropriate staging and configuration of data to make dashboards more performant.<br>&bull; Meet directly with business users to understand and clarify reporting requirements.<br>&bull; Provide Tableau training to inexperienced business users.<br>Expectations:<br>&bull; You must be experienced with framing and attempting to solve analytical questions using data in order to further organizational goals.<br>&bull; You should be both technical and design oriented: you have experience working with data and you understand why color matters.<br>&bull; You should understand how to communicate effectively with multiple audiences, such as data architects and business leaders. &nbsp;You can modify your message according to your audience.<br>&bull; You have a demonstrated commitment to servicing your client.<br>&bull; You are a self-starter, driven and accountable.<br>&bull; You have excellent attention to detail.<br>&nbsp;<br>** If this opportunity is for you please apply directly to this posting**<br>&nbsp;<br>Thank you for your time and attention!<br>Andrea<br>&nbsp;<br></span>
          Data Analyst - Long term Position - Well-known Tech Co.   
This Data Analyst Position Features:
? Great Pay to $60K

Immediate need for a data analyst that has the ability to create daily, weekly, monthly, and annual trends for company and all it encompasses. The perfect candidate will also be able to provide feedback for the Trends product and engineering team. Provide support for data scientist on larger data stories. A successful candidate will have at least two years experience working directly with data to determine trends. Having a good understanding of SQL based languages and experience with editorial work are traits that will help an individual succeed in this position. This position is like no other, it takes a special type of professional to execute what is needed. This organization is well-known, high-tech, and highly sought after. If you feel you are qualified and fit the mold, apply today!!

Minimum Qualifications:
BA/BS with a background in analytics and/or data journalism


We are an equal employment opportunity employer and will consider all qualified candidates without regard to disability or protected veteran status.
          Executive Director Engagement Activation - Verizon - Basking Ridge, NJ   
This discipline works closely with other internal teams (centralized media functions – i.e. SEM, Data Sciences), Demand Activation Leads, Experience, Creative...
From Verizon - Thu, 29 Jun 2017 10:58:38 GMT - View all Basking Ridge, NJ jobs
          Data Scientist / Machine Learning   
<span>Our client is ready to pay for an awesome Machine Learning person. &nbsp;My client is in San Francisco and is growing quickly. &nbsp;This is an opportunity for you to join a team of crazy smart experienced founders who have exited a total of 7 companies. &nbsp;Now is the time for you to be a part of something exciting!<br>&nbsp;<br>As a software engineer in Machine Intelligence you will be part of building a small team (3-4) collaborating closely with an experienced team in San Francisco. &nbsp;Their mission is to unlock the data and simplify the processes that are buried in legacy and SaaS enterprise software.<br>&nbsp;<br><B>Responsibilities</B><br><ul>
<li>Participate in cutting edge research in machine learning applications.</li><li>Develop solutions for B2B applications</li><li>Work on hierarchical rule system</li><li>Use graph theory </li></ul>
&nbsp;<br><B>Minimum qualifications</B><br><ul>
<li>BA/BS in Computer Science, related technical field or equivalent practical experience.</li></ul>
&nbsp;<br><B>Preferred qualifications</B><br><ul>
<li>MS or PhD degree in Computer Science, Artificial Intelligence, Machine Learning, or related technical field.</li><li>Strong background in Natural Language Processing or Computer Vision.</li><li>Experience coding in C, C++, Java, or Python.</li><li>Strong background in Machine Learning</li></ul>
&nbsp;<br>Apply now for immediate consideration!<br></span>
          Alibaba: Building a retail ecosystem on data science, machine learning, and cloud   
What does it take to compete in a global arena in which retail and cloud are increasingly intertwined? Domain-specific data science and machine learning for the masses, according to Alibaba.
          Data Scientist Principal Engineer -RELO to OH   
Akron, If you are a Data Scientist Principal Engineer with experience, please read on! Top Reasons to Work with Us 1. An awesome opportunity to work with Big Data & be part of high profile aviation projects for government & defense 2. We are privately held company, we continue to grow our Data Engineering team well into 2017 What You Will Be Doing Our Engineering team is developing multiple products for
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          (USA-WA-Kent) Marketing Data Analyst   
Marketing Data Analyst Posted Date:Jun-30-2017 Job ID:7561 Job Type:Full Time Job Function:Marketing City:Kent State:Washington ------------------------------------------------------------------------ What's cool about this job This position also conducts specific analytics related to customer purchasing, lifecycle, and engagement behavior. As a member of the marketing analytics team, this position will be responsible for contributing to an environment of fact-based decision making, utilizing REI’s extensive supply of customer, campaign, site, and marketing channel data. This combination of sophisticated analytics with key reporting enables the Senior Marketing Analyst to contribute to the bottom line success of REI as a co-op. • Partner with internal stakeholders to ensure accurate reporting of marketing campaigns, including creation of detailed technical specifications containing relevant information such as versioning, testing, and back-end instructions. • Curate data sets to enable self-service BI within other marketing groups • Partner with statisticians and data scientists to make fact based decisions on marketing programs to increase profitability, retention, and engagement. • Work closely with Senior Statistical Analyst to give input on personalized marketing model development and improving model performance. • Provide accurate and timely reporting of campaign performance to workgroup and cross-divisional teams. • Perform ad-hoc analytics related to customer purchasing, engagement, and marketing attribution as needed • Act as subject matter expert on REI’s data warehouse. Perform system and data testing as necessary. • Identify strategic opportunities to maximize returns in pursuit of REI’s financial and brand goals • Utilize SQL and other enterprise tools to execute customer segmentation strategies aimed at increasing customer acquisition, retention, and reactivation as well as report and analyze campaign performance • Provide analytics support for forecasting business trends and guidance on risks to plans Bring your passion and expertise • BS or MS/MBA in Analytics, Marketing, Statistics, or equivalent work experience • 2-4 years’ experience in a CRM or analytics environment. Experience in retail or ecommerce is a plus • Strong mathematical / analytical proficiency • Passionate about data and discovering new trends and insights across customers, merchandise, and customer engagement • Deep SQL experience and strong ability to comprehend data-warehouse structure, Netezza experience is a plus • Familiarity with econometrics and other advanced statistical concepts, including application of regression-based modeling, is a plus • Report development and Business Intelligence application experience • Self-disciplined and motivated in a marketing environment that requires creativity and strong attention to detail • Ability to prioritize and work multiple concurrent requests from different business teams • Tableau desktop and server experience is highly desired • Excellent communication skills and ability to quickly and effectively communicate complex quantitative results to internal stakeholders • Experience with R or Python is preferred At REI we offer an enviable work environment that has been recognized on the "100 Best Companies to Work For" list since the award's inception – 20 years in a row! Sure, we work hard, but it’s balanced with time off to play—a strategy that works for us as we continue to grow and thrive. Want to enjoy a workplace where you can be yourself, be heard and be respected while having a job that challenges you? This is the place. With more than 140 retail locations (and growing), REI offers unique competitive benefits to its more than 12,000 employees, including healthcare, gear and apparel discounts, free equipment rentals and challenge grants to help employees reach personal outdoor goals, generous retirement plan contributions, public transit subsidy, adoptions assistance, paid sabbaticals, and more. REI is an Equal Opportunity Employer
          (USA-WA-Bellevue) Data Scientist   
Bing’s mission: To deliver the most relevant knowledge to our customers by being more than just a search engine – Bing’s goal is to be the decision engine. Data is critical to achieving that mission. At Bing, we have an enormous wealth of data, ranging from user interaction logs to web documents, from user feedback to system performance data. The Bing Advertiser Sciences Team is hiring extremely talented, highly motivated and productive individuals with expertise in the areas of: Computer Science, Machine Learning, Econometrics, Statistics, Modeling, Simulation and Data Mining. The team develops and applies advanced techniques to turn our petabytes of data into insights; and to drive actions based on those insights. The team works closely with partners across Microsoft’s Online Services Division to enable rigorous, effective, and data-driven decision making. Some example of the challenges we face: •Modeling the dynamics of the paid search market •Understanding Advertiser value, lifecycle, opportunity and marketing objectives •Designing and analyzing the results of large-scale online experiments Prototyping algorithms fundamental to managing and optimizing demand generation activities to support our search marketplace. At Bing, we offer a strong team environment, exciting applied research challenges, and a fun place to work. The work environment empowers you to have a real impact Microsoft’s business, our advertiser partners, and millions of end users. This role is a unique opportunity to work with a world-class, interdisciplinary group of researchers, analysts, and developers. Job Responsibilities include: •Develop and manage and develop analyses and algorithms that generate actionable insights and programs to improve Bing Ads demand generation activities including: increasing both long-term revenue and relevance. •Research and develop solutions for improving profits for Microsoft and returning value to the audience, advertisers and publishers (e.g. Ecosystem health, marketplace performance measurement, advertiser health, outlier detection, etc.). •Specific responsibilities include the following: Work with key business stakeholders to understand the underlying business needs and formulate, communicate and create buy-in for analytics approaches and solutions •Influence stakeholders to make product/service improvements that yield customer/business value by effectively making compelling cases through story-telling, visualizations, and other influencing tools. •Effective communicate and translate Bing Ads business strategy and goals into discrete, manageable problems with well-defined measurable objectives and outcomes on which the Advertiser Sciences team can execute. •Transform formulated problems into implementations plans for experiments by developing data sources, applying/creating the appropriate methods, algorithms, and tools, as well as delivering statistically valid and reliable results •Contribute to an environment of scientific inquiry which reinforces team standards for analytic rigor that is consistent with the broader Microsoft data sciences community and strives to apply the simplest viable approach for experiments and analysis Qualifications: •A Bachelor’s degree in Data Science, Computer Science, Electrical Engineering, Machine Learning/AI or related fields. •Demonstrated experience in all phases of managing data science engagements including: problem definition, solution formulation and delivering measurable impact. •Experience with online data; experience with online-advertising data strongly preferred. •Knowledge and experience in at least three of the following areas: machine learning, data mining, user modeling, information retrieval (interrogation of log files and very large databases), economic modeling, econometrics, game theory, statistics, data analysis, e-metrics/measurement. •2+ years of experience in at least three of the following areas: machine learning, data mining, user modeling, information retrieval (interrogation of log files and very large databases), economic modeling, econometrics, game theory, statistics, data analysis, or e-metrics/measurement; 4+ years are preferred. •Experience with data analysis and statistical tools (e.g. Python, R, SAS, Matlab or SPSS). •Solid communications skills, both verbal and written. •Hands-on approach to data analysis and a strong focus on quality. •Ability to work independently and collaboratively in an interdisciplinary team environment Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to askstaff@microsoft.com. Data & applied sciences (engineering)
          (USA-WA-Redmond) PROGRAM MANAGER II   
The Universal Store Team is chartered with bringing to life a key component of the One Microsoft vision for the future – a Store to support scenarios across consumer and commercial, digital and physical, via multiple channels and storefronts. We are the in the midst of seismic changes in the Microsoft commerce landscape as we create an amazing experience for customers of all types. This is your opportunity to get in early to make waves and help Microsoft leap ahead to the future! The Store is powered by services, and our Marketplace Services team owns delivery of highly-scalable services across the Microsoft ecosystem, providing scale monetization, transaction and digital licensing capabilities across our most critical client experiences and content publishers. Marketplace Services powers commerce functionality for all digital content types (games, apps, video, music, subscriptions, etc.) across all Microsoft clients (Windows, Phone, Xbox, Web, Office). As we continue to transition toward One Microsoft and take a more holistic view of the opportunities for the Microsoft businesses and partners across all our channels (consumer and enterprise), a best-in-class marketplace is critical to our on-going success. This role is focused on driving key scenarios across the Universal Store to help enable all the above. You’ll be the front door and playmaker to many of the most critical scenarios for Xbox, Office, physical goods, and much more! You’ll own specific features and help drive the end-to-end experiences that are necessary to grow our business. To be successful in this role you must have the following qualifications: • Strong and demonstrated customer and partner empathy • Outstanding organizational and interpersonal skills demonstrated by previously working successfully across group boundaries, especially with engineering and business groups • Proven track record of delivering complex, high-scale systems that meet evolving business requirements • Strong written and verbal communication skills and a desire to create an open and collaborative team culture • You must thrive in a fast-paced work environment and demonstrate an ability to quickly come up to speed with new technologies Basic Qualifications: • 3+ years of program management experience • A BA or BS degree in computer science, engineering, math, physics, or business Preferred Qualifications: • Experience with data science or a degree in data science We’re a high-powered team with significant impact. If you can think big, want to join a fast-moving team that is breaking new ground in Universal Store, and you meet the qualifications above, we would like to meet you! Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to askstaff@microsoft.com. Program management (engineering)
          Data Scientist - RBC - Toronto, ON   
Data Science allows us to better understand the implications of what information means, identify trends, anticipate future behaviours, perform pattern matching,...
From RBC - Fri, 30 Jun 2017 19:20:41 GMT - View all Toronto, ON jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Senior Cloud Engineer/Big Data Architect / Data Science - Corporate Technology - New York, NY   
Perform data analysis with business, understanding business process and structuring both relational and distributed data sets....
From Corporate Technology - Wed, 14 Jun 2017 16:40:06 GMT - View all New York, NY jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientist - Consultants 2 Go - United States   
As Data Scientists, we work with business leaders to solve clients’ business challenges and improve clients’ marketing results....
From Consultants 2 Go - Tue, 27 Jun 2017 02:58:08 GMT - View all United States jobs
          Sr. Data Science Engineer - Adobe - San Jose, CA   
Develop predictive models on large-scale datasets to address various business problems through leveraging advanced statistical modeling, machine learning, or...
From Adobe - Fri, 26 May 2017 06:25:59 GMT - View all San Jose, CA jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
This position leads junior team members involved in advanced analytics activities and tasks. Senior Data Scientist....
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          CareSet Labs Releases Next-Gen Medicare Doctor Referral Dataset   
CareSet Labs has announced the release of Root NPI Graph, a new version of the “shared Medicare patients in time” provider teaming dataset. The Root NPI graph is the next-generation version of the Doctor Referral teaming dataset commonly available from Medicare. The dataset can be used by data scientists, researchers and innovators including pharmaceutical companies ... Read More
          Data Scientist - Signature Science, LLC - Austin, TX   
MapReduce, Hadoop) as well as databases (Amazon AWS, MongoDB, Cassandra). 15-0904-01_CHO/DC Data Scientist....
From Signature Science, LLC - Thu, 29 Jun 2017 06:24:23 GMT - View all Austin, TX jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Lo que aprendimos en A Coruña: conclusiones sobre la Ciencia de Datos   
TweetAcabamos de celebrar en A Coruña, en la sede de Afundación, el primer Summit sobre Data Science de la Fundación CorBI (Coruña Biomedical Research Institute), con el que CorBI Foundation pretendía abrir un foro de discusión en torno a la relevancia del tratamiento del Big Data en los campos de la neurociencia y el cambio [...]
          Data Science Apprentice - Predictive Science - United States   
Having additional data entry job experience, being a fast and accurate typist, and familiarity with MS Office and other data programs is a great plus....
From Predictive Science - Sat, 11 Mar 2017 08:05:32 GMT - View all United States jobs
          Making Data Governance "Agile" Again   

By Dennis D. McDonald

Introduction

In Can data governance be agile? Nancy Couture identifies the most important first step in establishing an "agile" data governance program:

An alternative, and more agile approach, is to identify smaller data governance initiatives based on strategic projects or business needs, and build from there. In this way, the organization can keep everyone apprised of progress and decisions, but the work effort is limited and focused. And the planned business value can be realized more quickly, thus increasing interest. 

The First Step

The importance of this first step, especially the part about "... based on strategic projects or business needs," shouldn't be underestimated.

Since beginning my own research into big data project management over a year ago I realized that data governance shares a lot with many other organizational priorities that generate a call for a comprehensive and strategic approach. Data when looked at as a resource permeates the organization. There's a tendency when examining how best to manage data comprehensively to suggest a wide-ranging solution that might be needed but that unfortunately might be doomed to failure.

No More Ocean Boiling

The prospect of failure exists partly because the day of the "boil the ocean" approach to massive enterprise level projects is long gone, especially in organizations that are legacy-system dependent and heavily siloed. Managing comprehensive programs that involve significant changes to technology, culture, and business processes requires a great deal of support and management skill, time, attention -- and money.

Also needed is the realization that change is inevitable and plans that attempt to plan too much inevitably have to be changed -- this is one of the reasons for the popularity of the "agile" movement which focuses on delivering value more quickly in manageable chunks.

Needed: Both Tactics and Strategy

As Couture suggests, it makes more sense to start off with something that's (a) important and (b) definable.

This doesn't mean you shouldn't think strategically when faced with "upping" your data governance game. When thinking about data governance it pays to think both strategically and tactically. We want to deliver value in the short term while at the same time providing a foundation for the future. But how?

An Approach

The following is based on ongoing discussions with consulting colleagues William Moore and Mark Bruscke. Will has evolved a structured approach to data governance that emphasizes how language and terminology are used. Mark is an experienced data architect with much enterprise level experience. My own focus is on IT strategy and project management especially in data intensive projects.  

What follows are some of the actions we recommend to address both the tactics and strategy associated with improved data governance:

  1. Establish Data Governance Management Structure
  2. Understand Language and Terminology
  3. Manage Metadata
  4. Perform Data Stewardship

1. Establish Data Governance Management Structure

Even for "tactical" projects focusing on data governance applied to selected high-importance problems or issues, key stakeholders must be represented including not only IT but potentially all groups impacted by how data are managed. This might include groups as diverse as Legal, Marketing, Customer Service, Research, Data Science, Sales, and Administration. Any group that might somehow be directly or indirectly associated with producing, managing, or using data should be considered for some sort of stakeholder role.

Sometimes management of the data governance process is formalized in a "data governance council" which oversee policies and practices regarding data and metadata. This may not be appropriate for short term or tactically focused projects managed following an agile process. Eventually, though, such "council" will have to set policy and even resolve disputes when different groups or systems use different terminology to refer to what appears to be the same concept. Starting such a group  requires settling on a data governance council structure (e.g., centralized, federated, hub-and-spoke, etc.), establishing and sharing a group charter, recruiting members, and overseeing the organized data governance operation.  

Initially a "data advisory council" should include, even in a tactically focused short term project, those who are best able to articulate requirements and user stories concerning data associated with the problem or issue being targeted.

Given the potential need for experimentation at this stage, the team must be ready to change as requirements (and user stories) evolve. Flexibility and collaboration are needed. The more focused the initial targets are, the better able the team will be to share information and make decisions.

2. Understand Language and Terminology

Whether dealing with a short term tactically-focused project or a longer term project with strategic implications for how the organization manages and uses data, participants need to understand how language and terminology are managed and used in relation to the specific problems or issues being addressed.

A structured and open process will be needed to identify and document the most important concepts that need to be understood and documented, regardless of the variety of ways people and machines refer to these concepts.

Moore calls these “key business attributes." The process of discovering, defining, and modeling them is "business attribute analysis."

The basic approach for identifying key business attributes involves not only reviewing how data and metadata are currently defined but also analyzing how important business concepts are referred to in the organization's documentation, emails, reports, and other communication media. This is one of the reasons why initial steps need to focus on well define projects or issues: scope control.

One output of this collaborative business attribute analysis, carried out with support of the data governance team, is a high level model of key business attributes and how they relate to the problem or issue that's initially being targeted. Two important questions to address during this process are:

  1. What do we know now about solving this problem or making this decision?
  2. What do we need to know to solve this problem or make this decision?

The business attribute model that emerges from this analysis will surely evolve but will establish the basis for the next step by identifying what data need to be managed.

3. Manage Metadata

Our goal here is to create a metadata repository that catalogs key data concepts (see above) and how these are expressed and used throughout the organization’s business processes, systems, and databases.

For a particular problem domain defined by the process or issue being addressed, we create an inventory and catalog of target data related to the key business attributes identified above. We document where and how these data are used and who is responsible for them.

Our basic approach is to review all relevant business attributes, data, definitions, decision rules, data models, and the manner in which data are linked, expressed, and used throughout the targeted problem domain.

Process and information are what are important here, not the tools used. We recommend in order to proceed quickly at this stage that existing collaboration, database, and document management tools be used. Accessibility and ease of use are major concerns. We want to move quickly and deliberately but not in secret.

The primary deliverable of this process is an evolving catalog that supports data governance, consistency, data transformation requirements, and (where necessary and justified) intelligent standardization.

Most of all we want to clearly and transparently document how data are used in machine to machine, machine to human, and human to human communication. And we want the systems and processes used here to be reused and where possible scaleable.

4. Perform Data Stewardship

We need to establish a dedicated and staffed data stewardship process that works across and supports the systems and processes described in (1), (2), and (3).

The Data Steward provides the "boots on the ground" in the data governance process. Our role as consultants is to help establish the above systems and processes and to function initially as the client's data stewards while simultaneously mentoring others so the client can manage its own data governance  and stewardship processes and system

Once a specific problem or issue is addressed and improvements identified for how data and metadata should be governed and used to address that problem, the processes and procedures we have gone through are represented as documented processes that can then be adapted for the next problem.

Conclusions

In summary we recommend:

  1. Start with a well defined data-dependent problem or issue.
  2. Move quickly but stay disciplined.
  3. Be collaborative and transparent.
  4. Treat this as a learning process; knowing in advance what benefits better data and analytics will bring is impossible.
  5. Keep track of costs.
  6. Don't just focus on existing technology and structured data.
  7. Keep management informed and involved.
  8. Document what can be done better next time.
  9. Focus on detail but keep the big picture in mind.
  10. Use collaboration and transparency to overcome resistance, not hierarchy and authority.

Copyright (c) 2017 by Dennis D. McDonald. Interested in applying these ideas to your own organization's data governance? Contact me in Alexandria Virginia at 703-402-7382 or by email at ddmcd@ddmcd.com.

 

 

 

 

 

 


          Data Scientist - Data Insights & Analytics - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 17 Jun 2017 08:59:04 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - Growth - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:08 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - RA - CSI - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:08 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - Mkt Place Routing - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:05 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - ACI Worldwide - India   
The Data Scientist &amp; Data Warehouse Expert will have experience in Oracle Business Intelligence Enterprise Edition and will lead initiatives to develop and...
From ACI Worldwide - Fri, 30 Jun 2017 08:47:09 GMT - View all India jobs
          Data Science Intern - Fall 2017 - NVIDIA - Santa Clara, CA   
Carry out independent research with the goal of furthering the understanding of gaming behavior and development of quantitative tools for marketing planning and...
From NVIDIA - Sat, 24 Jun 2017 01:33:58 GMT - View all Santa Clara, CA jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
The Senior Data Scientist provides leadership in implementation of advanced analytics models and solutions to yield predictive and prescriptive insights from...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Data Scientist, Data & Analytics - KPMG LLP - Toronto, ON   
Working at KPMG allows you to gain valuable on-the-job experience while building your professional network and business acumen. Being part of the KPMG Global
From KPMG LLP - Tue, 13 Jun 2017 21:52:29 GMT - View all Toronto, ON jobs
          Data Architect - Data Scientist - JP Morgan Chase - Columbus, OH   
Build the firm’s business metadata repository in partnership with the Chief Data Office. Adopting industry leading technologies to support best-in-class...
From JPMorgan Chase - Wed, 21 Jun 2017 13:42:24 GMT - View all Columbus, OH jobs
          Data Scientist   
NSW-Sydney CBD, My client is an established analytical services unit within an organisation focusing medical research and patient care in Sydney partnering with some of the largest medical organisations. They are currently transforming their predictive analytics capability, to build a `fit-for-purpose` solution to their growing needs. We are looking for a Data Scientist to help push the boundaries in a space that
          Software Developer - Data Science/Machine Learning - Leidos - Hanover, MD   
Java/JEE, JavaScript, Java Expression Language (JEXL), J1BX, Flex, EXT - JS, JSP, .NET, AJAX, SEAM, C, C++, PHP, Ruby / Ruby-on-Rails, SQL, MS SQL Server, MySQL...
From Leidos - Thu, 22 Jun 2017 10:40:48 GMT - View all Hanover, MD jobs
          Data Scientist   

          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientists and Hadoop Engineers (multiple levels)   

          Senior Analyst, Data Science - Prudential - Newark, NJ   
And a passion for generating business impact. Develop and maintain consultative relationships with key business stakeholders....
From Prudential - Wed, 28 Jun 2017 23:45:06 GMT - View all Newark, NJ jobs
          Senior Specialist, Data Science - Prudential - Newark, NJ   
And a passion for generating business impact. Develop and maintain consultative relationships with key business stakeholders....
From Prudential - Tue, 30 May 2017 20:29:44 GMT - View all Newark, NJ jobs
          Specialist, Data Science - Prudential - Newark, NJ   
Identify analytical solutions for business problems. And a passion for generating business impact. Develop and maintain consultative relationships with key...
From Prudential - Tue, 30 May 2017 20:29:44 GMT - View all Newark, NJ jobs
          Developing and Evaluating Digital Interventions to Promote Behavior Change in Health and Health Care: Recommendations Resulting From an International Workshop   
Devices and programs using digital technology to foster or support behavior change (digital interventions) are increasingly ubiquitous, being adopted for use in patient diagnosis and treatment, self-management of chronic diseases, and in primary prevention. They have been heralded as potentially revolutionizing the ways in which individuals can monitor and improve their health behaviors and health care by improving outcomes, reducing costs, and improving the patient experience. However, we are still mainly in the age of promise rather than delivery. Developing and evaluating these digital interventions presents new challenges and new versions of old challenges that require use of improved and perhaps entirely new methods for research and evaluation. This article discusses these challenges and provides recommendations aimed at accelerating the rate of progress in digital behavior intervention research and practice. Areas addressed include intervention development in a rapidly changing technological landscape, promoting user engagement, advancing the underpinning science and theory, evaluating effectiveness and cost-effectiveness, and addressing issues of regulatory, ethical, and information governance. This article is the result of a two-day international workshop on how to create, evaluate, and implement effective digital interventions in relation to health behaviors. It was held in London in September 2015 and was supported by the United Kingdom’s Medical Research Council (MRC), the National Institute for Health Research (NIHR), the Methodology Research Programme (PI Susan Michie), and the Robert Wood Johnson Foundation of the United States (PI Kevin Patrick). Important recommendations to manage the rapid pace of change include considering using emerging techniques from data science, machine learning, and Bayesian approaches and learning from other disciplines including computer science and engineering. With regard to assessing and promoting engagement, a key conclusion was that sustained engagement is not always required and that for each intervention it is useful to establish what constitutes “effective engagement,” that is, sufficient engagement to achieve the intended outcomes. The potential of digital interventions for testing and advancing theories of behavior change by generating ecologically valid, real-time objective data was recognized. Evaluations should include all phases of the development cycle, designed for generalizability, and consider new experimental designs to make the best use of rich data streams. Future health economics analyses need to recognize and model the complex and potentially far-reaching costs and benefits of digital interventions. In terms of governance, developers of digital behavior interventions should comply with existing regulatory frameworks, but with consideration for emerging standards around information governance, ethics, and interoperability.

Purchases: 0
          Data Scientist - get PAID to work on passion projects!   
Seattle, Would you like the opportunity to get paid to enhance your data science skills with remote work for a portion of the year? This role is a mix of 50% co-teaching (on-site), 25% supporting business operations, and 25% enhancing your data scientist skills in any number of ways (write a book, enroll in courses, pursue a passion project, etc.)! This is a full time/direct hire opportunity to work for a
          Nonnegative Factorization of a Data Matrix as a Motivational Example for Basic Linear Algebra. (arXiv:1706.09699v1 [math.HO])   

Authors: Barak A. Pearlmutter, Helena Šmigoc

We present a motivating example for matrix multiplication based on factoring a data matrix. Traditionally, matrix multiplication is motivated by applications in physics: composing rigid transformations, scaling, sheering, etc. We present an engaging modern example which naturally motivates a variety of matrix manipulations, and a variety of different ways of viewing matrix multiplication. We exhibit a low-rank non-negative decomposition (NMF) of a "data matrix" whose entries are word frequencies across a corpus of documents. We then explore the meaning of the entries in the decomposition, find natural interpretations of intermediate quantities that arise in several different ways of writing the matrix product, and show the utility of various matrix operations. This example gives the students a glimpse of the power of an advanced linear algebraic technique used in modern data science.


          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Senior Analyst, Predictive Modeling & Data Science - BMO Financial Group - Toronto, ON   
Ch. The Advanced Analytics &amp; Journey Science group partners with internal Personal and Commercial Banking Canada partners, and various lines of business across...
From BMO Financial Group - Sat, 24 Jun 2017 00:46:45 GMT - View all Toronto, ON jobs
          Principal Data Scientist - (Wellesley Hills)   
Working At Aetna the Value To You What does it mean to work at Aetna A lot From programs and benefits that support your financial physical and emotional health to opportunities to build your knowledge and expand your career the company makes working here a valuable experience in many ways Aetna s Data Analytics team is focused on delivering strategically impactful products to our internal customers by building analytically based solutions that integrate a wide range of internal and external large datasets within a cutting edge Hadoop parallel processing environment We are currently seeking a Principal Data Scientist in our Hartford CT Wellesley MA or New York NY location This position will be responsible for leveraging advanced statistical predictive modeling to evaluate scenarios and make predictions on future outcomes Analyzes very large data sets in real time databases and develops and implements mathematical approaches Position Summary This is a unique opportunity to develop new approaches leveraging the latest cutting edge big data technologies The successful candidate will provide strategic leadership for the development validation and delivery of algorithms statistical models and reporting tools Acts as the analytic team lead for highly complex projects involving multiple resources and tasks providing individual mentoring in support of company objectives Fundamental Components Leads development and execution of new and or highly complex algorithms and statistical predictive models and determines analytical approaches and modeling techniques to evaluate potential future outcomes Establishes analytical rigor and statistical methods to analyze large amounts of data using advanced statistical techniques and mathematical analyses Methods will be implemented in Hadoop and R using advanced technologies Manage highly complex analytical projects from data exploration model building performance evaluation testing Applies in depth knowledge of systems and products to consult and advise on additional efforts across organization enterprise Motivates team members and probes into technical details and mentors others to do the same Provides thought leadership and direction for analytic solutions tools and studies Anticipates and solves strategic and high risk business problems with broad impact on the business area by applying leading edge theories and techniques to investigate problems detect patterns and recommend solutions Provides guidance to develop enterprise wide analytics strategy and roadmap Interacts with internal and external peers and management to share highly complex information solutions related to areas of expertise and or to gain acceptance of new or enhanced technology business solutions Required Skills Background Experience years of progressively complex related experience Technical background with modeling and programming Experience in SAS or SQL or other programming languages Advanced in depth specialization in mathematical analysis methods predictive modeling statistical analyses machine learning and big data technologies such as Python R and Hadoop Demonstrated ability to communicate technical ideas and results to non technical clients in written and verbal form Comprehensive knowledge on health care industry products systems business strategies and products and e xperience in healthcare industry is preferred Strong organizational management and leadership skills Education Masters degree Ph D preferred Additional Job Information Aetna continues to build a world class Data Science organization to capture data understand context generate insights and react in real time We engage our business partners providing solutions to improve the consumer experience increase efficiencies and optimize health outcomes for our members through leveraging cutting edge technology Aetna is about more than just doing a job This is our opportunity to re shape healthcare for America and across the globe We are developing solutions to improve the quality and affordability of healthcare What we do will benefit generations to come We care about each other our customers and our communities We are inspired to make a difference and we are committed to integrity and excellence Together we will empower people to live healthier lives Aetna is an equal opportunity affirmative action employer All qualified applicants will receive consideration for employment regardless of personal characteristics or status We take affirmative action to recruit select and develop women people of color veterans and individuals with disabilities We are a company built on excellence We have a culture that values growth achievement and diversity and a workplace where your voice can be heard How to apply Please use the Apply link to apply to this position If you cannot view this link you can find this position on our website by visiting www aetna com working click on apply online search openings and enter BR in the keyword field Additional information on what its like to work for Aetna and more can also be found on our website We value leadership creativity and initiative If you share those values and a commitment to excellence and innovation consider a career with Aetna Aetna does not permit the use of tobacco related products or drugs in the workplace Aetna is an EO AA Employer Minorities Women Veterans Disability No search firms please You will not be asked for personal information until you have been fully evaluated through Aetna s screening processes Aetna will never ask applicants for money If you want to verify the identity of someone who contacts you from Aetna call AETNAHR Source: http://www.juju.com/jad/000000009qxk2z?partnerid=af0e5911314cbc501beebaca7889739d&exported=True&hosted_timestamp=0042a345f27ac5dc0413802e189be385daf54a16310431f6ff8f92f7af39df48
          Data Science Engineer - Performance Advertising - A9.com - Palo Alto, CA   
At least 2 years applying Machine Learning techniques to solve business problems. Be a member of the Amazon-wide Machine Learning Community, participating in...
From A9.com - Tue, 20 Jun 2017 05:00:34 GMT - View all Palo Alto, CA jobs
          Data Science Performance Engineer, Search Platform - A9.com - Palo Alto, CA   
At least 2 years of experience applying Machine Learning techniques to solve business problems. Work with Amazon Web Services to improve their machine learning...
From A9.com - Thu, 15 Jun 2017 20:40:02 GMT - View all Palo Alto, CA jobs
          Data Sciences Engineer - Sponsored Products - A9.com - Palo Alto, CA   
Be a member of the Amazon-wide Machine Learning Community, participating in internal and external Meetups, Hackathons and Conferences....
From A9.com - Wed, 03 May 2017 01:36:05 GMT - View all Palo Alto, CA jobs
          Offer - Urgent Oracle SQL and PLSQL Developer for Trivandrum - INDIA   
VINIRMA Consulting Pvt. Ltd. is a 360-degree Human Resource Management Consulting and Staffing Services Organization with operations in UAE, Qatar, Bahrain, Australia, USA, Singapore & India. VINIRMA Consulting is currently looking for Oracle SQL and PLSQL Developer for one of our clients which is a leading Organization in Trivandrum with the following skill set and terms and conditions. Skillset required: Extensive Knowledge in Oracle PL/SQL Extensive Knowledge in SQL Hands on experience in Performance tuning of SQL and PL/SQL In addition to the above, at least one of the following skill set is also required. Exposure to Data Warehousing and related tools: OWB, Golden Gate, Pentaho etc. Work experience in any Data Warehousing reporting tool: Business Objects, Cognos, Qlik, Micro Strategy etc. Work experience in Data Migration Exposure to Data science - Python and R Looking candidate only from Trivandrum who can come and attend Face to face interview with client Experience required: 2 - 5 years Terms and conditions: Joining time frame: 2 weeks (maximum 1 month). The selected candidates shall be a direct employee of one of the leading organizations in Trivandrum. Should you be interested in this opportunity, please send your latest resume in MS Word format at the earliest at sreejith.murali@vamsystems.com or call +91 471 2766011.
          Cyber Computer Scientist II/ Data Scientist - Battelle - Columbus, OH   
Experience with high level software languages (Python preferred, or demonstrated Java, C#, C++, Go, Haskell, Rust). Battelle is guided by a founding mission....
From Battelle - Wed, 14 Jun 2017 10:30:07 GMT - View all Columbus, OH jobs
          Instacart Data Science with Jeremy Stanley   
Instacart is a grocery delivery service. Customers log onto the website or mobile app and pick their groceries. Shoppers at the store get those groceries off the shelves. Drivers pick up the groceries and drive them to the customer. This is an infinitely complex set of logistics problems, paired with a rich data set given by the popularity of Instacart. Jeremy Stanley is the VP of data science for Instacart.

Continue reading...


          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Offer - Urgent Oracle SQL and PLSQL Developer for Trivandrum - INDIA   
VINIRMA Consulting Pvt. Ltd. is a 360-degree Human Resource Management Consulting and Staffing Services Organization with operations in UAE, Qatar, Bahrain, Australia, USA, Singapore & India. VINIRMA Consulting is currently looking for Oracle SQL and PLSQL Developer for one of our clients which is a leading Organization in Trivandrum with the following skill set and terms and conditions. Skillset required: Extensive Knowledge in Oracle PL/SQL Extensive Knowledge in SQL Hands on experience in Performance tuning of SQL and PL/SQL In addition to the above, at least one of the following skill set is also required. Exposure to Data Warehousing and related tools: OWB, Golden Gate, Pentaho etc. Work experience in any Data Warehousing reporting tool: Business Objects, Cognos, Qlik, Micro Strategy etc. Work experience in Data Migration Exposure to Data science - Python and R Looking candidate only from Trivandrum who can come and attend Face to face interview with client Experience required: 2 - 5 years Terms and conditions: Joining time frame: 2 weeks (maximum 1 month). The selected candidates shall be a direct employee of one of the leading organizations in Trivandrum. Should you be interested in this opportunity, please send your latest resume in MS Word format at the earliest at sreejith.murali@vamsystems.com or call +91 471 2766011.
          Podsumowanie konferencji SQLDay 2017   

W dniach 15- 17 maja 2017, we Wrocławskim Centrum Konferencyjnym odbyła się konferencja SQLDay 2017, organizowana przez Stowarzyszenie Użytkowników SQL Server PLSSUG (nowa nazwa - Data Community).

W wydarzeniu wzięło udział ponad 800 osób, kilkudziesięciu prelegentów przedstawiło ponad 60 sesji technicznych dot. rozwiązań Microsoft z zakresu Data Platform.
Wśród prelegentów znaleźli się wykładowcy studiów podyplomowych WSZiB; Tomasz Libera, Michał Sadowski, Grzegorz Stolecki i Marcin Szeliga.

Nowością tegorocznej - dziewiątej już konferencji SQLDay, była ścieżka naukowa APPLIED DATA SCIENCE. Nasza Uczelnia objęła patronatem to wydarzenie, a Dziekan dr Bartosz Banduła zasiadał w Radzie Programowej, obok przedstawicieli kilku innych uczelni, m.in. Politechniki Poznańskiej, Politechniki Lubelskiej,  Akademii Górniczo-Hutniczej czy Uniwersytetu Jagiellońskiego.

Studenci WSZIB, podobnie jak w poprzednich latach pracowali podczas konferencji jako wolontariusze.

WSZiB wspiera organizację Data Community - udostępniając sale wykładowe na potrzeby comiesięcznych spotkań podczas których uczestnicy dzielą się.

Strona internetowa konferencji:
http://www.sqlday.pl

Ścieżka naukowa:
http://science.sqlday.pl


          Comment on The Big Data Continuum: From Data Scientists to Empowered Business People by Goodbye Don Draper, Hello Big Data: An EMA Report on Modern Analytics   
[…] and be understood by line-of-business professionals, not just data scientists. (See more on this subject from RTInsight’s expert Connie […]
          Senior Cloud Engineer/Big Data Architect / Data Science - Corporate Technology - New York, NY   
Perform data analysis with business, understanding business process and structuring both relational and distributed data sets....
From Corporate Technology - Wed, 14 Jun 2017 16:40:06 GMT - View all New York, NY jobs
          Seattle's Tech Community Is Fighting Trump One Piece of Code at a Time   
From the Airport Lawyer app to hackathons, Seattle's programmers are resisting the best ways they know how. by Amber Cortes

If you happened to be walking through Westlake Park on the afternoon of March 1, you may have missed the Seattle-area "Tech Employees for Diversity and Inclusion" protest. It was only a handful of people holding signs and chanting: "No hate, no fear, everyone is welcome here!"

On Facebook, 334 people said they were interested in the event, 74 marked themselves as "going," but there couldn't have been more than a couple dozen people who actually showed up. And compared to the Women's March, which drew 175,000 people in Seattle alone, and the immigration rights protests that had sprung up recently in response to the new administration, it was sort of a sorry sight.

"The problem with this industry," explains Raine Dargis, "is that there are too many introverts. We need extroverts to organize a protest." Dargis, one of the organizers (and a self-described introvert), is a software engineer at what she calls "a big company"—but she won't say its name.

The big tech companies, for their part, have been fighting Trump on the legal front over immigration and visas—two issues that affect their ability to hire skilled workers in the tech field. Apple, Twitter, Facebook, Microsoft, and Google were among the 127 companies that filed court papers against Trump's executive order on immigration February 6. And tech companies like Microsoft are looking for ways to find exceptions to Trump's crackdown on H-1B visas. But oftentimes, tech employees themselves can't associate any political causes with the companies where they work (for instance, workers were asked not to wear any company logos to the protest).

Despite the disappointing turnout, Dargis says, the tech community—not the companies themselves, but the people who work there—is well-positioned to fight Trump, if they could ever get out from behind their computers. But maybe for them, the best way to fight Trump is from their keyboards.

In Seattle, a decidedly liberal-minded sanctuary city, resistance has come from many quarters, some of which have been people coding up a storm—making apps to connect immigrants with legal help, saving valuable data from being destroyed, and helping make sense of the latest onslaught of confusing, rapid-fire Trump news.

But can a community made up of the same people who brought you delivery drones and perfected the art of getting everything you need with a push of a button without ever having to leave your house deal with the messy realities of going beyond the screen to make things happen IRL? Can the tech community, with its privilege and disposable income, actually connect with grassroots movements to make the tools we need to help defeat Trump?


{{ image:1 }}

Among the waves of weary travelers making their way through Sea-Tac Airport on an early Saturday morning stands a well-dressed but unassuming man named Hussain Rachou. The look on his face ranges from excited to deeply anxious, as he switches between scanning who is coming up the escalators to checking his phone.

"My mother. My sister from Germany. Friend from Eugene," he says, almost flinching every time his phone alerts him to a new message, and then sighing when it's not the text he is expecting. "They all want to know what is happening."

Rachou is a Syrian refugee who is just minutes away from reuniting with his family—a wife and two sons, ages 8 and 9—after two years apart. By his side is lawyer Takao Yamada, who has been at the airport since 7 a.m. and is now digging through his contacts to find out what the delay is. He is reassuring Rachou that a more than two-hour wait for Syrian refugees coming out of customs with seven suitcases is, in fact, to be expected.

Since the days and weeks after the smoke cleared from the so-called "Muslim ban" protests at Sea-Tac, Yamada noticed that it was difficult to coordinate matching up lawyers with travelers who arrived at the airport and were at risk of being detained and sent back.

Even though it looks like this particular refugee story will have a happy ending, many don't. Which is why Yamada invented Airport Lawyer—an app where travelers or their families can submit contact and flight-arrival information ahead of time so on-site lawyers can be arranged. Yamada had been wanting to do something substantial for the struggle since election night—when he had been serving as election protection for the Clinton campaign, monitoring the polls as a legal field volunteer. When the polls closed, Yamada drove back to the hotel thinking Clinton had won. When he got to his room, alone, it was a different story. "Needless to say, I had a lot of time to think," he says.

He teamed up with the International Refugee Assistance Project and assembled a small group to help build the app over a weekend. Now Airport Lawyer is used in 30 airports across country, and he's working with Sea-Tac Airport to display more permanent signage.

For Yamada, who had members of his family put into Japanese internment camps, the struggle against Trump's immigration ban is personal. "Singling out people is very dangerous to me," he says.

In addition to being a lawyer, Yamada is an entrepreneur with a background in politics and policy work—he was the deputy campaign manager for Judge Mary Yu, once owned a restaurant in Philadelphia, and is the cofounder of a tech start-up that is creating a digital trading platform for the cannabis industry.

Airport Lawyer, Yamada says, is a "crisis agnostic tool"—perfect for the resistance because it was built quickly and is easy to replicate over and over in different situations—things the tech industry does well and can use as ammunition against Trump.

"Tech workers will be building the weapons we're going to use to fight this administration," he says, "and being able to marshal that force is something that I think the start-up community can do really effectively." But one challenge, Yamada says, is tech's penchant for perfectionism.

"Engineers and developers want to build really great things. But sometimes we don't need really great things," he says. "Sometimes we need really simple, shitty things that are going to work."

Yamada is turning his attention to other projects as well—creating "tools that will help turn interest into action through technology," like Democratizer, an app that helps activists find protests and actions around issues like civil rights, women's rights, and the environment. These tools, he says, maximize the impact that the tech community can wield in the opposition landscape.

"Take the dozen people who helped make Airport Lawyer in 48 hours—they'll make no difference at a protest. Not to say they shouldn't go. They should, everybody should," Yamada says. "But the impact those people had by using their tech skills in a politically-oriented way wildly maximized the impact they have. And I think that should be an aspirational goal for everyone."


It's a Saturday afternoon, and all the orange seats in the "Active Learning Classroom" at Odegaard Library are filled with people staring at laptops. There's some low-level chatter and the sound of the quiet hum of computers as application engineers, web developers, scientists, librarians, and "regular" people sit at round tables and try to save data sets from disappearing under the trigger fingers of a climate-change-denying Trump administration.

The Data Refuge Project—a nationwide collaborative effort in tandem with the Internet Archive, an online archive of more than 286 billion web pages, seeks to preserve and protect federal data and government information that supports environmental research. The project has generated large data-saving events like this one in satellite cities across the country.

Seattle co-organizer Will Smith is stunned at the turnout. "I figured it would be like nerds only," he jokes, "but you know, right on." Smith describes himself as a "hobbyist," not a programmer. "I just do this shit at home."

Smith, who also moonlights as the sound guy for experimental hiphop duo Shabazz Palaces, informs me that there's more to the layout of this room than meets the eye—from end to end, it operates like a well-oiled, online-data-preserving machine.

There are the "seeders"—tables populated by "interested citizens that are not particularly programming-centric," Smith explains, who nominate websites to download from government servers in order to back up elsewhere. At a nearby table, "harvesters" do the actual downloading and repackaging of the data found by the seeders—adding custom metadata to organize the information. Finally, "checkers" and "baggers" look at what that the harvesters have grabbed, making sure that the data set is complete.

By the end of the day, they will have pored over hundreds of websites from all over the world, and thousands of gigs of data will have been cataloged, saved, and backed up on an independent server and stored with the Internet Archive.

"I believe that data is a public good," says co-organizer Mary Gifford. Gifford moved to Seattle last year—she works in the private sector as the vice president of content and strategic partnerships for Silicon Valley start-up Tribal Planet ("Such a start-up title!" she laughs)—but prior to that was at the United Nations doing climate-change work.

She admits she's "not really the activist type," but after reading about how the Trump administration instructed the Environmental Protection Agency to take down the climate-change page from its website, she felt that "there are certain things that you need to stand up for."

"Regardless of your politics," she continues, "there's data that could be crucial for other things further on that we don't even know about. To me, it's the equivalent of modern-day book burning, really."

Gifford also volunteers her time with Data for Good, a Seattle group that connects hundreds of data scientist volunteers to issues in Seattle that could use some good old-fashioned data (like finding collision risk factors at key intersections in Seattle, for example). Along with Data for Democracy, which recently hosted a hackathon at Ada's Technical Books on 15th Avenue in partnership with the National Immigration Law Center, there are more than enough nerds in this town to get something done when it comes to fighting Trump.

"I think it shows tremendous promise," she says. "If you had the skills to develop a bunch of tools that would sell sneakers online, why wouldn't you use those skills for other things, too—right?"

Specifically, programmers are using their skills today to work on saving records of experiments—tables charting sea level rise, tidal information from measurement stations, and more. For the last few hours, Will Weatherford, a software engineer who specializes in using the Python programming language, and Dylan Hutchison, a computer-science graduate student at the University of Washington, have been writing simple Python scripts to mass-archive data sets from places like the National Renewable Energy Laboratory, NASA, and the EPA.

"You could say," Hutchinson muses, "that this is a form of nonviolent resistance. So if the policy is to revoke information related to climate change, then we resist that policy by making that information perpetually available, so that we can continue to study and act on climate change."

While it's maybe not as flashy as taking to the streets in a roar of mass civil unrest in pink pussy hats, Hutchinson insists writing Python scripts on a Saturday in a library is an equally valuable use of their time—and skills.

"There are many venues of activism," he says, "and they don't just have to be the visible 'Let's get out on the streets with signs and have sit-ins' kinds of actions."

"This is kind of a sit-in," Weatherford jokes. "We've been sitting here for a really long time."


Every day since Trump took office, Matt Kiser has woken up at six in the morning. He grabs a coffee and scans thousands of news sources on the administration's latest exploits. It sounds like a shitty way to start the day. But Kiser is dedicated to the cause—aggregating news for his website, whatthefuckjusthappenedtoday.com and corresponding daily newsletter (now at more than 80,000 subscribers).

What the Fuck Just Happened Today (WTFJHT) tells you exactly that—it's a daily aggregated record "logging the shock and awe" of every executive order, weird POTUS tweet, ongoing Russian intrigue, and controversy surrounding the state of national politics.

The website boasts a clean, navigable design (days are listed in the header) packed with easily digestible information. Each day starts off with a short title (Day 71: Tumultuous; Day 22: Denials; Day 4: The Upside Down) and lists news items broken down as bullet points, with the occasional embedded tweet or quote. Kiser set up the blog so that it is open-source and hosted on GitHub, so that others could make CSS tweaks and pull requests to edit the content.

"I thought it would be a really interesting idea to take a blog, but treat it the way software development works, make it really agile, use version control, and open it up to collaborate with other developers on the project," he explains.

A former "super-political skate punk kid," Kiser was active in the mid-2000s during the Bush years, but, like many leftist-progressive types, grew a little complacent and took a back seat during Obama's term. After trying to become a music journalist in New York City, Kiser focused on reinventing his career instead, moving out to Seattle and taking coding classes at General Assembly. But Trump's election was a wake-up call, and an opportunity to use his newly developed coding skills.

"I guess I find it funny that the flash point for this project had to be the outcome of the election. But I think that also speaks to the environment we're in right now, culturally," he said.

Which is the reason he invented WTFJHT—to keep up with the "daily atrocities" coming out of the White House and Congress—and the feeling of confusion that comes with how to stay informed amid all the chaos.

"Like, US national politics suck right now, the guy in the White House sucks right now. I'm upset, and I can't even keep up with what the fuck is going on! And I'm kind of a news junkie. So how could anyone else possibly keep up with this if they're a 'normal' person?" he says.

Kiser's original plan was to document Trump's first 100 days in office, as a kind of personal challenge to himself. "I mean, it was definitely supposed to just be a side project," Kiser says. But as of last week, it's become more than that—Kiser just quit his position as a project manager at a tech start-up to manage WTFJHT full-time.

In his efforts to help people not be so overwhelmed, Kiser admits, he feels... overwhelmed. Not only by the demands of keeping up with WTJFHT's growth, but with the surfeit of civic-engagement-friendly tech products created since the election, that, ironically, are meant to make it easier for the average person to get involved.

"There are like a million uncoordinated projects running around. There are so many daily-action-app-text-message-e-mail-website-blogs going around, and all those spreadsheets. Have you seen all the spreadsheets?" he asks, exasperated.

Of course, any decent tech resistance staying true to its open-source roots is going to be decentralized—relying on online forums and Slack message boards to plan actions and brainstorm prototypes among large groups of people living in different places. On one hand, you could say it's a plus—giving the movement a flexible, resilient edge (if one system goes down, there are others to quickly take its place), making it not reliant on place and time (like a protest), and giving individuals the power that comes with a sense of anonymity.

Who better than hackers and tech people, for example, to teach us how to wipe our phones before getting searched by a customs agent, or how to set up our VPN networks when the corporate powers that be come calling after Congress votes to sell our private online information to the highest bidder?

But the fragmentation of a base always comes with a price—in this case, it's the silos that come with trying to build savvy tech solutions in a bubble.

"So I'm a tech person, and I'm good at this thing, and I want to do something, so I decide I'm going to make a web app or whatever, or get some data, or build this experience. And I think what's missing is a connection back to people. What do people actually want? What's the problem we are actually solving?" Kiser asks.

Takao Yamato also agrees, saying engineers need to connect with activists to build accurate "user stories" around their immediate needs.

"I think the tech and the start-up community will be most effective within the activist community," he says. "Smaller organizations, which have really narrowly tailored needs that can be met by tools built in a weekend that can dramatically change their work."

It's clear that the tech resistance, whatever it is, needs to put down roots with the grassroots communities leading the charge against Trump or all those well-intentioned civic good projects will go the way of Apple's G4 cube, Ello (remember them?), or Microsoft's doomed media player, Zune.

"I think it's a really big miss on our part," Kiser says. "We should be aligning people who want to build digital products with people who have real experience organizing communities. And it's like, how do you find them? Where are all the organizers for all of this?"


{{ image:2 }}

One of those organizers may be Tiffany Chan. She's young, motivated, and eager to build bridges between the tech world and many of the communities she supports as a grassroots community organizer. Chan has recently joined the leadership team at Open Seattle—another group, like Data for Good, that builds technology-focused projects and "prototype solutions" for local civic issues.

The Open Seattle meetings take place at Socrata, a company that provides cloud-based data visualization and analysis tools for working with government data. The well-lit, carpeted hallways are lined with top-notch Mac desktops and busy-looking whiteboards. The space has the slick hipster playfulness of your typical tech company; in the lounge, there's a full kitchen, a fancy coffee maker, an entire shelf with an almost obscene variety of hot sauces, and a basket full of rubber duckies and other toys set up on a long stainless-steel table.

Chan was invited to join Open Seattle through her environmental and racial-justice work—she liked the concept of civic engagement, so she went to a meeting.

"And for me," she says, "I was just wondering like, who was in the room? Because as a community organizer, one of the things you always look out for is 'What is being asked? What's the goal?' I saw a lot of tech people, and their intent was good. I heard a pitch about homelessness, which was cool, but there wasn't really anybody from the homeless community there. I didn't really see any direct connection to the people they were trying to help."

It solidified Chan's resolve to connect eager-to-help tech workers with the communities they live in—and, ironically, push out of town when they gentrify a neighborhood. But Chan insists it's important to "take into account the individual actions, but also the institutional, systemic ones." For example, she says, she drives a car sometimes, but she opposes the oil industry.

"The reality is that we live in a system that's oppressive. So finding ways to collaborate together, to ally, I feel like that's my approach to dismantling it," Chan explains. "And I believe there's room for everybody in the movement."

As a lifelong resident of Beacon Hill, Chan has seen that gentrification firsthand. "But so far," she says, "we still know all of our neighbors, luckily." When she was a teenager, Chan was bused to Roosevelt High School in the North End as part of an exchange program. She volunteered at the Woodland Park Zoo and started hearing about environmentalism and sustainability, "which just created a different perspective and lens for looking at things."

She got involved in environmental-justice work, first at the zoo and then for Earth Corps. Right now, she's working with Facebook to organize a hackathon for the environment on April 28. In her work, she wants to invite tech workers to collaborate, while also making sure they confront their privilege. At Open Seattle, for example, she brought in food from businesses owned by people of color to replace the usual boxes of pizza.

"Because one thing I find in the tech community is that certain things are always just kind of done, like little magical elves or owls come in, and the free sodas are always stocked and the floors are always magically clean," she says. "And I was like, 'Yeah, those are people doing that work for you.' And I think with grassroots organizing, it's always us doing that work. We don't have free food all the time."

Chan has had a lot of "tough conversations" about race and privilege, and her advice for tech workers who want to help is simple: Show up. "Not only do we then have the emotions and the passion behind the movement, but also the data and the information to make our arguments and actions more objective." Resistance-minded tech products, Chan says, should be designed around real-life experiences. "I think pairing those two things makes our movement to resist the Trump era that much stronger."

The March for Science on Saturday, April 22, was a chance to do just that: In Seattle, thousands of scientists, techies, and concerned citizens showed up to protest the Trump administration's growing siege on the EPA, its refusal to acknowledge climate-change data, and the funding cuts for research programs.

"You know things are serious," one sign said, "when the introverts arrive." recommended

[ Comment on this story ]

[ Subscribe to the comments on this story ]


          Introducing our Postdoctoral Fellow, Dr. Dan Sholler   
We are pleased to welcome our Postdoctoral Fellow, Dr. Dan Sholler. Dan is an expert in qualitative research (yes, you read that correctly) and studies digital infrastructure creation, growth, and maintenance efforts. Through this research interest, he was drawn to the open science community and its ongoing development of tools and communities to support sustainable, reproducible, high-quality research. With rOpenSci, he intends to investigate what drives scientists to engage with or resist open science tools and communities. Dan will be the first postdoc for the rOpenSci project, based at UC Berkeley and the Berkeley Institute for Data Science, supervised by Karthik Ram and co-supervised by Carl Boettiger and Daniel Katz. We interviewed Dan to introduce you to him and his research (a fascinating conversation!). This short introduction can’t do him justice, but he'll share his research plan in another post in Fall 2017. Q: Tell us a bit about your background I'm a (mostly) qualitative, ethnographic researcher who studies user acceptance, resistance, and adaptation in digital infrastructure development programs. In other words, I like to study the factors that motivate people to engage with or resist new technologies, with the goal of helping to improve design and implementation strategies for eliciting engagement. I completed my B.A. in Science, Technology and Society at University of Pennsylvania. During that time, I learned that technologies don’t just succeed or fail based on their merit; instead, a host of social and political factors play influential roles in determining technological outcomes. My Ph.D. research at the University of Texas School of Information involved looking at contemporary deployments of new information technologies in developing countries, in government agencies, and in healthcare clinics. Alongside my research, I took courses in organizational theory, organizational behaviour, and information systems, sparking my interest in studying how organizational IT persist and become robust, or fail in the face of resistance from users during the implementation and use phases. Sometimes, the unanticipated actions users take actually help an IT implementation to succeed. In the first study I participated in at UT, we looked at a new branchless banking system intended to allow rural Brazilians to receive welfare benefits and pay utility bills without journeying to bank branches in major cities (think a network of simplified ATMs). The plan was to place point-of-service machines in places like grocery stores and post offices and allow clients to access self-service features. However, most of the clients were unable to use the machines themselves due to low technical literacy and they often encountered technical errors they couldn’t resolve. The owners and clerks in the shops took on new roles to fill the gap between the design of the technology and the reality of the situation: Elderly clients often handed over PINs; customers facing issues asked the shopkeepers to get in contact with the bank or utility companies; and some shop owners even borrowed money from their own registers to cover benefit checks when the system was down. All of these role-expanding actions ensured that the branchless system persisted. In other cases, users’ reactions to a new implementation can lead to implementation failure. In my dissertation work, I found that doctors actively resisted and impeded a federally-mandated implementation of electronic medical records (EMR) in the U.S. healthcare industry. Clinics implemented “certified” EMR and had their costs partially subsidized by the federal government. In the study, I found that doctors were frustrated with the extra time they spent using EMR, particularly because it added little perceived benefit to patient treatment. Through interviews and observations, I learned that doctors could not affect any technological change within their local organizations because of strict federal policies. Through actions like lobbying Congress, holding town hall meetings, and voicing the medical community’s concerns in public outlets, the American Medical Association effectively stalled the progression of the federal program, which remains in jeopardy today. I want to leverage my experience to explore how and why scientists engage with or resist open science communities and technologies, focusing on how particular communities like rOpenSci manage engagement and resistance to ensure positive outcomes. In turn, I hope to contribute to our understanding of best-practices for open science infrastructure development, including what community leaders, users, universities, government agencies, and other actors can do to develop robust infrastructures. Q: What would you like to accomplish with your postdoc? Just like any other academic postdoc, I plan to publish papers - in my case, examining the development of open science communities - and draw upon theories that might help to understand engagement and resistance. I intend to apply what I learn from the project and advise rOpenSci (and other open science communities) about strategies for anticipating, dealing with, and overcoming issues related to user engagement and/or resistance. I’ll strive to gain a deeper understanding of the open science movement, focusing first on general questions like: “What drives community leaders to devote time and resources to building a robust infrastructure to support open science? What social and technical circumstances support or stand in the way of infrastructure development? What managerial strategies might be applied to get scientists to engage or to help them deal with the perceived detriments of participating in the open science community?” To answer these questions, I’ll conduct a qualitative, comparative study of multiple open science communities, beginning with rOpenSci. In the study, I’ll interview and observe the leaders of these communities and the scientists who (a) actively engage with the open science tools produced by the communities and (b) might stand to benefit from the use of the tools, but instead resist use. I plan to evaluate the managerial strategies used across the communities. Focusing on multiple communities will assist the effort to generalize my findings to the broader open science movement. Throughout the study, I’ll consider the following prominent issues identified in the literature on digital infrastructure development, open science, open data, and related areas of study: Organizational motivations for building open science infrastructures and communities The need for balance between flexibility and standardization in managing users and their behaviors Best practices for creating infrastructures and managing their growth, including both technical and social elements Q: How do you arrive at doing qualitative research in an environment full of quantitative researchers? I first heard about rOpenSci through a member of my dissertation committee, James Howison, while at UT-Austin. James is a renowned researcher of scientific software issues, including topics like software citation and attribution in academic journals. He worked with members of the rOpenSci community and clued me into the ongoing development of open science infrastructures. Additionally, I had also heard from my peers about rOpenSci’s annual unconference and its novel approach to building technical and social capacity in the R community. The open science community has no shortage of exciting, innovative software tools to support scientific research. However, time and again, authors recognize that perhaps the biggest impediment to the widespread use of these tools is eliciting engagement from users who aren’t software developers themselves or who are hesitant to open their methods and data to the broader scientific community. I think my research will support efforts to expand infrastructure participation to these scientists by uncovering what their concerns and hesitations may be and considering how we might begin to address them, both through technical design and social approaches. The rOpenSci community, to me, is an ideal place to carry out a qualitative research project with quantitative researchers. Although many other communities exist and have their own merits, rOpenSci has quickly and effectively engaged an interdisciplinary community of researchers and produced tools with immediate impacts. Studying this community and comparing it to related communities will aid in understanding what makes rOpenSci’s approach so effective. I believe that drawing out lessons about managing engagement can be applied to other communities and benefit the open science community as a whole. Want to learn more about Dan Sholler? Read Dan’s publications on Google Scholar Follow Dan on Twitter
          Data Scientist - Wink - New York, NY   
Hands-on experience with supervised and unsupervised machine learning algorithms for regression, classification, and clustering....
From Wink - Thu, 18 May 2017 06:17:27 GMT - View all New York, NY jobs
          Data Scientist - Drop - Toronto, ON   
Through our mobile app, users supercharge their debit and credit cards to automatically earn points on their every day spending at places such as Starbucks, Tim...
From Drop - Thu, 01 Jun 2017 02:38:43 GMT - View all Toronto, ON jobs
          Senior Cloud Engineer/Big Data Architect / Data Science - Corporate Technology - New York, NY   
Perform data analysis with business, understanding business process and structuring both relational and distributed data sets....
From Corporate Technology - Wed, 14 Jun 2017 16:40:06 GMT - View all New York, NY jobs
          Data Scientist   

          Senior Analyst, Predictive Modeling & Data Science - BMO Financial Group - Toronto, ON   
Lev. The Advanced Analytics &amp; Journey Science group partners with internal Personal and Commercial Banking Canada partners, and various lines of business across...
From BMO Financial Group - Sat, 24 Jun 2017 00:46:45 GMT - View all Toronto, ON jobs
          Enveritas: Head Engineer   

(New York)

Head of Engineering

At Enveritas

New York, NY

About Us:

Enveritas aims to sustainably verify 100% of the world’s coffee production by 2020 and end poverty in the coffee sector by 2030. We are doing this by creating a low-cost, high-fidelity platform that helps coffee companies verify the sustainability practices of the products they source with complete transparency. As a social enterprise, we will channel the cost-savings that result from a re-engineered verification process into technical assistance funding to assist the poorest farmers out of poverty.

The Role:

You will design, implement, and maintain our technology stack from end to end. Our core product consists of a mobile survey application, a data ingestion and validation system, and a reporting interface for clients (coffee companies). We are also building tools for machine learning to optimize our verification work on the ground with farmers and for data visualization to enhance our ability to deliver insights for clients.

Your instincts and experience will drive our product design and the growth of our engineering team. We expect you to develop a vision for the “big picture” architecture of our system and create a roadmap for implementing it. You will have the opportunity to build out a team of engineers below you, so you should be comfortable managing people with different skills sets and coordinating multiple complex, time-sensitive work streams simultaneously.



Our Ideal Candidate Has:

  • Bachelor’s in computer science or related field from a top university
  • 5 years’ experience scaling systems at a tech-driven company
  • Strong knowledge of agile project management and software development techniques
  • Proven experience managing teams of engineers across a technology stack
  • Excellent communication skills
  • Experience with modern devops workflows and tools (e.g., Heroku, Docker, AWS)
  • A passion for social change



Bonus Points For:

  • Master’s degree in computer science or related field
  • Experience developing software for physical world applications (e.g., delivery/routing
  • systems, physical modeling and analysis)
  • Experience with modern web development frameworks (e.g. Angular.js, React)
  • Experience with the Android SDK as well as third-party libraries and APIs
  • Large company + Startup experience highly preferred



More about Enveritas:

We are recruiting a global team of socially-motivated professionals for roles in software engineering, data science, business development, and operations (on the ground in Africa, Asia and Latin America). We are currently a team of 20 people. We have secured funding for platform development, testing and scaling, and already have significant revenues from clients. Our full-time positions offer flexible hours and excellent benefits, including paid holidays and time off, insurance, 401k with matching, etc.




Enveritas is an equal opportunity employer and values diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status or disability status.

Apply:


          Data Scientist   

          آموزش Infinite Skills Oreilly Mining the Social Web - Twitter   

در این کورس آموزشی یاد می گیرید که چطور به صورت Live ( زنده ) داده ها و جریان های داده مداوم در توئیتر را خوانده  آن ها را تحلیل کنید. در این فرایند از بهترین زبان برای انجام کارهای Data Science یعنی پایتون استفاده ...


          آموزش Coursera Introduction to Data Science in Python   

با مشاهده این کورس آموزشی بسیار ارزشمند با مبانی علم داده یا همان Data Science آشنا شده و سپس کار بر روی داده ها و تحلیل آن ها را در زبان برنامه نویسی Python یاد می گیرید. مدرس مجموعه تمامی مباحث را به زبانی ساده و در حی...


          آموزش Coursera Applied Plotting Charting & Data Representation in Python   

برای نتیجه گرفتن از داده ها و انجام کارهای مختلف Data Science باید بتوانید داده ها را به نحوی شایسته بتصویر بکشید. در این مجموعه آموزش بسیار کاربردی مصور سازی ، نمایش داده ها به صورت چارت ها و پلات ها و دیگر انواع مصورسا...


          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Senior Data Scientist - Scotiabank - Toronto, ON   
Lead the development of strategy optimization frameworks to further enhance credit limits setting, risk-based pricing, and customer targeting throughout credit...
From Scotiabank - Tue, 27 Jun 2017 06:46:37 GMT - View all Toronto, ON jobs
          Fighting biases with dynamic boosting. (arXiv:1706.09516v1 [cs.LG])   

Authors: Anna Veronika Dorogush, Andrey Gulin, Gleb Gusev, Nikita Kazeev, Liudmila Ostroumova Prokhorenkova, Aleksandr Vorobev

While gradient boosting algorithms are the workhorse of modern industrial machine learning and data science, all current implementations are susceptible to a non-trivial but damaging form of label leakage. It results in a systematic bias in pointwise gradient estimates that lead to reduced accuracy. This paper formally analyzes the issue and presents solutions that produce unbiased pointwise gradient estimates. Experimental results demonstrate that our open-source implementation of gradient boosting that incorporates the proposed algorithm produces state-of-the-art results outperforming popular gradient boosting implementations.


          Data Scientist - Data Insights & Analytics - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 17 Jun 2017 08:59:04 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - Growth - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:08 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - RA - CSI - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:08 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist - Mkt Place Routing - 99 TAXIS - São Paulo, SP   
Has been worked with big data technologies like Storm, Samza, Hadoop, Presto, Airflow; 99's mission is to make transportation cheaper, faster and more efficient...
De 99 TAXIS - Sat, 10 Jun 2017 05:51:05 GMT - Visualizar todas as empregos: São Paulo, SP
          Data Scientist   

          Switzerland-based research fills Wikipedia’s language gaps   
A professor at Lausanne’s Federal Institute of Technology (EPFL) has helped develop a platform to identify and generate key Wikipedia pages that are currently missing in minority languages. The new tool combines machine learning with human expertise to make the crowdsourced online information platform available to more people. As an English speaker, chances are you can rely on Wikipedia to provide information when you want to do some light research, check a fact, or prove a point. But despite the online encyclopaedia’s 40 million pages in nearly 300 languages, significant gaps in coverage exist for less-spoken tongues – like Switzerland’s Romansh – which can lack basic or essential entries like “climate change” or “the universe”. For Wikipedia editors, it can be difficult to know how to prioritise the translation of pages into different languages based on the relative cultural importance of each topic. Robert West, head of the EPFL’s Data Science Lab, decided to fix that using ...
          Data Scientist - Verizon - Basking Ridge, NJ   
What you’ll be doing... If you are curious about new technology and the possibilities it creates, then this job may be perfect for you. As part of the
From Verizon - Thu, 29 Jun 2017 10:58:49 GMT - View all Basking Ridge, NJ jobs
          Big Data and the Stalker Economy   
VideoCrack is being served in Silicon Valley.An enthusiastic crowd of geeks and suits -- all of them "data scientists" -- just spent three days at the O'Reilly Strata conference (#strataconf) in Santa Clara. All over the event's menu is the crack cocaine of our day: big data. A couple decades ago, [...]
          (USA-FL-Tampa) Products Go To Market Senior Manager   
Title: Products Go To Market Senior Manager Location: USA-Southeast Job Number: 00491094 Organization: Accenture Analytics Travel: Travel Required Position: Accenture Analytics – Products Go To Market Senior Manager The digital revolution is changing everything. It’s everywhere – transforming how we work and play. Are you leading the way as a digital disrupter? Accenture Digital is driving these exciting changes and bringing them to life across 40 industries in more than 120 countries. At the forefront of digital you’ll create it, own it, and make it a reality for clients looking to better serve their connected customers and operate always-on enterprises. Join us and become an integral part of our experienced digital team with the credibility, expertise, and insight clients depend on. Accenture Digital is powered by three practices –Mobility, Interactive, and Analytics. As part of our Analytics practice, you’ll deliver analytically-informed, issue-based solutions that help clients make faster, smarter decisions. You’ll play a critical role in helping them tackle complex business issues. Join Accenture and help transform leading organizations and communities around the world. The sheer scale of our capabilities and client engagements and the way we collaborate, operate and deliver value provides an unparalleled opportunity to grow and advance. Choose Accenture, and make delivering innovative work part of your extraordinary career. People in our Client and Market career track drive profitable growth by developing market-relevant insights to increase market share or create new markets. They progress through required promotion into market-facing roles that have a direct impact on sales. Analytics professionals create new insights from predictive statistical modeling activities that target and deliver value to our clients. The Functional and Industry Analytics Senior Manager: Works with clients to provide primary and secondary research within an industry or functional area. Assesses industry trends, create reports, prepare forecasts and develop industry/functional models. Uses advanced statistical techniques to find relationships between variables. Solves organizational problems for the client by analyzing requirements, providing valuable market research linked insights and developing tools and processes to aid decision making. Job Description Analytics professionals use quantitative methods to derive actionable insights and outcomes from data. The Products Go to Market Analytics team comprises professionals with diverse experience and backgrounds including industry experience, consulting, statistics, engineering, economics and other quantitative disciplines. Employees are encouraged and expected to build their expertise as data scientists, and deploy analytics to business problems. We also provide access to an experienced group of professionals across disciplines, including Financial Services, Communication Media and Technology, Products -CPG/ Retail, Supply Chain & Operations. As a result, we are able to apply a multi-disciplinary approach to projects and formulate a more comprehensive, best in class and effective strategy. A professional at this position level within Accenture has the following responsibilities: Identifies, assesses and solves complex business problems for area of responsibility, where analysis of situations or data requires an in-depth evaluation of variable factors. Closely follows the strategic direction set by senior management when establishing near term goals. Interacts with senior management at a client and/or within Accenture on matters where they may need to gain acceptance on an alternate approach. Has some latitude in decision-making. Acts independently to determine methods and procedures on new assignments . Decisions have a major day to day impact on area of responsibility . Manages large - medium sized teams and/or work efforts (if in an individual contributor role) at a client or within Accenture. Key Responsibilities of the role include: The candidate will work closely with client teams and/or clients directly and will be responsible for specific responsibilities in one or more of the following areas: Excellent understanding of Products industry/Domain Working with one of the sub industry with in Products – retail, consumer packaged goods or health Understanding of the data sets within these sub-industries – example: commercial, point of sales, clinical, claims data etc. Deliver data and BI/analytics driven engagements Creating solutions around one or more of the below areas Analytics strategy (roadmap, maturity assessment) Commercial Analytics Consumer Analytics Campaign management Digital Marketing Analytics Analytics enablement (dash-boarding, tool development) People and project management Actively engage in knowledge learning, sharing and storing. Steep learning curve and adapting to new client situations quickly and work under a competitive environment Excellent communication skills Ability to summarize the essence in speech & in written communication Ability to influence and lead others Adhere to the firm’s standards on ethics, code of conduct and values Qualifications: YOUR EXPERIENCE: Basic Qualifications Bachelor’s Degree required Minimum of 7 years of client facing/consulting experience Minimum of 7 years of experience in working on advance analytics projects involving statistical models Minimum 7 years of experience with BI and Analytics add-on/cross sales and deal conversion to grow revenue at assigned clients Minimum of 7 years of experience using advanced analytics tools Minimum of 7 years of demonstrated experience with written and verbal communication skills including formal and informal presentations, building client relationships and providing excellent client service Minimum of 4 years of experience in requirements gathering and design for BI/analytics Minimum 7 years of experience working with cross functional teams across the US and/or European markets Ability to travel up to 80-100% SET YOURSELF APART: Preferred Skills Bachelor’s Degree, Engineering, Computer Science or Statistics Master’s Degree, Business Administration, Statistics or Operations Research 2 years of experience using data visualization tools such as tableau, spotfire and/or qlikview Strong work experience in Products industry and statistical analysis Exceptional business acumen Ability to build, maintain and own relationships Strong presentation skills Strong conceptual knowledge and understanding of databases and statistics Solid Microsoft Office (Excel and power point) skills Strong business and technical accountability Professional Skill Requirements Proven ability to build, manage and foster a team-oriented environment Proven ability to work creatively and analytically in a problem-solving environment Desire to work in an information systems environment Excellent communication (written and oral) and interpersonal skills Excellent leadership and management skills All of our consulting professionals receive comprehensive training covering business acumen, technical and professional skills development. You'll also have opportunities to hone your functional skills and expertise in an area of specialization. We offer a variety of formal and informal training programs at every level to help you acquire and build specialized skills faster. Learning takes place both on the job and through formal training conducted online, in the classroom, or in collaboration with teammates. The sheer variety of work we do, and the experience it offers, provide an unbeatable platform from which to build a career. Applicants for employment in the US must have work authorization that does not now or in the future require sponsorship of a visa for employment authorization in the United States and with Accenture (i.e., H1-B visa, F-1 visa (OPT), TN visa or any other non-immigrant status). Candidates who are currently employed by a client of Accenture or an affiliated Accenture business may not be eligible for consideration. Accenture is a Federal Contractor and an EEO and Affirmative Action Employer of Females/Minorities/Veterans/Individuals with Disabilities. Equal Employment Opportunity All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by federal, state, or local law. Job candidates will not be obligated to disclose sealed or expunged records of conviction or arrest as part of the hiring process. Accenture is committed to providing veteran employment opportunities to our service men and women. Job: Analytics
          (USA-FL-Tampa) Business Intelligence Director   
**PwC/LOS Overview** PwC is a network of firms committed to delivering quality in assurance, tax and advisory services. We help resolve complex issues for our clients and identify opportunities. Learn more about us at www.pwc.com/us. At PwC, we develop leaders at all levels. The distinctive leadership framework we call the PwC Professional (http://pwc.to/pwcpro) provides our people with a road map to grow their skills and build their careers. Our approach to ongoing development shapes employees into leaders, no matter the role or job title. Are you ready to build a career in a rapidly changing world? Developing as a PwC Professional means that you will be ready - to create and capture opportunities to advance your career and fulfill your potential. To learn more, visit us at www.pwc.com/careers. It takes talented people to support the US firm of the largest professional services organization in the world. Not all of us work directly with external clients. Some of our best people choose to apply their talents inside PwC. As part of Internal Firm Services, you're serving an organization on par with many of our external clients. Our Internal Firm Services team consists of first-rate marketers, human resource professionals, computer technologists, knowledge managers, accountants, financial planners, administrators and leaders. Internal Firm Services staff are the people who make it work for the people who make it work for our clients. **Job Description** The Office of the Chief Data Officer is charged with being the voice of data and generally representing data as a strategic business asset. The primary role of this organization is to champion the use of data and information across the firm, and drive changes and improvements in data related operations. This office will help to enable the business, as well as provide insights related to attendant risks. Significant effort will be dedicated to determining PwC data related needs, and developing proposed solutions. The Chief Data Officer team identifies where, when, how, and why the business uses data, and transforms it into information that serves clients. Fathom is a data-driven enterprise intelligence platform that provides a one-stop search to data to help win and deliver work. The team collaborates with a broad range of knowledge and systems stewards to curate, retrieve/index, and monitor content. The Fathom team manages the technical landscape to stay relevant to customers and competitive with the external environment. The team provides expertise in leading search capabilities, data indexing, and analyzing structured and unstructured data. **Position/Program Requirements** Minimum Year(s) of Experience: 8 Minimum Degree Required: High School Diploma or GED Degree Preferred: Bachelor's degree in Computer Science/Information Management, Data Science, Business Intelligence, or Data Visualization Certification(s) Preferred: Information Governance Professional; Project Management Professional; Certified Information Professional; Certified Information Management Professional; Certified Information Privacy Professional; Certified Records Manager; Certified Information Systems Security Professional; Certified Information Security Manager Knowledge Preferred: Demonstrates thought-leader level knowledge with, and/or a proven record of success, directing efforts in the following areas: - Utilizing and applying enterprise-wide leadership and strategic vision for a search platform that scans a multitude of internal and external data sources to expose data and deliver insights; - Creating and leading a firm’s data strategy across multiple lines of service and functions; - Implementing and directing policies, processes, technology solutions; - Collaborating with a broad range of knowledge and systems stewards to curate, retrieve, and monitor information; - Managing a technical landscape to stay relevant and competitive; - Directing a team providing expertise in leading search capabilities, data indexing and analyzing data; and, - Utilizing and applying many facets of data and information with a high level of thought leadership and recognized proficiency. Skills Preferred: Demonstrates thought-leader level abilities as a team leader, emphasizing the following areas: - Bringing vision and creative thinking to define enterprise level strategies then implementing based on business needs; - Partnering with the business to apply in-depth knowledge of the business operations, strategies, priorities and information requirements to establish technical direction and an enterprise view; - Establishing data standards and procedures which are defined and applied to extend business intelligence capabilities through search;. - Establishing industry leading practices which are adhered to in adopting new data assets; - Researching, evaluating and selecting from existing and emerging technologies to help with our strategic needs; - Reporting, interpreting data, extracting trends, and identifying insights or opportunities for product and business decisions; - Managing relationships with external data solution suppliers; - Identifying problems/issues with data quality and recommending remediation strategies; - Identifying and recommending opportunities for development of more efficient data repositories; - Recommending taxonomy and/or data classification to promote more efficient enterprise search; - Leading teams to generate a vision and motivate members, create an atmosphere of trust, leverage diverse views, coach staff, encourage improvement and innovation; - Overseeing operations, project workflow, budgets, and/or coordinating complex written materials; and, - Overseeing implementation of the plan for the future state technical design in alignment with broader data management strategy. All qualified applicants will receive consideration for employment at PwC without regard to race; creed; color; religion; national origin; sex; age; disability; sexual orientation; gender identity or expression; genetic predisposition or carrier status; veteran, marital, or citizenship status; or any other status protected by law.
          (USA-FL-Tampa) Data Scientist, Senior Associate   
**PwC/LOS Overview** PwC is a network of firms committed to delivering quality in assurance, tax and advisory services. We help resolve complex issues for our clients and identify opportunities. Learn more about us at www.pwc.com/us. At PwC, we develop leaders at all levels. The distinctive leadership framework we call the PwC Professional (http://pwc.to/pwcpro) provides our people with a road map to grow their skills and build their careers. Our approach to ongoing development shapes employees into leaders, no matter the role or job title. Are you ready to build a career in a rapidly changing world? Developing as a PwC Professional means that you will be ready - to create and capture opportunities to advance your career and fulfill your potential. To learn more, visit us at www.pwc.com/careers. It takes talented people to support the US firm of the largest professional services organization in the world. Not all of us work directly with external clients. Some of our best people choose to apply their talents inside PwC. As part of Internal Firm Services, you're serving an organization on par with many of our external clients. Our Internal Firm Services team consists of first-rate marketers, human resource professionals, computer technologists, knowledge managers, accountants, financial planners, administrators and leaders. Internal Firm Services staff are the people who make it work for the people who make it work for our clients. **Job Description** PwC's US Finance organization is a strategic business advisor responsible for managing the firm's financial risk, including: financial planning and reporting, data analysis, and assisting leadership with strategic and tactical matters. Services include: budget management, cost benefit analysis, forecasting needs, data and analytics, shared services and financing. Finance works daily with US Leadership, engagement partners and managers on managing the profitability of engagements. Finance has assisted other PwC Network firms regionalize their financial operations. Finance also analyzes potential acquisitions, assisting with the integration (including system needs) and educating partners/managers on how to navigate our various financial systems. The Cross Line of Service Management Reporting team is responsible for developing the Reporting strategy and producing and analyzing all reports related to Finance operations, Clients, Markets and Sectors, Line of Service (LoS) Finance and Leadership & Management reports. This team is responsible for maintaining a strong relationship with LoS Finance Leadership, Firmwide Finance Leadership, US Financial Services Leadership, and the Budget & Analysis Team to assess and deliver on the needs of the organizations and is responsible for developing a sustainable governance structure and standardized reporting across all reporting segments and LoS. **Position/Program Requirements** Minimum Year(s) of Experience: 2 Minimum Degree Required: Bachelor's degree or more than 4 years of experience in Finance or IT Technology Knowledge Preferred: Demonstrates thorough knowledge of and/or proven record of success in business intelligence (BI) solutions, preferably for a global network of professional services firms, including in the following areas: - Utilization of QlikView, QlikSense, Tableau; - Performing data analyses from collection through reporting and recommendations; - Utilization of financial systems, reporting systems, research tools and standard industry practices; and, - Reporting results and providing insightful results analyses. Skills Preferred: Demonstrates thorough level of ability and/or proven record of success in anticipating a range of possible solutions and opportunities, using research, analysis and consultation to reach sound conclusions, preferably for a global network of professional services firms, including in the follow areas: - Partnering with reporting leadership and stakeholders to develop and execute various reporting packages and ad hoc reporting requests; - Designing and developing consistent reporting packages and dashboards using various Business Intelligence (BI) solutions; and, - Seeking assignments appropriately to broaden individual experience, and sharing knowledge with others. All qualified applicants will receive consideration for employment at PwC without regard to race; creed; color; religion; national origin; sex; age; disability; sexual orientation; gender identity or expression; genetic predisposition or carrier status; veteran, marital, or citizenship status; or any other status protected by law.
          New Approaches to Ethno-Linguistic Maps   

humanswhoreadgrammars:

This post originates from the HWRG-blog. Please note that there are multiple authors of HWRG and that the most updated version of this blogpost can be found here: http://ift.tt/2ubG9BB.
___________________________________________
New Approaches to Ethno-Linguistic Maps

I’m excited to give a guest blog post here at humans who read grammars on new methods in language geography.  I’m a geographer by trade, and I am currently a PhD student at the University of Maryland.  I also work for an environmental nonprofit - Conservation International - doing data science on agriculture and environmental change in East Africa.  Before ending up where I am now, I lived for some time in West Africa and the Philippines.  During my time in both of those linguistically-rich areas, I became quite interested in language geographies and linguistics more generally.  Spurned on by curiosity and my disappointment in available resources, I’ve done some side projects mapping languages and language groups, which I’ll talk about here.

Problems with Current Language Maps

Screen Shot 2017-06-26 at 11.23.48 PM.png
A map of tonal languages from WALS.  Fascinating at a global scale, but unsatisfying if you zoom in to smaller regions.
One major issue with most modern maps of languages is that they often consist of just a single point for each language - this is the approach that WALS and glottolog take.  This works pretty well for global-scale analyses, but simple points are quite uninformative for region scale studies of languages.  Points also have a hard time spatially describing languages that have disjoint distributions, like English, or languages that overlap spatially. See here for a more in-depth discussion of these issues from Humans Who Read Grammars

One reason that most language geographers go for the one-point-per-language approach is that using a simple point is simple, while mapping languages across regions and areas is very difficult.  An expert must decide where exactly one language ends and another begins.  The problem with relying on experts, however, is that no expert has uniform experience across an entire region, and thus will have to rely on other accounts of which language is prevalent where.  This is how, for example, the Murdock Map of African ethno-linguistic groups was created.  As a continental scale map, it is rich and fascinating.  However, looking for closely at specific region, and the map seems to have problems - how did Murdock know exactly the shape of each little wiggle identifying the boundary between two groups?  What about areas where two different groups overlap?  Other issues can arise when trying to distinguish distinct groups when often the on-the-ground reality is that a language may exist as a dialect continuum, something that subjectively drawing polygons does not readily account for.

These maps can have real import when they form the foundation of other analyses. Researchers have examined whether ethnic diversity in developing countries, and in Africa in particular, can hamper economic development and lead to conflict. Scientists disagree, although many analyses use the Murdock map. See some of this research here, here and here. Another study, recently published in Science, looked at Internet penetration in areas where politically excluded ethnic groups live. They found that groups without political power were often marginalized in terms of internet service provision. However, their data for West Africa, which came from the Ethnic Power Relations database, was quite rough: all of southern Mali was one ethnic group labeled “blacks” while the north was labeled as “Tuaregs” or “Arabs”, while there was no data at all for Burkina Faso.  While their findings were important and they did the best that they could with available datasets, a less informed analysis from the same data could end up looking like linguistics done horribly wrong.  We need better ethno-linguistic maps simply to do good social science and address these critical questions.

New Methods and Datasets

I believe that, thanks to greater computational efficiency offered by modern computers and new datasets available from social media, it is increasingly possible to develop better maps of language distributions using geotagged text data rather than an expert’s opinion.  In this blog, I’ll cover two projects I’ve done to map languages - one using data from Twitter in the Philippines, and another using computationally-intensive algorithms to classify toponyms in West Africa.

I should note that for all its hype, big data can be pretty useless without real-world experience.  The Philippines and West Africa are two parts of the world where I have spent a good amount of time and have some on-the-ground familiarity with the languages.  Thus, I was able to use my local knowledge to inform how I conducted the analyses, as well as to evaluate their issues and shortcomings.

Case Study 1: Social Media From The Philippines

Many fascinating language maps from twitter have been created at global scales - see here, and here.  However, to explore the distribution of understudied languages that don’t show up in maps of global languages, one must use more bespoke methods.  This is especially true of austronesian languages like those found in the Philippines, which don’t have a lot of phonemic variability, and therefore aren’t easily classified using the methods that google translate uses.  These methods, which rely on slices of the sample text, often confuse austronesian languages like Tagolog and Bahasa - just look at the maps I mentioned above. Thus, I had to use a word-list method, and created word lists from corpora offered by SEAlang, and by scraping from local-language wikipedia articles.  The resulting maps show exactly where minority languages are used in comparison with English and Tagalog in the philippines, and likely underestimate the prevalence of minority languages because the corpora used (wikipedia and the bible) are quite different from the twitter data that was classified.

Languages of Tweets in the Philippines.
The resulting map shows about 125,000 tweets in English, Tagalog, Taglish (using Tagalog and English in the same tweet), and the local languages Cebuano, Ilocano, Hiligaynon, Kapampangan, Bikol, and Waray.  This map offers more nuance than traditional language maps of the Philippines.  For example, most maps would show Ilocano over the entire northern part of Luzon, but this map shows that the use of Ilocano is much more robust on the northwest coast than in the rest of the north.  This analysis also allowed me to test a hypothesis that I frequently heard locals assert when in the Philippines - that English is more common in the south, because southerners would rather use English than Tagalog, which is seen as a northern language.  I found that this was to be the case, and I was only able to confirm this because I had such a large sample size.  Without newer datasets like those offered by social media, this hypothesis would be untestable.

To see a more in-depth description of this analysis, you can see my original blog post here.

Case Study 2: West African Toponyms

Another project I did used toponyms, or place names, from West Africa.  Toponyms databases like geonames.org have relatively high spatial resolution - with a name for every populated place in an area.  And while a place name is not as long as a tweet or other linguistic dataset, toponyms do encode ethno-linguistic information.  It would be easy for someone familiar with Europe to distinguish whether a toponym is associated with the French or German linguistic group - a French name would likely begin with “Les” and end with “-elle”, while a German name could begin with “Der” and end with “-berg”.  Similar differences exist between toponyms from different ethnic groups all over the world, and are quite evident to locals.  What if you could train an algorithm to detect these differences, and then had it classify every single toponym throughout a region?  That is what I tried to do in this analysis.

I used toponyms for six countries in French West Africa. I decided to focus on French West Africa for several reasons. For one, I have worked there, and have some familiarity with the ethnic groups of the region and their distributions, and it is an area I am very curious about. For another thing, this is a relatively poorly documented part of the world as far as ethno-linguistic groups go, and it is an area with significant region-scale ethnic diversity. Finally, the countries I selected were colonized by one group, meaning that all of the toponyms were transliterated the same way and could be compared even across national borders. In all, I used 35,785 toponyms.

First, I got a list of every possible set of three letters (called a 3-gram) from the toponyms.   Then, I tested for spatial autocorrelation in the locations that contained each 3-gram using a Moran’s I test, and selected only those 3-grams that had significant clustering.

To give an illustration of why this was necessary, here are two examples of the spatial distribution 3-grams. One 3-gram - “ama” - occurs roughly evenly throughout the regions in this study. The other 3-gram - “kro” - is very common in toponyms in south-east Côte d'Ivoire, and virtually nonexistent in other areas. Thus, “kro” has significant spatial autocorrelation whereas “ama” does not.

Here are all of the toponyms that contain the 3-gram “kro" 


And here are all of the toponyms that contain the 3-gram “ama" 

Thus, the the 3-gram "ama” doesn’t tell us much about which ethnic group a toponym belongs to, because that 3-gram is found evenly distributed throughout West Africa - it is just noise. The 3-gram “kro”, on the other hand, carries information about which ethnic group a toponym belongs to, because it is clearly clustered in a group in Southeast Côte d'Ivoire.

I then calculated the lexical distance between all of the toponyms based on the number shared 3-grams that had significant spatial autocorrelation.  To add a spatial component, I also linked any two toponyms that were less than 25 kilometers apart. Thus, I had a graph where every toponym was a vertex, and undirected edges connected toponyms that had spatial or lexical affinity.  Finally, I used a fast greedy modularity-optimizing algorithm to detect communities, or clusters, in this graph.

Results
The algorithm found seven distinct communities, which definitely correspond to ethnic groups and ethnic macro-groups in West Africa.


The red cluster includes Wolof, Serer, and Fulfulde place names, which makes sense, as all of these groups are Senegambian languages. This group of languages is the primary group in Senegal and Mauritania, which my classification picked up on. It also caught the large Fulfulde presence in central Guinea, throughout an area known as the Fouta-Djallon. This cluster also has a significant presence throughout the Sahel, stretching into Burkina Faso and dotted throughout the rest of West Africa, much like the migrant Fulfulde people.

The green cluster captures most of the area where Mandé languages are spoken, including most of Mali, where the Bambara are found, as well as Eastern Guinea and Northern Côte d'Ivoire, where Malinké is found. Interestingly, most of the toponyms in Western Mali fell into the Senegambian/Fulfulde cluster, and were not in the Mandé cluster, even though there are Mandé groups like the Soninké and Khassonké in Western Mali. Southern Guinea is densely green, representing the presence of Mandé groups there, like the Kuranko. Surprisingly, much of central and southern Côte d'Ivoire also fell into the green cluster, even through there are a couple of different groups there which are not in any way related to the Mandé groups that were most represented in the green cluster. This is also true of areas in Western Burkina Faso and Eastern Mali, where there are many languages unrelated to the broader Mandé group, such as Dogon, Bobo, Minianka, and Senufo/Syempire. However, I know that Dyula, a Mandé language closely related to Bambara, is spoken as a trade language in both of these areas (Côte d'Ivoire and Western Burkina Faso). It could be that Dyula has had a long enough presence in these areas to leave an imprint on the toponyms there.

The purple group pretty clearly captured two different disjoint groups that are both in the broader Mandé group - the Susu, in far Western Guinea, and the Dan, in Western Côte d'Ivoire. These groups are normally classified as being on quite separate branches of the Mandé language family, with the Susu being Northern Mandé and Dan being Eastern Mandé. However, the fact that the algorithm put them in the same group, even though they were too far apart to have edges/connections based on spatial affinity, shows that Dan and Susu toponyms have several three-grams common.

The yellow cluster seems to have caught two sub-groups within the broader green/Mandé cluster. Many of the yellow toponyms in central Mali are in what you could call the Bambara homeland, between Bamako and Segou. However, a second cluster stands out quite distinctly in southern Guinea. It’s unclear to me what group this could represent and why it would have toponymic features distinct enough from its neighbors that the algorithm put it in a different cluster. Some maps say that a group called the Konyanka lives here and speaks a language closely related to Malinké.

The turquoise cluster quite clearly captures the Mossi people and their toponyms, as well as the Gurunsi, a related group (both Mossi and Gurunsi are classified as Gur languages).

The black cluster in southern Burkina Faso captured a group that most national ethno-linguistic maps call the Lobi, although this part of West Africa is known for its significant entho-linguistic heterogeneity. Another group of villages in Eastern Burkina Faso also fell into the black cluster, although I could not find any significant ethnic group found there.

Finally, the blue cluster captured both the Baoulé/Akan languages as well as the Senufo. It captured the Senufo especially in Côte d'Ivoire and somewhat in Burkina Faso, but not much in Mali, where I know the Senufo have a significant presence. This could represent a Bambarization of previously Senufo toponyms due to the fact that the government of Mali is predominantly Bambara, or it could pre-date the Malian state, as this area was part of Samori Toure’s Wassoulou Empire, in which the Malinké language was strongly enforced. The classification of the Senufo languages has always been controversial, but this toponymic analysis suggests that they are more related to Kwa toponyms to the south rather than to Gur toponyms to the northeast.

Caveats

Some caveats with this work and its interpretation. For one, this only shows toponymic affinities. Those affinities usually correspond to ethnic distributions, but not always. There is a lot of migration in West Africa today, and place names don’t usually change as quickly as the distributions of people. Thus, toponyms can sometimes encode historic ethnic distributions, for example many toponyms in the United States come from Native American languages, and there are many toponym suffixes in England that reflect a historic Nordic presence. Thus, this and similar maps are most informative when interpreted in combination with on-the-ground information and knowledge.

Another issue with classifying toponyms in West Africa in particular is that West African toponyms are transcribed using the Latin alphabet, which definitely does not capture all of the sounds that exist in West African languages. Different extensions of the Latin alphabet, as well as an indigenous alphabet, are often used to transcribe these languages, however these idiosyncratic methods of writing languages are not used in the geonames dataset. Thus, the Fulfulde bilabial implosive (/ɓ/ in IPA) is written the same way as a pulmonic bilabial plosive - as a “b”, so this distinction is lost in our dataset, even though it adds a lot of information about what ethnic group a given toponym belongs to. However, some other sounds and sound combinations, which are very indicative of specific languages are captured using a Latin alphabet- for example prenasalized consonants (/mb/) common in Senegambian languages, labial velars (/gb/ and /kp/) common in coastal languages, or the lack of a ‘v’ in Mandé languages. Issues also arise with how different colonizers transcribe sounds differently, for example ‘ny’ and 'kwa’ in English would be 'gn’ and 'coua’ in French. However, this didn’t apply in this analysis, which only used Francophone countries, and I believe it could be dealt with if I tried to do a larger analysis.

Conclusion


This is an exciting time to be at the intersection of geography and linguistics!  New datasets and computational methods are giving researchers the ability to ask newer and better questions about who belongs to what group, and where.  I hope new developments in this research can yields new linguistic results about phylogeny, migration, and the spread of linguistic phenomena.  Outside of the field of linguistics, better language maps could have broad applications, from improving disaster response planning to helping to answer critical questions about the origins of ethnic conflict.


          HR Professionals Say These Big Data Jobs are in High Demand   

Brands in every industry are utilizing big data to improve efficiency and deliver higher quality solutions. However, big data logistics can’t be entirely automated. They rely heavily on the input and expertise of data scientists. Jeanne Harris, one of the senior executives for Accenture Institute for High Performance, has stated that data analytics is an […]

The post HR Professionals Say These Big Data Jobs are in High Demand appeared first on SmartData Collective.


          (USA-FL-Tampa) Senior Statistician   
Job Description **General Dynamics Health Solutions/ ARMA has an opening supporting the DOD POTFF Program:** + The Senior Statistician/Data Scientist II shall be proficient at entering, cleaning, and conducting advanced data manipulation and statistical analysis. + Operating in consultation with POTFF program staff and the government's POTFF bio-statistician, the Senior Statistician/ Data Scientist II shall provide subject matter expertise in the areas of program evaluation, research methodologies and data analysis. + The contractor shall provide consultation and assistance to supported units and POTFF staff to identify opportunities and methods for capturing data relating to POTFF programs and initiatives. Education + Masters (MA/MS) or Doctoral (Ph.D.) in quantitative science, social science or related field Qualifications 1. TS/SCI Clearance 2. The contractor shall possess a master's or doctoral degree in quantitative science, social science or related discipline. 3. The contractor shall be proficient with the suite of Microsoft Office programs, including Word, Excel and Access. 4. The contractor shall have an advanced proficiency with commonly used statistical software application (e.g. SPSS, SAS, and R). 5. The contractor shall possess excellent communication skills and shall be highly detail oriented and organized and have at least three (3) years of research experience in an academic, social services, government, health-care or laboratory setting. \#POTFF GDIT is an Equal Opportunity/Affirmative Action Employer - Minorities/Females/Protected Veterans/Individuals with Disabilities
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Develops large scale Advanced Analytics Projects through business and analytical problem framing, data acquisition and creation model approach definition, model...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Designs and develops Advanced Analytics Projects through business and analytical problem framing, data acquisition and creation, model approach definition,...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Cryptocurrency Trading Algorithm using Machine Learning - Upwork   
I'm developing a machine learning trading algorithm for cryptocurrencies like Bitcoin and Ethereum. I'm looking for technical people to help in building this algorithm. It will do medium-frequency trades (not every second, possibly every minute) and use a large data set to out perform cryptocurrency indexes. kw: Trading, stocks, foreign exchange, currency.

I'm looking for applicants who can help in building this and has the required background in machine learning algorithms. Also applicants who can analyze large data sets and with trading experience.


Posted On: July 01, 2017 05:51 UTC
Category: Data Science & Analytics > Machine Learning
Skills: Algorithms, Bitcoin, C, C#, Data Analytics, Data Science, Ethereum, Foreign Exchange Trading, Machine Learning, Predictive Analytics, Python, Stock Management
Country: United States
click to apply
          Data Scientist, Data & Analytics - KPMG LLP - Toronto, ON   
Working at KPMG allows you to gain valuable on-the-job experience while building your professional network and business acumen. Being part of the KPMG Global
From KPMG LLP - Tue, 13 Jun 2017 21:52:29 GMT - View all Toronto, ON jobs
          Data Scientist - Rapidly Growing Start-Up   

          Data Scientist, Advanced Analytics - LogRhythm - Boulder, CO   
LogRhythm is a leading provider of unified security intelligence and analytics solutions that empower organizations to automate the detection, prioritization...
From LogRhythm - Fri, 24 Mar 2017 21:05:24 GMT - View all Boulder, CO jobs
          Phd Candidate - Public Administration With A Particular Focus On Data Science   
PhD Candidate - Public Administration with a particular focus on Data Science . . Posted on 29 Jun 2017 . Leiden University · Institute of Public Administration . Netherlands, Leiden . 2 Job openings at this institution . Apply for this...
          Data Scientist - integrate.ai - Toronto, ON   
Founded by Steve Irvine, former Facebook executive, we are proud to be based in Toronto, Canada at the center of an exciting AI ecosystem....
From Integrate.ai - Sat, 10 Jun 2017 01:09:24 GMT - View all Toronto, ON jobs
          Data Scientist - Data Mining, Analytic Software, Agile   
Little Rock, If you are a Data Scientist with experience, please read on! Located in Little Rock, AR we are one of the largest fashion apparel, cosmetics and home furnishings retailers that focuses on delivering quality customer service to shoppers in over 300 stores nationwide. A growing and strategic focus of our company is Advanced Analytics and Testing, so we are looking for a Data Scientist to join to our
          Data Scientist - Relocation Assistance Provided   
Little Rock, If you are a Data Scientist with experience, please read on! Located in Little Rock, AR we are one of the largest fashion apparel, cosmetics and home furnishings retailers that focuses on delivering quality customer service to shoppers in over 300 stores nationwide. A growing and strategic focus of our company is Advanced Analytics and Testing, so we are looking for a Data Scientist to join to our
          Data Scientist - Data Mining, Analytic Software, Agile   
AR-Little Rock, If you are a Data Scientist with experience, please read on! Located in Little Rock, AR we are one of the largest fashion apparel, cosmetics and home furnishings retailers that focuses on delivering quality customer service to shoppers in over 300 stores nationwide. A growing and strategic focus of our company is Advanced Analytics and Testing, so we are looking for a Data Scientist to join to our
          How to replace yourself with a very small shell script   

Data scientist Hillary Mason (previously) talks through her astoundingly useful collection of small shell scripts that automate all the choresome parts of her daily communications: processes that remind people when they owe her an email; that remind her when she accidentally drops her end of an exchange; that alert her when a likely important email arrives (freeing her up from having to check and check her email to make sure that nothing urgent is going on). It's a hilarious and enlightening talk that offers a glimpse into the kinds of functionality that users can provide for themselves when they run their own infrastructure and aren't at the mercy of giant webmail companies. (via Clive Thompson)

          Data Scientist (m/f) // PubNative   

PubNative is a global mobile publisher platform fully focused on native advertising for apps and mobile web. PubNative understands each app, its business model and what matters in terms of user experience to create non-intrusive, innovative and highly performing ad integrations. The platform’s programmatic exchange gives publishers access to 400+ demand partners worldwide. The company […]

Check out all open positions at http://BerlinStartupJobs.com


          Data Scientist - Drop - Toronto, ON   
Through our mobile app, users supercharge their debit and credit cards to automatically earn points on their every day spending at places such as Starbucks, Tim...
From Drop - Thu, 01 Jun 2017 02:38:43 GMT - View all Toronto, ON jobs
          (USA-IL-Bloomington) Data Scientist   
This job was posted by https://illinoisjoblink.illinois.gov : For more information, please see: https://illinoisjoblink.illinois.gov/ada/r/jobs/5065178 Data Scientist, Bloomington, IL: Develop, interpret, implement, & support several types of statistical modeling & data mining techniques. Analyze needs for analytic information & convert into action items, requests, or projects. Use domain knowledge of business areas & research skills to address a diverse range of real-world business issues & to solve business problems. Develop analytic development databases, strategies, & methodologies to validate model performance following implementation. Must have MS in a quantitative field & 2 yrs work experience performing statistical analysis & data mining in a business environment to solve problems & identify trends. To apply on-line, go to Statefarm.com/careers and apply to req153.
          Data Scientist - General Electric - Wisconsin   
Have an ability to work with process experts and data engineers to build data science models using mathematical, Advanced Statistics and physics based packages...
From GE Careers - Fri, 30 Jun 2017 10:24:38 GMT - View all Wisconsin jobs
          Data Scientist - McAfee - Santa Clara, CA   
Data Scientist will play a role in helping understand and optimize business performance across McAfee consumer and Mobile business segments....
From McAfee - Tue, 04 Apr 2017 10:07:16 GMT - View all Santa Clara, CA jobs
          Dataquest: Web Scraping with Python and BeautifulSoup   

To source data for data science projects, you’ll often rely on SQL and NoSQL databases, APIs, or ready-made CSV data sets.

The problem is that you can’t always find a data set on your topic, databases are not kept current and APIs are either expensive or have usage limits.

If the data you’re looking for is on an web page, however, then the solution to all these problems is web scraping.

In this tutorial we’ll learn to scrape multiple web pages with Python using BeautifulSoup and requests. We’ll then perform some simple analysis using pandas, and matplotlib.

You should already have some basic understanding of HTML, a good grasp of Python’s basics, and a rough idea about what web scraping is. If you are not comfortable with these, I recommend this beginner web scraping tutorial.

Scraping data for over 2000 movies

We want to analyze the distributions of IMDB and Metacritic movie ratings to see if we find anything interesting. To do this, we’ll first scrape data for over 2000 movies.

It’s essential to identify...


          Data Scientist, Data & Analytics - KPMG LLP - Toronto, ON   
Working at KPMG allows you to gain valuable on-the-job experience while building your professional network and business acumen. Being part of the KPMG Global
From KPMG LLP - Tue, 13 Jun 2017 21:52:29 GMT - View all Toronto, ON jobs
          Data Scientist: el unicornio que necesitan las compañías para crecer y ahorrar costes   

Todo el mundo sabe qué es un unicornio, pero nunca nadie ha visto ninguno. Casi lo mismo se podría decir del data scientist, uno de los perfiles profesionales más demandados

La entrada Data Scientist: el unicornio que necesitan las compañías para crecer y ahorrar costes aparece primero en MuyComputerPRO.


          (USA-VA-Herndon) Systems Architect 6, GLM Mature Technology Group   
**Systems Architect 6, GLM Mature Technology Group** **Requisition ID: 17015265** **Location\(s\): United States\-Virginia\-Herndon** **US Citizenship Required for this Position: Yes** **Relocation Assistance: No relocation assistance available** **Travel: Yes, 25 % of the Time** Are you interested in the opportunity to work for an industry\-leading company whose work with cutting\-edge technology is driven by something human: the lives our technology protects? If so, Northrop Grumman may be the place for you\. It’s not the systems that drive us: it’s the soldier our systems bring home\. It’s not just the equipment that motivates us: it’s the people our equipment protects\. It’s not the innovation that gets us up in the morning: it’s whom those innovations serve\. We’re united by our work to help people and protect the world\. And that mission makes our team even stronger\. When you join Northrop Grumman, you’ll have the opportunity to connect with coworkers in an environment that’s uniquely caring, diverse, and respectful\. Employees share experiences, insights, perspectives, and creative solutions with some of the best minds in the industry\. We collaborate through integrated product teams, cross\-functional teams, and employee resource groups, while thriving through the support of training and development, mentors and every day coaching, along with extensive health and work/life benefits\. We’re committed to our employees’ professional and personal development and success\. Northrop Grumman recruits top talent with traditional and non\-traditional backgrounds in order to ensure our team is united, connected, skilled, focused and innovative\. An inclusive workplace of people with diverse backgrounds, experiences, and perspectives is the key to our performance\. At Northrop Grumman, we want our employees to bring their whole self to work\. All your different sides are welcome here, as we believe they make our team, our products and our services, that much better\. Job Description Northrop Grumman Technology Services is seeking a System Architect to join a mature technology evaluation group and to design and develop system architectures, and defines key capabilities and performance requirements\. Defines design and technology maturity constraints of the system in accordance with customer specifications\. Develops thorough definition of system external interfaces\. Defines system implementation approach and operational concept\. Ensures requirements are met and evaluates performance with customer\. Monitors and evaluates technology trends in disciplines relevant to sustainment and modernization of aviation, space and missile platforms and payloads\. Basic Qualifications: + BS Degree in Engineering or related technical discipline and 20 years of relevant experience\. + Systems engineering, research & development and business development experience\. + Experience devising, bidding and/or implementing services and/or product solutions\. + Experience with generating DoD Architecture Framework \(DoDAF\) views\. + Ability to obtain and maintain a Top Secret clearance with counter intelligence polygraph\. + Must be able to travel CONUS and OCONUS\. Preferred Qualifications: + MS Degree in engineering or related technical discipline 18 years of experience\. + PhD Degree in engineering or related technical discipline 15 years of experience\. + Experience within a $100M Aerospace & Defense \(A&D\) profit & loss \(P&L\) business\. + Experience with system\-of\-systems modeling & simulation\. + Experience with delivering performance based logistics \(PBL\) services\. + Experience with operations research, data science and/or systems analysis\. + Experience developing ISR sensors, processing electronics and RF technologies\. + Experience developing applied flight technologies\. + Experience developing airframe technologies\. + Experience developing air vehicle survivability technologies\. + Experience developing data analytics, cognitive and autonomy technologies\. + Experience developing cyber resilience and embedded software support technologies\. + Experience leading mature technology IR&D and CR&D projects \(TRL >6\)\. Northrop Grumman is committed to hiring and retaining a diverse workforce\. We are proud to be an Equal Opportunity/Affirmative Action Employer, making decisions without regard to race, color, religion, creed, sex, sexual orientation, gender identity, marital status, national origin, age, veteran status, disability, or any other protected class\. For our complete EEO/AA and Pay Transparency statement, please visit www\.northropgrumman\.com/EEO \. U\.S\. Citizenship is required for most positions\. **Title:** _Systems Architect 6, GLM Mature Technology Group_ **Location:** _Virginia\-Herndon_ **Requisition ID:** _17015265_
          Senior Data Scientist - Tailored Brands - Fremont, CA   
Tailored Brands features leading menswear Men’s Wearhouse, Jos. Tailored Brands provides a personal, convenient, one-of-a-kind shopping experience with...
From Indeed - Tue, 06 Jun 2017 22:57:45 GMT - View all Fremont, CA jobs
          Data Science Training in Hyderabad   
Analytics Path is the most reputed Data Science Training Institute in Hyderabad that offers a comprehensive way of training sessions to the aspirants to build a career in trending field.
          User behaviour is the key to combatting evolving cyber threats   

Derek Lin, Chief Data Scientist at Exabeam discusses the important role that user and entity behaviour analytics will play in the development of security solutions for today’s cyber threat landscape.

The post User behaviour is the key to combatting evolving cyber threats appeared first on Computer Business Review.


          Data Scientist - Wink - New York, NY   
Hands-on experience with supervised and unsupervised machine learning algorithms for regression, classification, and clustering....
From Wink - Thu, 18 May 2017 06:17:27 GMT - View all New York, NY jobs
          Beginning Data Science with R   

Beginning Data Science with R

Beginning Data Science with R by Manas A. Pathak
English | 31 Dec. 2014 | ISBN: 3319120654 | 172 Pages | PDF | 3.86 MB
"We live in the age of data. In the last few years, the methodology of extracting insights from data or "data science" has emerged as a discipline in its own right. The R programming language has become one-stop solution for all types of data analysis. The growing popularity of R is due its statistical roots and a vast open source package library.


          Associate Data Science Business & Operations Management Director - Astellas Pharmaceuticals - Northbrook, IL   
Astellas offers an environment where our employees can make a real difference. Establishes, manages and updates the training curricula for vendor staff in line...
From Astellas Pharmaceuticals - Mon, 19 Jun 2017 20:03:12 GMT - View all Northbrook, IL jobs
          Logo, 3 columns layout, plus header and footer template design. by sushantt   
I need a HTML/CSS + SVG combo for a 3 columns + header + footer layout of a website. The site is going to focus on Data Science topics. It's name is treedepth.com. (Budget: $30 - $250 USD, Jobs: Graphic Design, Logo Design)
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Develops large scale Advanced Analytics Projects through business and analytical problem framing, data acquisition and creation model approach definition, model...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Designs and develops Advanced Analytics Projects through business and analytical problem framing, data acquisition and creation, model approach definition,...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Senior Analyst, Predictive Modeling & Data Science - BMO Financial Group - Toronto, ON   
Ch. The Advanced Analytics &amp; Journey Science group partners with internal Personal and Commercial Banking Canada partners, and various lines of business across...
From BMO Financial Group - Sat, 24 Jun 2017 00:46:45 GMT - View all Toronto, ON jobs
          Director, Data Scientist - KPMG - Atlanta, GA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Tue, 16 May 2017 08:29:26 GMT - View all Atlanta, GA jobs
          Director, Data Scientist - KPMG - Santa Clara, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Santa Clara, CA jobs
          Director, Data Scientist - KPMG - Irvine, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:26 GMT - View all Irvine, CA jobs
          Data Scientist - IBM - Austin, TX   
Opportunities to implement machine learning into support business processes. Business process optimization....
From IBM - Tue, 06 Jun 2017 21:03:26 GMT - View all Austin, TX jobs
          Director, Data Scientist - KPMG - Seattle, WA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Seattle, WA jobs
          (Associate) Data Scientist for Deep Learning Center of Excellence - SAP - Sankt Leon-Rot   
Build Machine Learning models to solve real problems working with real data. Software-Design and Development....
Gefunden bei SAP - Fri, 23 Jun 2017 08:50:58 GMT - Zeige alle Sankt Leon-Rot Jobs
          Innovation Fellow - Data Scientist - Manulife Financial - Toronto, ON   
Design and build customized data scrapers. Are you looking for unlimited opportunities to develop and succeed?...
From Manulife Financial - Mon, 10 Apr 2017 22:03:25 GMT - View all Toronto, ON jobs
          Data Scientist, Decision Analytics - EXL - Jersey City, NJ   
EXL serves the insurance, healthcare, banking and financial services, utilities, travel, transportation and logistics industries....
From EXL - Thu, 25 May 2017 11:14:52 GMT - View all Jersey City, NJ jobs
          Data Scientist with NLP Experience - EXL - Jersey City, NJ   
EXL Analytics serves clients across industries including healthcare, banking, insurance, retail and logistics. EXL Analytics is developing solutions for some of...
From EXL - Thu, 04 May 2017 23:18:32 GMT - View all Jersey City, NJ jobs
          Vincent Granville posted a blog post   
Vincent Granville posted a blog post

          (USA-WA-Kent) Marketing Data Analyst   
Marketing Data Analyst Posted Date:Jun-30-2017 Job ID:7561 Job Type:Full Time Job Function:Marketing City:Kent State:Washington ------------------------------------------------------------------------ What's cool about this job This position also conducts specific analytics related to customer purchasing, lifecycle, and engagement behavior. As a member of the marketing analytics team, this position will be responsible for contributing to an environment of fact-based decision making, utilizing REI’s extensive supply of customer, campaign, site, and marketing channel data. This combination of sophisticated analytics with key reporting enables the Senior Marketing Analyst to contribute to the bottom line success of REI as a co-op. • Partner with internal stakeholders to ensure accurate reporting of marketing campaigns, including creation of detailed technical specifications containing relevant information such as versioning, testing, and back-end instructions. • Curate data sets to enable self-service BI within other marketing groups • Partner with statisticians and data scientists to make fact based decisions on marketing programs to increase profitability, retention, and engagement. • Work closely with Senior Statistical Analyst to give input on personalized marketing model development and improving model performance. • Provide accurate and timely reporting of campaign performance to workgroup and cross-divisional teams. • Perform ad-hoc analytics related to customer purchasing, engagement, and marketing attribution as needed • Act as subject matter expert on REI’s data warehouse. Perform system and data testing as necessary. • Identify strategic opportunities to maximize returns in pursuit of REI’s financial and brand goals • Utilize SQL and other enterprise tools to execute customer segmentation strategies aimed at increasing customer acquisition, retention, and reactivation as well as report and analyze campaign performance • Provide analytics support for forecasting business trends and guidance on risks to plans Bring your passion and expertise • BS or MS/MBA in Analytics, Marketing, Statistics, or equivalent work experience • 2-4 years’ experience in a CRM or analytics environment. Experience in retail or ecommerce is a plus • Strong mathematical / analytical proficiency • Passionate about data and discovering new trends and insights across customers, merchandise, and customer engagement • Deep SQL experience and strong ability to comprehend data-warehouse structure, Netezza experience is a plus • Familiarity with econometrics and other advanced statistical concepts, including application of regression-based modeling, is a plus • Report development and Business Intelligence application experience • Self-disciplined and motivated in a marketing environment that requires creativity and strong attention to detail • Ability to prioritize and work multiple concurrent requests from different business teams • Tableau desktop and server experience is highly desired • Excellent communication skills and ability to quickly and effectively communicate complex quantitative results to internal stakeholders • Experience with R or Python is preferred At REI we offer an enviable work environment that has been recognized on the "100 Best Companies to Work For" list since the award's inception – 20 years in a row! Sure, we work hard, but it’s balanced with time off to play—a strategy that works for us as we continue to grow and thrive. Want to enjoy a workplace where you can be yourself, be heard and be respected while having a job that challenges you? This is the place. With more than 140 retail locations (and growing), REI offers unique competitive benefits to its more than 12,000 employees, including healthcare, gear and apparel discounts, free equipment rentals and challenge grants to help employees reach personal outdoor goals, generous retirement plan contributions, public transit subsidy, adoptions assistance, paid sabbaticals, and more. REI is an Equal Opportunity Employer
          (USA-WA-Redmond) Software Engineer   
The PROSE research and engineering team (https://microsoft.github.io/prose/) develops APIs for program synthesis or programming-by-examples for many task domains. These APIs ship within multiple Microsoft products including Excel (Flash Fill), Powershell (ConvertFrom-string and Convert-string cmdlet), and OMS (Custom Field feature), and are set to change the user experience in fundamental ways for many more. See this 5-minute video https://www.youtube.com/watch?v=w-k9WjRJvIY for a short demo. Program synthesis is a new frontier in AI wherein the computer programs itself---the user provides input-output examples and the computer synthesizes an intended script. This is significant because 99% people who own computers do not know programming. Even for programmers, this can provide a 10-100x productivity increase for many task domains, and especially in the areas of data wrangling/preparation and code refactoring. The former is very useful for data scientists who end up spending 80% of their time wrangling data. Our team builds an SDK which enables both high-level use of APIs already created for program synthesis scenarios in a variety of areas as well as lower-level creation of program synthesis solutions for new domains. We bring cutting-edge programming language and machine learning research together with solid engineering practices to deliver high-impact solutions for teams across Microsoft. Our codebase is primarily written in C# targeting both desktop .net and CoreClr on windows, mac and linux, but we incorporate a broad set of technologies including SQL and other data systems, web technologies, python and R. Our SDK is used in all sorts of places from front-end user productivity software, to backend server systems, business applications, developer tools and much more. The engineer will work closely with researchers and other engineers on the team to create robust implementations and the infrastructure to support them. This will include work on the SDK itself, creating proof-of-concept demos, collaborating with partner teams to build solutions using the SDK, and helping to extend and maintain the build and test systems the team depends on. Qualifications: •Bachelors in Computer Science or a related field. •1-2 years of experience with software development, preferably using C#. •Must be able to work closely with other engineers and researchers in the team. •Experience with compiler construction a plus. Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to askstaff@microsoft.com. Development (engineering)
          (USA-WA-Bellevue) Data Scientist   
Bing’s mission: To deliver the most relevant knowledge to our customers by being more than just a search engine – Bing’s goal is to be the decision engine. Data is critical to achieving that mission. At Bing, we have an enormous wealth of data, ranging from user interaction logs to web documents, from user feedback to system performance data. The Bing Advertiser Sciences Team is hiring extremely talented, highly motivated and productive individuals with expertise in the areas of: Computer Science, Machine Learning, Econometrics, Statistics, Modeling, Simulation and Data Mining. The team develops and applies advanced techniques to turn our petabytes of data into insights; and to drive actions based on those insights. The team works closely with partners across Microsoft’s Online Services Division to enable rigorous, effective, and data-driven decision making. Some example of the challenges we face: •Modeling the dynamics of the paid search market •Understanding Advertiser value, lifecycle, opportunity and marketing objectives •Designing and analyzing the results of large-scale online experiments Prototyping algorithms fundamental to managing and optimizing demand generation activities to support our search marketplace. At Bing, we offer a strong team environment, exciting applied research challenges, and a fun place to work. The work environment empowers you to have a real impact Microsoft’s business, our advertiser partners, and millions of end users. This role is a unique opportunity to work with a world-class, interdisciplinary group of researchers, analysts, and developers. Job Responsibilities include: •Develop and manage and develop analyses and algorithms that generate actionable insights and programs to improve Bing Ads demand generation activities including: increasing both long-term revenue and relevance. •Research and develop solutions for improving profits for Microsoft and returning value to the audience, advertisers and publishers (e.g. Ecosystem health, marketplace performance measurement, advertiser health, outlier detection, etc.). •Specific responsibilities include the following: Work with key business stakeholders to understand the underlying business needs and formulate, communicate and create buy-in for analytics approaches and solutions •Influence stakeholders to make product/service improvements that yield customer/business value by effectively making compelling cases through story-telling, visualizations, and other influencing tools. •Effective communicate and translate Bing Ads business strategy and goals into discrete, manageable problems with well-defined measurable objectives and outcomes on which the Advertiser Sciences team can execute. •Transform formulated problems into implementations plans for experiments by developing data sources, applying/creating the appropriate methods, algorithms, and tools, as well as delivering statistically valid and reliable results •Contribute to an environment of scientific inquiry which reinforces team standards for analytic rigor that is consistent with the broader Microsoft data sciences community and strives to apply the simplest viable approach for experiments and analysis Qualifications: •A Bachelor’s degree in Data Science, Computer Science, Electrical Engineering, Machine Learning/AI or related fields. •Demonstrated experience in all phases of managing data science engagements including: problem definition, solution formulation and delivering measurable impact. •Experience with online data; experience with online-advertising data strongly preferred. •Knowledge and experience in at least three of the following areas: machine learning, data mining, user modeling, information retrieval (interrogation of log files and very large databases), economic modeling, econometrics, game theory, statistics, data analysis, e-metrics/measurement. •2+ years of experience in at least three of the following areas: machine learning, data mining, user modeling, information retrieval (interrogation of log files and very large databases), economic modeling, econometrics, game theory, statistics, data analysis, or e-metrics/measurement; 4+ years are preferred. •Experience with data analysis and statistical tools (e.g. Python, R, SAS, Matlab or SPSS). •Solid communications skills, both verbal and written. •Hands-on approach to data analysis and a strong focus on quality. •Ability to work independently and collaboratively in an interdisciplinary team environment Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to askstaff@microsoft.com. Data & applied sciences (engineering)
          (USA-WA-Redmond) PROGRAM MANAGER II   
The Universal Store Team is chartered with bringing to life a key component of the One Microsoft vision for the future – a Store to support scenarios across consumer and commercial, digital and physical, via multiple channels and storefronts. We are the in the midst of seismic changes in the Microsoft commerce landscape as we create an amazing experience for customers of all types. This is your opportunity to get in early to make waves and help Microsoft leap ahead to the future! The Store is powered by services, and our Marketplace Services team owns delivery of highly-scalable services across the Microsoft ecosystem, providing scale monetization, transaction and digital licensing capabilities across our most critical client experiences and content publishers. Marketplace Services powers commerce functionality for all digital content types (games, apps, video, music, subscriptions, etc.) across all Microsoft clients (Windows, Phone, Xbox, Web, Office). As we continue to transition toward One Microsoft and take a more holistic view of the opportunities for the Microsoft businesses and partners across all our channels (consumer and enterprise), a best-in-class marketplace is critical to our on-going success. This role is focused on driving key scenarios across the Universal Store to help enable all the above. You’ll be the front door and playmaker to many of the most critical scenarios for Xbox, Office, physical goods, and much more! You’ll own specific features and help drive the end-to-end experiences that are necessary to grow our business. To be successful in this role you must have the following qualifications: • Strong and demonstrated customer and partner empathy • Outstanding organizational and interpersonal skills demonstrated by previously working successfully across group boundaries, especially with engineering and business groups • Proven track record of delivering complex, high-scale systems that meet evolving business requirements • Strong written and verbal communication skills and a desire to create an open and collaborative team culture • You must thrive in a fast-paced work environment and demonstrate an ability to quickly come up to speed with new technologies Basic Qualifications: • 3+ years of program management experience • A BA or BS degree in computer science, engineering, math, physics, or business Preferred Qualifications: • Experience with data science or a degree in data science We’re a high-powered team with significant impact. If you can think big, want to join a fast-moving team that is breaking new ground in Universal Store, and you meet the qualifications above, we would like to meet you! Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to askstaff@microsoft.com. Program management (engineering)
          (USA-WA-Bellevue) Sr Data Engineer   
Expedia Expedia Media Solutions builds innovative media partnerships for travel advertisers enabling them to use Expedia's network of leading travel brands and global sites. We have revolutionized the way brands reach and connect with online travel consumers, emerging as a leader in online advertising among travel and e-commerce brands. With a growing product portfolio offering a multitude of advertising and sponsorship opportunities, the Media Solutions team at Expedia has built a marketing platform for advertising partners to reach the 78 million worldwide monthly unique visitors like you that frequent Expedia Inc. sites. You will shape the future of the Business Intelligence team as we continue to build critical aggregate datasets on our on premise platform and work towards creating granular datasets on our developing cloud infrastructure that will serve as the foundation for business analysts and data scientists to create meaningful insights. We are seeking a Sr Data Engineer to lead the transition of our nimble team of data warehousing experts from using traditional ETL techniques to more flexible languages and tools that were created to enable advanced analytics on large datasets. **Key Responsibilities:** * You will optimize and automate ingestion processes for a variety of data sources such as: click stream, audience, transactional, and advertising * You will drive data investigations across organizations and deliver resolution of technical, procedural, and/or operational issues to completion * You will build/extend toolsets, create/maintain batch jobs, and create systems documentation * You will guide and mentor other data engineers and influence large scale projects **Minimum Qualifications:** * 7+ years of hands-on experience designing and operating large data platforms * 2+ years of experience with Java, Python or similar programming language * 2+ years leading data warehousing and analytics projects, including using AWS technologies—Redshift, S3, EC2, and other big data technologies * Expert in SQL and experience with the Hadoop ecosystem common and emerging tools (e.g. Hive, Spark, Presto, Qubole) * Experience in implementing SDLC best practices and Agile methods * BS in Computer Science, Mathematics, Statistics, or related field **Preferred Qualification:** * Background in web analytics, online advertising, e-commerce, business measurement or a comparable reporting and analytics role Expedia is committed to creating an inclusive work environment with a diverse workforce. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. This employer participates in E-Verify. The employer will provide the Social Security Administration (SSA) and, if necessary, the Department of Homeland Security (DHS) with information from each new employee's I-9 to confirm work authorization. * Posted Yesterday * Full time * R-19505
          New design for Jenkov.com   
Jenkov.com has been redesigned as part of our refocusing process in Jenkov Aps. Jenkov Aps has chosen to focus on the intersection of data science, cloud platforms, IoE and collaboration tools.
          Mathematical Analysis   
We have started a tutorial about mathematical analysis which is a core part of data science.
          Chief Data Scientist - Predictive Science - United States   
Develop innovative approaches to linking internal systems with external data. Experience in creating and implementing machine learning algorithms and advanced...
From Predictive Science - Sat, 11 Mar 2017 08:06:17 GMT - View all United States jobs
          Senior Advisor Product Management - Threat Detection, Security - Dell - Reston, VA   
Supports the Business Line (IS Level) of strategic planning. It also requires some knowledge of Data science and machine learning technologies used for...
From Dell - Thu, 25 May 2017 05:26:48 GMT - View all Reston, VA jobs
          Senior Data Scientist, NLP - Demandbase - San Francisco, CA   
Work with customers and internal stakeholders to define hypotheses and models. Proven ability to solve problems using NLP and machine learning in an industry...
From DemandBase - Sat, 22 Apr 2017 02:02:09 GMT - View all San Francisco, CA jobs
          Machine Learning Software Engineer - Demandbase - San Francisco, CA   
Build out new applications and business solutions as part of a combined data scientist / machine learning / engineering team....
From DemandBase - Wed, 05 Apr 2017 01:04:05 GMT - View all San Francisco, CA jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Statistical Workforce Architect   
WPS - Madison, WI - Job Summary The Statistical Workforce Architect will be the Principal Statistician, Data Science & Predictive Analytics role provides...
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Senior Analyst, Predictive Modeling & Data Science - BMO Financial Group - Toronto, ON   
Ch. The Advanced Analytics &amp; Journey Science group partners with internal Personal and Commercial Banking Canada partners, and various lines of business across...
From BMO Financial Group - Sat, 24 Jun 2017 00:46:45 GMT - View all Toronto, ON jobs
          Data Scientist - integrate.ai - Toronto, ON   
Founded by Steve Irvine, former Facebook executive, we are proud to be based in Toronto, Canada at the center of an exciting AI ecosystem....
From Integrate.ai - Sat, 10 Jun 2017 01:09:24 GMT - View all Toronto, ON jobs
          Data Science Python Developer   

          Machine Learning und Predictive Analytics : Data Science – mehr als Standard-Reporting und Self-Service BI   
Hadoop Cluster, Time-series Forecasting, Conjoint Analysis… Diejenigen, für die diese Begriffe keine Fremdwörter sind, befinden sich vermutlich auf dem besten Weg zu einem Job, den das Harvard Business Review als den „Sexiest Job of the 21st Century“ beschreibt: Der Data Scientist. All jene, die das eher unter „Nerd-Talk“ subsumieren würden, möchten wir vom Konzept Data Science überzeugen und zeigen, wie die passende Technologie zu präziseren Zahlen, Daten und besseren Umsetzungen im Unternehmen führt.
          Financial Market Data Manager/ Analyst - SQL   
New York, If you are a Data Manager with Financial Market Data experience please send me a resume and read on! We are a highly respected fundamental hedge firm with a rapidly growing Data Engineering / Data Science team. That said, we have an amazing opportunity to have a very high profile role. Your work will have a direct impact on discovery of investment signals that will generate billions of dollars. To
          Data Science Apprentice - Predictive Science - United States   
In many ways this is like you are starting your own venture as a freelance data scientist owning your own business....
From Predictive Science - Sat, 11 Mar 2017 08:05:32 GMT - View all United States jobs
          Data Science, Part One – Guiding Research on Heart Health, Cancer Care and Pediatrics   
Click for a full size image   Data Science, Part One – Guiding Research on Heart Health, Cancer Care and Pediatrics This is the first part of a two-part series on data science, adapted from a feature from the Northwestern University News Center and edited for the Breakthroughs in Care audience. Data science is transforming ...
          Inspiring women: Dr Angie Ma   
COO at ASI Data Science, Ma’s goal of bringing artificial intelligence to the general public has impressed investors to the tune of £1.5m……

In Women entrepreneurs


          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientist - Data Mining & Analysis   
4it Recruitment - Leeds - Data Scientist - Data Mining & Analysis Leeds, West Yorkshire - £50,000 basic + excellent benefits package An exciting opportunity... data, information, and analytics team, the Data Scientist will undertake data mining and analysis in order to help steer global pricing...
          Data Scientist Data Mining & Analysis   
4it Recruitment - Leeds - of Leeds city centre. Joining the data, information, and analytics team, the Data Scientist will undertake data mining and analysis in order... working in a senior, data focused, analytical role Experience of data mining, extraction, and multivariate analysis Data modelling experience...
          Data Scientist - Data Mining & Analysis   
Leeds - Data Scientist - Data Mining & Analysis Leeds, West Yorkshire - £50,000 basic + excellent benefits package An exciting opportunity... data, information, and analytics team, the Data Scientist will undertake data mining and analysis in order to help steer global pricing...
          Data Scientist - McAfee - Santa Clara, CA   
Data Scientist will play a role in helping understand and optimize business performance across McAfee consumer and Mobile business segments....
From McAfee - Tue, 04 Apr 2017 10:07:16 GMT - View all Santa Clara, CA jobs
          Principal Data Scientist   

          Senior Analyst, Predictive Modeling & Data Science - BMO Financial Group - Toronto, ON   
Lev. The Advanced Analytics &amp; Journey Science group partners with internal Personal and Commercial Banking Canada partners, and various lines of business across...
From BMO Financial Group - Sat, 24 Jun 2017 00:46:45 GMT - View all Toronto, ON jobs
          Data Scientist - Consultants 2 Go - United States   
As Data Scientists, we work with business leaders to solve clients’ business challenges and improve clients’ marketing results....
From Consultants 2 Go - Tue, 27 Jun 2017 02:58:08 GMT - View all United States jobs
          Senior Data Scientist - SiriusXM - New York, NY   
Visualization tools such as Tableau, Qlik, Adobe Analytics. 5 + years experience using machine learning and algorithms:....
From Sirius XM Radio - Tue, 27 Jun 2017 15:12:48 GMT - View all New York, NY jobs
          Group Director Data Science - Innocean - New York, NY   
Business and Client Development:. Must have implemented Adobe DMP. Excellent understanding of machine learning techniques and algorithms....
From INNOCEAN - Sat, 24 Jun 2017 00:29:04 GMT - View all New York, NY jobs
          Senior Analyst, Data Science - Prudential - Newark, NJ   
And a passion for generating business impact. Develop and maintain consultative relationships with key business stakeholders....
From Prudential - Wed, 28 Jun 2017 23:45:06 GMT - View all Newark, NJ jobs
          Senior Specialist, Data Science - Prudential - Newark, NJ   
And a passion for generating business impact. Develop and maintain consultative relationships with key business stakeholders....
From Prudential - Tue, 30 May 2017 20:29:44 GMT - View all Newark, NJ jobs
          Specialist, Data Science - Prudential - Newark, NJ   
Identify analytical solutions for business problems. And a passion for generating business impact. Develop and maintain consultative relationships with key...
From Prudential - Tue, 30 May 2017 20:29:44 GMT - View all Newark, NJ jobs
          [Перевод] Делаем data science-портфолио: история через данные   
Предисловие переводчика

Перевод внезапно удачно попал в струю других датасайенсных туториалов на хабре. :)
Этот написан Виком Паручури, основателем Dataquest.io, где как раз и занимаются подобного рода интерактивным обучением data science и подготовкой к реальной работе в этой области. Каких-то эксклюзивных ноу-хау здесь нет, но очень подробно рассказан процесс от сбора данных до первичных выводов о них, что может быть интересно не только желающим составить резюме на data science, но и тем, кто просто хочет попробовать себя в практическом анализе, но не знает, с чего начать.


Data science-компании всё чаще смотрят портфолио, когда принимают решение о приёме на работу. Это, в  частности, из-за того, что лучший способ судить о практических навыках — именно портфолио. И хорошая новость в том, что оно полностью в вашем распоряжении: если постараетесь – сможете собрать отличное портфолио, которым будут впечатлены многие компании.

Читать дальше →
          Senior Analyst, Predictive Modeling & Data Science - BMO Financial Group - Toronto, ON   
Ch. The Advanced Analytics &amp; Journey Science group partners with internal Personal and Commercial Banking Canada partners, and various lines of business across...
From BMO Financial Group - Sat, 24 Jun 2017 00:46:45 GMT - View all Toronto, ON jobs
          Software Sales Hunter, Data Science Solutions - EXPERIAN - Texas   
SAS, FICO, Fiserv, Oracle, SAP, IBM, Amdocs, Pegasystems, Sungard, TIBCO, etc…) is preferred. Job Description - Software Sales Hunter, Data Science Solutions ...
From Experian - Sat, 22 Apr 2017 11:04:53 GMT - View all Texas jobs
          Data Scientist - SEI - Oaks, PA   
This team is responsible for extracting value from the vast IMS data store to improve efficiency and decision-making across a broad client base....
From SEI - Fri, 30 Jun 2017 17:36:16 GMT - View all Oaks, PA jobs
          Data Scientist 6 month contract Madison, WI - See the USA!   

          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientist/Engineer - Wealthsimple - Toronto, ON   
Being a swiss army knife. Wealthsimple is Canada's largest and fastest growing online investment manager....
From WealthSimple - Fri, 30 Jun 2017 16:26:07 GMT - View all Toronto, ON jobs
          Data Scientist - Drop - Toronto, ON   
Through our mobile app, users supercharge their debit and credit cards to automatically earn points on their every day spending at places such as Starbucks, Tim...
From Drop - Thu, 01 Jun 2017 02:38:43 GMT - View all Toronto, ON jobs
          Data Scientist - ACI Worldwide - India   
The Data Scientist &amp; Data Warehouse Expert will have experience in Oracle Business Intelligence Enterprise Edition and will lead initiatives to develop and...
From ACI Worldwide - Fri, 30 Jun 2017 08:47:09 GMT - View all India jobs
          Data Scientist - Machine Learning, Python, C   
New York, If you are a Data Scientist with experience, please read on! What You Need for this Position More Than 5 Years of experience and knowledge of: - Machine Learning - Python - C So, if you are a Data Scientist with experience, please apply today!
          Senior Data Scientist - Tailored Brands - Fremont, CA   
Tailored Brands features leading menswear Men’s Wearhouse, Jos. Tailored Brands provides a personal, convenient, one-of-a-kind shopping experience with...
From Indeed - Tue, 06 Jun 2017 22:57:45 GMT - View all Fremont, CA jobs
          Principal Data Scientist   

          Manager, Quantitative Analytics/Data Science - Scotiabank - Toronto, ON   
Assess the impact of GL reconciliation results on capital and critical risk reports. Join the Global Community of Scotiabankers to help customers become better...
From Scotiabank - Thu, 29 Jun 2017 18:49:25 GMT - View all Toronto, ON jobs
          Learning Salesforce Einstein   

Incorporate the power of Einstein in your Salesforce application About This Book Make better predictions of your business processes using prediction and predictive modeling Build your own custom models by leveraging PredictionIO on the Heroku platform Integrate Einstein into various cloud services to predict sales, marketing leads, insights into news feeds, and more Who This Book Is For This book is for developers, data scientists, and Salesforce-experienced consultants who want to explore Salesforce Einstein and its current offerings. It assumes some prior experience with the Salesforce platform. What You Will Learn Get introduced to AI and its role in CRM and cloud applications Understand how Einstein works for the sales, service, marketing, community, and commerce clouds Gain a deep understanding of how to use Einstein for the analytics cloud Build predictive apps on Heroku using PredictionIO, and work with Einstein Predictive Vision Services Incorporate Einstein in the IoT cloud Test the accuracy of Einstein through Salesforce reporting and Wave analytics In Detail Dreamforce 16 brought forth the latest addition to the Salesforce platform: an AI tool named Einstein. Einstein promises to provide users of all Salesforce applications with a powerful platform to help them gain deep insights into the data they work on. This book will introduce you to Einstein and help you integrate it into your respective business applications based on the Salesforce platform. We start off with an introduction to AI, then move on to look at how AI can make your CRM and apps smarter. Next, we discuss various out-of-the-box components added to sales, service, marketing, and community clouds from salesforce to add Artificial Intelligence capabilities. Further on, we teach you how to use Heroku, PredictionIO, and the force.com platform, along with Einstein, to build smarter apps. The core chapters focus on developer content and introduce PredictionIO and Salesforce Einstein Vision Services. We explore Einstein Predictive Vision Services, along with analytics cloud, the Einstein Data Discovery product, and IOT core concepts. Throughout the book, we also focus on how Einstein can be integrated into CRM and various clouds such as sales, services, marketing, and communities. By the end of the book, you will be able to embrace and leverage the power of Einstein, incorporating its functions to gain more knowledge. Salesforce developers will be introduced to the world of AI, while data scientists will gain insights into Salesforce’s various cloud offerings and how they can use Einstein’s capabilities and enhance applications. Style and approach This book takes a straightforward approach to explain Salesforce Einstein and all of its potential applications. Filled with examples, the book presents the facts along with seasoned advice and real-world use cases to ensure you have all the resources you need to incorporate the power of Einstein in your work. Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com . If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the code file.


          (USA-NY-Brooklyn) Projoect Edision Project Manager   
JPMorgan Chase & Co. (NYSE: JPM) is a leading global financial services firm with assets of $2.5 trillion and operations in more than 60 countries. The firm is a leader in investment banking, financial services for consumers, small business and commercial banking, financial transaction processing, asset management, and private equity. Senior, hands-on development manager who is passionate about technology and has experience developing high performance transaction or reporting systems with large databases and multi-tier architectures; knows how to build and motivate a talented and committed technology team The firm is on a journey to transform the way we deliver technology, and to drive business value from our data. As such, the Corporate and Investment Bank (CIB) is looking for an experienced Project Manager for our Edison program. Edison is a scalable decision support system that utilizes Big Data technology, micro-service oriented architecture, and “inner sourcing” development model. It will store master reference data, support reporting, BI, deep learning, and data science and provide data governance. + Develop project plans and execution approaches for a number of initiatives that are defined by client requirements and specifications. + Work with other cross-functional managers to select a core team for each project, present a plan and gain approval from the business and IT leadership. + Monitor ongoing projects to evaluate progress and take action if any issues arise. + Work with QA Manager and testing team to ensure thorough testing of modules/applications/interfaces. + See projects to completion within terms of SLA and supervise accurate turnover to Operations staff. + Oversee performance management of assigned team, giving feedback, writing appraisals and approving development plans with senior team members. **What Education/Experience do I Need?** + You have at least 5 years of combined business, project management, team leadership and IT experience. + You should have a college degree or equivalent specialized training also. + Must possess experience managing geographically distributed and culturally diverse workgroups and demonstrate solid team management, leadership and coaching skills. + Excellent communication, analytical and writing skills; can handle immense amounts of work and tight deadlines; able to develop strong client relationships; confident when working with professional services firms. **Other Requirements** CMM and Six Sigma methodologies and standards required along with Microsoft Project, Project Workbench and PMI required. **Competitive Rewards** Competitive pay; excellent benefits; a performance-based incentive program; opportunity for company advancement. **Work Environment** Dynamic, no-nonsense business center. + College degree, specialized training or equivalent work experience + Minimum five years of combined business, project management, team leadership and IT experience required + Experience with projects in multiple technologies, functions (e.g. transaction management, risk management etc.) and industries + Knowledge of CMM and Six Sigma Methodologies and Standards + Knowledge and experience using project management software such as Microsoft Project, Project Workbench, PMI, etc. + Experience managing geographically distributed and culturally diverse work-groups with strong team management, leadership and coaching skills + Project Management Certification a plus + Knowledge of outsourcing methodologies and operating models, and working with professional services firms + Excellent written and verbal communication skills + Ability to develop strong client relationships JPMorgan Chase offers an exceptional benefits program and a highly competitive compensation package. JPMorgan Chase is an Equal Opportunity and Affirmative Action Employer, M/F/D/V JPMorgan Chase is an equal opportunity and affirmative action employer Disability/Veteran.
          (USA-WA-Seattle) Sr Financial Analyst - Consumer Marketing Analytics   
Amazon Consumer Marketing Finance seeks a Sr Financial Analyst to support the Consumer Marketing Analytics (CMA) team, which is responsible for generating customer understanding and insights, Customer Targeting applications, and Marketing Attribution. Products that CMA develop are built from the ground up, cutting edge technologies that fuse Big Data with concepts from Machine Learning, Economics and other fields like Data Science. These innovations help make strategic investment decisions and define the customer engagement metrics by which Amazon runs its business globally. This role will influence the metrics that make customer messaging more personal. The Sr. Financial Analyst would partner with the CMA business/tech teams. S/he has a passion for learning about complex econometric models and communicating to a non-technical audience in a crisp manner. The successful candidate will have strong analytic and communication skills, and have a passion for using data to drive business decisions. We are looking for a self-starter that attacks complex business problems with curiosity and dives deep to identify root causes. This individual thrives by providing data-driven decision support and business intelligence that is timely, accurate, and actionable. + Bachelor’s degree in finance, accounting, business, economics, or related field + 5+ years of relevant Finance experience + Master’s degree in finance, accounting, business, economics, or related field + Advanced knowledge of Excel + Strong analytical, financial modeling and reporting skills + Persuasive oral/written communication skills + Knowledge of SQL is desirable + Must be comfortable working in cross-functional teams + Ability to demonstrate strong leadership + Excellent communication and persuasion skills Amazon is an Equal Opportunity-Affirmative Action Employer - Female/Minority/Disability/Veteran/Gender Identity/Sexual Orientation AMZR Req ID: 550570 External Company URL: www.amazon.com
          Data Scientist - Data Scientist, Hadoop, Python   
Lexington Park, If you are a Data Scientist with experience, please read on! Top Reasons to Work with Us 1. An awesome opportunity to work with Big Data & be part of high profile aviation projects for government & defense 2. We are privately held company, we continue to grow our Data Engineering team well into 2017 What You Will Be Doing Our Engineering team is developing multiple products for the marketplace. Th
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
The Senior Data Scientist provides leadership in implementation of advanced analytics models and solutions to yield predictive and prescriptive insights from...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          We Do Community: Machine Learning at SacTown Women in Data Science Group Draws Crowd   

Tuesday night I had the opportunity to visit Hacker Lab, where the Sacramento Women in Data Science hosted Dr. Ian Brooks, one of our super-talented Solutions Engineers. SWDS has over 500 members and regularly hosts data science events to help grow the data science and analytics community for, but not exclusive to, women. Groups such as […]

The post We Do Community: Machine Learning at SacTown Women in Data Science Group Draws Crowd appeared first on Hortonworks.


          Principal Data Scientist / Machine Learning Engineer - Intel - San Diego, CA   
GraphLab, Spark, Hadoop) Inside this Business Group. Cross functional support with teams in various business groups....
From Intel - Sat, 17 Jun 2017 10:23:08 GMT - View all San Diego, CA jobs
          Learning Salesforce Einstein   

Incorporate the power of Einstein in your Salesforce application About This Book Make better predictions of your business processes using prediction and predictive modeling Build your own custom models by leveraging PredictionIO on the Heroku platform Integrate Einstein into various cloud services to predict sales, marketing leads, insights into news feeds, and more Who This Book Is For This book is for developers, data scientists, and Salesforce-experienced consultants who want to explore Salesforce Einstein and its current offerings. It assumes some prior experience with the Salesforce platform. What You Will Learn Get introduced to AI and its role in CRM and cloud applications Understand how Einstein works for the sales, service, marketing, community, and commerce clouds Gain a deep understanding of how to use Einstein for the analytics cloud Build predictive apps on Heroku using PredictionIO, and work with Einstein Predictive Vision Services Incorporate Einstein in the IoT cloud Test the accuracy of Einstein through Salesforce reporting and Wave analytics In Detail Dreamforce 16 brought forth the latest addition to the Salesforce platform: an AI tool named Einstein. Einstein promises to provide users of all Salesforce applications with a powerful platform to help them gain deep insights into the data they work on. This book will introduce you to Einstein and help you integrate it into your respective business applications based on the Salesforce platform. We start off with an introduction to AI, then move on to look at how AI can make your CRM and apps smarter. Next, we discuss various out-of-the-box components added to sales, service, marketing, and community clouds from salesforce to add Artificial Intelligence capabilities. Further on, we teach you how to use Heroku, PredictionIO, and the force.com platform, along with Einstein, to build smarter apps. The core chapters focus on developer content and introduce PredictionIO and Salesforce Einstein Vision Services. We explore Einstein Predictive Vision Services, along with analytics cloud, the Einstein Data Discovery product, and IOT core concepts. Throughout the book, we also focus on how Einstein can be integrated into CRM and various clouds such as sales, services, marketing, and communities. By the end of the book, you will be able to embrace and leverage the power of Einstein, incorporating its functions to gain more knowledge. Salesforce developers will be introduced to the world of AI, while data scientists will gain insights into Salesforce’s various cloud offerings and how they can use Einstein’s capabilities and enhance applications. Style and approach This book takes a straightforward approach to explain Salesforce Einstein and all of its potential applications. Filled with examples, the book presents the facts along with seasoned advice and real-world use cases to ensure you have all the resources you need to incorporate the power of Einstein in your work. Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com . If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the code file.


          Director, Data Scientist - KPMG - Atlanta, GA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Tue, 16 May 2017 08:29:26 GMT - View all Atlanta, GA jobs
          Director, Data Scientist - KPMG - Santa Clara, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Santa Clara, CA jobs
          Director, Data Scientist - KPMG - Irvine, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:26 GMT - View all Irvine, CA jobs
          Data Scientist - IBM - Austin, TX   
Opportunities to implement machine learning into support business processes. Business process optimization....
From IBM - Tue, 06 Jun 2017 21:03:26 GMT - View all Austin, TX jobs
          Director, Data Scientist - KPMG - Seattle, WA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Seattle, WA jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Develops large scale Advanced Analytics Projects through business and analytical problem framing, data acquisition and creation model approach definition, model...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Designs and develops Advanced Analytics Projects through business and analytical problem framing, data acquisition and creation, model approach definition,...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Data Scientist - Health insurance domain   

          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Chief Data Scientist - Predictive Science - United States   
Develop innovative approaches to linking internal systems with external data. Experience in creating and implementing machine learning algorithms and advanced...
From Predictive Science - Sat, 11 Mar 2017 08:06:17 GMT - View all United States jobs
          Senior Advisor Product Management - Threat Detection, Security - Dell - Reston, VA   
Supports the Business Line (IS Level) of strategic planning. It also requires some knowledge of Data science and machine learning technologies used for...
From Dell - Thu, 25 May 2017 05:26:48 GMT - View all Reston, VA jobs
          Senior Data Scientist, NLP - Demandbase - San Francisco, CA   
Work with customers and internal stakeholders to define hypotheses and models. Proven ability to solve problems using NLP and machine learning in an industry...
From DemandBase - Sat, 22 Apr 2017 02:02:09 GMT - View all San Francisco, CA jobs
          Machine Learning Software Engineer - Demandbase - San Francisco, CA   
Build out new applications and business solutions as part of a combined data scientist / machine learning / engineering team....
From DemandBase - Wed, 05 Apr 2017 01:04:05 GMT - View all San Francisco, CA jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Our ability to organize and design the wealth of data we receive each day provides the foundation which enables many of UPS’ core processes....
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Our ability to organize and design the wealth of data we receive each day provides the foundation which enables many of UPS’ core processes....
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          AMD Launches the World’s Fastest Graphics Card for Machine Learning Development and Advanced Visualization Workloads, Radeon Vega Frontier Edition, Available Now   
AMD Launches the World’s Fastest Graphics Card for Machine Learning Development and Advanced Visualization Workloads, Radeon Vega Frontier Edition, Available Now

–  Radeon Vega Frontier Edition fuels pioneers with the power to pursue new frontiers in AI, advanced game design and photorealistic visualization –


Singapore – June 29, 2017 AMD (NYSE: AMD) today unleashed the first product based on its highly anticipated “Vega” graphics processing unit (GPU) architecture: Radeon™ Vega Frontier Edition. Radeon Vega Frontier Edition is the world’s first graphics card designed to empower the next generation of data scientists, game designers and visualization professionals, with up to 172 percent faster rendering performance than the comparable competitor card1. Through its disruptive High Bandwidth Cache Controller, the cornerstone of the world’s most advanced GPU memory architecture – HBM2 – Radeon Vega Frontier Edition expands the capacity of traditional GPU memory to 256TB, allowing users to tackle massive datasets with ease, and scored up to 33 percent faster than the competition in the DeepBench benchmark that measures the performance of basic operations involved in training deep neural networks2.

“We’re dedicating Radeon Vega Frontier Edition to all the visionaries and trailblazers who embrace new technologies to propel their industries forward to help solve mankind’s greatest problems,” said Ogi Brkic, senior director and general manager, Radeon Pro business, Radeon Technologies Group, AMD. “With this powerful solution, we’ve brought the full weight of our new ‘Vega’ GPU architecture to bear, offering unmatched3performance in the most demanding design, rendering, and machine intelligence workloads so that the world’s top creators, data scientists and game developers can reach new frontiers in their fields.”

Radeon Vega Frontier Edition Board Design

“AMD did a stunning job on the industrial design of the Radeon Vega Frontier Edition. The blue-anodized brushed aluminum shroud and lit Radeon inlays are downright elegant,” said Kelt Reeves, president of Falcon Northwest. “The high-airflow I/O bracket and vented anodized backplate are a beautifully executed example of how form can follow function and still make for a beautiful product.”

Unmatched2 Performance and TCO in Machine Learning Applications


Together with AMD’s open-source, fully scalable ROCm software platform,

Radeon Vega Frontier Edition paves the way for pioneers to continue pushing boundaries in fields like artificial intelligence (AI). Developers can now use the power of the “Vega” architecture for machine learning algorithm development on the Radeon Vega Frontier Edition faster than with any other GPU on the market2, before deploying it out to massive servers equipped with Radeon Instinct accelerators. This powerful new solution also delivers a disruptive performance per dollar equation, solidifying AMD’s leadership in compute total cost of ownership (TCO).

Advanced Photorealistic Rendering Performance

Radeon Vega Frontier Edition delivers the horsepower required for design and manufacturing firms to drive increasingly large and complex models and to deploy real-time visualization and physically-based rendering. The Radeon Vega Frontier Edition’s revolutionary memory engine also allows professionals to achieve photorealistic detail in computer-generated imagery. A visualization powerhouse, the Radeon Vega Frontier Edition GPU offers exceptional multi-GPU scaling, with 91 percent faster rendering using two Radeon Vega Frontier Edition GPUs4.

Accelerating Game Design and Immersive Workflows

The Radeon Vega Frontier Edition graphics card simplifies and accelerates game creation by providing a single GPU that is optimized for every stage of a game developer’s workflow. This includes everything from asset production to playtesting and performance optimization. With the Radeon Pro Settings user interface, users can seamlessly switch between “Radeon Pro Mode” and “Gaming Mode” to alternate between development on animation applications like Autodesk® Maya and performance optimizations with free, open source tools available through AMD’s GPUOpen initiative.

The compute power in Radeon Vega Frontier Edition and its support for an open software ecosystem also give a new breed of developers and filmmakers the ability to break new ground in virtual reality (VR) and 360-degree video content. AMD’s fastest Radeon VR Ready Creator graphics card ever, Radeon Vega Frontier Edition achieves the maximum possible score in the SteamVR benchmark, up to 21 percent higher than the multi-GPU Radeon™ Pro Duo solution5. Combined with Radeon™ Loom, AMD’s revolutionary 360-degree video stitching technology, creators can stitch high-resolution video in real time.

Radeon Vega Frontier Edition Availability

Radeon Vega Frontier Edition graphics cards are available from etailers in select regions today with an SEP of $999 USD for the air-cooled edition. The water-cooled edition is expected to launch in Q3 with an SEP of $1499.

Supporting Resources

       Learn more about Radeon Vega Frontier Edition at Pro.Radeon.com/Frontier


       Learn more about ROCm

       Become a fan of AMD on Facebook

       Follow Radeon Graphics on Twitter @Radeon


About AMD


For more than 45 years AMD has driven innovation in high-performance computing, graphics, and visualization technologies ― the building blocks for gaming, immersive platforms, and the datacenter. Hundreds of millions of consumers, leading Fortune 500 businesses, and cutting-edge scientific research facilities around the world rely on AMD technology daily to improve how they live, work, and play. AMD employees around the world are focused on building great products that push the boundaries of what is possible. For more information about how AMD is enabling today and inspiring tomorrow, visit the AMD (NASDAQ: AMD) website, blog, Facebook and Twitter pages.



Cautionary Statement

This press release contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including the features, functionality, availability, timing and expected benefits of Radeon™ Vega Frontier Edition products, which are made pursuant to the Safe Harbor provisions of the Private Securities Litigation Reform Act of 1995. Forward-looking statements are commonly identified by words such as "would," "intends," "believes," "expects," "may," "will," "should," "seeks," "intends," "plans," "pro forma," "estimates," "anticipates," or the negative of these words and phrases, other variations of these words and phrases or comparable terminology. Investors are cautioned that the forward-looking statements in this document are based on current beliefs, assumptions and expectations, speak only as of the date of this document and involve risks and uncertainties that could cause actual results to differ materially from current expectations. Such statements are subject to certain known and unknown risks and uncertainties, many of which are difficult to predict and generally beyond AMD's control, that could cause actual results and other future events to differ materially from those expressed in, or implied or projected by, the forward-looking information and statements. Material factors that could cause actual results to differ materially from current expectations include, without limitation, the following: Intel Corporation’s dominance of the microprocessor market and its aggressive business practices may limit AMD’s ability to compete effectively; AMD has a wafer supply agreement with GF with obligations to purchase all of its microprocessor and APU product requirements, and a certain portion of its GPU product requirements, from GLOBALFOUNDRIES Inc. (GF) with limited exceptions. If GF is not able to satisfy AMD’s manufacturing requirements, its business could be adversely impacted; AMD relies on third parties to manufacture its products, and if they are unable to do so on a timely basis in sufficient quantities and using competitive technologies, AMD’s business could be materially adversely affected; failure to achieve expected manufacturing yields for AMD’s products could negatively impact its financial results; the success of AMD’s business is dependent upon its ability to introduce products on a timely basis with features and performance levels that provide value to its customers while supporting and coinciding with significant industry transitions; if AMD cannot generate sufficient revenue and operating cash flow or obtain external financing, it may face a cash shortfall and be unable to make all of its planned investments in research and development or other strategic investments; the loss of a significant customer may have a material adverse effect on AMD; AMD’s receipt of revenue from its semi-custom SoC products is dependent upon its technology being designed into third-party products and the success of those products; global economic uncertainty may adversely impact AMD’s business and operating results; the markets in which AMD’s products are sold are highly competitive; AMD may not be able to generate sufficient cash to service its debt obligations or meet its working capital requirements; AMD has a large amount of indebtedness which could adversely affect its financial position and prevent it from implementing its strategy or fulfilling its contractual obligations; the agreements governing AMD’s notes and the Secured Revolving Line of Credit impose restrictions on AMD that may adversely affect its ability to operate its business; AMD's issuance to West Coast Hitech L.P. (WCH) of warrants to purchase 75 million shares of its common stock, if and when exercised, will dilute the ownership interests of its existing stockholders, and the conversion of the 2.125% Convertible Senior Notes due 2026 may dilute the ownership interest of its existing stockholders, or may otherwise depress the price of its common stock; uncertainties involving the ordering and shipment of AMD’s products could materially adversely affect it; the demand for AMD’s products depends in part on the market conditions in the industries into which they are sold. Fluctuations in demand for AMD’s products or a market decline in any of these industries could have a material adverse effect on its results of operations; AMD’s ability to design and introduce new products in a timely manner is dependent upon third-party intellectual property; AMD depends on third-party companies for the design, manufacture and supply of motherboards, software and other computer platform components to support its business; if AMD loses Microsoft Corporation’s support for its products or other software vendors do not design and develop software to run on AMD’s products, its ability to sell its products could be materially adversely affected; and AMD’s reliance on third-party distributors and AIB partners subjects it to certain risks. Investors are urged to review in detail the risks and uncertainties in AMD's Securities and Exchange Commission filings, including but not limited to AMD's Quarterly Report on Form 10-Q for the quarter ended April 1, 2017.




1 Radeon™ Vega Frontier Edition delivers up to 172% faster performance in Maya 2017 GPGPU tests than NVIDIA GeForce Titan Xp. Testing conducted by AMD Performance Labs as of May 12th, 2017 on a test system comprising of Intel E5-1650 v3 @ 3.50 GHz, 16GB DDR4 physical memory, Windows 10 Enterprise 64-bit, Radeon™ Vega Frontier Edition / NVIDIA GeForce Titan Xp, AMD graphics driver 17.20/NVIDIA graphics driver 382.05 and Samsung 850 PRO 512G SSD.



Benchmark Application: AMD Internal Benchmark for Autodesk Maya 2017. Radeon™ Vega Frontier Edition

score: 10.38. NVIDIA GeForce Titan Xpscore: 3.81. Performance Differential: (10.38-3.81)/3.81 = ~172.44% faster performance on Radeon™ Vega Frontier Edition. PC manufacturers may vary configurations, yielding different results. Performance may vary based on use of latest drivers. RPVG-008.

2 Testing conducted by AMD Performance Labs as of May 15th 2017 with the Radeon™ Vega Frontier Edition graphics card, Intel® Xeon E5 2640v4

2.4Ghz 10C/20T, Dual Socket, 32GB per socket, 64GB Total, Ubuntu 16.04 LTS, ROCm 1.5, and OpenCL™ 1.2. The Nvidia Tesla P100, was tested on a system comprising of Intel® Xeon E5 2640v4 2.4Ghz 10C/20T, Dual Socket, 32GB per socket, 64GB Total, Ubuntu 16.04 LTS with CuDNN 5.1, Driver 375.39 and Cuda version 8.0.61. When using the DeepBench Benchmark, Radeon™ Vega Frontier Edition completed in 88.7 ms and the Nvidia

Tesla P100 completed in 133.1 ms. PC manufacturers may vary configurations, yielding different results. Performance may vary based on use of latest drivers. VG-9.


3 Testing conducted by AMD Performance Labs as of May 12th, 2017 on a test system comprising of Intel E5-1650 v3 @ 3.50 GHz, 16GB DDR4 physical memory, Windows 7 Professional 64-bit, Radeon™ RX Vega Frontier Edition / NVIDIA Geforce TitanXp, AMD graphics driver 17.20/NVIDIA graphics driver 382.05 and LITEON 512GB SSD.

4   
Benchmark Application: SPECViewperf 12.1 catia-04 viewset, Radeon™ Vega Frontier Edition score: 135.78 and NVIDIA GeForce Titan Xp score: 107.29 for ~26.55% faster performance on Radeon™ Vega Frontier Edition;
Benchmark Application: SPECViewperf 12.1 creo-01 viewset, Radeon™ Vega Frontier Edition score: 83.94 and NVIDIA GeForce Titan Xp score:
65.20 for ~28.74% faster performance on Radeon™ Vega Frontier Edition;
Benchmark Application: SPECViewperf 12.1 sw-03 viewset, Radeon™ Vega Frontier Edition score: 114.88 and NVIDIA GeForce Titan Xp score:
67.75 for  ~69.56% faster performance on Radeon™ Vega Frontier Edition.

Benchmark Application: SPECapc Siemens NX 10, Radeon™ Vega Frontier Edition score: 4.08 and NVIDIA GeForce Titan Xp score: 2.93 for ~39.25% faster performance on Radeon™ Vega Frontier Edition.

Benchmark Application: Cinebench, Radeon™ Vega Frontier Edition FPS: 183.28 and NVIDIA GeForce Titan Xp FPS: 169.72 for ~7.99% faster performance on Radeon™ Vega Frontier Edition. Scores are estimates based on AMD internal lab measurements/modelling and may vary. SPEC® and the benchmarks named SPECviewperf® and SPECapc are registered trademarks or service marks of the Standard Performance Evaluation Corporation. For more information about SPECviewperf or SPECapc, see www.spec.org. PC manufacturers may vary configurations, yielding different results. Performance may vary based on use of latest drivers. RVFE-001.


Testing conducted by AMD Performance Labs as of May 15th 2017 with the Radeon™ Vega Frontier Edition graphics card, Intel® Xeon E5 2640v4

2.4Ghz 10C/20T, Dual Socket, 32GB per socket, 64GB Total, Ubuntu 16.04 LTS, ROCm 1.5, and OpenCL™ 1.2. The Nvidia Tesla P100, was tested on a system comprising of Intel® Xeon E5 2640v4 2.4Ghz 10C/20T, Dual Socket, 32GB per socket, 64GB Total, Ubuntu 16.04 LTS with CuDNN 5.1,

Driver 375.39 and Cuda version 8.0.61. When using the DeepBench Benchmark, Radeon™ Vega Frontier Edition completed in 88.7 ms and the Nvidia Tesla P100 completed in 133.1 ms. PC manufacturers may vary configurations, yielding different results. Performance may vary based on use of latest drivers. VG-9.

4 2x Radeon™ Vega Frontier Edition is up to 91% faster rendering than 1x Radeon™ Vega Frontier Edition when using Maya with the Radeon™ ProRender plug-in. Testing conducted by AMD Performance Labs as of May 26th, 2017 on a test system comprising of Ryzen™ 7 1800X @3.60 GHz, 32GB DDR4 physical memory, Windows 10 Enterprise 64-bit, Radeon™ Vega Frontier Edition, AMD graphics driver 17.20 and Samsung 850 PRO

512GB SSD.

Benchmark Application: Maya Radeon ProRender plug-in GPU rendering option. Measurement: Render time for the Helmet scene with 8x AA,
HD720 output and 100 pass limit. 2 x Radeon™ Vega Frontier Edition render time (seconds): 135. Single Radeon™ Vega Frontier Edition render time

(seconds): 258. Performance differential: (258-135)/135 = ~91.11% faster rendering on 2 x Radeon™ Vega Frontier Edition. Scores are estimates based on AMD internal lab measurements/modelling and may vary. PC manufacturers may vary configurations, yielding different results. Performance may vary based on use of latest drivers. Performance may vary based on use of latest drivers. RPSW-002.

5 Testing conducted by AMD Performance Labs as of May 24th, 2017 on a test system comprising of Intel E5-1650 v3 @ 3.50 GHz, 16GB DDR4 physical memory, Windows 7 Professional 64-bit, Radeon™ Vega Frontier Edition/Radeon™ Pro Duo (Polaris)/ Radeon™ Pro WX 7100, AMD graphics driver 17.20 and LITEON 512GB SSD.

Benchmark Application: SteamVRPerformance Test/VRMark. Radeon™ Vega Frontier Edition SteamVRPerformance Test Score: 11. Radeon™ Pro

Duo (Polaris) SteamVRPerformance Test Score: 9.1. Radeon™ Pro WX 7100 SteamVRPerformance Test Score: 6.4. Radeon™ Vega Frontier Edition
VRMark–Orange Room Score: 8157. Radeon™ Pro Duo (Polaris) VRMark–Orange Room Score: 6596. Radeon™ Pro WX 7100 VRMark–Orange


Room Score: 6588. Scores are estimates based on AMD internal lab measurements/modelling and may vary. PC manufacturers may vary configurations, yielding different results. Performance may vary based on use of latest drivers. Performance may vary based on use of latest drivers. RPVG-009.


For the LATEST tech updates,
FOLLOW us on our Twitter
LIKE us on our FaceBook
SUBSCRIBE to us on our YouTube Channel!

          Data Scientist - SME level - Praxis Engineering - Reston, VA   
No red tape. Ability to identify and/or develop business opportunities. Thorough knowledge of Agency business operations and associated processes....
From Praxis Engineering - Fri, 09 Jun 2017 07:13:27 GMT - View all Reston, VA jobs
          Data Scientist - Praxis Engineering - Reston, VA   
No red tape. Utilizing complex mathematical, statistical or other data-driven problem solving analysis to identify significant intelligence issues or trends in...
From Praxis Engineering - Fri, 09 Jun 2017 07:13:27 GMT - View all Reston, VA jobs
          Data Scientist - Mid - Praxis Engineering - Reston, VA   
No red tape. Conducting mathematical, statistical or other data-driven problem solving analysis to identify business operations or intelligence questions by...
From Praxis Engineering - Fri, 09 Jun 2017 07:13:26 GMT - View all Reston, VA jobs
          Sr Data Scientist - Praxis Engineering - Reston, VA   
No red tape. Ability to identify and/or develop business opportunities. Utilizing complex mathematical, statistical or other data-driven problem solving...
From Praxis Engineering - Fri, 09 Jun 2017 07:12:41 GMT - View all Reston, VA jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Principal Data Scientist - Microsoft - Redmond, WA   
We are a horizontal team and need each DS to lead and drive their projects. The Microsoft Support Engineering Group (MSEG) is part of Universal Store Team (UST)...
From Microsoft - Sat, 13 May 2017 05:34:59 GMT - View all Redmond, WA jobs
          Беларусь и Россия на IV Форуме регионов заключили контракты на 450 миллионов долларов   

Договоры носят глобальный характер и расширяют кооперационное сотрудничество между странами.

Рассказывает Илона Красутская.

Пока мир пристально следил за сводками прогноза погоды в России, “Экспоцентр” в Москва-сити был самым безоблачным местом. Здесь, действительно, все предельно ясно. IV Форум регионов. Переговоры нон-стоп. 600 топ-менеджеров Беларуси и России нашли общий язык с сенаторами, министрами и губернаторами. Контракты на сотни миллионов долларов. Впервые подписаны соглашения между советами депутатов. Инициаторы конгресса - верхняя палата белорусского парламента и Совет Федерации России. Главные сенаторы сегодня были нарасхват. Журналистов интересовали первые итоги и новые проекты в рамках союзной интеграции.

За прошлый год у нас на 8 % выросли прямые российские инвестиции, составили 3,5 миллиарда долларов. Сегодня в России работает 210 организаций с белорусским капиталом. Страны связывают 70 межправсоглашений. Безусловно, за этим стоит большая работа. В том числе заслуга и Форума регионов. Уже 4 года это переговорная площадка потенциальных партнеров двух стран - можно обсудить сотрудничество и прямо здесь же заключить выгодные сделки.

Понятно, окончательные итоги форума пока подводить рано. Многие бизнесмены суеверны, ведь деньги любят тишину. Но вот некоторые цифры. У концерна "Белнефтехим" контракт на 100 миллионов долларов. Такая же сумма у нашего обувного холдинга. Еще 5 - у Барановичской швейной фабрики. А всего по итогам московского форума наша страна заключила десятки таких экспортных коммерческих контрактов.

Не менее серьезными оказались политические дивиденды этого конгресса. С участниками форума интеграцию и кооперацию обсудили президенты Александр Лукашенко и Владимир Путин. Их встречи на форумах регионов стали уже доброй традицией.

Главная тема IV Форума - сотрудничество в сфере высоких технологий. Да, последние десятки лет Беларусь сыскала славу в IT-области.

ЕПАМ, Wargaming, Viber, "Маскарад". Наши айтишники вошли в топ-100 лучших аутсорсеров мира. На неделе Александр Лукашенко анонсировал принятие кардинальных мер по развитию IT-индустрии. Речь идет о прорывном декрете, значительно меняющем условия работы IT-компаний.

В общем, зеленый свет для новых проектов союзного IT-государства.

В Москве наши айтишники показали новые проекты в сфере Data Science (наука о данных). Говоря простым языком, с помощью математических расчетов эти ребята с точностью прогнозируют успех или провал бизнес-плана любой компании.

Глава белорусского государства предлагает конкретные шаги для формирования не только единой промышленной, но и научной политики в Союзном государстве.

Для нашей страны вопрос приоритетный, тем более Год науки. Исследованиями в Беларуси занимаются 26 тысяч человек и более 400 организаций. Средний возраст ученого в Беларуси - 46 лет. Пора омолаживать статистику.

Впервые полноправные участники Форума регионов - юные светила белорусской науки - демонстрируют свои разработки в рамках нового проекта "100 идей для Союзного государства". Один из авторов - студент- третьекурсник из Гомеля Максим Кирьянов. Молодой разработчик придумал тренажер для реабилитации после инсульта. Совместил резиновую перчатку с простейшим компьютером и аккумулятором. Устройство проводит гимнастику для пальцев. Эта разработка в разы дешевле зарубежных аналогов.

С Беларусью сегодня поддерживают прямые связи 80 субъектов России. В нашей стране с начала года побывали с десяток региональных делегаций из соседнего государства. По словам Владимир Путина, удалось выстроить прочный фундамент сотрудничества. Оба государства и впредь намерены идти по пути развития всестороннего партнерства.

Кооперация, интеграция, инновации, импортозамещение. И это далеко не полный список тем, которые сегодня обсуждали в Москве. Пока главы государств общались с гостями форума, на стендах специализированной выставки продолжались активные переговоры. Например, вот этот кукурузоуборочный комбайн “Полесье” сразу из Москвы отправится на Кубань. Примерно 60 % комплектующих – российского производства. А труд и технологии – белорусские. После Форума регионов кооперация станет еще более тесной.

Два дня форума. Сотни предприятий и разработок – индивидуальные и совместные. Сферы – от машиностроения, легкой промышленности до космических исследований. Все области Беларуси и 30 регионов России. Преодолевая тысячи километров, кто-то приезжает на форум впервые, а кто-то уже в четвертый раз. Впрочем, об обоюдном намерении углублять сотрудничество после сами за себя говорят насыщенные программы визитов и ответные приглашения, которые непременно принимаем. Ведь независимо от географической удаленности все российские регионы для нас близки. По сути уже завтра начинаем подготовку к очередной встрече. Следующий Форум регионов примет Беларусь. 


          Comment on Discovering My Inner Data Scientist And Other Key Revelations by The Journey To Modern Services Marketing Starts Here   
[…] data science: The geeks won; it’s time to finally embrace advanced analytics. Business intelligence is seeing unprecedented […]
          PlayerUnknown's Battlegrounds Has Used the Banhammer 25,000 Times in 3 Months   

Cheating in online games is a foregone conclusion these days as players try to beat the system for better ranks and gear. In turn, devs continue to ban cheaters and update the game's anti-cheat measures. PlayerUnknown's Battlegrounds, which has already sold four million units and made $100 million, has faced the same issue, as it has already banned 25,000 cheaters in the last three months. 

"This is an ongoing battle, but one we are committed to fighting," developer Bluehole Studios said in a blog post on Steam. "[We] work daily with BattlEye to add new protections and detections for cheats appearing on the market."

The dev team also has continued to improve server performance for the game, and says it will continue to monitor progress. "Our data science team is spending much of their time preparing reports on how you play the game. We plan to make continuing changes to improve play area size & speed, loot balance, and gunplay properties in order to provide an exciting and fair gaming experience to everyone."

Yesterday, it was announced that loot changes would be coming in today's update. Here are the full patch notes for update 3:

Early Access - Month 3 - Patch Notes

Server Performance

  • Improved network performance by reducing the amount of data being sent from the server to the client.
  • Reduced network lag by preventing a large amount of data being sent from the server to the client simultaneously.

Client Performance

  • Fixed an issue of frame drop when other characters were around by optimizing nearby characters.
  • Fixed an issue of frame drop when vehicles were around by optimizing vehicles.
  • Optimized the starting airplane and Care Package airplanes.
  • Improved rendering performance of weapons from far-off.
  • Made improvements to the weapon animations.
  • Optimized many in-game effects, including the red zone bombing effect.
  • Improved many UI features.
  • Improved features regarding rainy weather.

New Items

  • Added Groza. Groza is an AR chambered for 7.62mm ammo, and can only be acquired in Care Packages.
  • Added P18C. P18C is a pistol chambered for 9mm ammo with a full auto fire-mode.

Gameplay

  • Vector and UMP now support burst mode.
  • You can now pick up items while moving. The interaction animation will not force you to stop anymore, but make you walk slowly.
  • You may interact with doors, items, or vehicles while reloading. Reloads will be canceled with such interactions.
  • The screen will be gradually desaturated based on remaining health during the REVIVE state.
  • Blood effect does not appear during the REVIVE state anymore.
  • Adjusted kill count system in Duo and Squad modes. A person knocking an opponent out will receive a kill count regardless of the actual killer.
  • Adjusted F key (interaction) to prioritize the REVIVE action in certain cases
  • You can now pull out pistols faster.
  • Fire mode can no longer be switched during reload.
  • Adjusted the play area to spawn more evenly within a circle, so that the play area does not appear in the center so frequently.
  • Adjusted default quantity to be selected at 1 when pressing CTRL key at inventory to partially drop or pick up items.
  • Adjusted motorcycle and motorcycle with sidecar to move more smoothly.
  • You can no longer switch to/from prone while picking up items.
  • Red Dot Sight is now attachable to pistols, except for the revolver.
  • Increased the recoil on UMP.

World

  • Added two new weather settings: Sunset and Clear Skies.
  • Added destructible cabins.
  • Added new animation for when a character is at the speed of taking fall damage.

Items & Vehicles

  • Adjusted vehicles to face random directions at spawn.
  • Adjusted loot balance for certain items.
  • VSS will no longer be found in Care Packages. It will remain to be spawned in the map.
  • At a low probability, you will be able to acquire AR Silencers, SR Silencers, and 4x Scopes in Care Packages.
  • Spawn rate of UMP was slightly decreased.
  • Spawn rate of UZI was slightly increased.
  • Level 1 Helmet was being spawned at a much higher rate than the Level 1 Vest, and the spawn rate was adjusted so that both items will be spawned at a similar rate.
  • Increased spawn rate of items in regions and buildings with relatively low spawn rate.
  • Changed the names of certain weapon attachments.
  • Adjusted the spawn time of vehicles and speedboats, so that they can be seen from farther away.
  • Improved the starting airplanes.
  • Cargo door opens when participants get ejected from the airplane. 
  • Optimized lighting inside and outside of the airplane.

UI

  • Adjusted the direction of teammate icons to the direction the teammate faces.
  • Added teammate list on World Map.
  • Improved character’s recoil animation.
  • Removed death marks of teammates after a certain period of time or distance away from the place of death.
  • Added three new languages: Thai, Indonesian, and Vietnamese.

Bug Fixes

  • Fixed an issue when the voice chat volume blasted momentarily after getting on an airplane.
  • Fixed an issue where the character was not at the center of the screen while on a parachute.
  • If you reload into game while in a parachute, you will still be in the parachute.
  • Prevented users from removing outer walls of buildings by deleting certain files.
  • Partially fixed a bug that caused a character to be stuck in terrain.
  • Fixed a bug that caused a character to be misplaced after getting in a vehicle.
  • Fixed a bug that caused effects to look identical underwater and outside of water.
  • Fixed a bug that caused a character to make an interaction motion trying to pick up an item when there is no inventory space.
  • Fixed a client crash that occurred when fences were destroyed.
  • Fixed an issue of invisible fences even after destruction.

          Data Scientist Principal Engineer -RELO to OH   
Akron, If you are a Data Scientist Principal Engineer with experience, please read on! Top Reasons to Work with Us 1. An awesome opportunity to work with Big Data & be part of high profile aviation projects for government & defense 2. We are privately held company, we continue to grow our Data Engineering team well into 2017 What You Will Be Doing Our Engineering team is developing multiple products for
          Senior Analyst, Predictive Modeling & Data Science - BMO Financial Group - Toronto, ON   
Ch. The Advanced Analytics &amp; Journey Science group partners with internal Personal and Commercial Banking Canada partners, and various lines of business across...
From BMO Financial Group - Sat, 24 Jun 2017 00:46:45 GMT - View all Toronto, ON jobs
          Data Scientist - General Electric - Wisconsin   
Have an ability to work with process experts and data engineers to build data science models using mathematical, Advanced Statistics and physics based packages...
From GE Careers - Fri, 30 Jun 2017 10:24:38 GMT - View all Wisconsin jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientist - Signature Science, LLC - Austin, TX   
MapReduce, Hadoop) as well as databases (Amazon AWS, MongoDB, Cassandra). 15-0904-01_CHO/DC Data Scientist....
From Signature Science, LLC - Thu, 29 Jun 2017 06:24:23 GMT - View all Austin, TX jobs
          Data Scientist - Consultants 2 Go - United States   
As Data Scientists, we work with business leaders to solve clients’ business challenges and improve clients’ marketing results....
From Consultants 2 Go - Tue, 27 Jun 2017 02:58:08 GMT - View all United States jobs
          Sr. Data Science Engineer - Adobe - San Jose, CA   
Develop predictive models on large-scale datasets to address various business problems through leveraging advanced statistical modeling, machine learning, or...
From Adobe - Fri, 26 May 2017 06:25:59 GMT - View all San Jose, CA jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientist - ACI Worldwide - India   
The Data Scientist &amp; Data Warehouse Expert will have experience in Oracle Business Intelligence Enterprise Edition and will lead initiatives to develop and...
From ACI Worldwide - Fri, 30 Jun 2017 08:47:09 GMT - View all India jobs
          Senior Analyst, Data Science - Prudential - Newark, NJ   
And a passion for generating business impact. Develop and maintain consultative relationships with key business stakeholders....
From Prudential - Wed, 28 Jun 2017 23:45:06 GMT - View all Newark, NJ jobs
          Senior Specialist, Data Science - Prudential - Newark, NJ   
And a passion for generating business impact. Develop and maintain consultative relationships with key business stakeholders....
From Prudential - Tue, 30 May 2017 20:29:44 GMT - View all Newark, NJ jobs
          Specialist, Data Science - Prudential - Newark, NJ   
Identify analytical solutions for business problems. And a passion for generating business impact. Develop and maintain consultative relationships with key...
From Prudential - Tue, 30 May 2017 20:29:44 GMT - View all Newark, NJ jobs
          Big Dive - Haking Development, Visualization & Data Science   
none
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Data Scientist - Signature Science, LLC - Austin, TX   
MapReduce, Hadoop) as well as databases (Amazon AWS, MongoDB, Cassandra). 15-0904-01_CHO/DC Data Scientist....
From Signature Science, LLC - Thu, 29 Jun 2017 06:24:23 GMT - View all Austin, TX jobs
          Data Scientist   

          Data Scientist   

          Jambaar Announces a New Video and Data Science Platform   
...one of the best models for any middle-market company wanting to adopt a data-driven strategy for their B2B video marketing needs," said Franz Dill, a leading Data Scientist formerly of GE and P&G and Adjunct Professor ...
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
The Senior Data Scientist provides leadership in implementation of advanced analytics models and solutions to yield predictive and prescriptive insights from...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientist - Signature Science, LLC - Austin, TX   
MapReduce, Hadoop) as well as databases (Amazon AWS, MongoDB, Cassandra). 15-0904-01_CHO/DC Data Scientist....
From Signature Science, LLC - Thu, 29 Jun 2017 06:24:23 GMT - View all Austin, TX jobs
          Director, Data Scientist - KPMG - Atlanta, GA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Tue, 16 May 2017 08:29:26 GMT - View all Atlanta, GA jobs
          Director, Data Scientist - KPMG - Santa Clara, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Santa Clara, CA jobs
          Director, Data Scientist - KPMG - Irvine, CA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:26 GMT - View all Irvine, CA jobs
          Data Scientist - IBM - Austin, TX   
Opportunities to implement machine learning into support business processes. Business process optimization....
From IBM - Tue, 06 Jun 2017 21:03:26 GMT - View all Austin, TX jobs
          Director, Data Scientist - KPMG - Seattle, WA   
Statistics, data mining, machine learning, statistics, operations research, econometrics, natural language processing, and/or information retrieval;...
From KPMG LLP - Fri, 19 May 2017 08:26:37 GMT - View all Seattle, WA jobs
          Research Associate: Data Science in Learning Analytics (1 year fixed term)   
Imperial College London<br />Salary: £36,070 to £43,350 per annum
          Data Scientist (Artificial Intelligence & Deep Learning) - Microsoft - Redmond, WA   
We help drive actionable business intelligence through advanced statistical modeling and business analytics across Microsoft....
From Microsoft - Sat, 13 May 2017 05:34:57 GMT - View all Redmond, WA jobs
          Innovation Technical Data Scientist, Developer   
TX-Irving, Job Summary : Technical development of software prototypes, betas, and technologies for Innovation projects. Engage with technical subject matter experts (SMEs), Geoscientists and Engineers to adopt and drive scientific / technical innovations that require advanced data science and scientific software development. Construct models, simulations, prototypes, data models and codes for Geoscience, Eng
          Innovation Technical Data Scientist, Modeling and Simulations   
TX-Irving, Our client is looking for a Technical Data Scientist for a direct hire position out of their Irving, TX location. Job Summary : Engage with technical subject matter experts (SMEs), Geoscientists and Engineers to develop scientific / technical innovations using advanced data modeling and simulations. Create technical models, prototypes and simulations to develop innovative solutions. Provide suppor
          Data Scientist - SEI - Oaks, PA   
This team is responsible for extracting value from the vast IMS data store to improve efficiency and decision-making across a broad client base....
From SEI - Fri, 30 Jun 2017 17:36:16 GMT - View all Oaks, PA jobs
          APAC Senior Recruitment Executive - Essence - Singapore   
Clients include Google, FrieslandCampina, Tesco Mobile and the Financial Times. Essence is a global digital agency that blends data science, objective media and...
From Essence - Fri, 30 Jun 2017 06:26:22 GMT - View all Singapore jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Science For Missionaries; A Colorado Poet Is Back In Print; 'Post-Modern' Bluegras...   
A Colorado Springs firm uses mapping to determine where missionaries can best do their work, and data to help Evangelicals spread their message. Then, how the city of Aspen transitioned to 100 percent renewable energy. Plus, with a new book, the poetry of Belle Turnbull gets new life. Turnbull and her lesbian partner lived in Breckenridge in the first part of the 20th century, where the poet’s work focused on the mountains and mining. Also, Fort Collins band Head for the Hills offers “post-modern” bluegrass on its new album, “Potions and Poisons.”
          Data Scientist, Advanced Analytics - LogRhythm - Boulder, CO   
Working hands-on/embedded with engineering teams to develop and rapidly deliver scalable, reliable, performant product that provides security value to end-users...
From LogRhythm - Fri, 24 Mar 2017 21:05:24 GMT - View all Boulder, CO jobs
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
The Senior Data Scientist provides leadership in implementation of advanced analytics models and solutions to yield predictive and prescriptive insights from...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Data Science Engineer - Performance Advertising - A9.com - Palo Alto, CA   
At least 2 years applying Machine Learning techniques to solve business problems. Be a member of the Amazon-wide Machine Learning Community, participating in...
From A9.com - Tue, 20 Jun 2017 05:00:34 GMT - View all Palo Alto, CA jobs
          Data Science Performance Engineer, Search Platform - A9.com - Palo Alto, CA   
At least 2 years of experience applying Machine Learning techniques to solve business problems. Work with Amazon Web Services to improve their machine learning...
From A9.com - Thu, 15 Jun 2017 20:40:02 GMT - View all Palo Alto, CA jobs
          Data Sciences Engineer - Sponsored Products - A9.com - Palo Alto, CA   
Be a member of the Amazon-wide Machine Learning Community, participating in internal and external Meetups, Hackathons and Conferences....
From A9.com - Wed, 03 May 2017 01:36:05 GMT - View all Palo Alto, CA jobs
          Data Science Apprentice - Predictive Science - United States   
Having additional data entry job experience, being a fast and accurate typist, and familiarity with MS Office and other data programs is a great plus....
From Predictive Science - Sat, 11 Mar 2017 08:05:32 GMT - View all United States jobs
          Senior Analyst, Predictive Modeling & Data Science - BMO Financial Group - Toronto, ON   
Lev. The Advanced Analytics &amp; Journey Science group partners with internal Personal and Commercial Banking Canada partners, and various lines of business across...
From BMO Financial Group - Sat, 24 Jun 2017 00:46:45 GMT - View all Toronto, ON jobs
          Data Scientist - Cisco - San Jose, CA   
We Are Cisco. Excellent written and oral communication skills, able to communicate with all levels of SSO internal technology teams and business....
From Cisco Systems - Tue, 18 Apr 2017 00:04:32 GMT - View all San Jose, CA jobs
          Sr. Data Scientist - Cisco - San Jose, CA   
We Are Cisco. Excellent written and oral communication skills, able to communicate with all levels of SSO internal technology teams and business *....
From Cisco Systems - Thu, 23 Mar 2017 23:32:26 GMT - View all San Jose, CA jobs
          Senior ML Data Scientist - Leadership Role!   

          Senior ML Data Scientist - Leadership Role!   

          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
The Senior Data Scientist provides leadership in implementation of advanced analytics models and solutions to yield predictive and prescriptive insights from...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          How to replace yourself with a very small shell script   

Data scientist Hillary Mason (previously) talks through her astoundingly useful collection of small shell scripts that automate all the choresome parts of her daily communications: processes that remind people when they owe her an email; that remind her when she accidentally drops her end of an exchange; that alert her when a likely important email arrives (freeing her up from having to check and check her email to make sure that nothing urgent is going on). It's a hilarious and enlightening talk that offers a glimpse into the kinds of functionality that users can provide for themselves when they run their own infrastructure and aren't at the mercy of giant webmail companies. (via Clive Thompson)

          SQL Server Дайджест #14: SQL Server 2017 Performance Improvements, BI and DWH, материалы с SQLSaturday Kyiv   

SQLSaturday прошла, да здравствует SQLSaturday! :) В то время, когда стали доступны материалы докладов SQLSaturday Kyiv, вовсю идёт подготовка к SQLSatuday Lviv. Куча новостей по новинкам автоматической оптимизации запросов в новой версии SQL Server. Новые статьи Пола Рендала, Дмитрия Пилюгина, Пола Уайта и и уже традиционный BI раздел от Евгения Полоничко. Все это и многое другое в этом дайджесте.

SQLSaturday

Материалы Докладов SQLSaturday Kyiv 2017 доступны для загрузки. Спасибо всем за участие! Наши прекрасные докладчики выгрузили все материалы своих докладов на сайт. Качайте и наслаждайтесь :)

SQLSaturday Lviv 2017 — наша любимая конференция SQLSaturday недавно прошла в Киеве и вот отличная новость от лидера львовского сообщества Сергея Лунякина: SQLSaturday снова в Украине! 20 августа пройдёт конференция SQLSaturday во Львове. Если вы хотите послушать качественные доклады по SQL Server, BI и базам данных, регистрируйтесь прямо сейчас! Количество мест ограничено. Если вы хотите выступить на SQLSaturday, у вас есть такой шанс, сейчас ещё идёт приём докладов. Если вы или ваша компания хотите поддержать конференцию, подавайтесь как спонсор, организаторы будут вам благодарны и помогут вам достичь ваших целей, которые вы ставите перед участием в конференции! До встречи на SQLSaturday Lviv!

SQL Server 2017 Performance Improvements

Так много деталей улучшений производительности нового SQL Server раскрыто, что это достойно отдельного раздела.

Introducing Batch Mode Adaptive Joins — новый оператор в плане запроса: Adaptive Join. Оператор имеет три входа, вместо двух. И в зависимости от того, сколько записей подано на первый вход по факту, оператор выбирает, какой тип джоина использовать: Hash или Nested Loops (2 и 3 вход соответственно).

SQL Server 2017: Adaptive Join Internals — детальный разбор того, как работает адаптивный джоин от Дмитрия Пилюгина.

Introducing Batch Mode Adaptive Memory Grant Feedback — ещё одна фича адаптивного процессинга запроса в SQL Server 2017 — адаптивный грант памяти. Грант памяти используется в таких операторах, как Sort и Hash Match, и оценивается он один раз на этапе построения плана запроса. Новшество адаптивного гранта в том, что в случае, если по факту выполнения запроса потребовалось больше памяти чем оценивалось, или наоборот, использовалось по факту в два раза меньше памяти чем оценивалось, данные о гранте памяти корректируются в закэшированном плане запроса.

SQL Server 2017: Sort, Spill, Memory and Adaptive Memory Grant Feedback — детальный разбор адаптивного гранта памяти от Дмитрия Пилюгина.

SQL Server vNext: Interleaved Execution for mTVF — SQL Server 2017 очень сильно улучшает производительность multi-statement табличных функций за счёт фичи Interleaved Execution. Напомню, что по умолчанию SQL Server считает, что multi-statement табличная функция возвращает одну строку. Interleaved Execution меняет это не самое лучшее поведение в лучшую сторону. Смысл её в том, что раз функция всё равно выполняется в процессе выполнения, имеет смысл её выполнить в самом начале и использовать информацию о возвращённом количестве строк для построения оставшейся части плана. Детально про то, как это работает читаем где? Правильно, в блоге Дмитрия Пилюгина :) Лично я всегда так делаю :)

Automatic plan correction in SQL Server 2017 — запрос выполнялся быстро, потом начал выполняться долго из-за того, что изменился план запроса? Новый SQL Server позволяет это определить и принудительно форсировать более быстрый план для последующих вызовов данного запроса.

Automatic index management in Azure SQL database — некий прототип этого мы уже видели в DTA, но, мягко говоря, тогда это было далеко от идеала. Сейчас всё должно быть лучше. Кроме того, что SQL Server (SQL Azure Database) может определять, что лучше для вашей базы, он может динамически удалять и создавать индексы в вашей базе :) Если вы это сами ему разрешите.

Почитать

Query optimization — недавно наткнулся на очень интересный блог одного из сотрудников Майкрософт. Подача материала довольно специфическая, блог старенький, но удивительно просто и кратко автор подаёт важные вещи. В этом посте он рассказывает о том, из чего состоит Duration запроса, куда смотреть, если запрос выполняется дольше чем должен, и какие стандартные действия могут быть в той или иной ситуации.

Entity Framework Performance and What You Can Do About It — must read статья для всех, кто использует Entity Framework в своих проектах. Очень доступно преподносится информация о том, почему в том или ином случае EF неэффективен и как это можно улучшить.

Extended Events — Configuring Session Options — статья-справочник по опциям сессии EE. Просто и доступно описаны опции и что они дают.

SQL Server In-Memory OLTP Internals for SQL Server 2016 — не за горами выпуск SQL Server 2017, а технология In-Memory OLTP входит уже в третий релиз SQL Server. К чему это я? К тому что пора использовать или хотя бы присмотреться к этой технологии. Первый ресурс, с которого желательно начать, вот эта статья (или скорее мини-книга :)) Kalen Delaney.

How are default column values stored? — как на самом деле хранятся DEFAULT значения для колонки в SQL Server? Это просто метаданные или значения хранятся в строке таблицы, как и обычные значения? Не буду раскрывать все карты :) Читаем пост Пола Рендала с детальным разбором и доказательствами.

What is a Page Split? What happens? Why does it happen? Why worry? — Page Split, несмотря на кажущуюся простоту этого понятия, является довольно тяжёлым для понимания процессом. Как всё происходит на самом деле, в деталях, можно прочитать в довольно старой, но классной статье Тони Роджерсона.

SQL Server Mysteries: The Case of the Not 100% RESTORE... — наверняка вы такое видели. Восстанавливаете бекап, видите сообщение, что 100% данных восстановлены, а задача висит ещё какое-то время. Этому есть вполне логическое объяснение. 100% — это сообщение о том, что скопированы все страницы данных, время которое мы ждём после 100% — восстановление куска лога транзакций из бекапа. Для более детальной информации читаем как всегда прекрасную статью Боба Варда.

SQL Server Temporary Object Caching — вы знали, что временные таблицы (некоторые страницы) могут кэшироваться? Если нет, читаем статью Пола Уайта. Знали, но хотите узнать больше? Читаем статью Пола Уайта.

The Myth that DROP and TRUNCATE TABLE are Non-Logged — логируются ли операции DROP и TRUNCATE? Да. Как и почему, отвечает в своей статье Пол Рендал.

BI and DWH от Евгения Полоничко

Лето пришло, а значит пора заниматься самообразованием. Итак, курсы по BI и все, что с ним связано:

Питер Майерс и его замечательный курс Developing a SQL Server Analysis Services Tabular Data Model. Питер очень крутой профи в области BI, но самое главное объясняет очень доступно. Кстати, он приезжал в Киев и выступал у нас уже два раза.

Следующий курс от маэстро SSAS Альберто Ferrari и Марко Руссо — Introducing DAX. Этот курс даст вам основы работы с языком DAX, который используется для SSAS Tabular, Power BI и т. д.

Это курс уже по Data Science от компании Microsoft.

Вдогонку к предыдущему у Microsoft есть целая программа по подготовке Data Scientists.

Ну а теперь, то, что можно просто почитать или посмотреть:

Power BI dashboard best practices by Marco Russo:

Обновленная архитектура Power BI от Dustin Ryan. Кстати, там много чего интересного есть по Power BI.

Интересная статья на сайте Hortonworks об очень, очень быстрой Олап-аналитике.

Очень интересный пост о построение модели Azure Analysis Services для Azure Blob Storage, то есть источник данных — блобы.

Хотите попробовать, что это за зверь Azure SSAS? Вот здесь Microsoft даже для вас примеры подготовил.

Конечно же, DWH раздел не может быть без ETL. Интересно? Заходите, знакомьтесь: Azure Data Factory.


← Предыдущий выпуск: SQL Server дайджест #13


          Data Scientist - General Electric - Wisconsin   
Have an ability to work with process experts and data engineers to build data science models using mathematical, Advanced Statistics and physics based packages...
From GE Careers - Fri, 30 Jun 2017 10:24:38 GMT - View all Wisconsin jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Learning Salesforce Einstein   

Incorporate the power of Einstein in your Salesforce application About This Book Make better predictions of your business processes using prediction and predictive modeling Build your own custom models by leveraging PredictionIO on the Heroku platform Integrate Einstein into various cloud services to predict sales, marketing leads, insights into news feeds, and more Who This Book Is For This book is for developers, data scientists, and Salesforce-experienced consultants who want to explore Salesforce Einstein and its current offerings. It assumes some prior experience with the Salesforce platform. What You Will Learn Get introduced to AI and its role in CRM and cloud applications Understand how Einstein works for the sales, service, marketing, community, and commerce clouds Gain a deep understanding of how to use Einstein for the analytics cloud Build predictive apps on Heroku using PredictionIO, and work with Einstein Predictive Vision Services Incorporate Einstein in the IoT cloud Test the accuracy of Einstein through Salesforce reporting and Wave analytics In Detail Dreamforce 16 brought forth the latest addition to the Salesforce platform: an AI tool named Einstein. Einstein promises to provide users of all Salesforce applications with a powerful platform to help them gain deep insights into the data they work on. This book will introduce you to Einstein and help you integrate it into your respective business applications based on the Salesforce platform. We start off with an introduction to AI, then move on to look at how AI can make your CRM and apps smarter. Next, we discuss various out-of-the-box components added to sales, service, marketing, and community clouds from salesforce to add Artificial Intelligence capabilities. Further on, we teach you how to use Heroku, PredictionIO, and the force.com platform, along with Einstein, to build smarter apps. The core chapters focus on developer content and introduce PredictionIO and Salesforce Einstein Vision Services. We explore Einstein Predictive Vision Services, along with analytics cloud, the Einstein Data Discovery product, and IOT core concepts. Throughout the book, we also focus on how Einstein can be integrated into CRM and various clouds such as sales, services, marketing, and communities. By the end of the book, you will be able to embrace and leverage the power of Einstein, incorporating its functions to gain more knowledge. Salesforce developers will be introduced to the world of AI, while data scientists will gain insights into Salesforce’s various cloud offerings and how they can use Einstein’s capabilities and enhance applications. Style and approach This book takes a straightforward approach to explain Salesforce Einstein and all of its potential applications. Filled with examples, the book presents the facts along with seasoned advice and real-world use cases to ensure you have all the resources you need to incorporate the power of Einstein in your work. Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com . If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the code file.


          Senior Data Scientist - Scotiabank - Toronto, ON   
Lead the development of strategy optimization frameworks to further enhance credit limits setting, risk-based pricing, and customer targeting throughout credit...
From Scotiabank - Tue, 27 Jun 2017 06:46:37 GMT - View all Toronto, ON jobs
          Data Scientist, Advanced Analytics - LogRhythm - Boulder, CO   
LogRhythm is a leading provider of unified security intelligence and analytics solutions that empower organizations to automate the detection, prioritization...
From LogRhythm - Fri, 24 Mar 2017 21:05:24 GMT - View all Boulder, CO jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Data Scientist - Verizon - Basking Ridge, NJ   
What you’ll be doing... If you are curious about new technology and the possibilities it creates, then this job may be perfect for you. As part of the
From Verizon - Thu, 29 Jun 2017 10:58:49 GMT - View all Basking Ridge, NJ jobs
          Junior Data Scientist - Wabco - København   
The Role As Data Scientist , you will participate to the construction and exploitation of an analytics platform within a large company using various
Fra Wabco - Thu, 29 Jun 2017 00:59:28 GMT - Vis alle København job
          Sr. Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Senior Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:45 GMT - View all Mahwah, NJ jobs
          Lead Data Scientist - UNITED PARCEL SERVICE - Mahwah, NJ   
Lead Data Scientist We’re the obstacle overcomers, the problem get-arounders. From figuring it out to getting it done… our innovative culture demands “yes...
From UPS - Fri, 30 Jun 2017 14:14:41 GMT - View all Mahwah, NJ jobs
          Chances at Harvard, other Ivies, MIT, etc.   
Hello, I am an international student studying in Azerbaijan originally from Italy, and I am seeking admission into these following US colleges: Intended Major: Double/Major + Minor in Computer Science and Economics/Finance First Choice: - Harvard EA Regular: - MIT - UPenn - Columbia - Brown - Princeton - UCB - UCLA - Gtech - Harvey Mudd - Cornell I plan on focusing on finance,business and technology, as I am very attracted to becoming an entrepreneur. Below are my stats: Academics: GPA: 3.97 UW/ 4.93 W IB: Current Score of 42, with 3 sevens and 3 sixes, but I am predicted 44, 7 in HL Physics, HL History, HL Economics, SL English, SL Spanish, and 6 in HL Math. SAT: 1450 (First Sitting) plan on taking this again, along with ACT, Math and Physics Subject Tests Major-Related Extracurriculars: - Founder and President of the Investment Club --> a club where students interested in business and investing where brought together to discuss matters. The club also hosted lessons at public Azeri schools, teaching less fortunate the basics of investing, and its power in changing their lives. - Founder and President of Technology of TutorMe.Co --> a non-profit organisation where, via a website designed by myself, younger students and parents can schedule tutoring sessions from older high-achieving students in all academic subjects, ranging from math, to Russian, etc. Session cost 15 Azn, or about 8$, with all profits donated to charity - Founder and CEO of www.investingteen.com --> my own blog with hundreds of followers, started about a year and a half ago, sharing valuable resources for young investors to learn business, entrepreneurship, and finance. - Applied Python and Data Science --> an online course from the University of Michigan, where I learned how to apply python code to data analysis. The course was accredited. - Microeconomics --> an online course from the University of California, Irvine, which illustrated advanced microeconomics, game theory, and real life applications. The course was accredited. - Entrepreneurship and Finance Lab --> a summer program at Bocconi University, the most prestigious university in Italy. The course included work-experience, company visits, and networking. It was too accredited. - Ebook --> I have been working on an ebook regarding the basics of investing for teenagers. Essentially a rundown of the most important principles to apply for the longterm, principled investing young entrepreneurs are looking for. - Work Experience --> I will be working at BP, most likely in the finance department, not sure if I will include it in my application. - Reading --> read a lot of investing and computer science books, these sparked my passions, and thus will probably be part of my essays. - Research Project --> ran a research project investigating how computer capacitors are effected by the salinity of the materials they are made of. The project included building capacitors out of apples, which I though was pretty cool. Not sure I will include this in my application either. Non-Major Related Extracurriculars: Not sure which of these I will include in my application... - Varsity football, have been playing since I was a 4 years old. Played in Azerbaijan's first division - MUN --> model united nations, was awarded the Distinguished Delegate Award, attended international conferences, one in Portugal - Global Issues Network --> a club that uses technology to raise awareness regarding global issues or problems affecting our school community - Local paper --> I am sometimes referred to contribute to research projects for my local paper. Probably will not be included in my application Thanks Guys!!!
          Data Scientist - Wink - New York, NY   
Hands-on experience with supervised and unsupervised machine learning algorithms for regression, classification, and clustering....
From Wink - Thu, 18 May 2017 06:17:27 GMT - View all New York, NY jobs
          Data Scientist - Drop - Toronto, ON   
Through our mobile app, users supercharge their debit and credit cards to automatically earn points on their every day spending at places such as Starbucks, Tim...
From Drop - Thu, 01 Jun 2017 02:38:43 GMT - View all Toronto, ON jobs
          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          Establishing a distributed national research infrastructure providing bioinformatics support to life science researchers in Australia   
Abstract
EMBL Australia Bioinformatics Resource (EMBL-ABR) is a developing national research infrastructure, providing bioinformatics resources and support to life science and biomedical researchers in Australia. EMBL-ABR comprises 10 geographically distributed national nodes with one coordinating hub, with current funding provided through Bioplatforms Australia and the University of Melbourne for its initial 2-year development phase. The EMBL-ABR mission is to: (1) increase Australia’s capacity in bioinformatics and data sciences; (2) contribute to the development of training in bioinformatics skills; (3) showcase Australian data sets at an international level and (4) enable engagement in international programs. The activities of EMBL-ABR are focussed in six key areas, aligning with comparable international initiatives such as ELIXIR, CyVerse and NIH Commons. These key areas—Tools, Data, Standards, Platforms, Compute and Training—are described in this article.

          (USA-NV-Elko - Shared Business Center) Analytics and Unified Operations Center Specialist   
Analytics and Unified Operations Center Specialist **Hot** **🔍** at https://jobs.barrick.com/jobs/1164/other-jobs-matching/location-only **Elko - Shared Business Center, Nevada, United States** at https://jobs.barrick.com/jobs/1164/other-jobs-matching/location-only at https://jobs.barrick.com/landingpages/operations-opportunities-3 📁 at https://jobs.barrick.com/landingpages/operations-opportunities-3 at https://jobs.barrick.com/landingpages/operations-opportunities-3 Operations at https://jobs.barrick.com/landingpages/operations-opportunities-3 at https://jobs.barrick.com/landingpages/operations-opportunities-3 📅    081356 Requisition # 📅    May 16, 2017 Post Date Apply for Job Share this Job Sign Up for Job Alerts **Join Barrick Nevada, a twenty-first century mining company.** The gold mining sector is ripe for disruption, and at Barrick we are restless. We will be a twenty-first century mining company—generating wealth for our owners, our people, and the countries and communities with which we partner—with conviction and the courage to be different. We have the most attractive assets in the entire gold industry. At Barrick you’ll get the opportunity to be creative and to turn great ideas into reality. Free thinking team members with an aspiration to change the world should apply immediately. **RESPONSIBILITIES:** The successful candidate will be accountable for supporting Barrick Nevada Operations, and Analytical Capacity by means of an Analytics and Unified Operations Center (Au Operations Center) that supports visualization tools, hardware, software and sensor connectivity/integration to provide a real time version of the truth. The Specialist of the Analytics and Unified Operations Center will be responsible for supporting all Operations personnel by providing standard reporting and visualization tools that enable leaders and technical specialists to make sound decisions based on data. The reports will be made available to all required personnel as per current standards; daily, weekly, monthly, annually, or as requested by leadership. The Au Operations Center team provides 24 hours a day, 7 days a week support for Barrick Nevada mine sites. They will also support the Au Operations Center leadership and technical team by developing and sustaining internal performance reports, SOPs and other relevant documents. This role will also develop and maintain the performance dashboards that will be displayed in the Au Operations Center and also made available to Barrick Nevada personnel on site with the capacity of testing/utilizing advanced analytics tools, providing capacity to customize reports and completing analysis to gain non- intuitive insights in to production trends. **BASIC REQUIREMENTS:** To be considered for this job, applicants must meet these basic requirements: + Bachelor’s degree in a technical discipline, business, statistics, economics, math, IT, data analysis, or related field or equivalent or equivalent work experience **required** + Minimum of three (3) years’ experience in data sciences or related fields **required,** mining experience **preferred** + Knowledge of IT servers, networking, and collaboration processes **required** + Understanding of data structures/Cisco Networking Methodology/Knowledge of VIOP/SIP **required** + Experience with Microsoft Office Suite, data warehousing, data and reporting software, data analytics software, visualization software, and graphics **required** + Ability to meet tight deadlines **required** + Demonstrated ability in operational execution and project management **required** + Innovative, creative, and strategic thinking skills **required** + Ability to gather data and present it **required** + Ability to safely perform the essential functions of the position **required** **BARRICK IS AN EQUAL OPPORTUNITY EMPLOYER**
          (USA-NV-Elko) Lead Data Scientist   
Lead Data Scientist Lead Data Scientist - Skills Required - Data Science, Big Data, Analytics, Data Mining, Hadoop, Data Scientist, Python, IOT, Visualization, Business Intelligence If you are a Data Scientist Lead with experience building enterprise Analytics Dashboards, please read on! We are one of the worlds largest mineral companies, looking to bring automation and analytics to our vast enterprise operation. This role will be based out of the Las Vegas area, with frequent travel to production facilities throughout Nevada. To help advance our infrastructure, we are looking for a Data Scientist Lead who can guide us on our path toward next generation technologies, to include: Predictive and Preventative Analytics, Data Mining, and various other cutting edge projects. Must have deep understanding of building and enabling big data analytical solutions. In this role you'll spend 50% of your time doing hands-on technical development and 50% of your time doing Project Coordination and Leadership. We are an equal opportunity employer and are willing to relocate the right candidate for the role. We are offering an excellent compensation package including a base salary of $170K+, generous bonuses, and a great benefit package. **What You Will Be Doing** - Building Predictive & Preventative Analytics Dashboard - Designing Cloud based Machine Learning production pipelines - Date mining, data warehousing, data sampling and creating predictive models - Working alongside business and end-users to strategically define requirements and build applications **What You Need for this Position** More Than 5 Years of experience and knowledge of: - Either Master's or Ph.D. degree (BS degree & solid experience is fine) - IoT experience - Taking business/end-user requirements and creating Data models - Experience with Hadoop, AWS, Data mining and either Python or Java - MapReduce, Sqoop/Spark, Hive, Pig - Visualization, creating reports and cleaning data sets for modeling purposes **What's In It for You** - Competitive base salary and annual bonus (up to 200k) - Ownership of your work - leading our analytics platform!!! - Great benefits package with exceptional 401K match. - Opportunity to be a part of ground breaking technology within industry. - Opportunity for growth within all locations of company in the Americas. So, if you are a Data Scientist Lead with experience building large scale enterprise systems, please apply today! We're currently conducting interviews and look forward to speaking with you ASAP! Applicants must be authorized to work in the U.S. **CyberCoders, Inc is proud to be an Equal Opportunity Employer** All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status, or any other characteristic protected by law. **Your Right to Work** – In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire. *Lead Data Scientist* *NV-Elko* *KO-1367072*
          [Перевод] Делаем data science-портфолио: история через данные   
Предисловие переводчика

Перевод внезапно удачно попал в струю других датасайенсных туториалов на хабре. :)
Этот написан Виком Паручури, основателем Dataquest.io, где как раз и занимаются подобного рода интерактивным обучением data science и подготовкой к реальной работе в этой области. Каких-то эксклюзивных ноу-хау здесь нет, но очень подробно рассказан процесс от сбора данных до первичных выводов о них, что может быть интересно не только желающим составить резюме на data science, но и тем, кто просто хочет попробовать себя в практическом анализе, но не знает, с чего начать.


Data science-компании всё чаще смотрят портфолио, когда принимают решение о приёме на работу. Это, в  частности, из-за того, что лучший способ судить о практических навыках — именно портфолио. И хорошая новость в том, что оно полностью в вашем распоряжении: если постараетесь – сможете собрать отличное портфолио, которым будут впечатлены многие компании.

Читать дальше →
          Attend Free Data Science Demo on This weekend   
Attend Free Data Science Demo on “Careers at Our Prestigious Institute” On This Weekend Data Science is considered to be a revolutionary field of technology providing effective opportunities for leading a successful career in this field. Our Institute is now offering a free demo on Data Science on This Weekend. Overview of Data Science Free Demo: Data Science Free Demo is must...
          Data Scientist - Consultants 2 Go - United States   
As Data Scientists, we work with business leaders to solve clients’ business challenges and improve clients’ marketing results....
From Consultants 2 Go - Tue, 27 Jun 2017 02:58:08 GMT - View all United States jobs
          Sr. Data Science Engineer - Adobe - San Jose, CA   
Develop predictive models on large-scale datasets to address various business problems through leveraging advanced statistical modeling, machine learning, or...
From Adobe - Fri, 26 May 2017 06:25:59 GMT - View all San Jose, CA jobs
          Innovation Fellow - Data Scientist - Manulife Financial - Toronto, ON   
Design and build customized data scrapers. Are you looking for unlimited opportunities to develop and succeed?...
From Manulife Financial - Mon, 10 Apr 2017 22:03:25 GMT - View all Toronto, ON jobs
          Junior Data Scientist - Wabco - København   
The Role As Data Scientist , you will participate to the construction and exploitation of an analytics platform within a large company using various
Fra Wabco - Thu, 29 Jun 2017 00:59:28 GMT - Vis alle København job
          Senior NodeJS Backend Engineer - Knock - Georgia   
TRLA, acquired by Zillow for $3.5B), Knock is an online home selling platform that uses data science to price homes accurately, technology to sell them quickly...
From Knock - Sat, 18 Mar 2017 23:14:22 GMT - View all Georgia jobs
          Vice President of Engineering - Knock - California   
TRLA, acquired by Zillow for $3.5B), Knock is an online home selling platform that uses data science to price homes accurately, technology to sell them quickly...
From Knock - Fri, 19 May 2017 16:33:54 GMT - View all California jobs
          Learning Salesforce Einstein   

Incorporate the power of Einstein in your Salesforce application About This Book Make better predictions of your business processes using prediction and predictive modeling Build your own custom models by leveraging PredictionIO on the Heroku platform Integrate Einstein into various cloud services to predict sales, marketing leads, insights into news feeds, and more Who This Book Is For This book is for developers, data scientists, and Salesforce-experienced consultants who want to explore Salesforce Einstein and its current offerings. It assumes some prior experience with the Salesforce platform. What You Will Learn Get introduced to AI and its role in CRM and cloud applications Understand how Einstein works for the sales, service, marketing, community, and commerce clouds Gain a deep understanding of how to use Einstein for the analytics cloud Build predictive apps on Heroku using PredictionIO, and work with Einstein Predictive Vision Services Incorporate Einstein in the IoT cloud Test the accuracy of Einstein through Salesforce reporting and Wave analytics In Detail Dreamforce 16 brought forth the latest addition to the Salesforce platform: an AI tool named Einstein. Einstein promises to provide users of all Salesforce applications with a powerful platform to help them gain deep insights into the data they work on. This book will introduce you to Einstein and help you integrate it into your respective business applications based on the Salesforce platform. We start off with an introduction to AI, then move on to look at how AI can make your CRM and apps smarter. Next, we discuss various out-of-the-box components added to sales, service, marketing, and community clouds from salesforce to add Artificial Intelligence capabilities. Further on, we teach you how to use Heroku, PredictionIO, and the force.com platform, along with Einstein, to build smarter apps. The core chapters focus on developer content and introduce PredictionIO and Salesforce Einstein Vision Services. We explore Einstein Predictive Vision Services, along with analytics cloud, the Einstein Data Discovery product, and IOT core concepts. Throughout the book, we also focus on how Einstein can be integrated into CRM and various clouds such as sales, services, marketing, and communities. By the end of the book, you will be able to embrace and leverage the power of Einstein, incorporating its functions to gain more knowledge. Salesforce developers will be introduced to the world of AI, while data scientists will gain insights into Salesforce’s various cloud offerings and how they can use Einstein’s capabilities and enhance applications. Style and approach This book takes a straightforward approach to explain Salesforce Einstein and all of its potential applications. Filled with examples, the book presents the facts along with seasoned advice and real-world use cases to ensure you have all the resources you need to incorporate the power of Einstein in your work. Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com . If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the code file.


          manager, data science and data management, Starbucks Technology Center - Phoenix, AZ   

          TensorFlow: resoconto secondo incontro Meetup "Machine-Learning e Data Science" Roma (19 febbraio 2017)   

170219-tensorflow.jpg

Il 16 febbraio 2017 si è svolto a Roma – presso il Talent Garden di Cinecittà - il secondo incontro del Meetup "Machine Learning e Data Science" (web, fb, slideshare): l’incontro - organizzato insieme al Google Developer Group Roma Lazio Abruzzo   - è stato dedicato alla presentazione di TensorFlow,  la soluzione di Google per il deep learning nel machine learning.

Nel seguito una breve sintesi dell’incontro del 16 febbraio 2017.

Prima di iniziare:

•    Cos’è il Machine Learning? Intervista a Simone Scardapane sul Machine Learning Lab (1 dicembre 2016)
•    Cos’è TensorFlow? Andrea Bessi, “TensorFlow CodeLab”, Nov 16, 2016

Resoconto

Premessa

Il 15 Febbraio 2017 a Mountain View (USA) si è tenuto il “TensorFlow Dev Summit” , il primo evento ufficiale di Google dedicato a TensorFlow, la soluzione di deep learning rilasciato circa un anno fa e giunta adesso alla versione 1.0 “production ready”.
Il “TensorFlow Dev Summit” è stato aperto da un keynote di Jeff Dean (wiki, web, lk) - Google Senior Fellow - Rajat Monga (tw) - TensorFlow leader nel Google Brain team - e Megan Kacholia (lk) Engineering Director del TensorFlow/Brain team.

Per approfonsire il #TFDevSummit 2017:

L’incontro del Meetup "Machine-Learning” di Roma è stata un’occasione per rivedere insieme e commentare il video e fare anche una breve presentazione di TensorFlow.
Alla fine dell'evento c’è stato un piccolo rinfresco aperto a tutti gli appassionati di deep learning a Roma.

Simone Scardapane: “TensorFlow and Google, one year of exciting breakthroughs”

Simone (mup, web, fb, lk) ha aperto l’incontro con una breve presentazione sul deep learning e TensorFlow.
“TensorFlow” ha detto Simone “ ha reso accessibili a tutti le reti neurali che sono alla base del deep learning e dell’intelligenza artificiale. Prima di TensorFlow c’era una oggettiva difficoltà nel gestire e allenare reti neurali vaste e complesse”.
Le moderne architetture di deep learning usano infatti reti neurali molto complesse: ad esempio nelle applicazioni di data imaging  i modelli architetturali prevedono decine di milioni di parametri.

170219-reteneurale.jpg

(Rete neurale, immagine tratta da http://joelouismarino.github.io/blog_posts/blog_googlenet_keras.html)

Uno dei grossi vantaggi di TensorFlow è che permette di definire una rete neurale in modo simbolico: lo strumento fornisce inoltre un compilatore efficiente che gestisce in automatico il processo di back-propagation.
TensorFlow può essere utilizzato inoltre con una interfaccia semplificata come Keras arrivando quasi ad una sorta di programmazione dichiarativa.
La prima release di TensorFlow – ha ricordato Simone - è stata rilasciata nel novembre 2015 con licenza aperta Apache 2.0  (cos’è?). Il 15 febbraio 2017 – durante il TFDevSummit - è stata annunciata la versione 1.0 di TensorFlow la prima “production ready”.
La disponibilità di un ambiente deep learning aperto e “user-friendly” ha permesso lo sviluppo di una vasta comunità di esperti, ricercatori e semplici appassionati e il rilascio applicazioni software di grande impatto. Simone ha mostrato alcuni esempi.

1) Neural image captioning: software in grado di riconoscere e descrivere o sottotitolare immagini.

170216-tensorflow-s2.jpg
 

170216-ragazzo-aquilone.jpg

170216-tren-b-n.jpg

2) Google Neural Machine Translation (GNMT)  che ha permesso il rifacimento di “Google translator” grazie al deep learning: invece di tradurre parola per parola ora è possibile analizza il testo nella sua interezza cogliendo significato e il contesto con un livello di accuratezza ormai vicino alla traduzione umana.

nmt-model-fast.jpg

170216-neural-translation.jpg
170216-neural-translation2.jpg

3) Generative Adversarial Networks (GANS) sistemi capaci di generare nuovi dati grazie a un emenorme “training set” e che lavorano con una coppia di reti neurali: la prima produce nuovi dati la seconda controlla la “bontà” del risultato; questi sistemi sono già stati usati per generare immagini artificiali, scenari per video-game, migliorare immagini e riprese video di scarsa qualità.

170216-generative-image

4) Alphago: il deep learning è anche alla base dei recenti spettacolari successi dell’IA nel campo dei giochi da tavolo come la vittoria di Alphago di Google  contro il campione del mondo di GO.

170216-alphago.jpg

5) WaveNet - a generative model for raw audio - capace di generare discorsi che imitano una voce umana con molta più “naturalezza” rispetto ai migliori sistemi di Text-to-Speech oggi esistenti. WaveNet è già stato utilizzato anche per creare musica artificiale.

Simone ha concluso il suo intervento ricordando che di deep learning e ML si parlerà anche in un track specifico alla Data Driven Innovation Roma 2017   che si terrà presso la 3° università di Roma il 24 e 25 febbraio 2017.

Sintesi del keynote di Jeff Dean, Rajat Monga e Megan Kacholia su TensorFlow

Il keynote di apertura del TF DevSummit 2017 condotto da Jeff Dean, Rajat Monga e Megan Kacholia  ha trattato:

  • origini e storia di TensorFlow
  • i progressi da quanto è stata rilasciata la prima versione opensource di TensorFlow
  • la crescente comunità open-source di TensorFlow
  • performance e scalabilityà di TensorFlow
  • applicazioni di TensorFlow
  • exciting announcements!

Jeff Dean

jeff dean.jpg

Jeff ha detto che l’obiettivo di Google con TensorFlow è costruire una “machine learning platform” utilizzabile da chiunque.
TensorFlow è stato rilasciato circa un anno fa ma le attività di Google nel campo del machine learning e deep learning sono iniziati 5 anni fa.
Il primo sistema realizzato – nel 2012 – è stato DistBelief un sistema proprietario di reti neurali adatto ad un ambiente produzione come quello di Google basato su sistemi distribuiti (vedi “Large Scale Distributed Deep Networks” in pdf). DistBelief è stato utilizzato in molti prodotti Google di successo come Google Search, Google Voice Search, advertising, Google Photos, Google Maps, Google Street View, Google Translate, YouTube.
Ma DistBelief aveva molti limiti: “volevamo un sistema molto più flessibile e general purpose” ha detto Jeff “che fosse open source e che venisse adottato e sviluppato da una vasta comunità in tutto il mondo e orientato non solo alla produzione ma anche alla ricerca. Così nel 2016 abbiamo annunciato TensorFlow, una soluzione capace di girare in molteplici ambienti compreso IOS, Android, Raspberry PI, capace di girare su CPU, GPU e TPU, ma anche sul Cloud di Goole ed essere interfacciata da linguaggi come Python, C++, GO, Hasknell, R”.

170216-tensorflow-keynote1.jpg
TensorFlow ha anche sofisticati tool per la visualizzazione dei dati e questo ha facilitato lo sviluppo di una vasta comunità open source intorno a TensorFlow.

170216-tensorflow-keynote2.jpg

Rajat Monga

rajatmonga2.jpg

Rajat Monga ha ufficialmente annunciato il rilascio della versione 1.0 di TensorFlow illustrandone le nuove caratteristiche.

170216-tensorflow-keynote3.jpg

170216-tensorflow-keynote4.jpg

Rajat ha poi illustrato le nuove API di TensorFlow

170218-tf-model.jpg

TensorFlow 1.0 supporta IBM's PowerAI distribution, Movidius Myriad 2 accelerator, Qualcomm SnapDragon Hexagon DSP. Rajat ha annunciato anche la disponibilità di XLA an experimental TensorFlow compiler  specializzato nella compilazione just-in-time e nei calcoli algebrici.

170218-tf-xla.jpg

Megan Kacholia

megankacholia2.jpg

Megan Kacholia ha approfondito il tema delle performancedi TensorFlow 1.0.

170216-tensorflow-keynote7.jpg

In ambiente di produzione si possono utilizzare  molteplici architetture: server farm, CPU-GPU-TPU, server a bassa latenza (come nel mobile) perché TensorFlow 1.0 è ottimizzato per garantire ottime performance in tutti gli ambienti.

170218-tf-performance1.jpg

Megan ha poi illustrato esempi dell’uso di TensorFlow in ricerche d'avanguardia - cutting-edge research – e applicazioni pratiche in ambiente mobile.

170218-tf-research2.jpg

170218-tf-research1.jpg

Conclusione del keynote

In conclusione del Keynote è di nuovo intervenuto Jeff per ringraziare tutti coloro che contribuiscono alla comunità di TensorFlow pur non facendo parte di Google.
“Dalla comunità” ha detto Jeff “arrivano suggerimenti, richieste e anche soluzioni brillanti a cui in Google non avevamo ancora pensato” citando il caso di un agricoltore giapponese che ha sviluppato un’applicazione con TensorFlow su Raspberry PI per riconoscere i cetrioli storti e scartarli nella fase di impacchettamento.
Nel campo della medicina – ha ricordato Jeff – TensorFlow è stato usato per la diagnostica della retinopatia diabetica (qui una sintesi) e all’università di Stanford per la cura del cancro della pelle .

Contatti

•    Meetup “Machine-Learning e Data Science” di Roma: - sito web e pagina Facebook

Approfondimenti

Video

Ulteriori informazioni su TensorFlow

Leggi anche

AG-Vocabolario: 

          Resoconto primo incontro Meetup "Machine-Learning e Data Science" (3 febbraio 2017)   

170203-ml-meetup.jpg

Il 2 febbraio 2017 si è svolto a Roma il primo incontro del Meetup "Machine-Learning e Data-Science"  (fb) presso la Sala Storica di LUISS ENLABS.

Agenda

  • Simone Scardapane e Gabriele Nocco, presentazione del Meetup "Machine-Learning e Data Science"
  • Gianluca Mauro (AI-Academy): Intelligenza artificiale per il tuo business
  • Lorenzo Ridi (Noovle): Serverless Data Architecture at scale on Google Cloud Platform

Simone Scardapane e Gabriele Nocco, presentazione del Meetup "Machine-Learning e Data Science"

161022-simone-scardapane.jpg

(Simone Scardapane)

Simone (mup, web, fb, lk) ha rapidamente illustrato le finalità del Meetup "Machine-Learning e Data-Science" 
“Vogliamo creare una comunità di appassionati e professionisti di ML, AI e Data Science” ha detto Simone, “un luogo dove trovare risposte a domande quali:

  1. Sono appassionato di Ml, dove trovo altri esperti?
  2. Cerchiamo qualcuno esperto di ML, ne conosci?
  3. Mi piacerebbe avvicinarmi a ML, come faccio?”

gabrielenocco.jpeg

(Gabriele Nocco)

Gabriele Nocco (mup , fb, lk) ha annunciato che il secondo evento del Meetup si terrà a Roma il 16 febbraio 2017 al Talent Garden di Cinecittà (mappa) in collaborazione con il Google Dev Group di Roma . Per partecipare occorre registrarsi – gratuitamente – a EventBrite.
“Parleremo di TensorFlow  e proietteremo il keynote ed alcuni momenti salienti del primo TensorFlow Dev Summit per tutti gli appassionati di deep learning e, grazie anche alla gentile sponsorship di Google, avremo il nostro secondo momento di networking condiviso nei bellissimi spazi a nostra disposizione” ha detto Gabriele.
innocenzo-sansone.jpg

(Innocenzo Sansone)

È intervenuto anche Innocenzo Sansone (fb, tw , lk)  – tra gli organizzatori e sponsor – che ha ricordato che il 24 e 25 marzo 2017 a Roma avrà luogo Codemotion  nel quale è previsto – tra gli altri – anche un track specifico sul Machine Learning.

Gianluca Mauro (AI-Academy ): Intelligenza artificiale per il tuo business

170203-gianluca-mauro2.jpg

(Gianluca Mauro)

Gianluca (blog, lk) – ingegnere, imprenditore, esperto di AI, ML e Data Science – è anche uno dei 3 fondatori – insieme a Simone Totaro  (lk)  e Nicolò Valigi (lk) - di AI-Academy  una startup che si prefigge di favorire l’utilizzo dell’Intelligenza Artificiale nei processi di business aziendali (vedi AI Academy manifesto).

ai-academy.jpg

Breve storia dell’Intelligenza Artificiale

Nella prima parte del suo intervento Gianluca ha delineato lo sviluppo storico dell’Intelligenza artificiale.
Gli inizi della IA si devono alla conferenza tenutesi a Dartmouth – USA - nel 1956  ed organizzata da John McCarthy, Marvin Minsky, Nathaniel Rochester e Claude Shannon: per la prima volta si parla di “intelligenza artificiale” e viene indicato l’obiettivo di “costruire una macchina che simuli totalmente l’intelligenza umana” proponendo temi di ricerca che negli anni successivi avranno un grande sviluppo: reti neurali, teoria della computabilità, creatività e elaborazione del linguaggio naturale.

dartmouth.jpg
 
Fonte immagine: http://www.scaruffi.com/mind/ai.html

La ricerca IA viene generosamente finanziata dal governo statunitense fino alla metà degli anni 60: di fronte alla mancanza di risultati concreti i finanziamenti cessano dando origine al primo “AI winter” (1966 – 1980).
Negli anni 80 l’IA riprende vigore grazie a un cambio di paradigma: invece di inseguire l’obiettivo di riprodurre artificialmente l’intera intelligenza umana si ripiega sulla realizzazione di “Sistemi esperti” in grado di simulare le conoscenze in ambiti delimitati.
Anche questo 2° tentativo ha però scarsa fortuna causando il nuovo “AI winter” che si protrae fino agli inizi degli anni 90 quando comincia a imporsi una nuova disciplina: il Machine Learning.

Cos’è il Machine Learning?

AI.jpg

Fonte immagine: http://www.thebluediamondgallery.com/tablet/a/artificial-intelligence.html

Il Machine Learning – ha spiegato Gianluca – è una branca dell’Intelligenza Artificiale che si propone di realizzare algoritmi che a partire dai dati ricevuti in input si adattino in maniera automatica così da produrre risultati “intelligenti” quali previsioni e raccomandazioni.
Gianluca ha fatto l’esempio di un bambino che impara a camminare: non serve conoscere la legge di gravità ed è sufficiente osservare come cammina la mamma e riprovare fino a che non si trova l’equilibrio.

Cos’è il Deep Learning?

Il deep learning è un sottoinsieme del Machine Learning e si rivolge alla progettazione, allo sviluppo, al testing e soprattutto al traning delle reti neurali e di altre tecnologie per l’apprendimento automatico.
Il deep learning è alla base degli spettacolari successi dell’IA nel campo dei giochi da tavolo: la vittoria agli scacchi di Deep Blue di IBM contro il campione del mondo in carica, Garry Kasparov,  e la vittoria di Alphago di Google contro il campione del mondo di GO .

This is the golden age of Artificial Intelligence

Secondo Gianluca Mauro questo è il momento magico per l’IA perché finalmente abbiamo gli strumenti – algoritmi, data, computing power – necessari per realizzare applicazioni di ML a costi sempre più bassi.
Gli algoritmi sono ormai collaudati grazie ai lavori pubblicati negli ultimi anni a cominciare da quelli di Corinna Cortes (“Support-vector networks”) e Davide Rumelhart (“Learning representations by back-propagating errors”).
Il computing power è rappresentato principalmente dalla grande quantità di tecnologie open source a disposizione.
La combinazione di tutti questi fattori è rivoluzionario come dice Chris Dixon, uno dei più noti esponenti del Venture capital USA:
“La maggior parte degli studi di ricerca, degli strumenti e degli strumenti sw legati al ML sono open source. Tutto ciò ha avuto un effetto di democratizzazione che consente a piccole imprese e addirittura a singoli individui di realizzare applicazioni veramente potenti. WhatsApp è stato in grado di costruire un sistema di messaggistica globale che serve 900 milioni di utenti assumendo solo 50 ingegneri rispetto alle migliaia di ingegneri che sono stati necessari per realizzare i precedenti di sistemi di messaggistica. Questo "effetto WhatsApp" sta accadendo adesso nell’Intelligenza Artificiale. Strumenti software come Theano e TensorFlow, in combinazione con i cloud data centers per i training, e con le GPU a basso costo per il deployment consentono adesso a piccole squadre di ingegneri di realizzare sistemi di intelligenza artificiale innovativi e competitivi”.
Secondo Gianluca l’IA presto sarà una necessità per qualsiasi azienda o per citare Pedro Domingos: “A company without Machine Learning can’t keep up with one that uses it”.
Secondo Andrew Ng, chief scientist in Baidu, AI e ML stanno già trasformando le imprese perché le obbligheranno a rivoluzionare i loro processi produttivi così come accadde nell’800 quando fu disponibile per la prima volta elettricità a basso costo (video).
Questo cambiamento culturale è già avvertibile nel Venture Capital e nel Merger & Acquisition: le grandi imprese non cercano solo startup che si occupano di ricerca pura nell’ML ma startup che realizzano servizi e prodotti con ML embedded.
“Siamo all’alba di una nuova era” ha concluso Gianluca “quella del Machine Learning as a feature”.

Lorenzo Ridi (Noovle): Serverless Data Architecture at scale on Google Cloud Platform

lorenzo-ridi.jpeg

(Lorenzo Ridi)

Lorenzo Ridi (mup, fb, lk),  tw) ha presentato un caso d’uso concreto (qui disponibile nella sua versione integrale , anche su SlideShare) per mostrare i vantaggi di usare l’architettura su Google Cloud Platform, attraverso sole componenti serverless, in applicazioni con Machine-Learnin embedded.
Il caso d’uso riguarda una società che con l’avvicinarsi del Black Friday decide di commissionare un’indagine sui social, e in particolare su Twitter, per catturare insights utili a posizionare correttamente i propri prodotti, prima e durante l’evento: questo è tanto più cruciale quanto si considera l’enorme dimensione del catalogo aziendale perché indirizzare in modo sbagliato la propria campagna pubblicitaria e promozionale sarebbe un errore fatale.
Tuttavia, per gestire il forte traffico atteso durante l’evento, gli ingegneri di ACME decidono di abbandonare le tecnologie tradizionali, e di implementare questa architettura su Google Cloud Platform, attraverso sole componenti serverless:

170203-google-architecture-ml1.jpg
 
Ingestion

Per recuperare i dati viene implementata una semplice applicazione Python che, attraverso la libreria TweePy, accede alle Streaming API di Twitter recuperando il flusso di messaggi riguardanti il Black Friday e le tematiche ad esso connesse.
Per fare in modo che anche questa componente mantenga gli standard di affidabilità prefissati, si decide di eseguirla, all’interno di un container Docker, su Google Container Engine, l’implementazione di Kubernetes su Google Cloud Platform. In questo modo, non dovremo preoccuparci di eventuali outage o malfunzionamenti. Tutto è gestito (e all’occorrenza automaticamente riavviato) da Kubernetes.

170203-google-architecture-ml2.jpg
 
Innanzitutto creiamo l’immagine Docker che utilizzeremo per il deploy. A questo scopo è sufficiente redigere opportunamente un Dockerfile che contenga le istruzioni per installare le librerie necessarie, copiare la nostra applicazione ed avviare lo script:

170203-google-architecture-ml3.jpg
 
Et voilà! L’immagine Docker è pronta per essere eseguita ovunque: sul nostro laptop, su un server on-prem o, come nel nostro caso, all’interno di un cluster Kubernetes. Il deploy su Container Engine è semplicissimo, con il tool da riga di comando di Google Cloud Platform: tre sole istruzioni che servono a creare il cluster Kubernetes, acquisire le credenziali di accesso ed eseguire l’applicazione in modo scalabile ed affidabile all’interno di un ReplicationController.
Il secondo elemento della catena, la componente cioè verso la quale la nostra applicazione invierà i tweet, è Google Pub/Sub. una soluzione middleware fully-managed, che realizza un’architettura Publisher/Subscriber in modo affidabile e scalabile.
Nella fase di processing, utilizziamo altri due strumenti della suite Google Cloud Platform:

  • Google Cloud Dataflow è un SDK Java open source – adesso noto sotto il nome di Apache Beam – per la realizzazione di pipeline di processing parallele. Inoltre, Cloud Dataflow è il servizio fully managed operante sull’infrastruttura Google, che esegue in modo ottimizzato pipeline di processing scritte con Apache Beam.
  • Google BigQuery è una soluzione di Analytic Data Warehouse fully managed. Le sue performance strabilianti, che abbiamo avuto modo di sottolineare più volte, lo rendono una soluzione ottimale all’interno di architetture di Data Analytics.

La pipeline che andiamo a progettare è estremamente semplice. Di fatto non farà altro che trasformare la struttura JSON che identifica ogni Tweet, inviata dalle API di Twitter e recapitata da Pub/Sub, in una struttura record BigQuery. Successivamente, attraverso le BigQuery Streaming API, ogni record verrà scritto in una tabella in modo tale che i dati possano essere immediatamente analizzati.
 170203-google-architecture-ml4.jpg
Il codice della pipeline è estremamente semplice; questo è in effetti uno dei punti di forza di Apache Beam rispetto ad altri paradigmi di processing, come MapReduce. Tutto ciò che dobbiamo fare è creare un oggetto di tipo Pipeline e poi applicare ripetutamente il metodo apply() per trasformare i dati in modo opportuno. È interessante osservare come i dati vengano letti e scritti utilizzando due elementi di I/O inclusi nell’SDK: PubSubIO e BigQueryIO. Non è quindi necessario scrivere codice boilerplate per implementare l’integrazione tra i sistemi.

Machine learning

Per visualizzare graficamente i risultati utilizziamo Google Data Studio, uno strumento della suite Google Analytics che consente di costruire visualizzazioni grafiche di vario tipo a partire da diverse sorgenti dati, tra le quali ovviamente figura anche BigQuery.
Possiamo poi condividere le dashboard, oppure renderle pubblicamente accessibili, esattamente come faremmo con un documento Google Drive.

170203-ml5.jpg
 
In questo grafico è riportato il numero di Tweet collezionato da ogni stato dell’Unione. Sicuramente d’impatto, ma non molto utile per il nostro scopo. In effetti, dopo un po’ di analisi esplorativa dei dati, ci accorgiamo che con i soli tweet collezionati non riusciamo a fare analisi molto “avanzate”. Dobbiamo quindi rivedere la nostra procedura di processing per cercare di inferire qualche elemento di conoscenza più “interessante”.
Google Cloud Platform ci viene in aiuto, in questo caso offrendoci una serie di API, basate su algoritmi di Machine Learning, il cui scopo è esattamente aggiungere un pizzico di “intelligenza” al nostro processo di analisi. In particolare utilizzeremo le Natural Language API, che ci saranno utili per recuperare il sentiment di ogni tweet, cioè un indicatore numerico della positività (o negatività) del testo contenuto nel messaggio.

170203-google-architecture-ml6.jpg
 
La API è molto semplice da usare: prende in ingresso un testo (il nostro tweet) e restituisce due parametri:

  • Polarity (FLOAT variabile da -1 a 1) esprime l’umore del testo: valori positivi denotano sentimenti positivi.
  • Magnitude (FLOAT variabile da 0 a +inf) esprime l’intensità del sentimento. Valori più alti denotano sentimenti più forti (siano essi rabbia o gioia).

La nostra personale semplicistica definizione di “sentiment” altro non è che il prodotto di questi due valori. In questo modo siamo in grado di assegnare un valore numerico ad ogni tweet – ed auspicabilmente, di tirarne fuori delle statistiche interessanti!
La pipeline Dataflow viene modificata in modo da includere, oltre al flusso precedente, anche questo nuovo step. Tale modifica è molto semplice, e visto il modello di programmazione di Cloud Dataflow, permette un notevole riuso del codice esistente.

170203-google-architecture-ml7.jpg
 

Con questi nuovi dati possiamo realizzare delle analisi molto più interessanti, che ci informano sulla distribuzione geografica e temporale del “sentimento” riguardante l’evento Black Friday.
La mappa che segue, ad esempio, mostra il sentiment medio registrato in ognuno degli stati degli US, colori più scuri rappresentano sentiment più negativi (quel quadrato rosso là in mezzo è il Wyoming).

170203-google-architecture-ml8.jpg
 
Quest’altra analisi invece riporta l’andamento del sentiment legato ai tre maggiori vendor statunitensi: Amazon, Walmart e Best Buy. A partire da questa semplice analisi, con un po’ di drill-down sui dati, siamo riusciti a carpire alcuni fatti interessanti:

  • il popolo di Twitter non ha apprezzato la decisione di Walmart di anticipare l’apertura delle proprie vendite al giorno precedente il Black Friday, la festa nazionale del Thanksgiving Day. La popolarità di Walmart è stata infatti minata fin dai primi di Novembre da questa decisione  – d’altronde, la tutela dei lavoratori è un tema universale.
  • Le vendite promozionali di Amazon (aperte il 18 Novembre, quindi con anticipo rispetto al Black Friday) sono state inizialmente duramente criticate dagli utenti, con un crollo della popolarità che ha raggiunto il suo minimo il 22. In seguito però il colosso delle vendite online ha recuperato terreno rispetto a Best Buy, che invece sembra aver mantenuto intatta la sua buona reputazione per tutto il periodo.

170203-google-architecture-ml9.jpg

Contatti

Leggi anche

AG-Vocabolario: 

          Google DevFest + Linux Day Roma 22 ottobre 2016   

161022-gdfest-2016.jpg

Sessioni Monografiche

  • Angular2
  • BigData
  • DevOps
  • Machine Learning
  • Mobile

Angular 2

il Web avanza ed incontra il mobile su ogni piattaforma: Angular 2 è un framework Javascript molto potente per creare applicazioni web desktop e mobile La nuova versione è più semplice da utilizzare ma obbliga a scelte preventive importanti (non solo il vecchio javascript). Che vantaggi? Quali strategie architetturali e di approfondimento?
Codelab sulla parte Web e Mobile nativo con Mini Progetto reale.
Prerequisiti
Venite con il notebook e con Node installato. Per lo sviluppo mobile nativo occorre anche avere Nativescript e SDK (Android o/e iOS).
Agli iscritti verrà fornita la lista esatta del Software da installare con le istruzioni
I più bravi e fortunati tra gli iscritti registrati saranno premiati con maglietta, block notes ed una licenza a scelta JetBrains
..

BigData

Cosa significa Data Science? Perchè ha tanto successo in questo periodo e come mai tutte le aziende innovative cercano esperti in Analisi di grandi quantità di Dati?
Verranno illustrate le tecniche principali e le abilità non tipicamente informatiche da sviluppare.
L'evoluzione da Map Reduce ad Apache Spark ed ai principali Servizi, tra i quali i nuovi servizi Google per lo streming di enormi flussi di dati spiegati (in inglese) da Mete di Google.
Codelab con Databricks e le migliori risorse per imparare.
Prerequisiti
Venite solo con il con il notebook e con Node installato. E' tutto in Cloud..
I più bravi e fortunati tra gli iscritti registrati saranno premiati con maglietta, block notes ed una licenza a scelta JetBrains.

DevOps

Developer di Sistemi capaci di automatizzare l'integrazione tra ambienti diversi e distribuiti, in particolare per il Cloud.
Virtualizzazione leggera con Docker e le tecniche per creare e scalare container, immagini, per creare pipeline e per ottenere un processo snello, veloce e automatico.
Organizzazione e concertazione di Sistemi con Google Kubernetes (in inglese)

Prerequisiti

Venite con il notebook in cui avete installato Docker: https://www.docker.com/products/docker
I più bravi e fortunati tra gli iscritti registrati saranno premiati con maglietta, block notes ed una licenza a scelta JetBrains.

Machine Learning

Introduzione pratica al Machine Learning in Python con scikit-learn.
Perchè Google (e moltissime altre aziende di successo) utilizzano il ML in praticamente tutti i loro sistemi applicativi. I campi di applicazione ed i diversi utilizzi.
Tre casi d'uso realistici relativi alla classificazione, al clustering, ed al recommending.

Prerequisiti

Venite con il notebook in cui avete installato Python e librerie associate ( in particolare Sklearn). Il modo più semplice è con Anaconda, un build che comprende tutto. https://www.continuum.io/downloads
Agli iscritti verrà fornita la lista esatta del Software da installare con le istruzioni
I più bravi e fortunati tra gli iscritti registrati saranno premiati con maglietta, block notes ed una licenza a scelta JetBrains.

Mobile

Non solo Android ma soprattutto non solo Smartphone.
Che approccio e che tecniche utilizzare per creare una semplice app in Android ed iOS.
Android TV: che cosa è, come orientarsi, cosa è possibile fare.
Dispositivi e Piattaforme diverse: come sarà il futuro di un developer mobile e quali sono le opportunità più interessanti.
La caccia al Tesoro con i visori 3D in realtà virtuale.

Prerequisiti

Il Lab non è pre principianti. Dovete avere installato l'SDK Android o iOS.
Agli iscritti verrà fornita la lista esatta del Software da installare con le istruzioni
I più bravi e fortunati tra gli iscritti registrati saranno premiati con maglietta, block notes ed una licenza a scelta JetBrains.

CoderDojo

I CoderDojo sono club gratuiti, orientati all'insegnamento della programmazione informatica ai più piccoli. CoderDojo è un movimento aperto, libero e totalmente gratuito, organizzato in centinaia di club indipendenti sparsi in tutto il mondo. Ogni Dojo organizza le proprie attività senza scopo di lucro, rispettando le indicazioni della Charter internazionale stilata dalla Fondazione Internazionale CoderDojo (per informazioni: www.coderdojo.com). Le attività di formazione dei club ruotano intorno al gioco, lo scambio reciproco ed il peer learning, secondo l’unica regola fondamentale di ogni dojo: Above all, Be Cool..
Durante l'evento 30 bambini dagli 8 ai 13 anni impareranno a programmare giocando insieme.

 

AG-Vocabolario: 

          Data Scientist - Engility Corporation - Reston, VA   
Cloudera and/or Hadoop Experience with web protocols, SOAP, RSS and other publishing tools. Engility is seeking Data Scientists to support an exciting mission...
From Engility Corporation - Tue, 16 May 2017 06:50:15 GMT - View all Reston, VA jobs
          O'Reilly - Learning Path From Python Programming to Data Science   


O'Reilly - Learning Path From Python Programming to Data Science

O'Reilly - Learning Path From Python Programming to Data Science
English | Size: 4.46 GB
Category: CBTS

Python has become the language of choice of data scientists for performing data analysis, visualization, and machine learning. If you re looking forward to implementing Python in your data science projects to enhance data discovery, then this is the perfect Learning Path for you. Starting out at the basic level, this Learning Path will take you through all the stages of data science in a step-by-step manner

          Industry Spotlight: How data science improves ALM   

If you’re an agile team, you may still be planning, developing, testing and deploying by instinct. But what if you bring data science into the picture? Enter HPE Predictive Analytics, which can surface everything from accurate planning estimates in agile projects to efficiencies in defect detection for continuous testing. SD Times spoke with Collin Chau, a … continue reading

The post Industry Spotlight: How data science improves ALM appeared first on SD Times.


          Best Practical Class Room Training For Data Science & Big Data/Data Analytics in Sequelgate - Hyderabad, India   
A**SequelGate** is one of the best training institutes for Data Science & Big Data /Data Analytics Training . We have been providing Classroom and Online Trainings and Corporate training. **All Our Training Sessions are Comple...
          Data Scientist 6 month contract Madison, WI - See the USA!   
Beacon Hill Staffing Group, LLC Madison, WI
          Data Scientist - Drop - Toronto, ON   
Through our mobile app, users supercharge their debit and credit cards to automatically earn points on their every day spending at places such as Starbucks, Tim...
From Drop - Thu, 01 Jun 2017 02:38:43 GMT - View all Toronto, ON jobs
          Data Scientist   
Bay Area Techworkers San Ramon, CA