25 AI & Data Science Articles from Feb 2024

Generative AI can improve -- not replace -- predictive analytics

The NGA is supercharging its use of commercial satellite imagery & analytics with a new program dubbed “’Luno’”

China is building its own Starlink—even as questions surround Musk's constellation

This machine learning study tests the transformer’s ability of length generalization using the task of addition of two integers

Understanding transformers, how they've advanced LLMs—and what may replace them

Scale AI to set the Pentagon’s path for testing and evaluating large language models

How to solve binary classification problems using Bayesian methods in Python 

A quick rundown of the impact AI will have on data roles across the organization

Some of the top R packages every data scientist should be familiar with them 

Python Libraries for Geospatial Data Visualization: Transform Your Maps into Stories 

A list of premier YouTube channels exploring large language models

Python code commenting as a data scientist

New intelligence related to Russia’s attempts to develop a space-based antisatellite nuclear weapon 

The excitement surrounds large language models to the detriment of other equally valuable machine learning methodologies 

10 Prominent Data Science Predictions 2024

Bayesian Analysis with Python

Tech Companies turned Ukraine into an AI War Lab

The pace of innovation in the space sector is picking up thanks in part to AI and machine learning 

What a data scientist looks like in 2032 is likely to be starkly different than today

What an AI-powered future of data science looks like

Sony AI’s tech predictions for the year ahead 

10 emerging data science trends 

A machine learning engineer and data scientist has applied for more than 1,000 roles without any success

An empirical analysis about whether ML models make more mistakes when making predictions on outliers 

How a Surge in Satellites Will Revolutionize Intelligence

Tech Companies Turned Ukraine Into an AI War Lab

The collaboration between foreign tech companies and the Ukrainian armed forces, who say they have a software engineer deployed with each battalion, is driving a new kind of experimentation in military AI. The software processes raw intelligence from sources including drones, satellites, and Ukrainians on the ground, as well as radar that can see through clouds and thermal images that can detect troop movements and artillery fire. AI-enabled models can then present military officials with the most effective options to target and enemy positions.

Read more from TIME

15 Articles about Data Science & AI published in January

AI Definitions in Simple Language

AI Definitions

Agents - Unlike AI prompts requiring user conversations, AI agents work in the background. Users provide a goal (from researching competitors to buying a car) and the agent acts independently, generating task list and starting to work. 

Artificial General Intelligence (AGI) - AI that possesses human-level intelligence that can evaluate complex situations, apply common sense, and learn and adapt.  Beyond the goal of AGI lies the more speculative notion of "sentient AI," the idea that these programs might cross some boundary to become aware of their own existence and even develop their own wishes and feelings. 

AI Evolution

  1. Generative AI sounds like a person.

  2. AGI (artificial general intelligence) reasons like a person.

  3. Sentient AI thinks it's a person.

AI model collapse - The idea that AI can eat itself by training on internet data until it runs out of fresh data and trains on it’s on product or the product of another AI. Thus, errors and bias are magnified and rare data is more likely to be lost.

AI winter  - A period where funding and interest in the field subsided considerably.

*Algorithms - Direct, specific instructions for computers created by a human through coding that tells the computer how to perform a task. This set of rules has a finite number of steps that instruct the computer how to perform a task. More specifically, it is code that follows the algorithmic logic of “if”, “then”, and “else.”  

See the Entire List

19 Articles about Data Science & AI from Nov 2023

The Bitter Lesson

There’s a famous essay in the field of machine learning known as “The Bitter Lesson,” which notes that decades of research prove that the best way to improve AI systems is not by trying to engineer intelligence but by simply throwing more computer power and data at the problem. The lesson is bitter because it shows that machine scale beats human curation. And the same might be true of the web. Read more at The Verge

24 August articles about Data Science, AI, & Space

US spy satellite agency isn’t so silent about a coming launch that will allow it to access potential threats by continually track other objects in geosynchronous orbit 

The top geospatial intelligence brands in the world

China’s Constant Spying On Australian Drills From Space A Sign Of Shifting Orbital Balance

What is a liquid neural network, really?

7 ChatGPT Prompts To be a Better Data Scientist

What are LLMs bad at? Reference lists

“Space science is such a rarefied field that the developers don’t have the security skills to do a rigorous shakedown of a satellite”

GenAI Is Making Data Science More Accessible

5 Things You Need to Know When Building LLM Applications

What a hijacked satellite could do

Finding: “The larger the satellite the more vulnerable it was” to hacking

A study into the feasibility of hacking low-Earth orbit satellites has revealed that it's worryingly easy to do

Four types of learning in machine learning explained

Five essential Python packages for effectively handling and visualizing valuable insights from geospatial data

Stability AI known for its text-to-image generation model called Stable Diffusion has now released a

code generator called StableCode

Researchers say they have developed an optical neural network that can “significantly reduce the size and processing time of image sensors”

The importance of data cleaning in data science —what it is, the benefits of using it, & the commonly used tools

IBM and NASA open source an AI model for geospatial data analysis

AI startup Sweetspot is a search engine using LLMs to look for specific U.S. government contracts

How can Data Scientists use ChatGPT for developing Machine Learning Models?

“Five mistakes I made while switching to data science career”

“Liquid neural networks, a novel type of deep learning architecture offer a compact, adaptable and efficient solution to certain AI problems”   

Physicists have found that deep-learning AI technology can accurately quantify the amount of entanglement in a given system 

Scientists have trained a machine learning model in outer space

24 Data Science & AI Articles from July 2023

The future space economy could encompass activities that currently aren’t being pursued at scale, such as in-orbit manufacturing, power generation, & space mining, as well as scalable human spaceflight 

Will the future be filled with “networks of autonomous drones, deployed around the globe, helping humans keep conflict in check … or maybe the skies will darken with attack swarms”

A basic explanation of geospatial data & geospatial technology—how they are used and their limitations

Our hesitation, perceived or otherwise, to move forward with military applications of artificial intelligence will be punished

Our Oppenheimer Moment: The Creation of AI Weapons 

The AI-powered, totally autonomous future of war Is here

Experts imagine what artificial intelligence could mean for the future of satellites, space entrepreneurship, and government defense systems 

How much coding is needed in a data science career?

10 Specific Predictions about AI

A look at what sets Meta’s Llama 2 apart from its predecessor & other large language models—here’s the technical details & implications for data scientists

US sharpens military space race plan as Space Force is challenged to compete with China  

A review of major data science and AI developments during the first half of 2023

What’s missing from ChatGPT and other LLMs?

How does Bayesian inference work when estimating noisy interactions?

A US Army project called "Real-Time Threat Forecasting" hopes to create AI that can forecast enemy actions just minutes before the enemy actually does it—and continuously update that forecast as adversaries change their tactics

OpenAI rolls out a ChatGPT Plus feature called the Code Interpreter that can write and execute python code, and can work with file uploads

A primer on large language models

The latest trends in artificial intelligence and deep learning from the metaverse to quantum computing

“I don't think generative AI will displace predictive analytics"

Understanding the difference between advanced and predictive analytics

The advantages of Causal AI over traditional machine learning

Most of the large language models developed in China are nearly 2 years behind the US—a gap that would be a challenge to close even if American firms had to adjust to regulation

A Chinese satellite manufacturer and constellation operator says it has successfully demonstrated space-to-ground high-speed laser communications— transmitting data 10x faster thanks to lasers

The engineering applications of machine learning and predictive analytics

27 Data Science & AI articles from June 2023

An argument for bigger quantum neural networks

In-orbit demonstration of a re-trainable machine learning payload for processing optical imagery

7 ways ChatGPT makes you code better and faster

Are data scientists still needed in the age of generative AI? Not according to this opinion piece

Making Predictions: A Beginner’s Guide to Linear Regression in Python

Air Force studying ‘military applications’ for artificial intelligence like ChatGPT  

24 articles worth reading about the dangers of AI (beyond security issues)

Open-source AI chatbots are booming

A hacking conference (DEF CON 31) has invited hackers to find bugs and biases in AI  

9 articles worth reading about the security dangers of AI

Neural Networks need data even fake data to learn: Why researchers turn to synthetic data to train their artificial intelligence systems

China tests first-ever low-Earth orbit constellation to rival SpaceX's Starlink

Intelligence analysts confront the reality of deepfakes  

The NGA is hailing the value of AI tools & machine learning to analyze 1000s of satellite images

How the rise of low Earth orbit satellites can disrupt how militaries fight

Space Force reconsiders the use of the Global Positioning System constellation

USGIF white paper on GEOINT opportunities created by AI related to synthetic training data

A look at how the commercial satellite economy got to where it is today

Mastering the art of data storytelling: A guide for data scientists

A system based on Google DeepMind’s AlphaZero AI can create algorithms that will sort data faster than algorithms built by people

A visual introduction to neural networks

Asking ChatGPT to write you a malicious code

Mutating malware can be built using the ChatGPT

New US spy satellites to track Chinese, Russian threats in orbit

Five ways to help your data science team collaborate more effectively

Many commercial-satellite operators are still creating overly ambitious plans

NGA: AI has come a long way but “not good enough” to justify a pause in development

26 articles about Data Science & AI published this month

A tutorial on how to perform a deep learning task in R

Why North Korea's satellite launch attempt may be 'first of many'

A deep dive into GPT models: evolution & performance comparison

Data science is “about building, training and maintaining AI systems”

Is data science evolving into a branch of contemporary AI?

NRO hopes AI & machine learning will help it as it is “awash in satellites and their data”

New app aims to streamline ordering satellite images from more than a dozen companies

UC Berkeley researchers weigh in on neural networks

Israel aiming to become an “AI superpower" by streamlined combat decision-making

Five key components that contribute to the successful scaling of data science projects

“China’s embrace of AI for warfare has touched off alarm bells everywhere from Silicon Valley to the Pentagon”

BlackSky suggests China may be hacking Western satellites using laser directed-energy weapons

OpenAI uses its GPT-4 language model to write explanations for the behavior of neurons

HuggingChat Python API is a free & open source alternative to commercial chat offerings such as ChatGPT

Python Pandas is an open-source toolkit for data scientists using the Python—here’s what it can do

Never neglect to monitor your machine-learning models

China lands mysterious spaceplane after 276 days in orbit

Some essential statistical concepts applicable in data science and machine learning

Pentagon & intell agencies are making it clear they plan to use such tools as ChatGPT

A simple introduction to Geospatial Intelligence (GEOINT)

What exactly is data science and how did it get its start?

The USGIF has set up a new working group focused on “Space Situational Awareness”

Can ChatGPT work as a personalized tutor for learning data science concepts?

Solving complex AI Tasks with HuggingGPT

Liquid neural networks could generalize to scenarios that they had never seen

Four different approaches to data analytics

33 Data Science & AI articles from April 2023

Drones equipped with liquid neural networks edged out other AI systems when navigating unknown territory

A pitch for using cultural consensus theory to mitigate large language model bias  

An inside look at geospatial intelligence with the CEO of the US Geospatial Intelligence Foundation

The base rate fallacy and its impact on data science

Some examples of how AI is advancing current space efforts

China is building sophisticated cyber-weapons to "seize control" of enemy satellites (FT subscription)

We might see as many as 65,000 satellites in orbit by 2030

“The disruptive nature of the 4IR, which brings both complex risks and unprecedented opportunities”

"The Chinese military could soon deploy a high-altitude spy drone that travels at least three times the speed of sound” 

How many satellites are orbiting around earth?

The core topics you need to focus on to become an AI Data Scientist

China is trying to get around export restrictions to acquire military-relevant technology, such as "launchers with intelligence, surveillance, reconnaissance and communication satellites”

Launch providers must make tricky decisions on how to ramp up capacity as the space economy expands

An overview of some the most influential Deep Learning papers of the last decade

AutoGPT basics

KD Nuggets offers predictions for AI in the next decade 

The 10 most innovative space companies of 2023

An advanced satellite surveillance imagery system—the LAPIS time-series video

60 ChatGPT prompts for data science with ratings

“Claims the NGA unlawfully bypassed their commercial product to sink funds into a $376 million project”

“The new DoD satellite acquisition model favors a spiral development timeline” 

When data scientists are working with sparse data, there are several machine learning models to help

“A national scarcity of geodesists could threaten critical intelligence community missions”

“Geodesy is an essentially nonexistent expertise in the US”

China’s military aims to launch 13K satellites in the race for low-earth orbit dominance

Why you don’t need big data to train machine learning

The Top 19 skills you need to know in 2023 to be a data scientist 

Why the chances of being able to fully explain AI may become impossible for humans to comprehend

Activation functions can be used to design neural networks that achieve better performance on any dataset 

Can neural networks be optimized for certain tasks? MIT researchers think so

How rapid growth in drone use and EU Regulations will accelerate demand for satellite connectivity 

How to Use ChatGPT to Improve Your Data Science Skills

A short video explanation of geospatial intelligence (GEOINT) in simple terms

The Potential of AI using Liquid Neural Networks

Large language models like ChatGPT and Dall-E have billions of parameters, and each improved model increases in size and complexity. Researchers at an MIT lab believe artificial intelligence can make a leap forward by going smaller. Their experiments show liquid neural networks beat other systems when navigating in unknown environments. “Liquid neural networks could generalize to scenarios that they had never seen, without any fine-tuning, and could perform this task seamlessly and reliably.” They also open the proverbial black box of the system’s decision-making process, which could help to root out bias and other undesirable elements in an AI model. The results have immediate implications for robotics, navigation systems, smart mobility, and beyond toward predicting financial and medical events. Read more here.

29 Data Science & Geospatial Articles from March 2023

Smaller, simpler neural network models are always more suitable for real-world applications

“Russia has expressed its willingness to target space assets, including commercial communications systems, adding to the U.S. urgency of developing warfighting tactics.”

US vs China—a video about the race to launch the next generation of space telescopes

China is preparing to launch its first satellites for a national low Earth orbit broadband megaconstellation to challenge SpaceX’s Starlink

Pentagon Prepares for Space Warfare as Potential Threats From China, Russia Grow

“The ideal size and intricacy of neural networks remain a matter of debate in the AI community, raising the question: Does neural network complexity matter?”

Remote sensing companies try to capture bigger piece of satellite imaging market

What data scientists need to know about machine learning

A list of free data science courses—from web scraping, statistics/probability, data analytics, SQL to business intelligence

The value of predictive models — cartography when data is very scarce

Quantum computers are a security threat before they even exist thanks to the encryption-breaking threats it posses

Space Force Wants $60 Million for Ultra-Quick Satellite Launches—with just 24 hour notice  

“The era of small satellites in Low-Earth Orbit is upon us”: Satellite manufacturers look to benefit from a multi-orbit future

China launches second classified high resolution remote sensing satellite

China’s secret naval base in Cambodia, through satellite imagery

Four machine learning trends to watch in 2023

Valuable GitHub repositories for data engineering

OpenAI’s price cut is “a warning sign that this may be a business with few producers"

“The launch of ChatGPT & Whisper APIs is expected to have a profound impact on the community of developers”

Documents detail 65-year effort to monitor an increasingly crowded orbital environment: A report on the US space surveillance network

Chinese research institutes are working to construct a quantum communications network using satellites in low and medium-to-high Earth orbits

The paradox that explains why “too much aggregation of data can become useless and start to introduce bias”

31 Generative AI Tools for text, images, & more with descriptions

A Chinese satellite launched in 2018 has been inspecting other nations' spacecraft high above Earth in geostationary orbit

Debating the rules of a conflict in orbit

Data Cleaning with Python Cheat Sheet

Diving into the world of quantum machine learning by exploring an advanced project utilizing a sample dataset

A systematic approach to retraining deep-learning artificial intelligence algorithms to deal with different situations

The difference between the roles of questions versus decisions in data science

20 Data Science articles from February 2023

Five statistical paradoxes that data scientists should be aware of in order to do accurate analysis

What Pentagon leaders say they have learned from a year of battle in Ukraine:"The power of information is winning”

Software to sow doubts as you meta-analyze  

Machine learning is vulnerable to a wide variety of attacks. How the adversary can disrupt model training and even introduce backdoors

How Pandas alternatives—Polars, DuckDB, Vaex, and Modin—stack up to one of the most popular libraries in Python

Six of the most important types of machine learning algorithm 

“Big Data is real, but most people may not need to worry about it”

The ChatGPT prompts any data scientist must use

No, chatbots aren’t sentient. Here’s how their underlying technology works

5 Common Data Analytics Types Explained in Laymen’s Terms

Using the metaverse to virtually assemble and test AI war machines for the US military

Researchers discover a more flexible approach to machine learning—liquid neural nets

The evolving role of the data engineer

Top Predictive Analytics Trends in 2023

Even the pentagon Is using ChatGPT—the DoD’s used it to write a press release about a new counter-drone task force

How NGA Is integrating commercial analytic services into agency workflows

Python string matching without complex RegEx Syntax

Six python libraries especially useful to data engineers and natural language processing

Can ChatGPT write better code than Data Scientist? 

Researchers say ChatGPT can “weed out errors with sample code and fix it better than existing programs designed to do the same.”

25 Data Science Articles from Dec 2022

A Pandas DataFrame cheatsheet for exploratory analysis & data manipulation 

Five ways that data roles will change in 2023 related to Chief Data Officers

AI & machine learning are “top of mind for the Army, especially as it pertains to protecting its assets in space”

10 weird things about SpaceX's more than 3,000 Starlink satellites (and that number keeps growing)

Initial specific steps toward launching a machine learning project 

Adobe has just released a remarkable and free AI-powered enhanced speech tool

The four biggest trends they expect to shape the AI landscape in 2023

Synthetic data applications, limitations & vulnerabilities

A guide to the roles and responsibilities on a data migration team

A tech journalist goes back to high school to find out what OpenAI’s Chatbot can pass AP Lit

The current limitations of AI’s military impact & where tech could one day spark “revolutionary changes” 

How Bayesian network structure learning can incorporate missing data 

The NGA has plans to develop an overarching cloud-based enterprise management system capable of automating its data collection and dissemination and ultimately replacing the overall Foundation GEOINT storage and management process 

A new paper on “Localization and classification of space objects using EfficientDet detector for space situational awareness”

Potential uses of ChatGPT for data scientists

McKinsey on the state of AI since the research firm began tracking it five years ago

A new collaborative effort is designed to “support interoperable open map data as a shared asset that can strengthen mapping services worldwide”

Different kinds of geospatial specialists are needed in different situations

China outpaces efforts by U.S. intelligence agencies to harness power of publicly available data 

The Space Dev Agency’s first major satellite launch has been delayed again

A look under the hood: How does ChatGPT work internally? 

An AI method from MIT and IBM research “improves the training and inference performance of deep learning models on large graphs”

Some basics about the new AI called ChatGPT 

Why Neural Network explainability is important, how to do it, & the tools for it

“The FCC approved part of SpaceX’s application for the second generation of the Starlink constellation, which will allow SpaceX to deploy up to 7,500 satellites”