How to Create Pandas DataFrame with Random Data


You can use the following basic syntax to create a pandas DataFrame that is filled with random integers:

df = pd.DataFrame(np.random.randint(0,100,size=(10, 3)), columns=list('ABC'))

This particular example creates a DataFrame with 10 rows and 3 columns where each value in the DataFrame is a random integer between 0 and 100.

The following examples shows how to use this syntax in practice.

Example 1: Create Pandas DataFrame with Random Data

The following code shows how to create a pandas DataFrame with 10 rows and 3 columns where each value in the DataFrame is a random integer between 0 and 100:

import pandas as pd
import numpy as np

#create DataFrame
df = pd.DataFrame(np.random.randint(0,100,size=(10, 3)), columns=list('ABC')) 

#view DataFrame
print(df)

    A   B   C
0  72  70  27
1  87  85   7
2   4  42  84
3  85  87  63
4  79  72  30
5  96  99  79
6  26  47  90
7  35  69  56
8  42  47   0
9  97   4  59

Note that each time you run this code, the random integers in the DataFrame will be different.

If you’d like to create a reproducible example where the random integers are the same each time, you can use the following piece of code immediately before you create the DataFrame:

np.random.seed(0)

Now each time you run the code, the random integers in the DataFrame will be the same.

Example 2: Add Column of Random Data to Existing DataFrame

Suppose we have the following existing pandas DataFrame:

import pandas as pd

#create DataFrame
df = pd.DataFrame({'team': ['A', 'B', 'C', 'D', 'E', 'F', 'G', 'H'],
                   'points': [18, 22, 19, 14, 14, 11, 20, 28],
                   'assists': [5, 7, 7, 9, 12, 9, 9, 4],
                   'rebounds': [11, 8, 10, 6, 6, 5, 9, 12]})

#view DataFrame
print(df)

  team  points  assists  rebounds
0    A      18        5        11
1    B      22        7         8
2    C      19        7        10
3    D      14        9         6
4    E      14       12         6
5    F      11        9         5
6    G      20        9         9
7    H      28        4        12

We can use the following code to add a new column called “rand” that contains random integers between 0 and 100:

import numpy as np

#add 'rand' column that contains 8 random integers between 0 and 100
df['rand'] = np.random.randint(0,100,size=(8, 1))

#view updated DataFrame
print(df)

  team  points  assists  rebounds  rand
0    A      18        5        11    47
1    B      22        7         8    64
2    C      19        7        10    82
3    D      14        9         6    99
4    E      14       12         6    88
5    F      11        9         5    49
6    G      20        9         9    29
7    H      28        4        12    19

Notice that the new column “rand” has been added to the existing DataFrame.

Additional Resources

The following tutorials explain how to perform other common operations in pandas:

How to Impute Missing Values in Pandas
How to Replace NaN Values with Zero in Pandas
How to Check if Cell is Empty in Pandas

One Reply to “How to Create Pandas DataFrame with Random Data”

  1. This manner, learnt representations ought to seize each population patterns and topic-stage
    variability. Comprehensive experiments reveal that the learnt representations of said relationship capture precious data from the intrinsic properties of the subject’s brain networks and result in improved accuracy in classifying a large population of heavy drinkers
    and non-drinkers from the Human Connectome Project.
    Sensor information can supplement logs obtained from
    in-recreation knowledge to offer extra info for predictive fashions and probably enhance their
    efficiency. A of each topic is extracted from the dMRI and structural MRI data.
    The graph estimation/regression task learns a parsimonious illustration of the population-level SC-FC relationship, and the
    classification activity makes use of subject labels as
    extra inputs for supervised learning. All in all, the novelty of our work is in studying low-dimensional, latent
    representations that capture the intrinsic attributes of the
    mind SC-FC coupling across a big population, while being discriminative for topic-stage classification. Brain connectomes, graph illustration learning, graph convolutional community, graph
    sign processing, encoder-decoder system, graph classification.

Leave a Reply

Your email address will not be published. Required fields are marked *