Check Missing dataX.isnull()X.notnull()Delete Missing DataX.drop_na() -- delete any row that has missing dataX.drop_na(how = all) - if all column in a row has missing data.X.drop_na(axis=1) - will delete a column that has missing dataX.drop_na(axis=1, how=all) - will delete a column if all value in column has missing data.Impute missing valueX.fillna(99)...
Thursday, 29 September 2022
4. Data Analysis - Data Ingestion
Data IngestionData Ingestionpd.read_csv("XYZ.csv", )pd.read_table("XYZ.csv", sep=",")pd.read_table("XYZ.csv", sep=",", header=None) -- pandas will provide header column.pd.read_table("XYZ.csv", sep=",", names=['a', 'b', 'c', 'd', 'e']) --- provide column names.pd.read_table("XYZ.csv", sep=",", names=['a', 'b', 'c', 'd', 'e'], index_col="Names") ---...
Wednesday, 28 September 2022
3. Data Analysis - Pandas Dataframe
Pandas Dataframe creationDataframe creation using dictionary ( with only column values)data1 = {State:["Karnataka", "Jharkhand"], Year:["2021", "2022"], Name:['ABC', 'DEF]}X= pd.DataFrame(data1) ------- dataframe creation with all featuresX= pd.DataFrame(data1, columns=["State", "Year"]) ---- dataframe creation with 2 featuresX= pd.DataFrame(data1,...
Tuesday, 27 September 2022
2. Data Analysis - Pandas Series
Series CreationSeries creation with default indexX= pandas.series([10,20,30,40]) -- by passing a listX.index(), X.values()print(X[0], X[[0,2,3]] , X[1:3] --- Access the series valueSeries creation with labeled indexX = pd.series([10,20,30,40], Index = ['l1', 'l2', 'l3', 'l4'])X['l1'], X[['l1', 'l2']], X['l2':'l4']Series creation using dictionarypd.Series(dict1)...
Monday, 26 September 2022
1. Data Analysis - NumPy Operations
Numpy Operations Numpy Array Creation - X = np.array[[10,20,30,40]]X = np.zeros(10)X= np.ones(10)X = np.empty (10)X=np.arange([1,11]) ---- will create 10 element array starting from element=1 to 10Array Creation using data typeX = np.array([10,20,30,40], dtype = np.floar64)Changing data typeX = X.astype(np.int32)Arithmetic...