Counting duplicate values in Pandas DataFrame

You can use groupby with function size. Then I reset index with rename column 0 to count. print df Month LSOA code Longitude Latitude Crime type 0 2015-01 E01000916 -0.106453 51.518207 Bicycle theft 1 2015-01 E01000914 -0.111497 51.518226 Burglary 2 2015-01 E01000914 -0.111497 51.518226 Burglary 3 2015-01 E01000914 -0.111497 51.518226 Other theft 4 2015-01 E01000914 … Read more

Get group id back into pandas dataframe

A lot of handy things are stored in the DataFrameGroupBy.grouper object. For example: >>> df = pd.DataFrame({‘Name’: [‘foo’, ‘bar’] * 3, ‘Rank’: np.random.randint(0,3,6), ‘Val’: np.random.rand(6)}) >>> grouped = df.groupby([“Name”, “Rank”]) >>> grouped.grouper. grouped.grouper.agg_series grouped.grouper.indices grouped.grouper.aggregate grouped.grouper.labels grouped.grouper.apply grouped.grouper.levels grouped.grouper.axis grouped.grouper.names grouped.grouper.compressed grouped.grouper.ngroups grouped.grouper.get_group_levels grouped.grouper.nkeys grouped.grouper.get_iterator grouped.grouper.result_index grouped.grouper.group_info grouped.grouper.shape grouped.grouper.group_keys grouped.grouper.size grouped.grouper.groupings grouped.grouper.sort grouped.grouper.groups and so: … Read more

How to check if a pandas dataframe contains only numeric values column-wise?

You can check that using to_numeric and coercing errors: pd.to_numeric(df[‘column’], errors=”coerce”).notnull().all() For all columns, you can iterate through columns or just use apply df.apply(lambda s: pd.to_numeric(s, errors=”coerce”).notnull().all()) E.g. df = pd.DataFrame({‘col’ : [1,2, 10, np.nan, ‘a’], ‘col2’: [‘a’, 10, 30, 40 ,50], ‘col3’: [1,2,3,4,5.0]}) Outputs col False col2 False col3 True dtype: bool

Plotting Pandas Multiindex Bar Chart

import pandas as pd data = pd.DataFrame([ (‘Q1′,’Blue’,100), (‘Q1′,’Green’,300), (‘Q2′,’Blue’,200), (‘Q2′,’Green’,350), (‘Q3′,’Blue’,300), (‘Q3′,’Green’,400), (‘Q4′,’Blue’,400), (‘Q4′,’Green’,450), ], columns=[‘quarter’, ‘company’, ‘value’] ) data = data.set_index([‘quarter’, ‘company’]).value data.unstack().plot(kind=’bar’, stacked=True) If you don’t want to stack your bar chart: data.unstack().plot(kind=’bar’)

Grouped Bar graph Pandas

Using pandas: import pandas as pd groups = [[23,135,3], [123,500,1]] group_labels = [‘views’, ‘orders’] # Convert data to pandas DataFrame. df = pd.DataFrame(groups, index=group_labels).T # Plot. pd.concat( [ df.mean().rename(‘average’), df.min().rename(‘min’), df.max().rename(‘max’) ], axis=1, ).plot.bar()

Hata!: SQLSTATE[HY000] [1045] Access denied for user 'divattrend_liink'@'localhost' (using password: YES)