Note

Go to the end to download the full example code.

Using skore with scikit-learn compatible estimators#

This example shows how to use skore with scikit-learn compatible estimators.

Any model that can be used with the scikit-learn API can be used with skore. Use evaluate() to create a report from any estimator that has a fit and predict method (or only predict if already fitted).

Note

When computing the ROC AUC or ROC curve for a classification task, the estimator must have a predict_proba method.

In this example, we showcase a gradient boosting model (XGBoost) and a custom estimator.

Note that this example is not exhaustive; many other scikit-learn compatible models can be used with skore:

More gradient boosting libraries like LightGBM, and CatBoost,
Deep learning frameworks such as Keras and skorch (a wrapper for PyTorch).
Tabular foundation models such as TabICL and TabPFN,
etc.

Generate a classification dataset#

To illustrate the compatibility with scikit-learn estimators, we first generate a synthetic binary classification dataset with only 1,000 samples.

import pandas as pd
import skrub
from sklearn.datasets import make_classification

X, y = make_classification(n_samples=1_000, random_state=42)
X = pd.DataFrame(X, columns=[f"Feature_{i}" for i in range(X.shape[1])])
skrub.TableReport(X)

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

Gradient-boosted decision trees with XGBoost#

While skore is designed to be fully compatible with classifiers and regressors from the scikit-learn library, it is also compatible with any classifier or regressor that follows the scikit-learn API as defined in the scikit-learn documentation.

Here, we showcase a gradient-boosted decision trees model from the XGBoost library that follows exactly this paradigm.

from skore import evaluate
from xgboost import XGBClassifier

xgb = XGBClassifier(n_estimators=50, max_depth=3, learning_rate=0.1, random_state=42)

xgb_report = evaluate(xgb, X, y, splitter=0.2, pos_label=1)
xgb_report

XGBClassifier(base_score=None, booster=None, callbacks=None,
              colsample_bylevel=None, colsample_bynode=None,
              colsample_bytree=None, device=None, early_stopping_rounds=None,
              enable_categorical=False, eval_metric=None, feature_types=None,
              feature_weights=None, gamma=None, grow_policy=None,
              importance_type=None, interaction_constraints=None,
              learning_rate=0.1, max_bin=None, max_cat_threshold=None,
              max_cat_to_onehot=None, max_delta_step=None, max_depth=3,
              max_leaves=None, min_child_weight=None, missing=nan,
              monotone_constraints=None, multi_strategy=None, n_estimators=50,
              n_jobs=None, num_parallel_tree=None, ...)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

0 issue(s), 0 tip(s), 3 passed, 0 ignored.

We see that we get the same report as when using a scikit-learn classifier and we can access the different elements.

xgb_report.metrics.summarize().frame()

	XGBClassifier
Metric
Accuracy	0.900000
Precision	0.989899
Recall	0.837607
ROC AUC	0.980126
Log loss	0.218888
Brier score	0.064364
Fit time (s)	0.039216
Predict time (s)	0.001271

We can easily get the summary of metrics, and also a ROC curve plot for example:

_ = xgb_report.metrics.roc().plot()

ROC Curve for XGBClassifier Positive label: 1 Data source: Test set

We can also inspect our model:

_ = xgb_report.inspection.permutation_importance().plot()

Permutation importance of XGBClassifier on test set

Custom model#

Now, we showcase how one could create a scikit-learn custom estimator that follows the requirements of scikit-learn.

Here, we create a nearest neighbor classifier:

import numpy as np
from sklearn.base import BaseEstimator, ClassifierMixin
from sklearn.metrics import euclidean_distances
from sklearn.utils.multiclass import unique_labels
from sklearn.utils.validation import check_is_fitted, validate_data


class CustomClassifier(ClassifierMixin, BaseEstimator):
    def __init__(self):
        pass

    def fit(self, X, y):
        X, y = validate_data(self, X, y)
        self.classes_ = unique_labels(y)
        self.X_ = X
        self.y_ = y
        return self

    def predict(self, X):
        check_is_fitted(self)
        X = validate_data(self, X, reset=False)
        closest = np.argmin(euclidean_distances(X, self.X_), axis=1)
        return self.y_[closest]

custom_report = evaluate(CustomClassifier(), X, y, splitter=0.2)
custom_report

CustomClassifier()

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

1 issue(s), 0 tip(s), 2 passed, 0 ignored.

Conclusion#

This example demonstrates how skore can be used with scikit-learn compatible estimators. This allows practitioners to use consistent reporting and visualization tools across different estimators.

	Feature_0	Feature_1	Feature_2	Feature_3	Feature_4	Feature_5	Feature_6	Feature_7	Feature_8	Feature_9	Feature_10	Feature_11	Feature_12	Feature_13	Feature_14	Feature_15	Feature_16	Feature_17	Feature_18	Feature_19
	Feature_0	Feature_1	Feature_2	Feature_3	Feature_4	Feature_5	Feature_6	Feature_7	Feature_8	Feature_9	Feature_10	Feature_11	Feature_12	Feature_13	Feature_14	Feature_15	Feature_16	Feature_17	Feature_18	Feature_19
0	-0.669	-1.50	-0.871	1.14	0.0216	1.73	-1.25	0.289	0.357	-0.197	0.829	0.155	-0.220	-0.739	1.80	1.63	-0.938	-1.27	-1.28	1.02
1	0.0934	0.786	0.106	1.27	-0.846	-0.979	1.26	0.264	2.41	-0.960	0.543	0.200	0.289	0.732	-0.872	-1.65	-1.13	-0.123	0.693	0.911
2	-0.906	-0.608	0.295	0.944	0.0929	1.37	-0.0648	0.287	-0.533	-0.0325	-0.550	-0.510	-0.869	-0.598	0.0198	0.613	-1.78	0.830	-0.737	-0.578
3	-0.586	0.389	0.699	0.436	-0.315	0.460	1.45	0.506	-1.44	-1.13	-0.241	1.47	0.679	-1.19	-1.44	-0.929	-0.222	-0.347	0.0342	-1.04
4	1.15	0.516	-1.22	-0.396	-1.29	-0.352	0.0713	1.24	1.01	-1.48	-0.696	-0.918	0.604	1.07	-0.882	2.30	-0.973	1.26	0.360	1.92

995	0.519	1.87	0.0781	0.0811	0.202	-2.76	0.400	-1.07	-0.589	-1.40	-1.03	0.0461	2.54	-0.481	-1.63	-0.0399	1.67	-0.134	1.79	0.248
996	-0.411	-0.547	1.13	0.334	-0.619	0.693	-0.617	1.09	0.193	1.46	0.957	-1.01	-0.257	0.518	0.593	-0.630	-0.0801	-0.247	-0.486	2.21
997	-0.200	-1.46	1.80	-0.244	0.544	1.78	-2.02	-0.658	0.207	-0.115	0.859	0.543	-0.420	-0.748	1.67	-1.21	-1.25	-1.50	-1.27	1.60
998	0.0394	0.249	-0.475	-1.14	1.94	-1.30	-0.803	0.451	-1.45	-0.679	-0.451	0.154	0.637	1.24	0.780	1.56	0.264	0.0991	0.543	1.21
999	0.769	0.471	0.170	0.268	-1.19	-1.28	-0.161	-0.216	0.607	-0.471	0.194	1.03	-1.20	0.273	0.222	2.06	-0.140	0.656	0.643	-2.02

Column	Column name	dtype	Is sorted	Unique values	Mean	Std	Min	Median	Max
0	Feature_0	Float64DType	False	1000 (100.0%)	-0.00836	1.02	-3.69	0.0184	3.53
1	Feature_1	Float64DType	False	1000 (100.0%)	0.0297	0.859	-3.28	0.0503	2.87
2	Feature_2	Float64DType	False	1000 (100.0%)	0.0253	0.987	-3.23	0.0723	3.93
3	Feature_3	Float64DType	False	1000 (100.0%)	0.0557	1.02	-3.33	0.0339	3.11
4	Feature_4	Float64DType	False	1000 (100.0%)	-0.0424	1.01	-2.99	-0.0661	3.11
5	Feature_5	Float64DType	False	1000 (100.0%)	-0.0225	1.40	-4.54	-0.196	4.02
6	Feature_6	Float64DType	False	1000 (100.0%)	-0.00886	1.03	-2.97	-0.00836	3.14
7	Feature_7	Float64DType	False	1000 (100.0%)	0.0665	1.01	-3.24	0.0985	3.28
8	Feature_8	Float64DType	False	1000 (100.0%)	-0.0411	0.952	-3.60	-0.0532	2.98
9	Feature_9	Float64DType	False	1000 (100.0%)	0.00979	0.995	-3.04	0.00180	3.43
10	Feature_10	Float64DType	False	1000 (100.0%)	-0.0503	1.00	-3.32	-0.0953	2.94
11	Feature_11	Float64DType	False	1000 (100.0%)	0.0202	0.974	-3.03	0.0216	4.48
12	Feature_12	Float64DType	False	1000 (100.0%)	-0.00196	0.979	-2.80	-0.00391	3.12
13	Feature_13	Float64DType	False	1000 (100.0%)	-0.0375	0.992	-3.31	-0.0681	3.16
14	Feature_14	Float64DType	False	1000 (100.0%)	-0.0484	1.32	-3.84	-0.219	3.99
15	Feature_15	Float64DType	False	1000 (100.0%)	0.00509	1.04	-3.84	0.0192	3.38
16	Feature_16	Float64DType	False	1000 (100.0%)	0.0383	1.02	-3.38	0.0339	3.10
17	Feature_17	Float64DType	False	1000 (100.0%)	-0.0202	1.01	-3.92	-0.00504	3.38
18	Feature_18	Float64DType	False	1000 (100.0%)	0.0215	0.818	-2.79	0.128	2.82
19	Feature_19	Float64DType	False	1000 (100.0%)	-0.00546	1.02	-3.25	0.0140	3.15

Column 1	Column 2	Cramér's V	Pearson's Correlation
Feature_1	Feature_18	0.713	0.936
Feature_5	Feature_18	0.684	-0.950
Feature_1	Feature_5	0.566	-0.779
Feature_1	Feature_14	0.477	-0.702
Feature_14	Feature_18	0.406	-0.406
Feature_5	Feature_14	0.324	0.100
Feature_5	Feature_13	0.145	-0.0449
Feature_13	Feature_18	0.141	0.0583
Feature_8	Feature_14	0.140	-0.0142
Feature_1	Feature_13	0.135	0.0664
Feature_1	Feature_6	0.119	-0.0212
Feature_15	Feature_16	0.116	-0.0738
Feature_0	Feature_7	0.115	-0.0349
Feature_3	Feature_16	0.114	-0.0358
Feature_6	Feature_18	0.110	-0.00905
Feature_4	Feature_5	0.108	0.0523
Feature_6	Feature_14	0.107	0.0366
Feature_0	Feature_14	0.107	-0.0252
Feature_0	Feature_13	0.105	0.0437
Feature_5	Feature_6	0.104	-0.00265

	Feature_0	Feature_1	Feature_2	Feature_3	Feature_4	Feature_5	Feature_6	Feature_7	Feature_8	Feature_9	Feature_10	Feature_11	Feature_12	Feature_13	Feature_14	Feature_15	Feature_16	Feature_17	Feature_18	Feature_19	Target
	Feature_0	Feature_1	Feature_2	Feature_3	Feature_4	Feature_5	Feature_6	Feature_7	Feature_8	Feature_9	Feature_10	Feature_11	Feature_12	Feature_13	Feature_14	Feature_15	Feature_16	Feature_17	Feature_18	Feature_19	Target
0	0.233	0.315	-0.390	0.392	0.940	-1.16	1.97	0.118	0.314	-0.0621	-0.761	0.571	1.19	0.137	0.470	0.346	0.121	1.04	0.529	-0.367	0
1	-0.271	0.212	0.669	-0.514	1.83	-0.225	-0.719	-0.589	-0.400	-0.562	0.345	-0.620	-1.27	1.37	-0.277	-0.668	-1.49	-0.0157	0.175	-0.559	1
2	-1.02	-0.971	0.585	1.29	0.480	2.47	0.238	-0.927	-1.06	0.202	-0.432	0.232	1.13	-0.634	-0.276	-0.234	-1.74	-0.495	-1.27	0.00306	1
3	-1.34	0.212	0.741	-0.775	0.660	-0.601	-0.877	1.17	-0.116	-0.0573	-0.385	-2.07	-0.403	-0.854	0.126	0.272	-0.742	0.548	0.297	0.687	1
4	-0.255	0.568	0.0415	0.604	0.458	-1.56	-1.07	1.54	-0.0831	1.48	-0.715	1.79	0.0648	1.36	0.281	0.765	-1.70	0.245	0.780	0.823	0

995	0.0160	1.10	-0.852	2.92	-0.529	-1.28	0.518	0.137	0.974	-2.07	-0.869	-0.0109	-1.13	-0.184	-1.31	0.273	0.795	-0.0920	0.939	2.28	0
996	0.399	-0.118	-0.783	-0.555	-0.414	2.05	0.766	0.798	0.325	-1.15	0.461	-1.52	-0.0929	0.0122	-1.90	-1.27	1.05	0.968	-0.726	-2.29	1
997	0.0552	-1.03	1.35	0.732	1.44	1.96	1.35	0.873	-1.63	-0.854	0.291	0.0322	0.405	-0.309	0.419	-0.0351	2.05	1.29	-1.13	1.61	1
998	0.901	-0.134	0.483	1.09	-0.853	-1.19	-0.414	0.0336	-0.458	0.431	0.610	-0.144	0.0150	-0.114	1.59	-0.600	-0.579	0.350	0.325	-0.153	0
999	0.853	-0.245	-0.339	1.99	-1.12	0.350	1.45	1.62	-0.268	1.95	-0.727	-0.769	0.783	0.752	0.224	-0.580	-0.106	0.268	-0.231	-0.613	1

Column	Column name	dtype	Is sorted	Unique values	Mean	Std	Min	Median	Max
0	Feature_0	Float64DType	False	1000 (100.0%)	-0.00836	1.02	-3.69	0.0184	3.53
1	Feature_1	Float64DType	False	1000 (100.0%)	0.0297	0.859	-3.28	0.0503	2.87
2	Feature_2	Float64DType	False	1000 (100.0%)	0.0253	0.987	-3.23	0.0723	3.93
3	Feature_3	Float64DType	False	1000 (100.0%)	0.0557	1.02	-3.33	0.0339	3.11
4	Feature_4	Float64DType	False	1000 (100.0%)	-0.0424	1.01	-2.99	-0.0661	3.11
5	Feature_5	Float64DType	False	1000 (100.0%)	-0.0225	1.40	-4.54	-0.196	4.02
6	Feature_6	Float64DType	False	1000 (100.0%)	-0.00886	1.03	-2.97	-0.00836	3.14
7	Feature_7	Float64DType	False	1000 (100.0%)	0.0665	1.01	-3.24	0.0985	3.28
8	Feature_8	Float64DType	False	1000 (100.0%)	-0.0411	0.952	-3.60	-0.0532	2.98
9	Feature_9	Float64DType	False	1000 (100.0%)	0.00979	0.995	-3.04	0.00180	3.43
10	Feature_10	Float64DType	False	1000 (100.0%)	-0.0503	1.00	-3.32	-0.0953	2.94
11	Feature_11	Float64DType	False	1000 (100.0%)	0.0202	0.974	-3.03	0.0216	4.48
12	Feature_12	Float64DType	False	1000 (100.0%)	-0.00196	0.979	-2.80	-0.00391	3.12
13	Feature_13	Float64DType	False	1000 (100.0%)	-0.0375	0.992	-3.31	-0.0681	3.16
14	Feature_14	Float64DType	False	1000 (100.0%)	-0.0484	1.32	-3.84	-0.219	3.99
15	Feature_15	Float64DType	False	1000 (100.0%)	0.00509	1.04	-3.84	0.0192	3.38
16	Feature_16	Float64DType	False	1000 (100.0%)	0.0383	1.02	-3.38	0.0339	3.10
17	Feature_17	Float64DType	False	1000 (100.0%)	-0.0202	1.01	-3.92	-0.00504	3.38
18	Feature_18	Float64DType	False	1000 (100.0%)	0.0215	0.818	-2.79	0.128	2.82
19	Feature_19	Float64DType	False	1000 (100.0%)	-0.00546	1.02	-3.25	0.0140	3.15
20	Target	Int64DType	False	2 (0.2%)	0.500	0.500	0	1	1

	objective objective: str \| xgboost.sklearn._SklObjWProto \| typing.Callable[[typing.Any, typing.Any], typing.Tuple[numpy.ndarray, numpy.ndarray]] \| None Specify the learning task and the corresponding learning objective or a custom objective function to be used. For custom objective, see :doc:`/tutorials/custom_metric_obj` and :ref:`custom-obj-metric` for more information, along with the end note for function signatures.	'binary:logistic'
	base_score base_score: float \| typing.List[float] \| None The initial prediction score of all instances, global bias.	None
	booster	None
	callbacks callbacks: typing.List[xgboost.callback.TrainingCallback] \| None List of callback functions that are applied at end of each iteration. It is possible to use predefined callbacks by using :ref:`Callback API `. .. note:: States in callback are not preserved during training, which means callback objects can not be reused for multiple training sessions without reinitialization or deepcopy. .. code-block:: python for params in parameters_grid: # be sure to (re)initialize the callbacks before each run callbacks = [xgb.callback.LearningRateScheduler(custom_rates)] reg = xgboost.XGBRegressor(**params, callbacks=callbacks) reg.fit(X, y)	None
	colsample_bylevel colsample_bylevel: float \| None Subsample ratio of columns for each level.	None
	colsample_bynode colsample_bynode: float \| None Subsample ratio of columns for each split.	None
	colsample_bytree colsample_bytree: float \| None Subsample ratio of columns when constructing each tree.	None
	device device: str \| None .. versionadded:: 2.0.0 Device ordinal, available options are `cpu`, `cuda`, and `gpu`.	None
	early_stopping_rounds early_stopping_rounds: int \| None .. versionadded:: 1.6.0 - Activates early stopping. Validation metric needs to improve at least once in every early_stopping_rounds round(s) to continue training. Requires at least one item in eval_set in :py:meth:`fit`. - If early stopping occurs, the model will have two additional attributes: :py:attr:`best_score` and :py:attr:`best_iteration`. These are used by the :py:meth:`predict` and :py:meth:`apply` methods to determine the optimal number of trees during inference. If users want to access the full model (including trees built after early stopping), they can specify the `iteration_range` in these inference methods. In addition, other utilities like model plotting can also use the entire model. - If you prefer to discard the trees after `best_iteration`, consider using the callback function :py:class:`xgboost.callback.EarlyStopping`. - If there's more than one item in eval_set, the last entry will be used for early stopping. If there's more than one metric in eval_metric, the last metric will be used for early stopping.	None
	enable_categorical enable_categorical: bool See the same parameter of :py:class:`DMatrix` for details.	False
	eval_metric eval_metric: str \| typing.List[str \| typing.Callable] \| typing.Callable \| None .. versionadded:: 1.6.0 Metric used for monitoring the training result and early stopping. It can be a string or list of strings as names of predefined metric in XGBoost (See :doc:`/parameter`), one of the metrics in :py:mod:`sklearn.metrics`, or any other user defined metric that looks like `sklearn.metrics`. If custom objective is also provided, then custom metric should implement the corresponding reverse link function. Unlike the `scoring` parameter commonly used in scikit-learn, when a callable object is provided, it's assumed to be a cost function and by default XGBoost will minimize the result during early stopping. For advanced usage on Early stopping like directly choosing to maximize instead of minimize, see :py:obj:`xgboost.callback.EarlyStopping`. See :doc:`/tutorials/custom_metric_obj` and :ref:`custom-obj-metric` for more information. .. code-block:: python from sklearn.datasets import load_diabetes from sklearn.metrics import mean_absolute_error X, y = load_diabetes(return_X_y=True) reg = xgb.XGBRegressor( tree_method="hist", eval_metric=mean_absolute_error, ) reg.fit(X, y, eval_set=[(X, y)])	None
	feature_types feature_types: typing.Sequence[str] \| None .. versionadded:: 1.7.0 Used for specifying feature types without constructing a dataframe. See the :py:class:`DMatrix` for details.	None
	feature_weights feature_weights: Optional[ArrayLike] Weight for each feature, defines the probability of each feature being selected when colsample is being used. All values must be greater than 0, otherwise a `ValueError` is thrown.	None
	gamma gamma: float \| None (min_split_loss) Minimum loss reduction required to make a further partition on a leaf node of the tree.	None
	grow_policy grow_policy: str \| None Tree growing policy. - depthwise: Favors splitting at nodes closest to the node, - lossguide: Favors splitting at nodes with highest loss change.	None
	importance_type	None
	interaction_constraints interaction_constraints: str \| typing.List[typing.Tuple[str]] \| None Constraints for interaction representing permitted interactions. The constraints must be specified in the form of a nested list, e.g. ``[[0, 1], [2, 3, 4]]``, where each inner list is a group of indices of features that are allowed to interact with each other. See :doc:`tutorial ` for more information	None
	learning_rate learning_rate: float \| None Boosting learning rate (xgb's "eta")	0.1
	max_bin max_bin: int \| None If using histogram-based algorithm, maximum number of bins per feature	None
	max_cat_threshold max_cat_threshold: int \| None .. versionadded:: 1.7.0 .. note:: This parameter is experimental Maximum number of categories considered for each split. Used only by partition-based splits for preventing over-fitting. Also, `enable_categorical` needs to be set to have categorical feature support. See :doc:`Categorical Data ` and :ref:`cat-param` for details.	None
	max_cat_to_onehot max_cat_to_onehot: Optional[int] .. versionadded:: 1.6.0 .. note:: This parameter is experimental A threshold for deciding whether XGBoost should use one-hot encoding based split for categorical data. When number of categories is lesser than the threshold then one-hot encoding is chosen, otherwise the categories will be partitioned into children nodes. Also, `enable_categorical` needs to be set to have categorical feature support. See :doc:`Categorical Data ` and :ref:`cat-param` for details.	None
	max_delta_step max_delta_step: float \| None Maximum delta step we allow each tree's weight estimation to be.	None
	max_depth max_depth: int \| None Maximum tree depth for base learners.	3
	max_leaves max_leaves: int \| None Maximum number of leaves; 0 indicates no limit.	None
	min_child_weight min_child_weight: float \| None Minimum sum of instance weight(hessian) needed in a child.	None
	missing missing: float Value in the data which needs to be present as a missing value. Default to :py:data:`numpy.nan`.	nan
	monotone_constraints monotone_constraints: typing.Dict[str, int] \| str \| None Constraint of variable monotonicity. See :doc:`tutorial ` for more information.	None
	multi_strategy multi_strategy: str \| None .. versionadded:: 2.0.0 .. note:: This parameter is working-in-progress. The strategy used for training multi-target models, including multi-target regression and multi-class classification. See :doc:`/tutorials/multioutput` for more information. - ``one_output_per_tree``: One model for each target. - ``multi_output_tree``: Use multi-target trees.	None
	n_estimators n_estimators: Optional[int] Number of boosting rounds.	50
	n_jobs n_jobs: int \| None Number of parallel threads used to run xgboost. When used with other Scikit-Learn algorithms like grid search, you may choose which algorithm to parallelize and balance the threads. Creating thread contention will significantly slow down both algorithms.	None
	num_parallel_tree	None
	random_state random_state: numpy.random.mtrand.RandomState \| numpy.random._generator.Generator \| int \| None Random number seed. .. note:: Using gblinear booster with shotgun updater is nondeterministic as it uses Hogwild algorithm.	42
	reg_alpha reg_alpha: float \| None L1 regularization term on weights (xgb's alpha).	None
	reg_lambda reg_lambda: float \| None L2 regularization term on weights (xgb's lambda).	None
	sampling_method sampling_method: str \| None Sampling method. Used only by the GPU version of ``hist`` tree method. - ``uniform``: Select random training instances uniformly. - ``gradient_based``: Select random training instances with higher probability when the gradient and hessian are larger. (cf. CatBoost)	None
	scale_pos_weight scale_pos_weight: float \| None Balancing of positive and negative weights.	None
	subsample subsample: float \| None Subsample ratio of the training instance.	None
	tree_method tree_method: str \| None Specify which tree method to use. Default to auto. If this parameter is set to default, XGBoost will choose the most conservative option available. It's recommended to study this option from the parameters document :doc:`tree method `	None
	validate_parameters validate_parameters: bool \| None Give warnings for unknown parameter.	None
	verbosity verbosity: int \| None The degree of verbosity. Valid values are 0 (silent) - 3 (debug).	None

Metric	Label	CustomClassifier
Accuracy		0.710000
Precision	0	0.628866
Precision	1	0.786408
Recall	0	0.734940
Recall	1	0.692308
Fit time (s)		0.001639
Predict time (s)		0.003045

Using skore with scikit-learn compatible estimators#

Generate a classification dataset#

Feature_0

Feature_1

Feature_2

Feature_3

Feature_4

Feature_5

Feature_6

Feature_7

Feature_8

Feature_9

Feature_10

Feature_11

Feature_12

Feature_13

Feature_14

Feature_15

Feature_16

Feature_17

Feature_18

Feature_19

Feature_0

Feature_1

Feature_2

Feature_3

Feature_4

Feature_5

Feature_6

Feature_7

Feature_8

Feature_9

Feature_10

Feature_11

Feature_12

Feature_13

Feature_14

Feature_15

Feature_16

Feature_17

Feature_18

Feature_19

Please enable javascript

Gradient-boosted decision trees with XGBoost#

Feature_0

Feature_1

Feature_2

Feature_3

Feature_4

Feature_5

Feature_6

Feature_7

Feature_8

Feature_9

Feature_10

Feature_11

Feature_12

Feature_13

Feature_14

Feature_15

Feature_16

Feature_17

Feature_18

Feature_19

Target

Please enable javascript

Custom model#

Feature_0

Feature_1

Feature_2

Feature_3

Feature_4

Feature_5

Feature_6

Feature_7

Feature_8

Feature_9

Feature_10

Feature_11

Feature_12