Data

A. 3D Regular Household Items Time-Series Dataset

With the rapid development of e-commerce and the retail industry, the variety and quantity of household items have continuously increased. Efficiently arranging and packing these regularly shaped three-dimensional items to maximize space utilization has become a critical issue in logistics management. Many companies record specific information about each item's size, weight, volume, and its handling in storage and transportation, aiming to optimize picking and distribution processes.

This dataset contains a total of 16,767 such order records, including the processing time of each order, the dimensions, volume, and weight of each item, and the quantity of items to be picked, sorted in a real-time sequence. Time-series sorted test data can better simulate the bin packing process in actual operations, making the test results more realistic. Some algorithms may perform poorly in real-world applications due to their inability to adapt to sequence requirements, or they may experience performance degradation or errors when faced with large volumes of continuous input data. Time-series sorted data helps verify whether algorithms can effectively handle specific constraints, such as requiring items to be loaded in a specific order or certain items not being allowed to be placed on top of others. This data can help identify bottlenecks and improvement points in algorithms, for example, by analyzing phases with low packing efficiency to find ways to improve the algorithm and enhance overall efficiency.

You can click here to download sample files.

Length Width Height Vol/CS Gross weight Case_pick_finish_time
35 20.3 20.8 29.5568 6.324 2022/06/07, 00:24:37
33.7 19.1 17 10.94239 5.874 2022/06/07, 00:24:47
35.3 26.7 24.1 22.714491 7.983 2022/06/07, 00:24:59
35 20.3 20.8 29.5568 6.327 2022/06/07, 00:25:12

B. 3D Regular Office Supplies Occupancy Dataset

In stationery stores, we often see a variety of office supplies arranged on shelves in a way that is both aesthetically pleasing and convenient for access. This arrangement requires consideration of each item's size, weight, and shape, aiming to maximize the use of limited space to accommodate more products. Efficiently arranging and packing these regularly shaped three-dimensional office supplies to maximize space utilization is also a significant challenge. Some office supply companies record information such as the name, dimensions, volume, weight, and occupancy rate of each office supply item in orders.

The dataset contains a total of 6,849 such order records, including the dimensions, volume, weight, and occupancy rate of each office supply item in the order. Notably, the dataset also includes the names of office supplies, providing rich occupancy information that helps optimize packing strategies. For example, many identical types of items can be grouped together to facilitate categorization and management. The occupancy rate information in the dataset helps evaluate the performance of algorithms in maximizing space utilization. Different office supplies may have different priorities and constraints during packing. The occupancy data can help verify whether an algorithm can handle these practical constraints, such as whether high-occupancy items need to be loaded first, or certain items cannot be packed together with others. The occupancy dataset for office supplies offers a more realistic testing environment for 3D bin packing algorithms, helping researchers evaluate and improve algorithm performance in real-world applications.

You can click here to download sample files.

name Length Width Height Proportion
0 Sealing tape 24.5 35.5 39 0.71
1 Neutral Pen 0.5mm bullet 32.5 42.5 33.5 0.54
2 Solid glue (white) 27.5 35.5 20 0.48
3 Marker (Black) 32 54 30.5 0.46

C. 3D Regular Long Board Dataset

In production and logistics management, long strips of board materials often face problems such as low space utilization efficiency and difficulty in arrangement when loading and placing due to their long sides. The packing requirements of this type of material are high, and algorithms are required to optimize space more accurately to maximize utilization. The long strip plate data set contains data of long strip plates of various sizes, and each data record the length, width, and height information of the plate.

The main feature of this dataset is that the material has a long side, which makes it relatively difficult to load in practice, especially when space is limited. The algorithm needs to pay special attention to the placement order and relative position of long materials to avoid wasting space and ensure the stability of loading. The long sheet material dataset provides a challenging test scenario for the packing algorithm, which is particularly suitable for evaluating the performance of the algorithm when processing materials with long sides. By analyzing these data, researchers can better optimize the algorithm and improve the loading efficiency and space utilization when facing such special-shaped materials.

You can click here to download sample files.

Length Width Height
0 62 35.5 13
1 273 26 6
2 85 61 12
3 151 62 8

Simulation Environment

To bridge the gap between simulation and real-world applications, we implemented three progressively realistic settings using the PyBullet physics engine:

These three settings form a tiered evaluation framework that allows us to analyze how different packing strategies perform under increasingly realistic constraints—from ideal geometric logic to full physical execution with robotics. This design helps reveal the robustness, adaptability, and real-world viability of each algorithm.

Our simulation environment is built with the PyBullet physics engine. The robotic arm used in Setting 3 is configured based on a 6-DOF UR5 model. During the simulation, the robot is required to perform the pick-and-place action from the loading area to the target position using inverse kinematics (IK) and motion planning.

We consider both geometric and physical constraints: geometrically, boxes must remain within the container without overlap; physically, we enforce gravity, friction, collision response, and ensure the robotic arm avoids self-collision and environmental obstacles.

The system is implemented in Python and supports both direct geometric simulation and full physics-based interaction. A unified interface encapsulates the simulation engine, robotic control, and evaluation metrics. Users can launch simulations with different settings via a single entry point, allowing convenient algorithm testing and scenario switching.

To enhance the reusability and scalability of our bin packing simulation environment, we have encapsulated the entire system into a Python package named packsim, now available on PyPI. Users can install it using the following command:

pip install packsim

After installation, the simulation process can be invoked in Python as follows:

from packsim import simulate result = simulate({ 'setting': 1, 'data': 'flat_long', 'method': 'PCT', 'test_data_config': 0, 'gui': 1, 'config': '/rl/pypi/default.yaml', 'action_path': '/rl/pypi/action.json', 'planning_time_path': '/rl/pypi/planning_time.json', 'save_path': '/rl/pypi' })

By calling the simulate function and passing a dictionary of parameters, users can execute a complete simulation process. The input parameters include the simulation setting, dataset name, test configuration ID, visualization option, path to the default configuration file, action sequence file, planning time log, and result save path. Upon completion, the system returns evaluation results including packing efficiency, decision trajectories, and execution time for subsequent algorithm benchmarking and performance analysis.

Before running the simulation, users need to generate action.json and planning_time.json using the Submit module, and configure the default.yaml file accordingly—especially the data path for testing.
You can click here to download sample files.

This module supports fast switching of algorithms and dataset configurations, making it a versatile tool for testing and comparing online bin packing algorithms. It offers high compatibility and strong visualization capabilities.