Cloud Native with Jintao

How to reduce the cost of GitHub Actions

Jintao Zhang — Thu, 26 Jan 2023 20:05:36 GMT

I'll cover how to reduce the code of GitHub Actions, and give some advice.

According to G2's statistical report, GitHub Actions is the easiest-to-use CI/CD tool, and more and more people like it.

Since GitHub Actions is GitHub's native CI/CD tool, tens of thousands of Actions can be used directly in the marketplace, and it is free for public repositories. More and more projects are switching their CI tools to GitHub Actions.

I also really like GitHub Actions and use it for almost all my GitHub-hosted repositories.

But recently I was working on a project that hit the GitHub Actions quota limit. It took me some time to focus on its cost.

https://twitter.com/zhangjintao9020/status/1616077513125691399?s=20&t=we9o9FxhgSEOXzY39czbJg

Why is the quota exhausted?

Recently I found an interesting project: upptime/upptime: Free uptime monitor and status page powered by GitHub

I want to try to use it to monitor some of the services I have developed and make a status page, this will involve some API configurations, and I don't want to make it public, so I forked the project into a private repository. After a simple configuration, it works fine.

Since I wanted more data, I tweaked the CI scheduler configuration. Make these tasks run more frequently.

workflowSchedule:  graphs: "0 * * * *"  responseTime: "0 * * * *"  staticSite: "0 * * * *"  summary: "0 * * * *"  updateTemplate: "0 * * * *"  updates: "0 * * * *"  uptime: "*/5 * * * *"

According to the billing documentation for GitHub Actions, GitHub Actions for public repositories is Free, but there is a quota limit for private repositories.

GitHub Actions usage is free for standard GitHub-hosted runners in public repositories, and for Self-hosted runners. For private repositories, each GitHub account receives a certain amount of free minutes and storage for use with GitHub-hosted runners, depending on the product used with the account. Any usage beyond the included amounts is controlled by spending limits.

Soon I received a quota reminder email from GitHub, reminding me that the quota was about to be used up.

This got me thinking about how to solve it.

Cost of using GitHub Actions

Making the repository public is the most straightforward way, but I explained above why it cannot be made public. I can only find other solutions.

Paying for GitHub Actions is also a very straightforward solution.

Before deciding to pay for it, I want to estimate the cost. GitHub provides a Pricing Calculator, which can easily estimate costs.

Since I modified the CI's scheduling configuration, the most frequently run tasks will run every 5 minutes.

I used Meercode to collect the running data of GitHub Actions in this repository. It provides some dashboards by default:

It also allows users to customize it themselves. I created my dashboard. If you are interested in Meercode, please let me know in the comments.

As can be seen from the figure above, each task takes no more than 0.5 minutes, and there are no more than 12 tasks per hour. Using the price calculator, the approximate cost is $35 per month.

Ways to save costs

Since my repository is mainly run uptime CI, it consumes few resources but has frequent tasks, so I wonder if I can save costs if I use a self-hosted runner.

I compared the prices of 3 lower-priced cloud service providers:

Among them, both Civo and Vultr provide 1C1G instances at $5/month, and DigitalOcean instances with the same specifications are priced at $6/month.

I finally chose Civo, which is a cloud-native service provider, and there is an introduction on its homepage:

Transparent pricing from just $5 a month

Civo provides a variety of services, such as Kubernetes (based on k3s), or compute instances.

Among them, the instance specification of the Extra Small type is 1C1G, and it has 1TB traffic, and if you choose the Kubernetes service, you do not need to pay for the control plane(same as Azure AKS). Even the larger specs look cheap.

I have tried using its Kubernetes service, and compute instance respectively, and they both work fine.

Using compute instances

Deploying the GitHub Actions runner in a Linux compute instance is simple, just add it to the project https://github.com///settings/actions/runners/new.

There are complete deployment steps on this page, just follow the steps.

My installation process is as follows:

civo@polished-bush-99d8-1926a1:~$ mkdir actions-runner && cd actions-runnercivo@polished-bush-99d8-1926a1:~/actions-runner$ curl -o actions-runner-linux-x64-2.301.1.tar.gz -L https://github.com/actions/runner/releases/download/v2.301.1/actions-runner-linux-x64-2.301.1.tar.gzcivo@polished-bush-99d8-1926a1:~/actions-runner$ echo "3ee9c3b83de642f919912e0594ee2601835518827da785d034c1163f8efdf907  actions-runner-linux-x64-2.301.1.tar.gz" | shasum -a 256 -cactions-runner-linux-x64-2.301.1.tar.gz: OK                                                                     civo@polished-bush-99d8-1926a1:~/actions-runner$ tar xzf ./actions-runner-linux-x64-2.301.1.tar.gz              civo@polished-bush-99d8-1926a1:~/actions-runner$ ./config.sh --url https://github.com/MoeLove/monitoring --token $TOKEN

After the execution is complete, some files will be added to the current directory. Execute ./env.sh to start the GitHub Actions runner.

civo@polished-bush-99d8-1926a1:~/actions-runner$ ls_diag  _work  actions-runner-linux-x64-2.301.1.tar.gz  bin  config.sh  env.sh  externals  run-helper.cmd.template  run-helper.sh  run-helper.sh.template  run.sh  safe_sleep.sh  svc.sh

If you want to run stably in the background, you can execute ./svc.sh install to install the runner as a systemd service and manage its life cycle through systemd.

Using Kubernetes

Civo does not charge for the Kubernetes control plane, but only for Worker Nodes. The advantage of using Kubernetes is that I can automatically scale up and down in the cluster, and I can easily run and create multiple runners for different projects.

Since GitHub official has not provided to deploy a Self-hosted runner on Kubernetes, I used the Actions Runner Controller (ARC) project, This project allows rapid deployment of Self-hosted runners through Runner custom resources.

The deployment process is clearly described in the documentation. The following is my deployment process.

# deploy cert-manager(MoeLove)  kubectl apply -f https://github.com/cert-manager/cert-manager/releases/download/v1.11.0/cert-manager.yaml# deploy ARC(MoeLove)  helm repo add actions-runner-controller https://actions-runner-controller.github.io/actions-runner-controller(MoeLove)  helm upgrade --install --namespace actions-runner-system --create-namespace\  --set=authSecret.create=true\  --set=authSecret.github_token="REPLACE_YOUR_TOKEN_HERE"\  --wait actions-runner-controller actions-runner-controller/actions-runner-controller# create runner(MoeLove)  cat <

After installation, the following results are achieved:

https://twitter.com/zhangjintao9020/status/1616251840429002755?s=20&t=SM0rFgSfq8b03CD11dhdNw

`Self-hosted vs GitHub-managed`

In the content above, I introduced how I used Meercode to measure the key indicators of CI metrics and estimate the cost of GitHub Actions. According to my actual low resource consumption and high time-consuming scenario, I chose the Self-hosted runner.

So when is it more appropriate to choose a GitHub-managed runner? What are the benefits of GitHub-managed?

The GitHub-managed runner has the following advantages:

Support for multiple operating systems: In addition to providing Linux systems, GitHub-managed runner also supports macOS and Windows, but most cloud providers do not provide macOS environments. (I used to put some Mac minis as servers in the data center for specific scenarios)
VM-level isolation: According to the GitHub Actions documentation, when the GitHub Actions runner runs a job, it creates a VM to run all tasks, which brings certain security and isolation guarantees. If it is a Self-hosted runner when running through the binary, the task will share the host environment, and if it is running through ARC, it will bring isolation through the Pod. This will cause certain security issues.
Low Maintenance Costs: In fact in any large system, maintenance costs are very expensive. If it is only for personal use, or only a few projects use the Self-hosted runner, the maintenance cost is relatively controllable. Once it gets big, it introduces a lot of complexity. The GitHub-managed runner is maintained by GitHub.

There are also two products that offer self-hosted runner services:

Actuated
cirun

They reduce the cost of runner maintenance and management and provide more secure isolation and support for Arm-based environments. cirun also provides GPU runner support.

If you have the above requirements, you may also wish to consider these services.

`Summarize`

In general, the following steps are required to reduce the cost of GitHub actions.

Visualization/Observability: Estimate costs using actual data.
Compare multiple vendors/solutions: Different vendors offer different pricing for different scenarios or products, and you can choose according to your actual situation.
Security and maintenance costs also need to be considered.

If you are interested in my articles, please subscribe to my Newsletter!



My Rust journey and how to learn Rust
Jintao Zhang — Tue, 17 Jan 2023 11:27:40 GMT
I'll share my Rust journey, how I learned Rust and some free Rust learning resources.
Rust has become more and more popular. Through the StackOverflow 2022 Developer Survey, we can see that many people are interested in Rust.
Rust is on its seventh year as the most loved language with 87% of developers saying they want to continue using it.
Rust also ties with Python as the most wanted technology with TypeScript running a close second
Most Wanted
  
Most Loved vs. Dreaded
  
But Rust has a particular learning curve.
This made me want to share my Rust journey, why I chose Rust, and how to learn Rust.
Getting connected with Rust
I had heard about Rust when it was first released, and my impression was that it was a system programming language that could replace C/C++ and was safe enough. But I didn't learn and use it. (I've only used it to write Hello World!)
Back in time to 5 years ago, I was leading the transformation of the company's infrastructure into a cloud-native stack.
I need to construct a monitoring stack based entirely on Prometheus to replace a set of monitoring software in the company with more than 10 years of history. And some other monitoring software, such as Nagios, Zabbix, and Graphite.
Yes, you read that right, we are using a lot of surveillance software. There are a few reasons for this:
A single software cannot meet all needs
The team is scattered, and most of the time, new software is introduced just to meet specific needs, rather than to solve the problem
Anyway, here are some historical reasons.
And, from what I mentioned above, we have a set of self-developed monitoring software with a history of more than 10 years, as you can see, our infrastructure is slow to iterate.
And because we have our physical data center, this also leads to many old machines in our servers that have not been updated. (This is one of the reasons why I used Rust later)
I first replaced the monitoring stack in a newly launched small data center, with about 400 machines, and the effect was good. Using Prometheus to complete the monitoring of all the servers in this small data center and the various services running on them. There are also Dashboards created for them in Grafana, and alarm notifications created through Alertmanager.
Later, I promoted these transformations in two data centers, and overall it was relatively smooth, including the monitoring of Kubernetes was also completed during this process.
But when it was implemented in the last data center, I faced the biggest challenge.
node_exporter failed to start on some machines, and some machines crashed automatically after running for some time.
I started to investigate this issue. For the automatic crash issue, I temporarily fixed it by adding a restart script.
I'm mainly concerned with why node_exporter won't start. I found that the operating system of this part of the machine is CentOS 5, and the kernel is 2.6.18.
I found that there are already similar issues in the community: https://github.com/prometheus/node_exporter/issues/691
At the same time, I also noticed that the Go documentation clearly stated that CentOS 5 is not supported, and a kernel of at least version 2.6.32 or above is required.
(I forgot the minimum dependencies when I checked, but through the web archive, I see that the minimum kernel version required in 2017 is 2.6.23)
After some searching, I also saw something like How to install Go 1.1 on CentOS 5.9, but at the same time, some known issues are mentioned in the article.
So I'm not going to keep fighting it.
I want to re-implement one by myself, which can also solve the above automatic crash problem.
In the end, I used Rust to implement a tool similar to node_exporter and completed the upgrade and transformation of the monitoring system.
This is where my journey started with Rust in production.
Next, let me introduce why I chose Rust.
Why choose Rust
I have introduced some background above. At that time, the easiest choice should be Python, which is simple enough and rich in ecology. At the same time, I also have many years of experience in Python development, I can quickly build the tools I need.
The reasons for not choosing Python are:
Not all of these machines have a Python environment, and the versions of Python are also different. I was asked not to modify the environment on these machines as much as possible;
Since I may make some modifications later, I think the subsequent distribution may not be convenient;
Then I rethought my goal:
Can be compiled into binary executable files for easy distribution and deployment. I used Ansible for unified deployment.
So a more suitable option is C/C++/Rust.
I have more experience in C development and a little experience in C++. For my first requirement, the above three languages can be easily met.
When most people compare Rust and C/C++, they are comparing their performance and safety.
And in my use case at the time, I don't think the results in the other two languages would be worse than in Rust, although these are also considerations. And since I was just starting to learn Rust at the time, it might be worse than my C implementation.
But I want more challenges, try something new, and in terms of Prometheus monitoring, the C/C++-related ecology is not very active. Another point I think Rust will have great development in the future.
So in the end I chose Rust.
How I learned Rust
Rust is not simple, and it's not quite the same as other languages, so some practices that work in other languages may not work in Rust.
Since I have a specific problem that needs to be solved, I need to implement a  node_exporter  to complete the transformation of the monitoring stack. So I learned Rust through the learning-by-doing mode.
I first took a quick look at the following:
The Rust Programming Language: This book is very complete, I didn't read it completely at first. Instead, use it to understand the main concepts and some usages in Rust.
Rust By Example: There are many examples here, and you can also increase your familiarity with Rust by practicing these examples;
Rust std lib docs: Documentation of the standard library, a quick overview, understanding some keywords, modules, etc. But it is not necessary to read it in its entirety initially.
This way I quickly implemented a basic node_exporter version. Then continue to iterate and apply it to the production environment, and completed the construction of the Prometheus monitoring stack.
Later, I continued to implement some small tools in Rust, learned its best practices, and learned some open-source projects implemented in Rust to increase my Rust experience.
Recommend some Rust learning resources
There are many learning resources for Rust now. In addition to the ones I listed above, I recommend the following free content:
Take your first steps with Rust - Training | Microsoft Learn
rust-lang/rustlings: Small exercises to get you used to reading and writing Rust code!
videos:
Rust Crash Course | Rustlang - YouTube
Rust Tutorial - YouTube
Rust for Beginners - YouTube
Summarize
This is how my Rust journey started, and it continues.
Although I focus on Cloud Native and Kubernetes-related technologies, and now I write more Go language, I also still write some tools in Rust and use Rust in WebAssembly.
In the future, I will also share relevant content. If you are interested in my articles, welcome to subscribe to my Newsletter!


Opportunities and Challenges of Technological Evolution in Cloud Native
Jintao Zhang — Thu, 15 Dec 2022 17:22:59 GMT
Nowadays, Cloud Native is becoming increasingly popular, and the CNCF defines Cloud Native as:
Based on a modern and dynamic environment, aka cloud environment.
With containerization as the fundamental technology, including Service Mesh, immutable infrastructure, declarative API, etc.
Key features include autoscaling, manageability, observability, automation, frequent change, etc.
According to the CNCF 2021 survey, there are a very significant number (over 62,000) of contributors in the Kubernetes community. With the current trend of technology, more and more companies are investing more cost into Cloud Native and joining the track early for active cloud deployment. Why are companies embracing Cloud Native while developing, and what does Cloud Native mean for them?
Technical Advantages of Cloud Native
The popularity of Cloud Native comes from its advantages at the technical level. There are two main aspects of Cloud Native technology, including containerization led by Docker, and container orchestration led by Kubernetes.
Docker introduced container images to the technology world, making container images a standardized delivery unit. In fact, before Docker, containerization technology already existed. Let's talk about a more recent technology, LXC (Linux Containers) in 2008. Compared to Docker, LXC is less popular since Docker provides container images, which can be more standardized and more convenient to migrate. Also, Docker created the DockerHub public service, which has become the world's largest container image repository. In addition, containerization technology can also achieve a certain degree of resource isolation, including not only CPU, memory, and other resources isolation, but also network stack isolation, which makes it easier to deploy multiple copies of applications on the same machine.
Kubernetes became popular due to the booming of Docker. The container orchestration technology, led by Kubernetes, provides several important capabilities, such as fault self-healing, resource scheduling, and service orchestration. Kubernetes has a built-in DNS-based service discovery mechanism, and thanks to its scheduling architecture, it can be scaled very quickly to achieve service orchestration.
Now more and more companies are actively embracing Kubernetes and transforming their applications to embark on Kubernetes deployment. And Cloud Native we are talking about is actually based on the premise of Kubernetes, the cornerstone of Cloud Native technology.
Containerization Advantages
Standardized Delivery
Container images have now become a standardized delivery unit. By containerization technology, users can directly complete the delivery through a container image instead of binary or source code. Relying on the packaging mechanism of the container image, you can use the same image to start a service and produce the same behavior in any container runtime.
Portable and Light-weight, Cost-saving
Containerization technology achieves certain isolation by Linux kernel's capabilities, which in turn makes it easier to migrate. Moreover, containerization technology can directly run applications, which is lighter in technical implementation compared to virtualization technology, without the need for OS in the virtual machine.All applications can share the kernel, which saves cost. And the larger the application, the greater the cost savings.
Convenience of resource management
When starting a container, you can set the CPU, memory, or disk IO properties that can be used for the container service, which allows for better planning and deployment of resources when starting application instances through containers.
Container Orchestration Advantages
Simplify the Workflow
In Kubernetes, application deployment is easier to manage than in Docker, since Kubernetes uses declarative configuration. For example, a user can simply declare in a configuration file what container image the application will use and what service ports are exposed without the need for additional management. The operations corresponding to the declarative configuration greatly simplify the workflow.
Improve Efficiency and Save Costs
Another advantageous feature of Kubernetes is failover. When a node in Kubernetes crashes, Kubernetes automatically schedules the applications on it to other normal nodes and gets them up and running. The entire recovery process does not require human intervention and operation, so it not only improves operation and maintenance efficiency at the operational level but also saves time and cost.
With the rise of Docker and Kubernetes, you will see that their emergence has brought great innovation and opportunity to application delivery. Container images, as standard delivery units, shorten the delivery process and make it easier to integrate with CI/CD systems.
Considering that application delivery is becoming faster, how is that application architecture following the Cloud Native trend?
Application Architecture Evolution: from Monoliths, Microservice to Service Mesh
The starting point of application architecture evolution is still from monolithic architecture. As the size and requirements of applications increased, the monolithic architecture no longer met the needs of collaborative team development, thus distributed architectures were gradually introduced.
Among the distributed architectures, the most popular one is the microservice architecture. Microservice architecture can split services into multiple modules, which communicate with each other, complete service registration and discovery, and achieve common capabilities such as flow limitation and circuit breaking.
In addition, there are various patterns included in a microservice architecture. For example, the per-service database pattern, which represents each microservice with an individual database, is a pattern that avoids database-level impact on the application but may introduce more database instances.
Another one is the API Gateway pattern, which receives the entrance traffic of the cluster or the whole microservice architecture through a gateway and completes the traffic distribution through APIs. This is one of the most used patterns, and gateway products like Spring Cloud Gateway or Apache APISIX can be applied.
The popular architectures are gradually extending to Cloud Native architectures. Can a microservice architecture under Cloud Native simply build the original microservice as a container image and migrate it directly to Kubernetes?
In theory, it seems possible, but in practice there are some challenges. In a Cloud Native microservice architecture, these components need to run not just in containers, but also include other aspects such as service registration, discovery, and configuration.
The migration process also involves business-level transformation and adaptation, requiring the migration of common logic such as authentication, authorization, and observability-related capabilities (logging, monitoring, etc.) to K8s. Therefore, the migration from the original physical machine deployment to the K8s platform is much more complex than it is.
In this case, we can use the Sidecar model to abstract and simplify the above scenario.
Typically, the Sidecar model comes in the form of a Sidecar Proxy, which evolves from the left side of the diagram below to the right side by sinking some generic capabilities (such as authentication, authorization, security, etc.) into Sidecar. As you can see from the diagram, this model has been adapted from requiring multiple components to be maintained to requiring only two things (application + Sidecar) to be maintained. At the same time, the Sidecar model itself contains some common components, so it does not need to be maintained by the business side itself, thus easily solving the problem of microservice communication.
To avoid the complex scenes of separate configuration and repeated wheel building when introducing a Sidecar for each microservice, the process can be implemented by introducing a control plane or by control plane injection, which gradually forms current Service Mesh.
Service Mesh usually requires two components, i.e., control plane + data plane. The control plane completes the distribution of configuration and the execution of the related logic, such as Istio, which is currently the most popular. On the data plane, you can choose an API gateway like Apache APISIX for traffic forwarding and service communication. Thanks to the high performance and scalability of APISIX, it is also possible to perform some customization requirements and custom logic. The following shows the architecture of the Service Mesh solution with Istio+APISIX.
The advantage of this solution is that when you want to migrate from the previous microservice architecture to a Cloud Native architecture, you can avoid massive changes on the business side by using a Service Mesh solution directly.
Technical Challenges of Cloud Native
The previous article mentioned some of the advantages of the current Cloud Native trend in terms of technical aspects. However, every coin has two sides. Although some fresh elements and opportunities can be brought, challenges will emerge due to the participation of certain technologies.
Problems Caused by Containerization and K8s
In the beginning part of the article, we mentioned that containerization technology uses a shared kernel, and the shared kernel brings lightness but creates a lack of isolation. If container escape occurs, the corresponding host may be attacked. Therefore, to meet these security challenges, technologies such as secure containers have been introduced.
In addition, although container images provide a standardized delivery method, they are prone to be attacked, such as supply chain attacks.
Similarly, the introduction of K8s has also brought about challenges in component security. The increase in components has led to a rise in the attack surface, as well as additional vulnerabilities related to the underlying components and dependency levels. At the infrastructure level, migrating from traditional physical or virtual machines to K8s involves infrastructure transformation costs and more labor costs to perform cluster data backups, periodic upgrades, and certificate renewals.
Also, in the Kubernetes architecture, the apiserver is the core component of the cluster and needs to handle all the inside and outside traffic. Therefore, in order to avoid border security issues, how to protect the apiserver also becomes a key question. For example, we can use Apache APISIX to protect it.
Security
The use of new technologies requires additional attention at the security level:
At the network security level, fine-grained control of traffic can be implemented by Network Policy, or other connection encryption methods like mTLS to form a zero-trust network.
At the data security level, K8s provides the secret resource for handling confidential data, but actually, it is not secure. The contents of the secret resource are encoded in Base64, which means you can access the contents through Base64 decoding, especially if they are placed in etcd, which can be read directly if you have access to etcd.
At the level of permission security, there is also a situation where RBAC settings are not reasonable, which leads to an attacker using the relevant Token to communicate with the apiserver to achieve the purpose of the attack. This kind of permission setting is mostly seen in the controller and operator scenarios.
Observability
Most of the Cloud Native scenarios involve some observability-related operations such as logging, monitoring, etc.
In K8s, if you want to collect logs in a variety of ways, you need to collect them directly on each K8s node through aggregation. If logs were collected in this way, the application would need to be exported to standard output or standard errors.
However, if the business does not make relevant changes and still chooses to write all the application logs to a file in the container, it means that a Sidecar is needed for log collection in each instance, which makes the deployment architecture extremely complex.
Back to the architecture governance level, the selection of monitoring solutions in the Cloud Native environment also poses some challenges. Once the solution selection is wrong, the subsequent cost of use is very high, and the loss can be huge if the direction is wrong.
Also, there are capacity issues involved at the monitoring level. While deploying an application in K8s, you can simply configure its rate limiting to limit the resource details the application can use. However, in a K8s environment, it is still rather easy to over-sell resources, over-utilize resources, and overflow memory due to these conditions.
In addition, another situation in a K8s cluster where the entire cluster or node runs out of resources will lead to resource eviction, which means resources already running on a node are evicted to other nodes. If a cluster's resources are tight, a node storm can easily cause the entire cluster to crash.
Application Evolution and Multi-cluster Pattern
At the application architecture evolution level, the core issue is service discovery.
K8s provides a DNS-based service discovery mechanism by default, but if the business includes the coexistence of cloud business and stock business, it will be more complicated to use a DNS service discovery mechanism to deal with the situation.
Meanwhile, if enterprises choose Cloud Native technology, with the expansion of business scale, they will gradually go to consider the direction of multi-node processing, which will then involve multi-cluster issues.
For example, we want to provide customers with a higher availability model through multiple clusters, and this time it will involve the orchestration of services between multiple clusters, multi-cluster load distribution and synchronization configuration, and how to handle and deploy strategies for clusters in multi-cloud and hybrid cloud scenarios. These are some of the challenges that will be faced.
How APISIX Enables Digital Transformation
Apache APISIX is a Cloud Native API gateway under the Apache Software Foundation, which is dynamic, real-time, and high-performance, providing rich traffic management features such as load balancing, dynamic upstream, canary release, circuit breaking, authentication, observability, etc. You can use Apache APISIX to handle traditional north-south traffic, as well as east-west traffic between services.
Currently, based on the architectural evolution and application changes described above, APISIX-based Ingress controller and Service Mesh solutions have also been derived in Apache APISIX to help enterprises to better carry out digital transformation.
APISIX Ingress Solution
Apache APISIX Ingress Controller is a Kubernetes Ingress Controller implementation that serves primarily as a traffic gateway for handling north-south Kubernetes traffic.
The APISIX Ingress Controller architecture is similar to APISIX in that it is a separate architecture for the control plane and the data plane. In this case, APISIX is used as the data plane for the actual traffic processing.
Currently, APISIX Ingress Controller supports the following three configuration methods and is compatible with all APISIX plugins out of the box:
Support for Ingress resources native to K8s. This approach allows APISIX Ingress Controller to have a higher level of adaptability. By far, APISIX Ingress Controller is the most supported version of any open-source and influential Ingress controller product.
Support for using custom resources. The current custom resources of APISIX Ingress Controller are a set of CRD specifications designed according to APISIX semantics. Using custom resources makes it easy to integrate with APISIX and is more native.
Support for Gateway API. As the next generation of the Ingress standard, APISIX Ingress Controller has started to support Gateway API (Beta stage). As the Gateway API evolves, it is likely to become a built-in resource for K8s directly.
APISIX Ingress Controller has the following advantages over Ingress NGINX:
Architectural separation. In APISIX Ingress, the architecture of the data plane and control plane are separated. When the traffic processing pressure is high and you want to expand the capacity, you can simply do the expansion of the data plane, which allows more data planes to be served externally without the need to make any adjustments to the control plane.
High scalability and support for custom plugins.
As the choice of data plane, with high performance and fully dynamic features. Thanks to the fully dynamic feature of APISIX, it is possible to protect business traffic as much as possible with the use of APISIX Ingress.
Currently, APISIX Ingress Controller is used by many companies worldwide, such as China Mobile Cloud Open Platform (an open API and cloud IDE product), Upyun, and Copernicus (part of Europe's Eyes on Earth).
APISIX Ingress Controller is still in continuous iteration, and we plan to improve more functions in the following ways:
Complete support for the Gateway API to enable more scenario configurations.
Support external service proxy.
Native support for multiple registries to make APISIX Ingress Controller more versatile.
Architectural updates to create a new architectural model;
Integrate with Argo CD/Flux and other GitOps tools to create a rich ecosystem.
If you are interested in the APISIX Ingress solution, please feel free to follow the community updates for product iterations and community trends.
APISIX Service Mesh Solution
Currently, in addition to the API gateway and Ingress solution, the APISIX-based Service Mesh solution is also in active iteration.
The APISIX-based Service Mesh solution consists of two main components, namely the control plane and the data plane. Istio was chosen for the control plane since it is an industry leader with an active community and is supported by multiple vendors. APISIX was chosen to replace Envoy on the data side, allowing APISIX's high performance and scalability to come into play.
APISIX's Service Mesh is still being actively pursued, with subsequent iterations planned in the following directions:
Performing eBPF acceleration to improve overall effectiveness.
Performing plugin capability integration to allow better use of APISIX Ingress capabilities within the Service Mesh architecture.
Creating a seamless migration tool to provide easier tools and simplify the process for users.
In general, the evolution of architecture and technology in the Cloud Native era brings us both opportunities and challenges. Apache APISIX as a Cloud Native gateway has been committed to more technical adaptations and integrations for the Cloud Native trend. Various solutions based on APISIX have also started to help enterprise users to carry out digital transformation and help enterprises to transition to the Cloud Native track more smoothly.